Model HQ Documentation

Model HQ includes proprietary high-performance parsers designed to process PDFs and other common document types with exceptional speed and accuracy. These parsers extract text, structure, and metadata efficiently, supporting a wide range of use cases for retrieval, RAG workflows, and agentic AI.

However, some documents—particularly PDFs—may present unique challenges:

Image-based PDFs: Certain PDFs are saved entirely as images (e.g., scanned documents), meaning there is no underlying text layer to extract. These require OCR (Optical Character Recognition) to process the content.

Permission-restricted PDFs: Some PDFs use encrypted permissions that limit actions such as text extraction, copying, or printing. These restrictions may prevent direct parsing without prior decryption or appropriate access rights.

When such files are encountered, additional preprocessing steps may be required before they can be ingested and indexed by Model HQ.

Old13:23

Build a Table Reading Bot in Minutes - No Code on AI PC

Table Reading Bot

9:44

Process 500 PDFs On-Device for RAG and Agents in 30 seconds?! Model HQ Sources Demo (Real-Time)

On-Device RAG

Ingest PDF

Old12:35

Analyze Executive Agreements with AI - on AI PC

Business

Document Analysis

1. Handling documents that cannot be natively parsed

In some cases, a document cannot be parsed by Model HQ's native proprietary parsers due to being fully image-based or having restrictive permissions.

1.1 In Chat

When attempting to load such a document, the system will respond with:

“Unfortunately, source could not be loaded.”

1.2 In Agents

The Parsing step within the workload will complete but return 0 text blocks parsed, indicating that no extractable text was found and the RAG Answer step will not return responses.

2. Fallback solution

Model HQ includes an embedded OCR (Optical Character Recognition) capability that can be activated to process such documents. Enabling OCR allows Model HQ to detect and extract text from images, scanned pages, or restricted PDFs, making them fully searchable and usable in RAG and agent workflows.

2.1 How to activate OCR:

In Chat or Bots, the icon beneath the chat box '⚙️' can be selected in the Dialogue section.

Once in RAG + Generation Config Options, the following steps can be followed:

Navigate to PDF Parsing Options, select OCR and click '>'. The document will be accessed via OCR and a previously locked PDF document will now be searchable.

In Agents, OCR can be selected in lieu of Parse Documents or Rag-Answer if in Visual editing mode (OCR available for Intel AI PC only - Jan 2026).

For further assistance or to share feedback, please contact us at support@aibloks.com

Troubleshooting Document Parsing and Limitations