The TrueParser Parsing Engine is the core execution layer of the platform. It is a unified orchestration system that sits in front of multiple specialized, independently versioned parsing engines (GIS, CAD, SQL, PDF, etc.) and exposes them through a single, consistent API. Unlike monolithic parsers, TrueParser uses a scalable architecture to handle technical and enterprise document extraction at scale. TrueParser does not expose a direct synchronous parsing mode. All document parsing requests enter the asynchronous worker pipeline and are completed through status polling and result retrieval.Documentation Index
Fetch the complete documentation index at: https://docs.trueparser.com/llms.txt
Use this file to discover all available pages before exploring further.
Core Concepts
Unified Orchestration
TrueParser acts as a product surface for diverse technical formats. You don’t need to implement individual library logic for spatial data, engineering drawings, or SQL dialects. The platform handles the intake, identifies the document type, and routes it to the optimal engine.Streaming-First Philosophy
Built for high-performance and low memory overhead, the engine is “stream-in, stream-out.”- Input: Documents are received as byte streams.
- Output: Results are emitted as NDJSON (Newline Delimited JSON) streams.
Asynchronous Execution
Document parsing is always asynchronous. When you submit a document, the system accepts the job, queues it for worker processing, and persists the final result for retrieval once the job is complete.How it Works
- Intake: You submit a document via the Parsing API.
- Identification: The system detects the format or follows your explicit routing instructions.
- Processing: Specialized worker pipelines consume the document and generate structured data.
- Materialization: The engine’s raw output is transformed into a canonical JSON representation.
- Persistence: The final artifact is stored in tenant-scoped S3-compatible storage for retrieval.
- Retrieval: You poll job status and fetch the final result when processing is complete.

