Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.trueparser.com/llms.txt

Use this file to discover all available pages before exploring further.

Public result contracts

These pages describe the public, client-facing result contract for each family. Results are produced asynchronously, and the API is the only supported retrieval path. The contract is family-specific, but the public shape stays stable across families, so you can rely on the same general envelope and read the family page that matches the result you requested.

Supported formats

FamilySupported formats
CADDWG, DXF
GISSHPZIP, SHP, GeoJSON, GPKG, KML, KMZ, GML, CSV, FGB, FileGDB, MapInfo, SpatiaLite, EsriJson, GeoJSONSeq, TopoJSON, GeoRss
MailKitEML, MBOX, EMLX, MHT, MHTML, TNEF, WinmailDat, P7S, P7M, MSG, PST, OST
MsOfficeDOCX, DOC, RTF, DOTX, DOCM, XLSX, XLS, CSV, TSV, XLSM, PPTX, PPT, PPTM, POTX, POTM, HTML, HTM, MD, Markdown, TXT
OpenDocODT, ODS, ODP, ODG, ODF, FODT, FODS, FODP, EPUB, IDML, DBF, DIF, MIF
PDFPDF
ParquetParquet
SQLSQL

Upload limits

Upload limits depend on the ingestion path:
  • Direct upload: up to 25 MB
  • Presigned upload: up to 100 MB
  • Effective cap: the lower of the endpoint default and JWT maxInputBytesPerDocument when that JWT limit is set

CAD

Public JSON contract for CAD drawings.

GIS

Public JSON contract for geospatial outputs.

MailKit

Public JSON contract for mail and archive results.

MsOffice

Public JSON contract for Microsoft Office content.

OpenDoc

Public JSON contract for OpenDocument content.

Parquet

Public JSON contract for Parquet datasets.

PDF

Public JSON contract for PDF results.

SQL

Public JSON contract for SQL statement results.

What clients can rely on

  • The contract is public and stable for a given version.
  • Results are returned after processing completes.
  • The envelope stays client-facing and family-based.
  • Optional fields may be omitted when they do not apply.
  • Family-specific fields live in document or attributes, not in internal transport details.
  • Order stays deterministic within each family.

Shared envelope

{
  "schema_version": "1.0",
  "document": {},
  "warnings": [],
  "content": []
}

Shared rules

  • schema_version identifies the public contract version.
  • document always appears.
  • warnings always appears.
  • content always appears for families that expose ordered content records.
  • attributes holds family-specific data.
  • source_ref carries provenance when the family exposes one.
  • No page in this section documents internal worker behavior or transport mechanics.
Last modified on April 28, 2026