Documentation
These pages are the long-form manual for tito-pdf.
tito-pdf --helpis the CLI contract.- The docs explain what the flags mean, why they exist, and how the pipeline behaves.
Start here
Reference (by topic)
- CLI reference (every parameter)
- Output contract (explicit vs convenience)
- Assets JSON (schema + rationale)
How it works (internal pipeline)
- Design rationale (why multiple tools)
- Implementation details (thresholds + heuristics)
- Pipeline overview
- OCR (ocrmypdf + tesseract)
- Tables (strict vs lenient; audit JSON)