tito-PDF Documentation

FAQ

Why does --out-dir sometimes do nothing?

Because you are in explicit output mode.

If you set any explicit output path (like --md-out or --tables-out), tito-pdf writes only to those paths and ignores --out-dir.

See: Output contract.

Why does --tables-audit-out require --tables-out?

Because the audit is defined as a companion to the Markdown tables output.

It is a contract rule:

Why can’t I request only --assets-json?

--assets-json is treated as a companion output. In explicit output mode you must also request at least one “content” output:

Rationale:

Why do outputs differ across machines?

tito-pdf is deterministic (no network, no randomness), but results can still differ because:

If you need forensic metadata, enable --assets-json.

Why is OCR enabled in robust mode?

Because robust mode is trying to maximize extraction quality for unknown PDFs.

If you want speed and your PDFs have a good text layer, use:

tito-pdf input.pdf --mode fast --md-out out/input.md

Why are there no sessions/ folders?

This is an explicit no-regress contract.

tito-pdf runs in a temporary working directory and deletes intermediates by default.

Where should I report bugs or request features?

Use GitHub issues: