tito-PDF Documentation

CLI reference

The installed tito-pdf --help output is the source of truth for flags.

tito-pdf --help

This page explains every parameter, including interactions that are hard to express in --help.

Metavars

In --help, placeholders mean:

Positional argument

input_path

Path to a .pdf or .docx.

Rules:

Version

-V, --version

Prints a single line version header and exits.

Mode (high level)

--mode {fast,robust,best}

A convenience knob that maps to lower-level OCR/tables behavior.

Important: explicit flags win over --mode:

Output selection: explicit vs convenience

tito-pdf has two output styles:

1) Explicit output mode Triggered when any explicit output path is set:

In explicit output mode:

2) Convenience mode Used when no explicit output paths are set.

In convenience mode (TITO-aligned folder structure):

Output paths (explicit)

--md-out PATH

Write primary Markdown output to PATH.

Notes:

--raw-text-out PATH

Write extracted plaintext (UTF-8) to PATH.

Why it exists:

--tables-out PATH

Write extracted tables as Markdown to PATH.

Notes:

--tables-audit-out PATH

Write a JSON audit describing accepted tables.

Rules:

--assets-json PATH

Write a JSON payload with runtime metadata and metrics.

Important:

See: Assets JSON.

Convenience directory

--out-dir DIR

Base directory for deliverables.

--id ID

Identifier for output filenames.

--keep-sessions

Preserve intermediate files in sessions/run-YYYYMMDD_HHMMSS/.

Convenience toggles

These toggles only matter in convenience mode.

--text

Write Markdown output.

--tables

Write tables Markdown output (<stem>.tables.md).

--all

Write both Markdown and tables Markdown.

Tables behavior

--tables-lenient

Enable text-based table detection (higher recall, more false positives).

Notes:

See: Tables.

OCR behavior

--no-ocr

Disable the OCR stage.

--force-ocr

Force OCR even if the PDF already has a text layer.

Notes:

See: OCR.

Debug

--max-pages N

Limit pages processed. Used for debugging performance and false positives.

Exit codes