docimprint

use cases

Every document workflow, verifiable

DocImprint turns PDFs, images, and URLs into agent-ready output with tamper-evident evidence bundles. One API surface — from quick text extraction to full chain of custody.

What document workflows does DocImprint support?

DocImprint supports cited Q&A, claim-check, evidence bundle capture, invoice parsing, legal hold, URL monitoring, matter corpus search, and cross-document ask — all via POST /v1/extract with tamper-evident output.

Which industries use verifiable document memory?

Law firms and compliance teams use chain-of-custody bundles. Finance teams parse invoices with audit trails. Insurance adjusters verify claims against policy PDFs. Research teams cite papers with Merkle proofs.

How do use-case workflows connect to the API?

Every workflow maps to extract modes: bundle (default), claim-check, structured, compare, and monitor. Agents discover endpoints via OpenAPI and MCP at api.docimprint.com.

AI & agent grounding

Give your agent cited, tamper-evident memory of the documents it reads.

Cited document Q&A

Ask any question against a PDF or URL and get paragraph-cited answers. Agents can prove exactly what they read when decisions are audited.

mode: qa

Claim verification

Submit a list of assertions and get supported / contradicted / not_found verdicts with source references — no hallucination, every verdict backed by a quote.

mode: claim-check

Verifiable agent memory

Every extraction produces a signed evidence bundle — SHA-256 manifest, Merkle tree, screenshot. Provable chain of custody your agent can hand to an auditor.

store: true

Financial documents

Structured extraction for invoices, filings, and audit-trail workflows.

Invoice & receipt parsing

Extract merchant name, line items, subtotals, tax, and payment terms from any invoice or receipt format in a single API call.

mode: invoice

Custom schema extraction

Bring your own JSON schema and get cited, structured output from annual reports, 10-Ks, or any financial filing — fields are traced back to source paragraphs.

mode: extract-structured

Table extraction from PDFs

Spatial OCR preserves table layout and returns Markdown syntax your agents can reliably parse for reconciliation and compliance checks.

ocr_type: tables

Legal & compliance

Evidence preservation, chain of custody, and notarization for regulated workflows.

Evidence preservation

Capture any web page or document with a cryptographic timestamp. Tamper-evident bundles with optional on-chain notarization anchored to Base.

notarize: true

Legal hold & retention

Lock bundles against deletion during litigation with PUT /hold. Set retention expiry at capture time. Every access and action is audit-logged.

PUT /:id/hold

Due diligence at scale

Organize documents into Collections. Run cross-document semantic search and cited Q&A across entire matter corpora in one API call.

collections

Monitoring & comparison

Track document changes over time with cryptographically signed diffs.

Regulatory document monitoring

Detect when a policy, filing, or compliance page changes. Cron-based checks with webhook delivery any time a diff is detected.

monitor: diff

Document version comparison

Compare the current state of a document against a previous bundle. Signed diff with changed sections highlighted — useful for contract redlines.

mode: compare

Content authenticity

Verify bundle integrity at any time with the free verify endpoint — offline capable, no API key required, usable by any downstream consumer.

GET /:id/verify

who uses this

Built for high-stakes documents

Any workflow where you need to prove what a document said, when you read it, and that it hasn't changed.

Law & e-discovery
Finance & audit
Insurance claims
M&A due diligence
Journalism & fact-check
Research & academia
Regulatory compliance
Web archiving

deep dives

Detailed workflow guides

Step-by-step API walkthroughs for specific industries.