Document APIs for agents and humans

Cloud APIs for extracting, generating, converting, validating, and reviewing high-stakes documents — with structured output agents can use and humans can trust.

Used by Lufthansa, Disney, Autodesk, UBS, Dropbox, IBM
Lufthansa
Disney
Autodesk
UBS
Dropbox
IBM

Use case finder

What do you want to do with your documents?

Extract data

curl -X POST https://api.nutrient.io/build \
-H "Authorization: Bearer your_api_key_here" \
-o result.json \
--fail \
-F invoice=@invoice.webp \
-F instructions='{
"parts": [
{
"file": "invoice"
}
],
"output": {
"type": "json-content",
"keyValuePairs": true
}
}'

Turn PDFs, scans, forms, invoices, contracts, and Office files into typed JSON or LLM-ready Markdown. Preserve tables, key-value pairs, line items, coordinates, and confidence signals so downstream agents can validate before acting.

Explore data extraction

Generate PDFs

curl -X POST https://api.nutrient.io/processor/generate_pdf \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F html=@index.html

Create PDFs from HTML, templates, Markdown, or structured data. Add headers, footers, dynamic content, and accessibility-ready output for reports, agreements, statements, and customer-facing documents.

Explore PDF generation

Convert documents

curl -X POST https://api.nutrient.io/processor/convert_to_pdf \
-H "Content-Type: application/vnd.openxmlformats-officedocument.wordprocessingml.document" \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
--data-binary @document.docx

Convert between PDF, Word, Excel, PowerPoint, HTML, images, and Markdown while preserving layout and structure. Prepare documents for review, storage, automation, or AI ingestion.

Explore conversion

OCR scanned files

curl -X POST https://api.nutrient.io/processor/ocr \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F file=@document.pdf \
-F data='{
"language": "english"
}'

Make scans, images, and photographed pages searchable, extractable, and ready for automation. Use OCR as the first step in document intake, classification, extraction, and review workflows.

Explore OCR

View documents in my app

curl -X POST https://api.nutrient.io/processor/generate_pdf \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F html=@index.html

Embed a production-ready document viewer for human review. Let users view, annotate, fill forms, compare, collaborate, and approve documents inside your product.

Explore document viewer

Sign documents

curl -X POST https://api.nutrient.io/sign \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F file=@document.pdf \
-F data='{
"signatureType": "cades",
"cadesLevel": "b-lt"
}'

Add signing and approval flows to generated or uploaded documents. Support governed completion with audit trails and downstream workflow handoff.

Explore eSignatures

Redact sensitive data

curl -X POST https://api.nutrient.io/processor/redact \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F file=@document.pdf \
-F data='{
"strategy": "text",
"strategyOptions": {
"text": "macaque",
"caseSensitive": false
},
"redactionState": "apply"
}'

Find and remove sensitive information such as names, emails, SSNs, account numbers, and custom patterns. Build secure, audit-ready workflows for regulated document handling.

Explore redaction

Validate compliance

curl -X POST https://api.nutrient.io/validate_pdfa \
-H "Authorization: Bearer your_api_key_here" \
-o result.json \
--fail \
-F file=@pdfa.pdf

Check PDF/A and PDF/UA requirements, accessibility readiness, archival formats, and compliance-sensitive outputs before documents move forward.

Explore compliance

Built for the workflows you’re actually building

Invoice processing
OCR Extract Check Post

Extract line items, totals, VAT, dates, vendors, and payment terms into ERP-ready JSON.

Contract automation
Generate Review Sign Archive

Generate agreements from templates, extract key terms, compare versions, collect signatures, and archive final documents.

Document AI and RAG
Parse Structure Enrich Retrieve

Convert document collections into clean Markdown and structured data for semantic embedding and agentic workflows.

Document viewing
View Annotate Collaborate Approve

Add document review, annotation, form filling, collaboration, and approval workflows directly inside your product.

Compliance workflows
Check Redact Audit Archive

Validate PDF/A and PDF/UA, redact sensitive information, preserve audit trails, and prepare compliant documents for regulated storage or review.

Healthcare documents
Classify Extract Redact Review

Classify patient forms, referrals, lab reports, and prescriptions into structured data; redact sensitive information; and route documents for review.

Construction drawing
Convert View Annotate Compare

Convert, view, annotate, and compare construction drawings, patents, and architectural plans aligned with structured review and markup workflows.

Customer portals
Upload View Fill Sign

Let customers upload, view, annotate, sign, and download documents in a secure, branded portal.


Cloud APIs for deterministic document workflows

Infrastructure

Document tools for agents

Give agents reliable document capabilities: extraction, conversion, OCR, generation, redaction, validation, and Markdown output for downstream automation.

Trust and audit

High-stakes compliance

Use structured responses, confidence signals, audit-friendly processing, and compliance-ready outputs where mistakes are costly.

Interface

Human review when it matters

Embed viewing, annotation, signing, comparison, and approval experiences so people can review and control important document decisions.

Showcase

Built with Nutrient

See all customer stories
Scaled legal document workflows 50 percent month over month

Replaced a patchwork of open source PDF tooling with Nutrient’s Web SDK and Document Engine — absorbing Harvey’s growth without adding internal infrastructure.

FULL STORY
Scaled to 50,000 students in a single day

Subject scaled 5× with Nutrient’s PDF SDK, handling peak usage without rebuilding its document layer or managing additional infrastructure.

FULL STORY
Powering seamless eSignatures at enterprise scale

DocuSign uses Nutrient to streamline eSignature workflows, enhance performance, and ensure a reliable and customizable document experience for millions of users.

FULL STORY

Frequently asked questions

How does pricing work?

Pricing depends on the API you use. Processor, Viewer, Accessibility, Data Extraction, signing, and enterprise deployments may have different pricing models, limits, and evaluation paths. Use the pricing comparison above to choose the right starting point. Then view the detailed pricing page for current plan details.

Is the API SOC 2 compliant? HIPAA?

Yes — It’s SOC 2 Type 2-audited, HIPAA-eligible, and GDPR-compliant. All API traffic is TLS-encrypted. Uploaded files are processed and discarded, not retained. Hosted in US and EU regions, with on-premises and dedicated deployments available.

Can Nutrient support agentic document workflows?

Yes. Nutrient APIs help agents and applications work with documents in predictable ways: extracting structured data, converting files, generating documents, OCRing scans, redacting sensitive content, validating compliance, and preparing documents for human review. Responses can include structured data, Markdown, coordinates, and confidence signals so downstream systems can check outputs before acting.

What file formats do Nutrient APIs support?

Nutrient APIs support common document formats, including PDFs, images, Microsoft Office files, HTML, Markdown, and more. Supported formats vary by API, so check the relevant API documentation for exact input and output options.

Build your first deterministic document workflow today

Get your API key. Make your first call in 60 seconds. Scale when you’re ready.