The PDF to text API that handles real-world documents.
Convert PDFs to clean text with one HTTP call. Scanned, multi-column, or faxed pages all return the same typed JSON. No templates to set up, no model to fine-tune.
Get started today for free. No credit card required.
POST a document, get back page-by-page text.
OCRWell processes PDFs asynchronously and returns typed JSON. Text mode gives you pages[] with raw text per page, ready to index, search, or pipe straight into an LLM.
# Submit a PDF for text extraction
$ curl https://api.ocrwell.com/v1/documents \
-H "X-API-Key: $OCRWELL_KEY" \
-d '{"upload_id":"upl_01H7","mode":"text"}'{
"job": {
"id": "019539a6-6c3d-7e5f",
"status": "completed",
"mode": "text",
"page_count": 3
},
"result": {
"pages": [
{ "page": 1, "text": "INVOICE 4172..." },
{ "page": 2, "text": "Line items..." },
{ "page": 3, "text": "Totals..." }
]
}
}Where teams use the PDF to text API.
Index a back-catalogue
Run thousands of PDFs through the same endpoint. Idempotency keys and per-organisation rate limits keep bulk backfills safe alongside live traffic.
Feed LLMs with text, not pixels
Sending extracted text to Opus 4.7 costs a fraction of sending the raw PDF image. Cuts frontier-model input by up to 80% on document-heavy pipelines.
Scanned and faxed pages, too
Image-based PDFs, two-column reports, and faxed scans return the same shape as digital-native PDFs. No branching in your code.
Start free. Pay only when you scale.
- No credit card required.
- Hard cap, no overage charges.
- Paid plans from $20/mo when you grow.
Turn your PDFs into text today.
Generate an API key, send your first document, and have page-by-page text back in seconds. Free forever tier covers 200 OCR pages per month.