The image to text API built for messy real-world photos.
Skewed angles, mixed lighting, low resolution, busy backgrounds. JPG, PNG, TIFF, WebP, and BMP all return the same typed JSON. No client-side preprocessing, no per-format branches in your code.
Get started today for free. No credit card required.
POST an image. Get text back.
The API handles upload, processing, and retrieval through a single async flow. Submit a phone photo, scanned page, or screenshot. Poll for the job or receive an HMAC-signed webhook.
# Send a phone photo, get text back
$ curl https://api.ocrwell.com/v1/documents \
-H "X-API-Key: $OCRWELL_KEY" \
-d '{"upload_id":"upl_01H7","mode":"text"}'{
"job": {
"id": "019539a6-6c3d-7e5f",
"status": "completed",
"mode": "text",
"page_count": 1
},
"result": {
"pages": [
{ "page": 1, "text": "MELBOURNE MUSEUM\nOpen daily 10am to 5pm..." }
]
}
}Where teams use the image to text API.
Real photos, not just scans
Phone photos taken at angles, with shadows, in low light, or against busy backgrounds. The same endpoint that handles a flatbed scan handles a snapshot from a queue.
Every common format
JPEG, PNG, TIFF, WebP, and BMP. Up to 20 MB per file. No client-side conversion step required, no separate endpoints per format.
Screenshots and signage too
Browser screenshots for accessibility tools, photos of signs and menus, product packaging, whiteboards. Returns the same typed JSON shape as any other source.
Start free. Pay only when you scale.
- No credit card required.
- Hard cap, no overage charges.
- Paid plans from $20/mo when you grow.
Turn images into text today.
Generate an API key, upload your first photo, and have clean text back in seconds. Free forever tier covers 200 OCR pages per month.