Skip to content
files.co

OCR PDF — extract text

Extract searchable text from scanned PDFs. Runs locally with Tesseract.

100% in-browser · 0 uploads

Frequently asked questions

Is this OCR tool free?

Yes, completely free with no usage limits, no account required.

Will my PDF be uploaded to a server?

No. Tesseract runs in your browser via WebAssembly — your PDF never leaves your device.

Does the first use require an internet connection?

Yes. The first time you select a language we download ~12 MB of trained data from the Tesseract CDN. After that, OCR works fully offline.

Which languages are supported?

English, Spanish, French, German, Italian, Portuguese, Dutch, Korean and Japanese. Mix two with a "+" (e.g. eng+spa) for multilingual documents.

What format is the output?

A plain .txt file with the extracted text per page. PDF/A export with searchable text layer is coming in a future update.