Image to Text (OCR)
Extract text from images using optical character recognition. Everything runs locally in your browser — your images never leave your device.
Privacy First
OCR runs entirely in your browser. Your images are never uploaded, stored, or sent to any server.
Tesseract — a mature engine that's reliable for clean, printed documents and scanned pages.
The model downloads once on first use, then is cached by your browser.
Drag & drop an image, or
…or paste from clipboard Ctrl+V
Supports PNG, JPG, WebP, GIF, BMP
About Image to Text (OCR)
Optical Character Recognition (OCR) reads the text inside an image and converts it into editable, selectable text you can copy elsewhere.
This tool processes everything on your device and never sends your images anywhere. Each language's model downloads once on first use, after which that language works offline.
Tesseract vs. PaddleOCR
- Tesseract is a long-established engine with 100+ languages. It's dependable on clean, high-contrast documents and scanned pages, and downloads a separate language pack per language.
- PaddleOCR (PP-OCRv5) is a modern deep-learning pipeline that detects and recognizes text in two stages. It's often more accurate on real-world photos, natural-scene text, and rotated or curved lines, and can use WebGPU for a 2–5× speedup when your browser supports it.
The default is Tesseract. Switch engines at the top and re-run the same image to compare results — try PaddleOCR when Tesseract struggles with a photo or slanted text.
Tips for best results:
- Use clear, high-contrast images with sharp text
- Pick the language that matches the text in the image
- Crop out noisy backgrounds before scanning
- Straighten skewed or rotated photos when possible
Powered by tesseract.js and ppu-paddle-ocr (PaddleOCR via onnxruntime-web) — both WebAssembly engines that run entirely in the browser.