OCRopus / OCRopy Python-based OCR package using recurrent neural networks.

OCRopus is really a collection of document analysis programs, not a turn-key OCR system.

In addition to the recognition scripts themselves, there are a number of scripts for ground truth editing and correction, measuring error rates, determining confusion matrices, etc. OCRopus commands will generally print a stack trace along with an error message; this is not generally indicative of a problem (in a future release, we'll suppress the stack trace by default since it seems to confuse too many users).

To recognize pages of text, you need to run separate commands: binarization, page layout analysis, and text line recognition.

The OCRopus OCR system is hosted at: https://github.com/tmbdev/ocropy

Showing 21-38 of 826 topics
caracter train with ocropus khadija gajoui 6/5/14
Solution to Problems With python setup.py download_models m00tpoint 5/7/14
IOError: Not a gzipped file ./models/uw3unlv.pyrnn.gz Jacob Reese 5/7/14
Best way to get lines from files Nikolai Lusan 4/2/14
ocropus in windows nandhu...@gmail.com 3/7/14
How fast is Ocropus? Aaron Handford 3/5/14
Re: [ocropus] Digest for ocr...@googlegroups.com - 1 Message in 1 Topic Adnan ul Hasan 2/27/14
How to use ocropus-rtrain Wesley Willians 2/26/14
Ocropus iOS development Nick Porter 2/6/14
line generation with llpy Mehdi Ghanimifard 2/4/14
Speeding up Ocropus in Ubuntu. johneri...@gmail.com 1/17/14
ocropus doesn't detect word breaks correctly in Fraktur text wasi99 1/14/14
Can ocropus give coordinates of bounding rectangle of scanned page? jimfunderburk 12/11/13
Create searchable pdfs Harald Heigl 12/10/13
ocropus for Windows 7 talha qaiser 11/21/13
Ocropus 0.6 Lattice Size MattJ 10/23/13
Stylized layout and "language" ken...@gmail.com 10/18/13
Tesseract installation - leptonica not found mara con 10/17/13
More topics »