OCRopus / OCRopy Python-based OCR package using recurrent neural networks.

OCRopus is really a collection of document analysis programs, not a turn-key OCR system.

In addition to the recognition scripts themselves, there are a number of scripts for ground truth editing and correction, measuring error rates, determining confusion matrices, etc. OCRopus commands will generally print a stack trace along with an error message; this is not generally indicative of a problem (in a future release, we'll suppress the stack trace by default since it seems to confuse too many users).

To recognize pages of text, you need to run separate commands: binarization, page layout analysis, and text line recognition.

The OCRopus OCR system is hosted at: https://github.com/tmbdev/ocropy

Showing 1-20 of 840 topics
ocropus-ctrain not working Raj Julha 3/1/18
Does a sample that recognize the Chinese Text and accuracy above the tesseract? Lofty Cool 6/12/17
run ocr prediction using predefined ROIs Youcef 6/7/17
What feature extraction method is used in OCROPUS? farzaneh tabatabaee 4/14/17
Training ocropus for handwritten text Subhodeep Maji 3/2/17
State of handwriting recognizer Jussi Pakkanen 2/28/17
How to OCR a Multipage TIFF? Pedro Correia 2/22/17
avoid columns segmentation fabio forno 12/23/16
Segmenting line to characters khadija EL Gajoui 10/12/16
loading a model GilGui 7/8/16
OCRopus / ocropy updates Tom 6/10/16
Web GUI for Ocropus Patricia 5/9/16
OCRopus for Mac OSX, and a very simple GUI Michael Moore 5/7/16
compiling ocropus under windows with visual studio 2008 mrbigman 4/21/16
Trying to understand implementation details of lstm.py sudeep raja putta 2/19/16
can i recognize Chinese by ocropus ? yang 1/29/16
Help with installing Philip Gwyn 1/28/16
nginx + lua + ocropus romajke 12/4/15
ocropus recognizers khadija EL Gajoui 11/9/15
Why is Tesseract so much more popular than Ocropus? maxim...@gmail.com 10/23/15
More topics »