OCRopus / OCRopy Python-based OCR package using recurrent neural networks.

OCRopus is really a collection of document analysis programs, not a turn-key OCR system.

In addition to the recognition scripts themselves, there are a number of scripts for ground truth editing and correction, measuring error rates, determining confusion matrices, etc. OCRopus commands will generally print a stack trace along with an error message; this is not generally indicative of a problem (in a future release, we'll suppress the stack trace by default since it seems to confuse too many users).

To recognize pages of text, you need to run separate commands: binarization, page layout analysis, and text line recognition.

The OCRopus OCR system is hosted at: https://github.com/tmbdev/ocropy

Showing 1-20 of 833 topics
compiling ocropus under windows with visual studio 2008 mrbigman 4/21/16
Web GUI for Ocropus Patricia 4/21/16
Trying to understand implementation details of lstm.py sudeep raja putta 2/19/16
can i recognize Chinese by ocropus ? yang 1/29/16
Help with installing Philip Gwyn 1/28/16
nginx + lua + ocropus romajke 12/4/15
ocropus recognizers khadija gajoui 11/9/15
avoid columns segmentation fabio forno 10/23/15
Why is Tesseract so much more popular than Ocropus? maxim...@gmail.com 10/23/15
OCRopus 0.6: possibility for command-line use only & for operating with Tesseract Martin Reynaert 9/8/15
Demo code available? jbest 9/8/15
No Output after ocroscript recognize Debayan 9/8/15
rtrain - input file selected at random? Ankit Agarwal 8/28/15
How to avoid segmenting pages into columns? Gyula Sámuel Karli 7/30/15
OCRopus / ocropy updates Tom 7/8/15
Border Noise Removal Everest 6/22/15
Training error when using ocropus-rtrain Faida 6/17/15
Book layout element recognition Christoph Holtermann 6/5/15
Network in details 贺盼 3/24/15
implementation details of ctc in lstm.py Ajinkya Kulkarni 3/9/15
More topics »