ocropus

OCRopus / OCRopy Python-based OCR package using recurrent neural networks.

OCRopus is really a collection of document analysis programs, not a turn-key OCR system.

In addition to the recognition scripts themselves, there are a number of scripts for ground truth editing and correction, measuring error rates, determining confusion matrices, etc. OCRopus commands will generally print a stack trace along with an error message; this is not generally indicative of a problem (in a future release, we'll suppress the stack trace by default since it seems to confuse too many users).

To recognize pages of text, you need to run separate commands: binarization, page layout analysis, and text line recognition.

The OCRopus OCR system is hosted at: https://github.com/tmbdev/ocropy






Showing 21-40 of 840 topics
Demo code available? jbest 9/8/15
No Output after ocroscript recognize Debayan 9/8/15
rtrain - input file selected at random? Ankit Agarwal 8/28/15
How to avoid segmenting pages into columns? Gyula Sámuel Karli 7/30/15
Border Noise Removal Everest 6/22/15
Training error when using ocropus-rtrain Faida 6/17/15
Book layout element recognition Christoph Holtermann 6/5/15
Network in details 贺盼 3/24/15
implementation details of ctc in lstm.py Ajinkya Kulkarni 3/9/15
ocropus-rpred compile error in python2.7 on windows Sen.T 2/28/15
clstm python setup.py install errors 贺盼 1/14/15
Blog post on running & training Ocropus Dan Vanderkam 1/12/15
Had this project been closed? why i could not get the correct source code from the moved website? libin sui 12/17/14
how to develop models of default instead of creating new models? Andrzej 11/20/14
Recognition of Polish words. Models with Polish characters or Training Ocropus 0.7 for Polish Andrzej 11/20/14
How to Read Prepared/Generated Forms (known layout) and Obtain Check-Box Data kingIZZZY 11/4/14
Re: Tesseract vs OCRopus Tom Morris 10/24/14
Is there a way to get the paragraph coordinates? Amanda García Pérez 9/4/14
Please suggest a way to output the match_score of each word in a given Document. rahul reddy 6/14/14
hOCR to PDF converter Florian Hackenberger 6/5/14
More topics »