Scan different areas of documents

58 views
Skip to first unread message

clickonl...@gmail.com

unread,
Apr 21, 2017, 4:05:08 AM4/21/17
to tesseract-ocr
Hello, I need to read some areas of different PDF documents. How could I do this? 
Other solutions do this with templates. 

Kind Regards

Dominik Jesiolowski

unread,
Apr 21, 2017, 9:45:05 AM4/21/17
to tesseract-ocr
Hi,

Tesseract can't process pdf input files. You can try some lib (iText?) to extract images from PDF,
and then feed them to Tesseract. Use SetRectangle to focus on a particular area in an image (if using Tesseract API).
Don't understand your mention of templates.

Regards,
Dominik
Reply all
Reply to author
Forward
0 new messages