OCR in only application

22 views

Skip to first unread message

Stanley Denman

unread,

Feb 15, 2018, 3:03:20 AM2/15/18

to tesseract-ocr

I am wanting to create an online application that takes a large pdf file and extracts information that is valuable for the user. The key to the application is going to be speed - I am basically wanting to provide a minimal service for free that builds up an e-mail address. I know when I OCRed one of these files in FoxIt it takes about 20 minutes. Here is my question: most of the information that I need is in the bookmarks but not all. One piece of info I need is an address that I could either get from accessing an API in Google Maps os something, or doing a partial OCR . I can see OCRing 10-12 pages to get my info. I am wondering about speed - anyone have ideas about what approach would be the fastest?

Reply all

Reply to author

Forward

0 new messages