OCRing existing PDF

68 views
Skip to first unread message

robert.j...@icloud.com

unread,
Apr 2, 2019, 9:56:11 AM4/2/19
to tesseract-ocr
Hello,

I need to OCR several PDF, what command line can I use to batch (French text)?

Can you point where I can find this information in the tesseract manual.

Thanks,

Robert Richard

Archiviste en ethnologie acadienne
Centre d'études acadiennes Anselme-Chiasson
Université de Moncton
Moncton (Nouveau-Brunswick) E1A 3E9
(506) 858-4724

robert....@umoncton.ca
http://www.umoncton.ca/umcm/

Robert Richard

unread,
Apr 3, 2019, 6:59:47 AM4/3/19
to tesser...@googlegroups.com
Hello, and thanks, for reply,

Question which third application use tesseract?

Cheers,

Robert Richard

Archiviste en ethnologie acadienne
Centre d'études acadiennes Anselme-Chiasson
Université de Moncton
Moncton (Nouveau-Brunswick) E1A 3E9
(506) 858-4724

robert....@umoncton.ca
http://www.umoncton.ca/umcm/


Envoyé de mon iPad

Le 2 avr. 2019 à 12:27, Shree Devi Kumar <shree...@gmail.com> a écrit :

Tesseract does not take pdfs as direct input. You have to convert pdf to images and provide that to tesseract.

However there are many 3rd party applications which take pdf as input and have tesseract as backend to do OCR.

Shree Devi Kumar

unread,
Apr 3, 2019, 7:10:29 AM4/3/19
to tesser...@googlegroups.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3F7738AF-3E77-4502-A746-45D946948EC5%40icloud.com.
For more options, visit https://groups.google.com/d/optout.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
Reply all
Reply to author
Forward
0 new messages