Tesseract API to output PDF with txt layer

60 views
Skip to first unread message

PSK

unread,
Jul 27, 2018, 5:46:15 AM7/27/18
to tesseract-ocr
I know that Tesseract v4 CLI is able to produce the output as PDF with txt layer. The question is whether this functionality is also available via its API?
If so, the other question is whether Tess4J will expose that API to Java, too (I know that this is a separate product, but maybe someone is familiar with both products, otherwise I will go to Tess4J form to ask if such API is planned to be exposed).


Quan Nguyen

unread,
Jul 27, 2018, 5:23:02 PM7/27/18
to tesseract-ocr
Yes, the PDF functionality is exposed in C-API interface, which Tess4J fully supports.

PSK

unread,
Jul 30, 2018, 6:57:54 AM7/30/18
to tesseract-ocr
Thanks!
Reply all
Reply to author
Forward
0 new messages