How to get all the tesseract fonts in tiff format

52 views
Skip to first unread message

Anand Akella

unread,
Feb 1, 2018, 2:31:20 AM2/1/18
to tesseract-ocr
Hi,
Im trying to convert HOCR to pdf using reportlab library. The library only supports english fonts. Given that tesseract supports different fonts for different languages. Is there a way we can fetch all the fonts in tiff and register it with reportlab. Kindly let me know.

Thanks,
Anand

ShreeDevi Kumar

unread,
Feb 1, 2018, 9:15:02 AM2/1/18
to tesser...@googlegroups.com
Gimagereader offers HOCR to pdf output with tesseract as the OCR engine.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/aa956b92-7ad8-4bb3-b6fc-af2b3d9a8c15%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Anand Akella

unread,
Feb 1, 2018, 12:46:25 PM2/1/18
to tesseract-ocr
Hi Shree,
Thanks for the info. This not want I'm looking for. Hocr output contains font names now if I have to output to PDF or some other document. The library expects font to be registered. It is like writing gibberish to file if the font is not registered.

Thanks,
Anand 
Reply all
Reply to author
Forward
0 new messages