Creating single-line ground-truth from PDF with corresponding hocr

45 views
Skip to first unread message

Nazar Kotsur

unread,
Apr 12, 2023, 6:23:33 AM4/12/23
to tesseract-ocr
I have a PDF scan and hocr file with fixed OCR mistakes, and would like to try to train a model. Is it possible to create ground-truth using this hocr file?
Reply all
Reply to author
Forward
0 new messages