How to deal with multi page images through tess4j

41 views
Skip to first unread message

Shawn Chen

unread,
Sep 9, 2017, 9:10:39 AM9/9/17
to tesseract-ocr

I am using tess4j 4.0.0 to process the image and pdf files.
For multiple page files if i use TessBaseAPIGetComponentImages(), all the boxa from all the pages will be returned,right?
If so how can i distinguish the boxes from different pages?

Also if I use TessBaseAPISetRectangle() and TessBaseAPIGetUTF8Text() how can i know the page number for the returned text?

Thanks!

v-room

unread,
Sep 9, 2017, 5:03:03 PM9/9/17
to tesseract-ocr
just confess.
it's what i would've done
Reply all
Reply to author
Forward
0 new messages