Multiple rotation on one image

28 views

Skip to first unread message

Jakub Fridrich

unread,

Sep 10, 2019, 4:23:46 AM9/10/19

to tesseract-ocr

I have next image, with multiple orientation on text, 0 and 180 degree, without red notes.

I need convert to text, but text on the middle block return trash of data. I don't edit image in external editor.

How to get data for all paragraph correctly?

I'm testing this command:

tesseract --oem 1 --psm 1 -l eng "img.jpg" "imgdata"

Thanks

David Maung

unread,

Sep 10, 2019, 10:38:03 AM9/10/19

to tesseract-ocr

>I don't edit image in external editor.

Why not use a tool such as OpenCV? You could use an operation such as erode until you have 3 contiguous blocks of black to identify paragraphs. You could then present each paragraph individually to Tesseract in 2 orientations.

Reply all

Reply to author

Forward

0 new messages