Multiple rotation on one image

28 views
Skip to first unread message

Jakub Fridrich

unread,
Sep 10, 2019, 4:23:46 AM9/10/19
to tesseract-ocr
I have next image, with multiple orientation on text, 0 and 180 degree, without red notes. 

I need convert to text, but text on the middle block return trash of data. I don't edit image in external editor.  
How to get data for all paragraph correctly?
 
I'm testing this command:
tesseract --oem 1 --psm 1 -l eng "img.jpg" "imgdata"


DRjoxOqVAAEpDWU.jpg

Thanks

David Maung

unread,
Sep 10, 2019, 10:38:03 AM9/10/19
to tesseract-ocr
>I don't edit image in external editor. 

Why not use a tool such as OpenCV?  You could use an operation such as erode until you have 3 contiguous blocks of black to identify paragraphs.  You could then present each paragraph individually to Tesseract in 2 orientations.
Reply all
Reply to author
Forward
0 new messages