tesseract yields different results when image is rotated

72 views
Skip to first unread message

George Tsai

unread,
Sep 30, 2015, 3:13:01 PM9/30/15
to tesseract-ocr
I am wondering if anyone can give me guidance on this interesting issue. I have a shipping label. tesseract performs much better when it's rotated compared to unrotated. The commands I used is "tesseract input_image_name output_text_name -psm 0" for both image files. I have also dumped tessinput to see what tesseract is actually seeing, and they look the same, except one is rotated, and the other is not. Thank you for your help.
tessinput_normal.tif
normal.jpg
rotated.jpg
tessinput_rotated.tif

Allistair C

unread,
Sep 30, 2015, 3:28:27 PM9/30/15
to tesser...@googlegroups.com
Can you describe much better? What are your results looking like? What is the target text you are trying to recognise?

> On 30 Sep 2015, at 16:27, George Tsai <george....@gmail.com> wrote:
>
> <normal.jpg>

George Tsai

unread,
Sep 30, 2015, 8:54:14 PM9/30/15
to tesseract-ocr
I am trying to recognize the recipient's name and address. This is the output from rotated.jpg. 

NICK WM
355 H 31' a 9795
BLAINE WA 98230—4124

HI!”IIIIIHNIIIHHIHH IIIIIHHIHH

Ul 3:01.119anan ol l/6047803396d1l?

:IIJHHII m I"!!!

IRACKING ID' YBABIMIMQBOG?

It's not perfect, but still allows me to look up the recipient from the database. This is the output from normal.jpg (not rotated):

'7‘”"NU{WIWUMUflJIlUUUWWW"

H

II‘[(6151{mil/IUflI/IIMIII mug/MW 1:

mo M960

D~085

It is really bad. I am puzzled why the results differ so much from the same input image and command, except with one rotated. Thank you
Reply all
Reply to author
Forward
0 new messages