Tesseract: OCR trimming of text is not required, need exact text as in image

141 views
Skip to first unread message

RUTURAJ R. Raval

unread,
Apr 29, 2015, 9:46:29 AM4/29/15
to tesser...@googlegroups.com

I am using tesseract library in my app, there when I capture image of text, and text gets extracted, if there are multiple spaces in image, they are trimmed to only 1 space.

Suppose, in image, the text is kind of,

HELLO       WORLD
HI HOW ARE YOU?
I    AM    FINE
THANK           YOU.

So when I get text, the perfect margin should be maintained, as per in image, but I get output as,

HELLO WORLD
HI HOW ARE YOU?
I AM FINE
THANK YOU.

So I don't want this.

So can I make change in any function, to achieve this task?

Please help me, what should I change, and where?

Because I don't know how to get the same text as in image as any trimming function is there in tesseract? Help me out.

Thank you.

Quan Nguyen

unread,
Apr 30, 2015, 7:52:18 PM4/30/15
to tesser...@googlegroups.com

buyi wen

unread,
Sep 17, 2015, 11:23:12 PM9/17/15
to tesseract-ocr
if you like tesseract ocr, you may like this free online tesseract ocr tool

Reply all
Reply to author
Forward
0 new messages