Re: OCR Text in Specific Color

2,066 views
Skip to first unread message

zdenko podobny

unread,
Jan 28, 2013, 3:05:42 AM1/28/13
to tesser...@googlegroups.com
Tesseract converts input image data to 2 colors mode (black & white). So it do no have information (at the output stage) about color of the input symbols...

Zdenko


On Sun, Jan 27, 2013 at 10:52 PM, <ipe...@gmail.com> wrote:

Im new to the community but did some searching around before posting. My question in of itself is not unique, however, the application is.

Question:

Does anyone know if tesseract engine that can convert only specific text (select and convert only green colored text) from an image to actual copyable text?

Example: A US Dollar bill has a green serial number. However, it may not always be in the same place (depending on how the bill is photographed). I need an OCR that can first recognize the green text and then convert it into editable text (output in .txt or anything else).

Any thoughts are welcome.

Igor

--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
 
 
 

Nick White

unread,
Jan 28, 2013, 3:09:39 AM1/28/13
to tesser...@googlegroups.com
Hi Igor,

> Does anyone know if tesseract engine that can convert only specific text
> (select and convert only green colored text) from an image to actual copyable
> text?

Tesseract doesn't process in colour at all, so it can't do what you
want by itself. You'll have to pre-process the images first
isolating the section you want, and then send that on to Tesseract.
The ImageMagick API or OpenCV may be good things to look at to
figure out good ways of analysing the image in this way.

Best of luck.

Nick
Reply all
Reply to author
Forward
0 new messages