Recognise only highlighted text

400 views
Skip to first unread message

SR

unread,
May 21, 2010, 7:24:08 AM5/21/10
to tesseract-ocr
Hi all
I want to do OCR process to get the text highlighted in the
document. Could you let me know whether it possible and how?

Thanks

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com.
To unsubscribe from this group, send email to tesseract-oc...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

Jimmy O'Regan

unread,
May 21, 2010, 8:08:36 AM5/21/10
to tesser...@googlegroups.com, tesseract-ocr
On 21 May 2010, at 12:24, SR <senthi...@gmail.com> wrote:

> Hi all
> I want to do OCR process to get the text highlighted in the
> document. Could you let me know whether it possible and how?

You would really need to ask a more specific question if you want to
have any hope of an answer.

If you mean you want to select an area of an image and have only that
piece be sent for OCR then the usual way to do that is to make a copy
of that selection and use that instead of the whole image.

SR

unread,
May 21, 2010, 8:38:41 AM5/21/10
to tesseract-ocr
Here more details about requirement:
The scanned document has some of the important texts in specific
color highlighted, i need to get those text separately. It is not
possible to pass those highlighted text image part to do ocr alone.

Thanks
On May 22, 12:08 am, Jimmy O'Regan <jore...@gmail.com> wrote:

Sven Pedersen

unread,
Mar 20, 2013, 8:24:08 AM3/20/13
to tesser...@googlegroups.com
that is really an image processing problem -- you would need to find an image process that eliminates all but the highlighted text and supply that image to tesseract as black and white (binary).
--Sven


On Tue, Mar 19, 2013 at 3:39 PM, Waldemar Pross <wapr...@gmail.com> wrote:
SR, were you able to find a solution for your problem? To OCR just a highlighted text?
--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
 
---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 



--
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”


--
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”
Reply all
Reply to author
Forward
0 new messages