Multiple colours text in an image

51 views
Skip to first unread message

Iago Giné

unread,
Sep 21, 2023, 7:04:33 AM9/21/23
to tesseract-ocr
Hi all,

Is there some option to tell tesseract-ocr that there is text with multiple colours, so it detect all the text? For example, in my case, I have a pdf with the cover of a book, with yellow background and text both in black and also in white. Depending on how I proceed, I get only the text in black or the text in white, but not both.

I have only found the next issue, but no answer or anything more :https://github.com/tesseract-ocr/tesseract/issues/3078

Thank you for your time!

Iago

Zdenko Podobny

unread,
Oct 7, 2023, 8:26:41 AM10/7/23
to tesser...@googlegroups.com
Hello,

this is about image preprocessing/thresholding rather than tesseract...
Please post an example image so tesseract users can test it and suggest a possible solution.

Zdenko


št 21. 9. 2023 o 13:04 Iago Giné <let7once10a...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/6610a558-975c-4ce4-8bba-c2b56fd9c50an%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages