Converting colored background and colored characters to text with the Tesseract library

101 views
Skip to first unread message

Emre Batu

unread,
Aug 5, 2024, 1:27:09 AM8/5/24
to tesseract-ocr
20240804211345.png  Hello everyone. I am using the Tesseract library in a C# application to analyze images. However, the image I want to convert to text contains colored characters and a colored background. As a result, the output is not accurate. How can I convert this image to text correctly? Thank you.  

Zdenko Podobny

unread,
Aug 5, 2024, 1:29:24 AM8/5/24
to tesser...@googlegroups.com
Captcha was created to fool OCR.


Zdenko


po 5. 8. 2024 o 7:27 Emre Batu <emreb...@gmail.com> napísal(a):
20240804211345.png  Hello everyone. I am using the Tesseract library in a C# application to analyze images. However, the image I want to convert to text contains colored characters and a colored background. As a result, the output is not accurate. How can I convert this image to text correctly? Thank you.  

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f0b4bb22-6e1a-41ab-b38a-d31440c12074n%40googlegroups.com.

Emre Batu

unread,
Aug 6, 2024, 1:59:21 PM8/6/24
to tesser...@googlegroups.com
The link generating the mentioned captcha is as follows: https://medeczane.sgk.gov.tr/eczane/SayiUretenImageYeniServlet

Do you have any idea how I can convert the numbers generated on this link into text? I have tried some C# code that changes the background color to white and the text to black. It is partially successful, but it does not achieve the desired result completely.


Zdenko Podobny <zde...@gmail.com>, 5 Ağu 2024 Pzt, 08:29 tarihinde şunu yazdı:
Reply all
Reply to author
Forward
0 new messages