Hello,
I've been working with Tessearct 4.1.1 (in C#, Visual Studio). I've been taking screenshots of small regions of my screen to capture text from youtube comments, TikTok chats, etc., and it has done a great job of converting the print in the images to text. I thought it did a great job on this text.
However, it failed when I took a screenshot of text in my notepad application, which is the simplest text imaginable. A list of words. It's black text on a white background. It only extracted a word or two from the document. The image snapshot is just 190 by 260 pixels. It says it's 300x300 dpi.
Here is the text from the document:
Up
Hey
Down
What
Left
Left
So
That's
It
Start
and this
what
Down
Here is the screenshot image I had Tesseract extract from.
Is there a way to fix this problem?
Many thanks for any help.
Regards,
...John