convert f1.png f2.png f3.png f4.png f5.png f6.png f7.png -evaluate-sequence Min -threshold 60% -negate output.png. I thought that was pretty good result. Clean image with decent text. Tesseract is about %50. My question is this: Can i train tesseract without the full alphabet? Since these are all labeled by network and Vanderbilt only records a few i'll have FOX, ABC, CBS, NBC, and CNN. Not too many letters to train with. Also could anyone point out instructions on getting the training tools installed on Mac os X? Macports doesn't have the training part, I did install v3 from source but the training programs won't compile. Any help is appreciated
Good work extracting text. But not sufficient for Tesseract. Try blurring your result image until characters become less blocky. This way you probably wouldn't need training.
Completely different approach is to use fixed pattern matching. Go find my post about pulling text out of game screenshots. You'll need to program yourself then.
The last thing I'd try is training. Wiki is your friend.
-Dmitri
Okay so my project is i want to extract the text imbedded in video. After experimenting with Imagemagick i was able to isolate the text and put it on a white background. I thought that would be the hard part. But every command line OCR software i try is pretty bad at converting what i have. In the sample image, f2.png, you can see what i'm working with. It is just the network name and date i need. With this imagemagick command:convert f1.png f2.png f3.png f4.png f5.png f6.png f7.png -evaluate-sequence Min -threshold 60% -negate output.png. I thought that was pretty good result. Clean image with decent text. Tesseract is about %50. My question is this: Can i train tesseract without the full alphabet? Since these are all labeled by network and Vanderbilt only records a few i'll have FOX, ABC, CBS, NBC, and CNN. Not too many letters to train with. Also could anyone point out instructions on getting the training tools installed on Mac os X? Macports doesn't have the training part, I did install v3 from source but the training programs won't compile. Any help is appreciated
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/52275c37-543e-4b85-ab44-6c51f890ca6b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
If you have limited letters and numbers that you want to recognize, also look at the whitelist
- sent from my phone. excuse the brevity and typos.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1340659d-b291-4ad8-ba95-9ed6976a1d15%40googlegroups.com.
Text color - somehow you need to replicate or take into account the logic behind color selection to extract as much correct pixels as possible.
Text position - just work with the cropped text.
High compression - see below.
When you use fixed pattern matching, it's about fixed patterns but not necessarily about "fixed matching". Here you can go with "fuzzy" matching, e.g. when a defined percentage of pixels match to a pattern.
Another "big thing" that came to my mind is to rectify italics by unshifting respective scanlines. This would make characters closer to what Tesseract is trained for.
-Dmitri
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1340659d-b291-4ad8-ba95-9ed6976a1d15%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0fc63467-5f89-459c-a0f6-0841d7e46dac%40googlegroups.com.