License plate recognition with Tesseract

124 views
Skip to first unread message

Douglas Carpenedo

unread,
Mar 28, 2017, 12:01:23 PM3/28/17
to tesseract-ocr
Hello everyone,
(sorry for the bad english, isn't my first language)
I am using the Tesseract to make car plate recognition (OCR) in my final paper. I trained the database with (at least) 20 samples of each character, manually extracted from actual plate images, without preprocessing. However, the results I have are not good enough.

Real-world images can be noisy and blurred, but tesseract can not learn the patterns in training? Due to the complexity of the backgrounds (ilumination variance, shadows, ...), even with preprocessing it is not possible to remove all the noise in all cases, some dirty are still present in the image. So, I decided train with samples withou preprocessing. I do wrong?

The minimum proposed character height is 10 pixels. Is small? Is the number of samples not enough? 
I send a complete license plate image (all characters) to tesseract. Would I have a better result if I sent character by character? I've read that tesseract has segmentation algorithms internally, which are probably better than one implemented by me.

Please, is there any suggestion to improve it? Any article recommendation that can help me?

Reply all
Reply to author
Forward
0 new messages