want to train tesseract 2.04 for new font OCR A or OCR A extended

231 views
Skip to first unread message

Will Smith

unread,
Dec 11, 2011, 10:53:58 AM12/11/11
to tesseract-ocr
Hi,

My problem-
I using tessnet2 .NET assembly in C#.NET (http://www.pixel-
technology.com/freeware/tessnet2/).
It is using tesseract 2.04 as OCR Engine.
I have to recognize only numbers on electric fuses in font OCR A or
OCR A extended
But OCR results are as follows.
OCR A extended font results
Digit= its OCR output
0=11
.=.
1=1
2=2
3=3
4=.1, 9
5=5
6=5
7=7
8=5
9=.1

training tesseract 2.04 and tesseract 3.01 is little different.I m
getting worse outputs after training.I want to train tesseract 2.04
for new font OCR-A or OCR A extended.Please send me training files tif
image, .box, .tr and trained data on yogeshjo...@gmail.com
urgent.Thanks for help in advance.

Ben Phung

unread,
Apr 18, 2012, 3:18:32 PM4/18/12
to tesser...@googlegroups.com
Are you preprocessing the image beforehand by any chance?

Ben Phung

unread,
Apr 18, 2012, 3:19:08 PM4/18/12
to tesser...@googlegroups.com
Have you preprocessed the image by any chance?


On Sunday, December 11, 2011 7:53:58 AM UTC-8, Will Smith wrote:
Reply all
Reply to author
Forward
Message has been deleted
0 new messages