Tesseract basic Recognition

69 views
Skip to first unread message

Thomas Bourrieau

unread,
Jan 15, 2021, 5:49:04 AM1/15/21
to tesseract-ocr
Hi, 
I m trying to use tesseract for basic string recognition but i m facing a bad text identification. 
you can find the basic image i m trying to identify in attachement. 

the following C# code is used for the identification : 
_ocr = OcrApi.Create();
_ocr.Init(Languages.French);
string r2 =  _ocr.GetTextFromImage(tempFileName);

the extracted text is : 
ABUSÿN7EUUCHÿSRVU1JTCJU2\n\n
instead of : 
ABLIS_N70031_SRV01_RTC_R02

I m using .net tesseract sdk patagame.ocr.dll with dependancy tesseract.dll downloaded from  the following link : Welcome to the Tesseract.Net SDK (patagames.com)  
These files are also in attachment . 
And i also imported basic fra and eng file in tessdata folder. 
In the other hand, my folder TessData/Config is empty. Is It Normal ? 

Am I missing some config files or specific settings in my code ? 

Thank you. 
Best regards

48124369-790e-4eae-acbb-3ee4610de558.bmp
Reply all
Reply to author
Forward
0 new messages