read other languages ​​by tesseract on c #

159 views
Skip to first unread message

Alessandro Latella

unread,
Oct 4, 2011, 6:02:35 AM10/4/11
to tesseract-ocr
Hi guys, I'm trying to run tesseract on c #.
The program works well on English language 'ocr.Init(@"C:\Program
Files\Tesseract-OCR\tessdata", "eng", false);'
If I try to change the language from "eng" to "ita", the program
generates an error and does not work.
I use the library tessnet2.dll .

Thanks,
Alessandro.

Quan Nguyen

unread,
Oct 4, 2011, 10:27:47 PM10/4/11
to tesseract-ocr
What's the error exactly? Does the image work with tesseract.exe?

Alessandro Latella

unread,
Oct 7, 2011, 12:20:45 PM10/7/11
to tesseract-ocr
The error is "Unable to load unicharset file C:\Program Files
\Tesseract-OCR\ita.unicharset", but in the directory ...\tessdata
there are all the "ita." files.
Yes the image work correctly with tesseract.exe
> > Alessandro.- Nascondi testo citato
>
> - Mostra testo citato -

Quan Nguyen

unread,
Oct 7, 2011, 7:23:33 PM10/7/11
to tesseract-ocr
The error indicated the program was looking for the data files in C:
\Program Files
\Tesseract-OCR folder, not C:\Program Files
\Tesseract-OCR\tessdata as you specified in your code. Can you try to
determine why the discrepancy?

You may want to take a look at VietOCR.NET 2.0x, which makes use of
tessnet2 library.

http://vietocr.sf.net

Zdenko Podobný

unread,
Oct 7, 2011, 4:12:18 PM10/7/11
to tesser...@googlegroups.com
If there is error message:

Unable to load unicharset file C:\Program Files\Tesseract-OCR\ita.unicharset"

than it means that your program expect language files (ita.*) in  directory "C:\Program Files\Tesseract-OCR\" and not in "...\tessdata"

Zdenko

Alessandro Latella

unread,
Oct 26, 2011, 6:48:01 AM10/26/11
to tesseract-ocr
I'm stupid, sorry for the simple error.
Ok, the program works with the general language.
Now I try to run the program with a language trained by me and this
error occurs:

"Error: Illegal malloc request size!
Fatal error: No error trap defined!
Signal_termination_handler called with signal 2001"

If the solution is this " http://www.win.tue.nl/~aeb/linux/ocr/tesseract.html
" , how can I apply this patch using the imported library "tessnet2"
in C# project?

Thanks,
Alessandro.

abdelaziz mokrani

unread,
Mar 23, 2017, 6:55:54 AM3/23/17
to tesseract-ocr
plz can you send the code , i have the same problem ''Unable to load unicharset file C:\Program Files (x86)\Tesseract-OCR\\eng.unicharset ''
Reply all
Reply to author
Forward
0 new messages