VietOCR v3.6 & VietOCR.NET v3.6 Release

133 views
Skip to first unread message

Quan Nguyen

unread,
Mar 5, 2015, 9:20:14 PM3/5/15
to tesser...@googlegroups.com

A Java/.NET GUI frontend for Tesseract OCR engine. The releases include the following improvements:

  • Add Split TIFF function
  • Add thumbnail bar for ease of page navigation
  • Display useful info in statusbar
  • Update links to OpenOffice dictionaries
  • Add support for reading specific configs files for setting control parameters
  • Improved 64-bit support
http://vietocr.sf.net

sibi kanagaraj

unread,
Apr 7, 2015, 10:34:54 AM4/7/15
to tesser...@googlegroups.com
Dear Quan ,

What is the Unicode support status in VietOCR and jTessBox Editor .
I feel bit uneasy to work with jTess when words like கோ கொ கா are to be handled .

This is my problem :

while கோ cannot be at any cost be represented as ெ    க    ா , but I think Teseract is segmenting the first ெ  and recognising it as எ . These form major part of my problem in Jtess as well as Tesseract .

For English its different case where single character represents a letter and I have minimal idea about Vietnamese language . Not sure if you too are facing / faced the same issue and resolved it . Could you please enlighten me on the same if its jTess issue / VietOCR  or I must start an new thread .

-Sibi
Reply all
Reply to author
Forward
0 new messages