Tesseract Trainer GUI for GNU/Linux

969 views
Skip to first unread message

Nalin Linux

unread,
Sep 14, 2016, 6:55:40 AM9/14/16
to tesseract-ocr
Dear list members,
   Currently I am developing a tesseract training GUI based on the manual sited at https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract. The deb installer package is attached with this mail which is tested on ubuntu 16.04. Please test the trainer and report your feedback. 

Installing from git
Dependecy list : tesseract-ocr,imagemagick,cuneiform,python3-imaging-sane|python3-sane,espeak,poppler-utils,python3-enchant,aspell-en,python3-speechd
cd lios-3
python3 setup.py install --install-data=/usr


Thanking you, Nalin
lios_2.2_all.deb

Toderel Adrian-Aurel

unread,
Sep 15, 2016, 11:57:30 AM9/15/16
to tesser...@googlegroups.com
If you intend to do such a project please consider the path taken by an very old, and now defunct, project ... Clara OCR - Cooperative optical recognition
I mean just skip the final automatic AI full char recognition and only use the char segmentation engine then group all real segmented chars to logical labels and final part of recognition to be manual, assisted by you training UI, revised and aided by real human eyes not that stupid AI who manage to misslabel a char every now and then. Is my belief that using this approach the accuracy of recognition will skyrocket trough the roof of 100% with a very modest time increase necessary for a brief and final human revision not too time consuming because this is what can humans do best spotting the black wolf in a set of white sheep.

=======================================
linux is free, but needed expertise to use this little beast
is a personal, time consuming, continuous accumulation of knowledge
 and wasted time can not be rolled back no matter how much money you have

selling free software can not bring to you too much money,
but USING free software you can make a lot of money
like Google ... or IBM

your little help to free software development does not bring
to you any money but can help you use it more efficiently,
so you can make more money ... meanwhile, other users of
that little free software program, using your contribution
can make more money! nobody loses anything, all those who know
how to use a free software program, in continuous evolution, wins
registered linux user #352479

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/08ea1575-d457-4893-aa5d-f96c130e3904%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Nalin Linux

unread,
Sep 18, 2016, 8:54:45 AM9/18/16
to tesseract-ocr
I have done two more commits to avoid starting trouble. Now one have to start using "train-tesseract" command. The updated deb package is attached with this post. 
lios_2.2_all.deb

Nalin Linux

unread,
Nov 10, 2016, 2:00:45 AM11/10/16
to tesseract-ocr
There are too many updates on Tesseract-Trainer GUI which is shipped with Lios package. Please visit our sourceforge page to get updated packages. https://sourceforge.net/projects/lios/

universal reseller

unread,
Nov 10, 2016, 3:03:49 AM11/10/16
to tesser...@googlegroups.com
is this work on cube and rtl languages!?​

Nalin Linux

unread,
Dec 2, 2016, 11:22:56 AM12/2/16
to tesseract-ocr


On Thursday, November 10, 2016 at 1:33:49 PM UTC+5:30, peiman F. wrote:
is this work on cube and rtl languages!?​

Will be enabled soon. Latest updates listed below  
1 Training with font_properties enabled
2 Training Image zoom strengthened
3 Dictionary editing enabled
4 progress-bar enabled
5 Bug fixes on imageview

Nalin Linux

unread,
Dec 29, 2016, 11:33:39 AM12/29/16
to tesseract-ocr
Reply all
Reply to author
Forward
0 new messages