bash script to help finetune training for Korean

70 views
Skip to first unread message

ShreeDevi Kumar

unread,
Mar 1, 2018, 2:29:16 AM3/1/18
to tesser...@googlegroups.com, 이경준
I am attaching  a bash script which makes it easy to give all the required commands for finetune training of a language. I recently tested it from the queries regarding Korean testing. Hopefully this will be easier to follow rather than descriptive text.

While the commands currently are setup for korean, they can be changed for other languages easily.

The directories etc need to be set based on your local setup.

Please note: If using Indic languages and RTL languages, the combine_lang_model command will need additional variables to be set -

 --lang_is_rtl  True if lang being processed is written right-to-left  (type:bool default:false)
  --pass_through_recoder  If true, the recoder is a simple pass-through of the unicharset. Otherwise, potentially a compression of it  (type:bool default:false)

kor-tesstrain_pluschars-log.txt
tesstrain_pluschars.sh

ShreeDevi Kumar

unread,
Mar 1, 2018, 2:37:51 AM3/1/18
to tesser...@googlegroups.com, 이경준
The log file sent earlier was only for training steps. 

Complete log file which shows output on console during building of training data using tesstrain.sh is attached now.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
kor-tesstrain_pluschars-log.txt

이경준

unread,
Mar 1, 2018, 7:19:11 AM3/1/18
to tesseract-ocr
Thank U I really really appreicate for your kindness. 

Thank U 

2018년 3월 1일 목요일 오후 4시 37분 51초 UTC+9, shree 님의 말:
Reply all
Reply to author
Forward
0 new messages