Re: [tesseract-ocr] Re: Font List

187 views
Skip to first unread message

ShreeDevi Kumar

unread,
Jun 14, 2017, 9:52:19 AM6/14/17
to tesser...@googlegroups.com
> what is the difference between the engtrain and engeval? 

It will depend on what fonts and training text you use for each.

one is used for training, the other is for evaluation of the training.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Wed, Jun 14, 2017 at 5:58 PM, Ibr <ibr.h...@gmail.com> wrote:
UPDATE

I figured out how to use the list, and seems two commands are the same, so still the question, what is the difference between the engtrain and engeval? 

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/9bf06943-bbb9-4b8e-86f3-ab48a54225df%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Ibr

unread,
Jun 18, 2017, 8:07:49 AM6/18/17
to tesseract-ocr
Hi,

for engtrain and engeval they almost have the same command but for eval you specify the font using the argument --font-list, while in train you define the fonts in language-specifics.sh , I ran both command and I noticed that they produce the same results files, except in engtrain case there will be lstmf file for each font in language-specific.sh.

but now there is another question, in the fine tuning the command is:
training/lstmtraining --model_output ~/tesstutorial/impact_from_small/impact \
 
--continue_from ~/tesstutorial/engoutput/base_checkpoint \
 
--train_listfile ~/tesstutorial/engeval/eng.training_files.txt \
 
--max_iterations 1200

and for evaluating its also uses:
training/lstmeval --model ~/tesstutorial/impact_from_small/impact_checkpoint \
 
--eval_listfile ~/tesstutorial/engeval/eng.training_files.txt

for me it seems its a typo at the fine tuning command since the argument --eval_listfile  ~/tesstutorial/engeval
so in both cases the commands contains engeval, my question, since I don't want to run the fine tuning many times, so can I use the training files that exists in the engtrain ? instead to create files for the engeval for every font and then run the fine tune against its training files




Reply all
Reply to author
Forward
0 new messages