Help in TrainingTesseract 4.00 Finetune

50 views
Skip to first unread message

Ahmad Moawad

unread,
Apr 12, 2017, 1:09:48 AM4/12/17
to tesseract-ocr
Hello All,

I want help in trainingTesseract 4.00 Finetune https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00---Finetune
I want to know some parameter such as:
1- langdata_dir is that the file in https://github.com/tesseract-ocr/langdata 

training/tesstrain.sh --fonts_dir /usr/share/fonts --lang ara  --linedata_only \
  --training_text ../langdata/ara/arabic1.txt \
  --langdata_dir ../langdata --tessdata_dir ./tessdata \
  --fontlist "Times New Roman," \
  --output_dir ~/tesstutorial/aratest

2- lineddata_only unkown

Thanks.

ShreeDevi Kumar

unread,
Apr 12, 2017, 4:46:09 AM4/12/17
to tesser...@googlegroups.com
--linedata-only means that it will only try to create lstmf files and not the files for 3.0x traing

- excuse the brevity, sent from mobile

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/7d0d9371-bbd4-4245-b415-4f67e8dfb9bb%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages