Unrecognized argument --linedata_only

169 views
Skip to first unread message

Zohreh Khosrobeygi

unread,
Jun 8, 2018, 9:19:43 AM6/8/18
to tesseract-ocr
Hi,
I have been training tesseract but i have this errore"

Unrecognized argument --linedata_only
 
And it's my version of tesseract"
tesseract 4.0.0-beta.1
 leptonica-1.74.4
  libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8

 Found AVX2
 Found AVX
 Found SSE

Besides it's my command:
sudo tesstrain.sh --fonts_dir /usr/share/fonts --lang fas    --training_text /home/kddlab/Desktop/tesseract-master/1MyData/fas/fas.training_text     --linedata_only \
  --noextract_font_properties --langdata_dir /home/kddlab/Desktop/tesseract-master/langdata \
  --tessdata_dir /home/kddlab/Desktop/tesseract-master/tessdata \
  --fontlist "B Mitra" --output_dir /home/kddlab/Desktop/tesseract-master/1MyData/testfas

And i have config file:
# Use LSTM
tessedit_ocr_engine_mode 1
tessedit_pageseg_mode 6

How can i solve this?

ShreeDevi Kumar

unread,
Jun 8, 2018, 10:14:27 AM6/8/18
to tesser...@googlegroups.com
Are you using the correct version of tesstrain.sh?

It should be in src/training/tesstrain.sh


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a692d903-34be-4a51-99c5-11ed34bb6cef%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Message has been deleted

Zohreh Khosrobeygi

unread,
Jun 9, 2018, 2:27:12 AM6/9/18
to tesseract-ocr
Yes, i am using   src/training/tesstrain.sh

ShreeDevi Kumar

unread,
Jun 9, 2018, 3:04:33 AM6/9/18
to tesser...@googlegroups.com
--linedata_only should work.

> tesseract 4.0.0-beta.1

Do you know which commit? Please try with latest code.

 i am using   src/training/tesstrain.sh

The command you used was:

sudo tesstrain.sh

Why do you need sudo?

Please run the script with

bash -x   src/training/tesstrain.sh etc ... and report with the console log.

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

Khosrobeigy.zohreh

unread,
Jun 9, 2018, 4:28:31 AM6/9/18
to tesser...@googlegroups.com
Thank. by your command fixed.
 but next i used this:

lstmtraining   \
  --traineddata /home/kddlab/Desktop/tesseract-master/1MyData/testfas/fas/fas.traineddata   --net_spec '[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx192O1c1]' \
  --model_output /home/kddlab/Desktop/tesseract-master/1MyData/testfasout/base --learning_rate 20e-4 \
  --train_listfile /home/kddlab/Desktop/tesseract-master/1MyData/testfas/fas.training_files.txt \
  --eval_listfile /home/kddlab/Desktop/tesseract-master/1MyData/testfas1/fas.training_files.txt \
  --max_iterations 5000 &>/home/kddlab/Desktop/tesseract-master/1MyData/testfasout/basetrain.log  
 and i have this error now

Segmentation fault (core dumped)


Could you please help me again?

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/GLlgILi5xOA/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

For more options, visit https://groups.google.com/d/optout.



--
Zohreh Khosrobeygi
University of Tehran, 2016

ShreeDevi Kumar

unread,
Jun 9, 2018, 5:08:50 AM6/9/18
to tesser...@googlegroups.com
Try without   --eval_listfile /home/kddlab/Desktop/tesseract-master/1MyData/testfas1/fas.training_files.txt \

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/GLlgILi5xOA/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.



--
Zohreh Khosrobeygi
University of Tehran, 2016

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Khosrobeigy.zohreh

unread,
Jun 11, 2018, 6:22:25 AM6/11/18
to tesser...@googlegroups.com
I am using this command and it is true
But i have trained 5000000 lines. but when tesseract 48000 images tiff, show an error:
No space left on device
 My RAM is 16 g
and swap is: 20g 
tiff file's size is 4 g too.

On Sat, Jun 9, 2018 at 11:33 AM, ShreeDevi Kumar <shree...@gmail.com> wrote:
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/GLlgILi5xOA/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

For more options, visit https://groups.google.com/d/optout.



--
Reply all
Reply to author
Forward
0 new messages