Make lstm for some files

54 views
Skip to first unread message

Zohreh Khosrobeygi

unread,
Aug 16, 2018, 8:10:45 AM8/16/18
to tesseract-ocr
I have some tif and box files for each font for example:
fas.B_Mitra.exp0.box
fas.B_Mitra.exp0.tif
fas.B_Mitra.exp1.box
fas.B_Mitra.exp1.tif
fas.B_Mitra.exp2.box
fas.B_Mitra.exp2.tif
.
.
.
How can I make lstm for each of them?
Thx.

Shree Devi Kumar

unread,
Aug 16, 2018, 9:58:28 AM8/16/18
to tesser...@googlegroups.com
You need to make lstmf file for each of these.

eg.  tesseract  fas.B_Mitra.exp0.tif  fas.B_Mitra.exp0 --psm 6 lstm.train

will create  fas.B_Mitra.exp0.lstmf



--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c011d8f3-75b1-471f-a772-35327390bf78%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

Khosrobeigy.zohreh

unread,
Aug 19, 2018, 9:15:36 AM8/19/18
to tesser...@googlegroups.com
Hi, when I run tesstrain.sh I get this error:
+ err_exit '/tmp/tmp.N31LQSCg1a/fas/fas.Times_New_Roman.exp0.lstmf does not exist or is not readable'
+ echo -e 'ERROR: /tmp/tmp.N31LQSCg1a/fas/fas.Times_New_Roman.exp0.lstmf' does not exist or is not readable
+ tee -a /tmp/tmp.N31LQSCg1a/fas/tesstrain.log
ERROR: /tmp/tmp.N31LQSCg1a/fas/fas.Times_New_Roman.exp0.lstmf does not exist or is not readable
+ exit 1

Tesseract -v:
tesseract 4.0.0-beta.1
 leptonica-1.74.4
  libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8

 Found AVX2
 Found AVX
 Found SSE





On Thu, Aug 16, 2018 at 6:28 PM Shree Devi Kumar <shree...@gmail.com> wrote:
You need to make lstmf file for each of these.

eg.  tesseract  fas.B_Mitra.exp0.tif  fas.B_Mitra.exp0 --psm 6 lstm.train

will create  fas.B_Mitra.exp0.lstmf


On Thu, Aug 16, 2018 at 5:40 PM, Zohreh Khosrobeygi <beigy....@gmail.com> wrote:
I have some tif and box files for each font for example:
fas.B_Mitra.exp0.box
fas.B_Mitra.exp0.tif
fas.B_Mitra.exp1.box
fas.B_Mitra.exp1.tif
fas.B_Mitra.exp2.box
fas.B_Mitra.exp2.tif
.
.
.
How can I make lstm for each of them?
Thx.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.



--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/QpAIHg4SPME/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

For more options, visit https://groups.google.com/d/optout.


--
Zohreh Khosrobeygi
University of Tehran, 2016

Shree Devi Kumar

unread,
Aug 19, 2018, 11:38:46 AM8/19/18
to tesser...@googlegroups.com
> tesseract 4.0.0-beta.1

Please upgrade to latest code.

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.



--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/QpAIHg4SPME/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-ocr+unsubscribe@googlegroups.com.


--
Zohreh Khosrobeygi
University of Tehran, 2016

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages