Can't run tesseract with LSTM

228 مرّة مشاهدة
التخطي إلى أول رسالة غير مقروءة

Jenkar Smithy

غير مقروءة،
22‏/03‏/2017، 3:57:47 م22‏/3‏/2017
إلى tesseract-ocr
When trying the following command :
tesseract input.tiff ./result --psm 7 -c tessedit_char_whitelist=0123456789 --oem 2 --tessdata-dir ~/

I get the following error :
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
int_mode_
:Error:Assert failed:in file weightmatrix.cpp, line 180
int_mode_
:Error:Assert failed:in file weightmatrix.cpp, line 180

The data is the correct one (for 4.0), from the git.

ShreeDevi Kumar

غير مقروءة،
23‏/03‏/2017، 6:00:13 ص23‏/3‏/2017
إلى tesser...@googlegroups.com
There might be some problem with your input file - all the following work for me.
Please note that whitelist has no effect in 4.0

$ tesseract input.tif input
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7 -c tessedit_char_whitelist=0123456789
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7 -c tessedit_char_whitelist=0123456789 --oem 2
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7 -c tessedit_char_whitelist=0123456789 --oem 2 --tessdata-dir ./tessdata
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/05c29148-7d90-4690-82d8-74077f1003ec%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

input.txt
input.tif

Jenkar Smithy

غير مقروءة،
23‏/03‏/2017، 6:58:26 ص23‏/3‏/2017
إلى tesseract-ocr
Tried with your input file - still no dice, throws the same error.

Interesting about the charwhitelist, thanks for pointing that out!

ShreeDevi Kumar

غير مقروءة،
23‏/03‏/2017، 7:49:57 ص23‏/3‏/2017
إلى tesser...@googlegroups.com
what version of tesseract are you running? If you built it, which commit source have you used?

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Mar 23, 2017 at 4:28 PM, Jenkar Smithy <quentin...@free.fr> wrote:
Tried with your input file - still no dice, throws the same error.

Interesting about the charwhitelist, thanks for pointing that out!

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Jenkar Smithy

غير مقروءة،
23‏/03‏/2017، 8:35:06 ص23‏/3‏/2017
إلى tesseract-ocr
Using tesseract 4.00.00alpha , built from commit 2b52915a740a39944157fd0fda0524fd1d71ef83

ShreeDevi Kumar

غير مقروءة،
23‏/03‏/2017، 8:46:18 ص23‏/3‏/2017
إلى tesser...@googlegroups.com
Ok. I am using an older version ...

git log -1


commit 0ff26ee3de166659970d80e50aef4000ff2557b2
Author: zdenop <zde...@gmail.com>
Date:   Fri Feb 3 08:15:15 2017 +0100

    Merge pull request #698 from stweil/configure

    configure: Run AVX test only with 64 bit compiler

Please try with that. If that works, some newer commit might be a problem.


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Mar 23, 2017 at 6:05 PM, Jenkar Smithy <quentin...@free.fr> wrote:
Using tesseract 4.00.00alpha , built from commit 2b52915a740a39944157fd0fda0524fd1d71ef83

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Jenkar Smithy

غير مقروءة،
23‏/03‏/2017، 9:19:58 ص23‏/3‏/2017
إلى tesseract-ocr
Cleaned up and installed from that commit, LSTMs are now functionnal.
Thanks!

الرد على الكل
رد على الكاتب
إعادة توجيه
0 رسالة جديدة