Can't run tesseract with LSTM

228 peržiūros
Praleisti ir pereiti prie pirmo neskaityto pranešimo

Jenkar Smithy

neskaityta,
2017-03-22 15:57:472017-03-22
kam: tesseract-ocr
When trying the following command :
tesseract input.tiff ./result --psm 7 -c tessedit_char_whitelist=0123456789 --oem 2 --tessdata-dir ~/

I get the following error :
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
int_mode_
:Error:Assert failed:in file weightmatrix.cpp, line 180
int_mode_
:Error:Assert failed:in file weightmatrix.cpp, line 180

The data is the correct one (for 4.0), from the git.

ShreeDevi Kumar

neskaityta,
2017-03-23 06:00:132017-03-23
kam: tesser...@googlegroups.com
There might be some problem with your input file - all the following work for me.
Please note that whitelist has no effect in 4.0

$ tesseract input.tif input
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7 -c tessedit_char_whitelist=0123456789
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7 -c tessedit_char_whitelist=0123456789 --oem 2
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$ tesseract input.tif input --psm 7 -c tessedit_char_whitelist=0123456789 --oem 2 --tessdata-dir ./tessdata
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
Page 1
$

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/05c29148-7d90-4690-82d8-74077f1003ec%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

input.txt
input.tif

Jenkar Smithy

neskaityta,
2017-03-23 06:58:262017-03-23
kam: tesseract-ocr
Tried with your input file - still no dice, throws the same error.

Interesting about the charwhitelist, thanks for pointing that out!

ShreeDevi Kumar

neskaityta,
2017-03-23 07:49:572017-03-23
kam: tesser...@googlegroups.com
what version of tesseract are you running? If you built it, which commit source have you used?

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Mar 23, 2017 at 4:28 PM, Jenkar Smithy <quentin...@free.fr> wrote:
Tried with your input file - still no dice, throws the same error.

Interesting about the charwhitelist, thanks for pointing that out!

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Jenkar Smithy

neskaityta,
2017-03-23 08:35:062017-03-23
kam: tesseract-ocr
Using tesseract 4.00.00alpha , built from commit 2b52915a740a39944157fd0fda0524fd1d71ef83

ShreeDevi Kumar

neskaityta,
2017-03-23 08:46:182017-03-23
kam: tesser...@googlegroups.com
Ok. I am using an older version ...

git log -1


commit 0ff26ee3de166659970d80e50aef4000ff2557b2
Author: zdenop <zde...@gmail.com>
Date:   Fri Feb 3 08:15:15 2017 +0100

    Merge pull request #698 from stweil/configure

    configure: Run AVX test only with 64 bit compiler

Please try with that. If that works, some newer commit might be a problem.


ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Mar 23, 2017 at 6:05 PM, Jenkar Smithy <quentin...@free.fr> wrote:
Using tesseract 4.00.00alpha , built from commit 2b52915a740a39944157fd0fda0524fd1d71ef83

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscribe@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.

Jenkar Smithy

neskaityta,
2017-03-23 09:19:582017-03-23
kam: tesseract-ocr
Cleaned up and installed from that commit, LSTMs are now functionnal.
Thanks!

Atsakyti visiems
Atsakyti autoriui
Persiųsti
0 naujų pranešimų