Tesseract Crashes for Spanish Language

103 views
Skip to first unread message

Pablo Andres Araya Melo

unread,
Jan 27, 2019, 2:25:14 PM1/27/19
to tesseract-ocr
Hi,

I have:

tesseract 4.0.0-beta.1
 leptonica-1.75.3
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0

 Found AVX2
 Found AVX
 Found SSE

it crashes (computer reboots) when I use 

tesseract image.jpg out -l spa -psm 6

It therefore crashes when I use pytesseract inside python package.

I am using Kubuntu 18.04.

Any ideas how to fix this?

Zdenko Podobny

unread,
Jan 28, 2019, 2:50:05 AM1/28/19
to tesser...@googlegroups.com
Your tesseract version is outdated. Please update.
Next: try to use  spa data from tessdata_best...

Zdenko


ne 27. 1. 2019 o 20:25 Pablo Andres Araya Melo <panb...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/97df2b72-c1c7-49e7-9337-6c9d294ddedd%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Pablo Andres Araya Melo

unread,
Jan 28, 2019, 8:03:35 PM1/28/19
to tesseract-ocr
I updated tesseract from github, I now have

tesseract 4.0.0-251-g44038

 leptonica-1.75.3
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0

 Found AVX2
 Found AVX
 Found SSE

It works with '-l spa', but when I do '--psm 6', it crashes.

Any ideas?

Thank you!

Shree Devi Kumar

unread,
Jan 28, 2019, 8:45:12 PM1/28/19
to tesser...@googlegroups.com
You have not mentioned which traineddata file you are using. 

>It works with '-l spa', but when I do '--psm 6', it crashes.

Please share the image. Also note the commands used and their output.


For more options, visit https://groups.google.com/d/optout.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

PA

unread,
Jan 29, 2019, 6:38:06 AM1/29/19
to tesser...@googlegroups.com
I am using spa.tessdata
What do you mean with commands used and their output?

Zdenko Podobny

unread,
Jan 29, 2019, 6:47:59 AM1/29/19
to tesser...@googlegroups.com
There are 3 version of "spa" data. Which one use?
Regarding command: how to you run tesseract?

Zdenko


ut 29. 1. 2019 o 12:38 PA <panb...@gmail.com> napísal(a):

PA

unread,
Jan 29, 2019, 6:56:15 AM1/29/19
to tesser...@googlegroups.com
I use spa.tessdata from here


As for usage, I do

tesseract norm.jpg out -l spa --psm 6

this works ok on my laptop, but on my desktop, it crashes. Both systems are Kubuntu 18.04

Thank you!


tesseract

Shree Devi Kumar

unread,
Jan 29, 2019, 11:09:08 AM1/29/19
to tesser...@googlegroups.com
>this works ok on my laptop, but on my desktop, it crashes. Both systems are Kubuntu 18.04

Please check tesseract version on both.

tesseract -v

Also check the version of of the traineddata file on each.


For more options, visit https://groups.google.com/d/optout.

PA

unread,
Feb 1, 2019, 7:13:27 AM2/1/19
to tesser...@googlegroups.com
On my laptop:

tesseract 4.0.0-beta.1
 leptonica-1.75.3
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0

 Found AVX2
 Found AVX
 Found SSE

This was installed from Kubuntu packages, so the tessdata comes from there.

On my desktop:

tesseract 4.0.0-251-g44038
 leptonica-1.75.3
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0

 Found AVX2
 Found AVX
 Found SSE

This was installed from github, and tessdata comes from  https://github.com/tesseract-ocr/tessdata/blob/master/spa.traineddata

Thank you!

Shree Devi Kumar

unread,
Feb 1, 2019, 7:39:47 AM2/1/19
to tesser...@googlegroups.com
>This was installed from github, and tessdata comes from  https://github.com/tesseract-ocr/tessdata/blob/master/spa.traineddata

Please try with traineddata file from tessdata_best and tessdata_fast

Also give the exact error message/console output.


For more options, visit https://groups.google.com/d/optout.

PA

unread,
Feb 1, 2019, 7:45:57 AM2/1/19
to tesser...@googlegroups.com
Are those test data for Spanish language?

Also I can not give error message as tesseract crashes making the desktop to reboot. Do you know a way to save to text file? 

Zdenko Podobny

unread,
Feb 1, 2019, 8:04:36 AM2/1/19
to tesser...@googlegroups.com
IMO if any program can cause crash of computer/reboot of system you have a big problem (not related to tesseract). 

Please try to search forum - I think there was already somebody with similar issue.

Zdenko


pi 1. 2. 2019 o 13:45 PA <panb...@gmail.com> napísal(a):

Shree Devi Kumar

unread,
Feb 1, 2019, 8:06:28 AM2/1/19
to tesser...@googlegroups.com

Alternately, look up the file size of spa.traineddata on your desktop and laptop. You can try copying the one from laptop (working version) to desktop.


For more options, visit https://groups.google.com/d/optout.

PA

unread,
Feb 1, 2019, 8:24:42 AM2/1/19
to tesser...@googlegroups.com
I was actually thinking the same thing, however, plain tesseract (with ou options) works, so I don't know what to think.

Will look in the forum for similar issues. 


PA

unread,
Feb 1, 2019, 8:25:00 AM2/1/19
to tesser...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages