Hi,
I've downloaded some of the *.traineddata files from
<
https://github.com/tesseract-ocr/tessdata_best> --- as far as I can
tell, all the ones I have tested work on the command line, e.g.,
$ tesseract --tessdata-dir /opt/data/tessdata-new/ --list-langs
...
swe
...
$ tesseract --tessdata-dir /opt/data/tessdata-new/ -l swe test.png stdout
[produces output with no errors]
$ tesseract --version
tesseract 4.1.0
leptonica-1.78.0
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.1) : libpng 1.6.37 :
libtiff 4.0.10 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0
Found AVX2
Found AVX
Found SSE
but when I try to use the same swe.traineddata file in a web service
built with grails and running on Tomcat, something causes a segfault and
such a massive problem that the whole Tomcat server has to be killed and
restarted. The grails service has the following dependency:
compile group: 'org.bytedeco', name: 'tesseract-platform', version:
'4.0.0-1.5'
which is a slightly lower version, but the data files are supposed to
work with Tesseract 4.
Any ideas why?
Thanks,
Adam