Tesseract couldn't load any languages!

236 views
Skip to first unread message

Walter Weinmann

unread,
Jan 4, 2022, 2:14:53 PM1/4/22
to tesseract-ocr
How is this possible?

walter@ubuntu:/mnt/hgfs/Projects/docker_images$ tesseract -v
tesseract 5.0.0
 leptonica-1.82.0
  libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : libtiff 4.1.0 : zlib 1.2.11
 Found AVX2
 Found AVX
 Found FMA
 Found SSE4.1
 Found OpenMP 201511

walter@ubuntu:/mnt/hgfs/Projects/docker_images$ tesseract --list-langs
List of available languages in "/home/walter/tesseract/" (1):
tessdata/eng

walter@ubuntu:/mnt/hgfs/Projects/docker_images$ tesseract *.pdf output
Error opening data file /home/walter/tesseract/eng.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory.
Failed loading language 'eng'
Tesseract couldn't load any languages!
Could not initialize tesseract.

Zdenko Podobny

unread,
Jan 5, 2022, 6:59:12 AM1/5/22
to tesser...@googlegroups.com
  1. tesseract --list-langs only print filenames in TESSDATA_PREFIX - e.g. it does not guarantee that files are not corrupted or they are compatible with your tesseract version
  2. tesseract without specification of language (-l) expect to find "eng". Your setting provides "tessdata/eng" (see information you provided) e.g. you either provide option "-l  tessdata/eng" or you set TESSDATA_PREFIX=/home/walter/tesseract/tessdata (or "--tessdata-dir /home/walter/tesseract/tessdata")

Zdenko


ut 4. 1. 2022 o 20:14 Walter Weinmann <walter.at....@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5bce98d1-b681-4ed0-ba5f-267de301ae71n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages