Reg: Legacy Components not found

25 views
Skip to first unread message

Surya VaraPrasad Alla

unread,
Apr 22, 2024, 11:43:46 AM4/22/24
to tesseract-ocr
Hello,

I have the similar response

pytesseract.pytesseract.TesseractError: (1, "read_params_file: Can't open tessedit_char_blacklist=,;: Error: Tesseract (legacy) engine requested, but components are not present in external/tesstrain/data/eng_pcb/eng_pcb.traineddata!! Failed loading language 'eng_pcb' Tesseract couldn't load any languages! Could not initialize tesseract.")

tesseract --version:
tesseract -v
tesseract 4.1.1
 leptonica-1.82.0
  libgif 5.1.9 : libjpeg 8d (libjpeg-turbo 2.1.1) : libpng 1.6.37 : libtiff 4.3.0 : zlib 1.2.11 : libwebp 1.2.2 : libopenjp2 2.4.0
 Found AVX512BW
 Found AVX512F
 Found AVX2
 Found AVX
 Found FMA
 Found SSE
 Found libarchive 3.6.0 zlib/1.2.11 liblzma/5.2.5 bz2lib/1.0.8 liblz4/1.9.3 libzstd/1.4.8

I am using best float tessdata files from: https://github.com/tesseract-ocr/tessdata_best/blob/main/eng.traineddata

also tried some of possibilities in https://github.com/ocrmypdf/OCRmyPDF/issues/209

I am looking for the source of the issue ---> could someone help if understood the source. so I can work further.
Reply all
Reply to author
Forward
0 new messages