how to install an additional language for tesseract i.e. "deu" tsch?

32 views
Skip to first unread message

Fred Philip

unread,
May 8, 2026, 11:42:06 AM (yesterday) May 8
to tesseract-ocr
Hi,
I not very  experienced with tesseract even tough I run some examples before.
I'm running tesseract in Linux Mint (Ubuntu).
Now I'm facing a situation were I want to read some documents with special German characters [üÜöÖäÄß] and for that I tried to install a tessdata package (deu).
However I wasn't able to find any explanation in the manual how exactly I have to install the data file (zip) on my system. 
Do I install it with "APT", with a fetch from the repository or do I use the combine_tessdata tool?
It would be helpful if someone could refer me to the part of the manual where the installation process is explained.
Any suggestion is much appreciated.
TIA

Zdenko Podobny

unread,
May 8, 2026, 11:43:25 AM (yesterday) May 8
to tesser...@googlegroups.com
Hello,

how did you installed tesseract?

Zdenko


pi 8. 5. 2026 o 17:41 Fred Philip <fphil...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/tesseract-ocr/37bc29ea-9cf0-4026-968b-8f86b38c4113n%40googlegroups.com.

Fred Philip

unread,
May 8, 2026, 2:38:34 PM (yesterday) May 8
to tesseract-ocr
tesseract is installed for long time on my Linux machine and I don't remember anymore how I installed it.
However, I can tell you that the package manager Synaptic shows that tesseract v. 5.3.4-1build5 is installed.

But how relates that to the question where the instruction to install tessdata???.zip can be found in the manual?
[/usr/share/tesseract-ocr/5/tessdata/] has only [eng.traindata] installed.

:-)

Zdenko Podobny

unread,
May 8, 2026, 2:40:55 PM (yesterday) May 8
to tesser...@googlegroups.com
If you used  package manager for installation of tesseract, you should use it also for language data.

Zdenko


pi 8. 5. 2026 o 20:38 Fred Philip <fphil...@gmail.com> napísal(a):

Fred Philip

unread,
May 8, 2026, 3:41:48 PM (24 hours ago) May 8
to tesseract-ocr
Damn, I should have thought to do it that way.
I was a little bit misguided by what I have read in the manual about the tessdata files.
All is good now.
Thanks  @ zdenop
Reply all
Reply to author
Forward
0 new messages