Groups
Groups
Sign in
Groups
Groups
tesseract-ocr
Conversations
About
Send feedback
Help
tesseract-ocr
Contact owners and managers
1–30 of 7338
Welcome,
Before sending an email to the group:
Please read
Frequently Asked Questions
Make sure you read the
Tesseract documentation
Search internet sources (including this group) for a solution
If you have a problem:
Provide all steps (including input resources) for its replication.
So not send a screenshot of the terminal - send the logs or copy text from a terminal.
Mark all as read
Report group
0 selected
Alessandro Griseta
, …
Milan Hauth
4
Aug 30
[questions] what happened to `tessdata_best` in Tesseract 5?
works for me with tesseract 5.5.1 git clone --depth=1 https://github.com/tesseract-ocr/tessdata_best
unread,
[questions] what happened to `tessdata_best` in Tesseract 5?
works for me with tesseract 5.5.1 git clone --depth=1 https://github.com/tesseract-ocr/tessdata_best
Aug 30
Andrus Moor
Aug 24
How recognize text with background
Tried https://github.com/Sicos1977/TesseractOCR and Leptonica to convert jpg receipt slip to text:
unread,
How recognize text with background
Tried https://github.com/Sicos1977/TesseractOCR and Leptonica to convert jpg receipt slip to text:
Aug 24
Pavel Hanák
Aug 21
Tesseract returns exotic characters while processing standard latin-script document
Short version: Ghostscipt uses Tesseract, but their data exchange interface may contain a bug.
unread,
Tesseract returns exotic characters while processing standard latin-script document
Short version: Ghostscipt uses Tesseract, but their data exchange interface may contain a bug.
Aug 21
Yuwen Hsieh
Aug 15
Can't install 5.5.1
Hello I tried to install Tesseract with docker base image python:3.11-trixie, but it's installing
unread,
Can't install 5.5.1
Hello I tried to install Tesseract with docker base image python:3.11-trixie, but it's installing
Aug 15
Cary Lewis
Aug 14
OCR of iPhone Screen shots
I have been trying with some success to have tesseract recognize text from iPhone's about screen.
unread,
OCR of iPhone Screen shots
I have been trying with some success to have tesseract recognize text from iPhone's about screen.
Aug 14
Thomas McGrew
,
Zdenko Podobny
7
Aug 10
Incorrect text detection
You are correct, I did miss that section. Inverting the image seems to produce better results. I
unread,
Incorrect text detection
You are correct, I did miss that section. Inverting the image seems to produce better results. I
Aug 10
Jan-Erik Lärka
,
Nikola Smolenski
4
Aug 4
TESSDATA_PREFIX doesn't work with national character(s)
The problem is that there are two places attempting to use TESSDATA_PREFIX and they have conflicting
unread,
TESSDATA_PREFIX doesn't work with national character(s)
The problem is that there are two places attempting to use TESSDATA_PREFIX and they have conflicting
Aug 4
Terasgr
Jul 25
Modern Greek depends on Ancient Greek language?
Hello people. When I tried to OCR a Greek text using tesseract I found that the modern Greek data (
unread,
Modern Greek depends on Ancient Greek language?
Hello people. When I tried to OCR a Greek text using tesseract I found that the modern Greek data (
Jul 25
Graham Toal
Jul 20
Re: [tesseract-ocr] How can I find out the version of current Tesseract from cmdline?
'--' not '-' gtoal@linux:~/github/uparse-main$ tesseract Usage: tesseract --help | --
unread,
Re: [tesseract-ocr] How can I find out the version of current Tesseract from cmdline?
'--' not '-' gtoal@linux:~/github/uparse-main$ tesseract Usage: tesseract --help | --
Jul 20
Tom Vercauteren
, …
Fly Night Society
7
Jul 17
Best settings to OCR an image of some cyphered text (base64)
I already have, and yes to all. On Wednesday, July 16, 2025 at 5:33:53 PM UTC-4 tfmo...@gmail.com
unread,
Best settings to OCR an image of some cyphered text (base64)
I already have, and yes to all. On Wednesday, July 16, 2025 at 5:33:53 PM UTC-4 tfmo...@gmail.com
Jul 17
Maxim Kizub
Jul 17
Training lstm with symbol boxes
Hello. I need to OCR text with mix of latin and cyrillic letters plus emoji-like icons. Text font is
unread,
Training lstm with symbol boxes
Hello. I need to OCR text with mix of latin and cyrillic letters plus emoji-like icons. Text font is
Jul 17
nahye koo
Jul 15
Can I train a model for multiple languages at the same time?
Hello! I want to use custom traineddata, but the performance is bad, so I want to ask for advice. I
unread,
Can I train a model for multiple languages at the same time?
Hello! I want to use custom traineddata, but the performance is bad, so I want to ask for advice. I
Jul 15
jollysalmon
,
Zdenko Podobny
3
Jul 6
OCR multiple pngs into one PDF
Sorry about that, looks like it wasn't so clear at al; - anyway, I ended up completing a script,
unread,
OCR multiple pngs into one PDF
Sorry about that, looks like it wasn't so clear at al; - anyway, I ended up completing a script,
Jul 6
Jay S
Jul 1
What build pipeline produces the most performant static binary for windows?
Hi all, Right now I am building a static x64 binary of tesseract 5.5.1 using vcpkg + Visual Studio
unread,
What build pipeline produces the most performant static binary for windows?
Hi all, Right now I am building a static x64 binary of tesseract 5.5.1 using vcpkg + Visual Studio
Jul 1
JoeBlack
,
Lorenzo Bolzani
3
Jun 30
(Py)Tesseract does not detect all expected hex codes in my image despite multiple preprocessing strategies
https://pastebin.com/4J40QcJE Template matching works better than Tesseract, and the code is much
unread,
(Py)Tesseract does not detect all expected hex codes in my image despite multiple preprocessing strategies
https://pastebin.com/4J40QcJE Template matching works better than Tesseract, and the code is much
Jun 30
Michael
,
tang...@gmail.com
2
Jun 29
How to find out the version of current Tessaract installation (on Win)?
Tesseract —version On Sun, 29 Jun 2568 BE at 13:19 Michael <mha...@gmail.com> wrote: Assume I
unread,
How to find out the version of current Tessaract installation (on Win)?
Tesseract —version On Sun, 29 Jun 2568 BE at 13:19 Michael <mha...@gmail.com> wrote: Assume I
Jun 29
Ajinkya Bobade
, …
jannes hoekman
5
Jun 19
Handling text scans and cleaning
You can use BIQE archive Op do 19 jun 2025 om 17:36 schreef Kliai Louay <kliai...@gmail.com>
unread,
Handling text scans and cleaning
You can use BIQE archive Op do 19 jun 2025 om 17:36 schreef Kliai Louay <kliai...@gmail.com>
Jun 19
محمود محمد
, …
Kalyan Karthik
4
Jun 19
organizing a Zoom-based training workshop to discuss and practice advanced applications of Tesseract OCR
Happy to join in this workshop On Sunday, June 15, 2025 at 10:58:08 PM UTC+5:30 mahmoud...@gmail.com
unread,
organizing a Zoom-based training workshop to discuss and practice advanced applications of Tesseract OCR
Happy to join in this workshop On Sunday, June 15, 2025 at 10:58:08 PM UTC+5:30 mahmoud...@gmail.com
Jun 19
הברנש
, …
Tech Tactics
3
Jun 15
help in ocr pdf
Hey Tom, I am working with bengli OCR text extraction using tesseract. How can I train, fine tune
unread,
help in ocr pdf
Hey Tom, I am working with bengli OCR text extraction using tesseract. How can I train, fine tune
Jun 15
gwapes
Jun 14
Inconsistent Tesseract performance
Hello, I am reaching out to get some advice on inconsistent performance running tesseract on
unread,
Inconsistent Tesseract performance
Hello, I am reaching out to get some advice on inconsistent performance running tesseract on
Jun 14
ossama khalyl
Jun 11
OCR-fine-tunning ~ LSTMTraining
I followed the steps for fine-tuning Tesseract for handwriting recognition. I have the character
unread,
OCR-fine-tunning ~ LSTMTraining
I followed the steps for fine-tuning Tesseract for handwriting recognition. I have the character
Jun 11
Vasudevakrishna R joshi
,
Zdenko Podobny
3
Jun 10
Compiled tesseract(Both in windows and linux) is giving junk results on most of the images
Even with simple images also I am not getting proper results. But If I ran tesseract.exe which is
unread,
Compiled tesseract(Both in windows and linux) is giving junk results on most of the images
Even with simple images also I am not getting proper results. But If I ran tesseract.exe which is
Jun 10
James Head
,
Zdenko Podobny
2
Jun 5
Poor recognition of scanned typewriter produced pages
Hello, To help troubleshoot this issue, please first try reproducing it using only Tesseract. This
unread,
Poor recognition of scanned typewriter produced pages
Hello, To help troubleshoot this issue, please first try reproducing it using only Tesseract. This
Jun 5
Ryan Bijoy
May 29
Need help with OCR accuracy
I am working on extracting scoreboard data from game images in real-time, with high accuracy and
unread,
Need help with OCR accuracy
I am working on extracting scoreboard data from game images in real-time, with high accuracy and
May 29
Massimiliano Carloni
, …
Tom Morris
5
May 25
Using tesseract_best (or other models?) for 18th-century English printed text
On Monday, April 21, 2025 at 12:03:33 PM UTC-4 mcarlo...@gmail.com wrote: Honestly, I am having the
unread,
Using tesseract_best (or other models?) for 18th-century English printed text
On Monday, April 21, 2025 at 12:03:33 PM UTC-4 mcarlo...@gmail.com wrote: Honestly, I am having the
May 25
Jean-Marc Spaggiari
, …
Tom Morris
6
May 23
pytesseract speed improvement?
That's odd that you get better results with the German model for English text. That might be
unread,
pytesseract speed improvement?
That's odd that you get better results with the German model for English text. That might be
May 23
C. D.
May 22
Confusion on 2 letters
Hi, I hope I can get some help/advise here... I'm using Tesseract 5.5.0 + LSTM within the latest
unread,
Confusion on 2 letters
Hi, I hope I can get some help/advise here... I'm using Tesseract 5.5.0 + LSTM within the latest
May 22
Azeem Qureshi
,
Azeem Qureshi
3
May 12
undefined symbols for arm64
Hello thanks for the reply okay so im using the tesseractOCR iOS library on this repo https://github.
unread,
undefined symbols for arm64
Hello thanks for the reply okay so im using the tesseractOCR iOS library on this repo https://github.
May 12
Davide Cavestro
2
May 9
RPM not available
There is some work in progress, more info at https://github.com/tesseract-ocr/tesseract/issues/3673#
unread,
RPM not available
There is some work in progress, more info at https://github.com/tesseract-ocr/tesseract/issues/3673#
May 9
Mitya
, …
ZeroCool Zero
3
May 7
Tesseract Training: Error 'Integer (fast) model' When Using Apex.lstm
You should use eng.traineddata file from the tesseract "best" repository as your
unread,
Tesseract Training: Error 'Integer (fast) model' When Using Apex.lstm
You should use eng.traineddata file from the tesseract "best" repository as your
May 7