Groups
Groups
Sign in
Groups
Groups
tesseract-ocr
Conversations
About
Send feedback
Help
tesseract-ocr
Contact owners and managers
1–30 of 7348
Welcome,
Before sending an email to the group:
Please read
Frequently Asked Questions
Make sure you read the
Tesseract documentation
Search internet sources (including this group) for a solution
If you have a problem:
Provide all steps (including input resources) for its replication.
So not send a screenshot of the terminal - send the logs or copy text from a terminal.
Mark all as read
Report group
0 selected
Stéphane Brunner
,
Zdenko Podobny
4
2:01 PM
Image just some color converted into black and white
Hello, You are using old version of tesseract - please update to current version 5.5.1 Zdenko so 22.
unread,
Image just some color converted into black and white
Hello, You are using old version of tesseract - please update to current version 5.5.1 Zdenko so 22.
2:01 PM
Jozef M.
,
Ger Hobbelt
2
Nov 11
Tesseract LSTM competitive word recognition (at least for certain use cases)
Thank you for publishing this. Question: the -1 confidence numbers for T3 and T4 in the charts: could
unread,
Tesseract LSTM competitive word recognition (at least for certain use cases)
Thank you for publishing this. Question: the -1 confidence numbers for T3 and T4 in the charts: could
Nov 11
Mattia Mirri
,
Ger Hobbelt
2
Nov 10
Help with auto island detection
Relevant (if only sideways at first glance): - https://tesseract-ocr.github.io/tessdoc/ImproveQuality
unread,
Help with auto island detection
Relevant (if only sideways at first glance): - https://tesseract-ocr.github.io/tessdoc/ImproveQuality
Nov 10
Jeremy C. Reed
,
Ger Hobbelt
2
Nov 9
training using a page at a time?
In answer to your question: AFAIK there is no 'simple' solution/answer. Reading, OCRing (old)
unread,
training using a page at a time?
In answer to your question: AFAIK there is no 'simple' solution/answer. Reading, OCRing (old)
Nov 9
Harshit Goel
,
Ger Hobbelt
5
Nov 5
tesseract via gosseract returns empty text for one image, but CLI detects correctly ("NO SMOKING")
"tav output modes": typo! I meant to say "TSV output mode". Sorry. Met
unread,
tesseract via gosseract returns empty text for one image, but CLI detects correctly ("NO SMOKING")
"tav output modes": typo! I meant to say "TSV output mode". Sorry. Met
Nov 5
Sandeep G
Nov 3
Issue with Colon Recognition After Fine-Tuning Tesseract 5.5.3 on Russian Dataset
I'm currently working on fine-tuning the Tesseract OCR model (version 5.5.3) and encountered an
unread,
Issue with Colon Recognition After Fine-Tuning Tesseract 5.5.3 on Russian Dataset
I'm currently working on fine-tuning the Tesseract OCR model (version 5.5.3) and encountered an
Nov 3
Michael Schuh
, …
Ger Hobbelt
11
Nov 1
Trouble extracting date and time from image
You're welcome! Good luck and take care! .... (For posterity / google search, here's a
unread,
Trouble extracting date and time from image
You're welcome! Good luck and take care! .... (For posterity / google search, here's a
Nov 1
Coure 2011
Oct 29
Deserialize header failed: 1.lstmf
I need to train the default eng data, so that it can also recognize new characters. I created box
unread,
Deserialize header failed: 1.lstmf
I need to train the default eng data, so that it can also recognize new characters. I created box
Oct 29
Jean-Marc Spaggiari
,
Zdenko Podobny
2
Sep 28
Same command for 2 files
Hi, But for the Aurochs file I'm getting "Empty page!!". I have not been able to get a
unread,
Same command for 2 files
Hi, But for the Aurochs file I'm getting "Empty page!!". I have not been able to get a
Sep 28
pascal 06
,
Tom Morris
4
Sep 18
Carriage return after each word
Bonsoir Tom, je suppose que tu es francophone :) Merci pour ta réponse ! Je vais continuer en anglais
unread,
Carriage return after each word
Bonsoir Tom, je suppose que tu es francophone :) Merci pour ta réponse ! Je vais continuer en anglais
Sep 18
Alessandro Griseta
, …
Milan Hauth
4
Aug 30
[questions] what happened to `tessdata_best` in Tesseract 5?
works for me with tesseract 5.5.1 git clone --depth=1 https://github.com/tesseract-ocr/tessdata_best
unread,
[questions] what happened to `tessdata_best` in Tesseract 5?
works for me with tesseract 5.5.1 git clone --depth=1 https://github.com/tesseract-ocr/tessdata_best
Aug 30
Andrus Moor
Aug 24
How recognize text with background
Tried https://github.com/Sicos1977/TesseractOCR and Leptonica to convert jpg receipt slip to text:
unread,
How recognize text with background
Tried https://github.com/Sicos1977/TesseractOCR and Leptonica to convert jpg receipt slip to text:
Aug 24
Pavel Hanák
Aug 21
Tesseract returns exotic characters while processing standard latin-script document
Short version: Ghostscipt uses Tesseract, but their data exchange interface may contain a bug.
unread,
Tesseract returns exotic characters while processing standard latin-script document
Short version: Ghostscipt uses Tesseract, but their data exchange interface may contain a bug.
Aug 21
Yuwen Hsieh
Aug 15
Can't install 5.5.1
Hello I tried to install Tesseract with docker base image python:3.11-trixie, but it's installing
unread,
Can't install 5.5.1
Hello I tried to install Tesseract with docker base image python:3.11-trixie, but it's installing
Aug 15
Cary Lewis
Aug 14
OCR of iPhone Screen shots
I have been trying with some success to have tesseract recognize text from iPhone's about screen.
unread,
OCR of iPhone Screen shots
I have been trying with some success to have tesseract recognize text from iPhone's about screen.
Aug 14
Thomas McGrew
,
Zdenko Podobny
7
Aug 10
Incorrect text detection
You are correct, I did miss that section. Inverting the image seems to produce better results. I
unread,
Incorrect text detection
You are correct, I did miss that section. Inverting the image seems to produce better results. I
Aug 10
Jan-Erik Lärka
,
Nikola Smolenski
4
Aug 4
TESSDATA_PREFIX doesn't work with national character(s)
The problem is that there are two places attempting to use TESSDATA_PREFIX and they have conflicting
unread,
TESSDATA_PREFIX doesn't work with national character(s)
The problem is that there are two places attempting to use TESSDATA_PREFIX and they have conflicting
Aug 4
Terasgr
Jul 25
Modern Greek depends on Ancient Greek language?
Hello people. When I tried to OCR a Greek text using tesseract I found that the modern Greek data (
unread,
Modern Greek depends on Ancient Greek language?
Hello people. When I tried to OCR a Greek text using tesseract I found that the modern Greek data (
Jul 25
Graham Toal
Jul 20
Re: [tesseract-ocr] How can I find out the version of current Tesseract from cmdline?
'--' not '-' gtoal@linux:~/github/uparse-main$ tesseract Usage: tesseract --help | --
unread,
Re: [tesseract-ocr] How can I find out the version of current Tesseract from cmdline?
'--' not '-' gtoal@linux:~/github/uparse-main$ tesseract Usage: tesseract --help | --
Jul 20
Tom Vercauteren
, …
Fly Night Society
7
Jul 17
Best settings to OCR an image of some cyphered text (base64)
I already have, and yes to all. On Wednesday, July 16, 2025 at 5:33:53 PM UTC-4 tfmo...@gmail.com
unread,
Best settings to OCR an image of some cyphered text (base64)
I already have, and yes to all. On Wednesday, July 16, 2025 at 5:33:53 PM UTC-4 tfmo...@gmail.com
Jul 17
Maxim Kizub
Jul 17
Training lstm with symbol boxes
Hello. I need to OCR text with mix of latin and cyrillic letters plus emoji-like icons. Text font is
unread,
Training lstm with symbol boxes
Hello. I need to OCR text with mix of latin and cyrillic letters plus emoji-like icons. Text font is
Jul 17
nahye koo
Jul 15
Can I train a model for multiple languages at the same time?
Hello! I want to use custom traineddata, but the performance is bad, so I want to ask for advice. I
unread,
Can I train a model for multiple languages at the same time?
Hello! I want to use custom traineddata, but the performance is bad, so I want to ask for advice. I
Jul 15
jollysalmon
,
Zdenko Podobny
3
Jul 6
OCR multiple pngs into one PDF
Sorry about that, looks like it wasn't so clear at al; - anyway, I ended up completing a script,
unread,
OCR multiple pngs into one PDF
Sorry about that, looks like it wasn't so clear at al; - anyway, I ended up completing a script,
Jul 6
Jay S
Jul 1
What build pipeline produces the most performant static binary for windows?
Hi all, Right now I am building a static x64 binary of tesseract 5.5.1 using vcpkg + Visual Studio
unread,
What build pipeline produces the most performant static binary for windows?
Hi all, Right now I am building a static x64 binary of tesseract 5.5.1 using vcpkg + Visual Studio
Jul 1
JoeBlack
,
Lorenzo Bolzani
3
Jun 30
(Py)Tesseract does not detect all expected hex codes in my image despite multiple preprocessing strategies
https://pastebin.com/4J40QcJE Template matching works better than Tesseract, and the code is much
unread,
(Py)Tesseract does not detect all expected hex codes in my image despite multiple preprocessing strategies
https://pastebin.com/4J40QcJE Template matching works better than Tesseract, and the code is much
Jun 30
Michael
,
tang...@gmail.com
2
Jun 29
How to find out the version of current Tessaract installation (on Win)?
Tesseract —version On Sun, 29 Jun 2568 BE at 13:19 Michael <mha...@gmail.com> wrote: Assume I
unread,
How to find out the version of current Tessaract installation (on Win)?
Tesseract —version On Sun, 29 Jun 2568 BE at 13:19 Michael <mha...@gmail.com> wrote: Assume I
Jun 29
Ajinkya Bobade
, …
jannes hoekman
5
Jun 19
Handling text scans and cleaning
You can use BIQE archive Op do 19 jun 2025 om 17:36 schreef Kliai Louay <kliai...@gmail.com>
unread,
Handling text scans and cleaning
You can use BIQE archive Op do 19 jun 2025 om 17:36 schreef Kliai Louay <kliai...@gmail.com>
Jun 19
محمود محمد
, …
Kalyan Karthik
4
Jun 19
organizing a Zoom-based training workshop to discuss and practice advanced applications of Tesseract OCR
Happy to join in this workshop On Sunday, June 15, 2025 at 10:58:08 PM UTC+5:30 mahmoud...@gmail.com
unread,
organizing a Zoom-based training workshop to discuss and practice advanced applications of Tesseract OCR
Happy to join in this workshop On Sunday, June 15, 2025 at 10:58:08 PM UTC+5:30 mahmoud...@gmail.com
Jun 19
הברנש
, …
Tech Tactics
3
Jun 15
help in ocr pdf
Hey Tom, I am working with bengli OCR text extraction using tesseract. How can I train, fine tune
unread,
help in ocr pdf
Hey Tom, I am working with bengli OCR text extraction using tesseract. How can I train, fine tune
Jun 15
gwapes
Jun 14
Inconsistent Tesseract performance
Hello, I am reaching out to get some advice on inconsistent performance running tesseract on
unread,
Inconsistent Tesseract performance
Hello, I am reaching out to get some advice on inconsistent performance running tesseract on
Jun 14