Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
tesseract-ocr
Conversations
About
Groups keyboard shortcuts have been updated
Dismiss
See shortcuts
tesseract-ocr
Contact owners and managers
1–30 of 7310
Welcome,
Before sending an email to the group:
Please read
Frequently Asked Questions
Make sure you read the
Tesseract documentation
Search internet sources (including this group) for a solution
If you have a problem:
Provide all steps (including input resources) for its replication.
So not send a screenshot of the terminal - send the logs or copy text from a terminal.
Mark all as read
Report group
0 selected
Jiansen Chan
,
Zdenko Podobny
2
Apr 28
Facing issues with unicharset when trying to automate model training
❌ Training failed: Command '['make', 'unicharset', 'lists', 'proto-
unread,
Facing issues with unicharset when trying to automate model training
❌ Training failed: Command '['make', 'unicharset', 'lists', 'proto-
Apr 28
Jiansen Chan
2
Apr 27
Trying to automate model training in tesseract OCR
It's okay, solved the issue. I didn't put enough training data into the folder hence no
unread,
Trying to automate model training in tesseract OCR
It's okay, solved the issue. I didn't put enough training data into the folder hence no
Apr 27
Jay S
,
TheComplete BookOfMormon
2
Apr 23
Using Tesseract 5.5.0 to recognize source code, but need a way to maintain original indentation.
I expect you will need to use the bounding boxes tesseract.exe "input.jpg" stdout -l eng --
unread,
Using Tesseract 5.5.0 to recognize source code, but need a way to maintain original indentation.
I expect you will need to use the bounding boxes tesseract.exe "input.jpg" stdout -l eng --
Apr 23
Massimiliano Carloni
, …
Graham Toal
3
Apr 21
Using tesseract_best (or other models?) for 18th-century English printed text
On Mon, Apr 21, 2025 at 2:02 PM RuePat07 <patil.ruc...@gmail.com> wrote: Try
unread,
Using tesseract_best (or other models?) for 18th-century English printed text
On Mon, Apr 21, 2025 at 2:02 PM RuePat07 <patil.ruc...@gmail.com> wrote: Try
Apr 21
Foong Gah Seng
,
Zdenko Podobny
2
Apr 21
Tesseract failed to read .lstm file and checkpoint file
Dear friends, do not try to train tesseract if you have a clue how tesseract works. You will fail,
unread,
Tesseract failed to read .lstm file and checkpoint file
Dear friends, do not try to train tesseract if you have a clue how tesseract works. You will fail,
Apr 21
Ishak DÖLEK
,
TheComplete BookOfMormon
2
Apr 20
Tesseract training with Custom Dataset
Yes you can. This video is very good. https://www.youtube.com/watch?v=SvhoBT-PnME&lc=
unread,
Tesseract training with Custom Dataset
Yes you can. This video is very good. https://www.youtube.com/watch?v=SvhoBT-PnME&lc=
Apr 20
Vijayakumar. T
Apr 17
Getting error while install tesseract-ocr v5.5.0 in windows machine
hi Team, I am getting below error while install tesseract-ocr v5.5.0 in windows machine. please help
unread,
Getting error while install tesseract-ocr v5.5.0 in windows machine
hi Team, I am getting below error while install tesseract-ocr v5.5.0 in windows machine. please help
Apr 17
TheComplete BookOfMormon
Apr 17
How can I train the 22MB eng model?
I am using the following 22MB eng.traineddata in my app and it is working very well https://github.
unread,
How can I train the 22MB eng model?
I am using the following 22MB eng.traineddata in my app and it is working very well https://github.
Apr 17
Jiansen Chan
,
RuePat07
3
Apr 16
'No rule to make target' issue.
convert .png to .tif On Wednesday, April 16, 2025 at 5:32:32 PM UTC-4 RuePat07 wrote: you need to
unread,
'No rule to make target' issue.
convert .png to .tif On Wednesday, April 16, 2025 at 5:32:32 PM UTC-4 RuePat07 wrote: you need to
Apr 16
Daniel Migowski
3
Apr 16
5.5.0 on SW
OK, nobody cares anyway, when I use vcpkg to build it, I get v5.5. Thanks for nothing. So where do I
unread,
5.5.0 on SW
OK, nobody cares anyway, when I use vcpkg to build it, I get v5.5. Thanks for nothing. So where do I
Apr 16
Elias Tsolis (Estatistics)
Apr 16
ocr to gt the username on the last extracted frame from tiktok video fails.
original applying filters with im6 in linux convert -resize 1480% -monochrome -sharpen 0x3 original.
unread,
ocr to gt the username on the last extracted frame from tiktok video fails.
original applying filters with im6 in linux convert -resize 1480% -monochrome -sharpen 0x3 original.
Apr 16
RuePat07
,
Jamie sturgeon
2
Apr 11
Adding new characters and Finetuning for a new font, how to preserve the accuracy? Merge models?
Try inferencing with a different combo and check tesseract settings. On Fri, 11 Apr 2025 at 12:47 am,
unread,
Adding new characters and Finetuning for a new font, how to preserve the accuracy? Merge models?
Try inferencing with a different combo and check tesseract settings. On Fri, 11 Apr 2025 at 12:47 am,
Apr 11
Francesc Juliana
Apr 10
Error with 3,5GB tiff file
Hello: I use tesseract with many files and it works perfect. When I use a tiff that is 3,5GB size ,
unread,
Error with 3,5GB tiff file
Hello: I use tesseract with many files and it works perfect. When I use a tiff that is 3,5GB size ,
Apr 10
Ajinkya Bobade
,
Zdenko Podobny
3
Apr 9
Handling text scans and cleaning
Thank you, just saw from your link that it is posted !! I'm so glad to hear this news Ajinkya On
unread,
Handling text scans and cleaning
Thank you, just saw from your link that it is posted !! I'm so glad to hear this news Ajinkya On
Apr 9
Kyle Foley
,
Tom Morris
2
Apr 7
tesseract whitelist not working
That looks like it's probably a character encoding issue with how pytesseract constructs/uses its
unread,
tesseract whitelist not working
That looks like it's probably a character encoding issue with how pytesseract constructs/uses its
Apr 7
Michael Lee
,
zdenop
2
Apr 7
Minimum required files for Simplest Invocation to OCR an image
Tesseract installation includes 135 files. Can you be more specific? Eg how did you install tesseract
unread,
Minimum required files for Simplest Invocation to OCR an image
Tesseract installation includes 135 files. Can you be more specific? Eg how did you install tesseract
Apr 7
Rajeswari Gopal
Apr 6
Tesseract to train for our custom image is not working as expected.
Hi Team, We are trying to create custom trained data for our own Tiff image. These are the steps
unread,
Tesseract to train for our custom image is not working as expected.
Hi Team, We are trying to create custom trained data for our own Tiff image. These are the steps
Apr 6
Mitya
,
Ger Hobbelt
3
Apr 5
Traineddata distorted and provides bad read, last trained sample is as usual
Haven't checked your info further, but note your remark: IMPORTANT: I use images in same color
unread,
Traineddata distorted and provides bad read, last trained sample is as usual
Haven't checked your info further, but note your remark: IMPORTANT: I use images in same color
Apr 5
Mitya
, …
Fish Money
6
Apr 5
Would training Tesseract with different binarization filters affect eng.traineddata?
Hi Lorenzo, thanks for reaching me out! I decided to train one source image (without any filters),
unread,
Would training Tesseract with different binarization filters affect eng.traineddata?
Hi Lorenzo, thanks for reaching me out! I decided to train one source image (without any filters),
Apr 5
Ajg
,
Zdenko Podobny
3
Apr 4
Any way to stop ocr after set time period?
Thanks for the tip. I'll look into this On Thursday, April 3, 2025 at 12:12:52 PM UTC-4 Ajg wrote
unread,
Any way to stop ocr after set time period?
Thanks for the tip. I'll look into this On Thursday, April 3, 2025 at 12:12:52 PM UTC-4 Ajg wrote
Apr 4
Vrinda Gopinath
Apr 2
Normalization error
While training a Tesseract 5 OCR model for Malayalam handwritten text, we encountered Unicode
unread,
Normalization error
While training a Tesseract 5 OCR model for Malayalam handwritten text, we encountered Unicode
Apr 2
Ilso
Mar 29
"--loglevel" doesn't output anything
Hi, I wanted to learn more about how Tesseract works and tried specifying the --loglevel option, but
unread,
"--loglevel" doesn't output anything
Hi, I wanted to learn more about how Tesseract works and tried specifying the --loglevel option, but
Mar 29
Pei
Mar 27
Tesseract lstmtraining in windows:Load of page 0 failed! Load of images failed!
Hi I am trying to use Tesseract and add my own data to fine-tune it, but I am stuck here and cannot
unread,
Tesseract lstmtraining in windows:Load of page 0 failed! Load of images failed!
Hi I am trying to use Tesseract and add my own data to fine-tune it, but I am stuck here and cannot
Mar 27
MadhuSudan Kaka
Mar 22
SIGSEGV Error when using libtessract4.0.0.so in Pega Application
Hi, I am trying to use tess4j in my Pega Application which is running on Linux Ubuntu and getting a
unread,
SIGSEGV Error when using libtessract4.0.0.so in Pega Application
Hi, I am trying to use tess4j in my Pega Application which is running on Linux Ubuntu and getting a
Mar 22
Quan Nguyen
, …
MadhuSudan Kaka
6
Mar 22
Tess4J - a Java wrapper for Tesseract OCR DLL
Hi Quan, I am trying to use tess4j in my Pega Application which is running on Linux Ubuntu and
unread,
Tess4J - a Java wrapper for Tesseract OCR DLL
Hi Quan, I am trying to use tess4j in my Pega Application which is running on Linux Ubuntu and
Mar 22
Mitya
,
Zdenko Podobny
2
Mar 22
Tesseract Training: Error 'Integer (fast) model' When Using Apex.lstm
Hello, I notice there may be some gaps in your understanding of Tesseract and its training
unread,
Tesseract Training: Error 'Integer (fast) model' When Using Apex.lstm
Hello, I notice there may be some gaps in your understanding of Tesseract and its training
Mar 22
Adam Seligson
,
MadhuSudan Kaka
2
Mar 21
Installing/bundling libtesseract.so file(s) in tesseract jar file for Java application
Hi, I have a similar requirement. Did anyone generate .so files for tess4j? On Wednesday, 21 August
unread,
Installing/bundling libtesseract.so file(s) in tesseract jar file for Java application
Hi, I have a similar requirement. Did anyone generate .so files for tess4j? On Wednesday, 21 August
Mar 21
ruby goel
Mar 20
OCR for making document not only searchable but accessible too
To make documents accessible ie read by screen readers, I am facing some issue with tabular data, as
unread,
OCR for making document not only searchable but accessible too
To make documents accessible ie read by screen readers, I am facing some issue with tabular data, as
Mar 20
Raphy Ubuntu
Mar 19
tesseract building: it finds icu 74 but says undefined reference to icu 72 files
Following the indications found here: https://tesseract-ocr.github.io/tessdoc/Compiling I'm
unread,
tesseract building: it finds icu 74 but says undefined reference to icu 72 files
Following the indications found here: https://tesseract-ocr.github.io/tessdoc/Compiling I'm
Mar 19
Mitya
Mar 19
could not update eng_custom.traineddata properly
Description: I have some text, which is single word on tiff file , designed to train eng_custom.
unread,
could not update eng_custom.traineddata properly
Description: I have some text, which is single word on tiff file , designed to train eng_custom.
Mar 19