Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
tesseract-ocr
Conversations
About
tesseract-ocr
Contact owners and managers
1–30 of 7137
Welcome,
Before sending an email to the group:
Please read
Frequently Asked Questions
Make sure you read the
Tesseract documentation
Search internet sources (including this group) for a solution
If you have a problem:
Provide all steps (including input resources) for its replication.
So not send a screenshot of the terminal - send the logs or copy text from a terminal.
Mark all as read
Report group
0 selected
Liam Doherty
, …
Tom Morris
5
1:38 AM
why are there no new trained models since 2018?
Thanks, that's helpful. Is the collaboration with Google ongoing then? Can you give me a sense of
unread,
why are there no new trained models since 2018?
Thanks, that's helpful. Is the collaboration with Google ongoing then? Can you give me a sense of
1:38 AM
Jan Ploska
Mar 16
Chinise characters.
Hello, I am making a transcrypt of YT wideos using tessaract. Images I input to tessaract look like
unread,
Chinise characters.
Hello, I am making a transcrypt of YT wideos using tessaract. Images I input to tessaract look like
Mar 16
Mark Pellegrino
, …
Art Rhyno
11
Mar 15
Post OCR Verification and Editing
Hi Art, Thanks so much for this. These are very intriguing tools. I'll definitely give Alethia a
unread,
Post OCR Verification and Editing
Hi Art, Thanks so much for this. These are very intriguing tools. I'll definitely give Alethia a
Mar 15
roei shlezinger
Mar 14
fine tuning on images
Hello, I have relatively clear images in Hebrew and Tesseract produces reasonable but not perfect
unread,
fine tuning on images
Hello, I have relatively clear images in Hebrew and Tesseract produces reasonable but not perfect
Mar 14
Quan Nguyen
,
JB Data31
3
Mar 13
VietOCR v6.3.0 & VietOCR.NET v6.3.0 Releases
VietOCR v6.13.0 & VietOCR.NET v6.11.0 Releases A Java/.NET WPF GUI frontend for Tesseract OCR
unread,
VietOCR v6.3.0 & VietOCR.NET v6.3.0 Releases
VietOCR v6.13.0 & VietOCR.NET v6.11.0 Releases A Java/.NET WPF GUI frontend for Tesseract OCR
Mar 13
Ravil R
,
Zdenko Podobny
2
Mar 13
Leptonica directory
It seems like you are not following the official documented way for compiling leptonica and tesseract
unread,
Leptonica directory
It seems like you are not following the official documented way for compiling leptonica and tesseract
Mar 13
Cain Pian
Mar 13
Does training new images increase the size of the traindata file?
I've trained thousands of images. But the traineddata file size didn't change at all. Did I
unread,
Does training new images increase the size of the traindata file?
I've trained thousands of images. But the traineddata file size didn't change at all. Did I
Mar 13
Roman Seidel
, …
Zdenko Podobny
7
Mar 12
user patterns with tesserocr python API
One correction: I checked the example in the below mentioned url with the Tesseract executable and
unread,
user patterns with tesserocr python API
One correction: I checked the example in the below mentioned url with the Tesseract executable and
Mar 12
Jan F
Mar 12
Some PDF readers see double spaces in tesseract PDF output
Dear readers, I'm experimenting with Tesseract 5.3.3.20231005 on Windows and I keep running into
unread,
Some PDF readers see double spaces in tesseract PDF output
Dear readers, I'm experimenting with Tesseract 5.3.3.20231005 on Windows and I keep running into
Mar 12
Panumeth Khongsawatkiat
Mar 12
Training Tesseract 5 for a New Font in Thai not wroking
I tried to train Tesseract 5 with a new font in Thai but The BCER value keeps increasing. This is the
unread,
Training Tesseract 5 for a New Font in Thai not wroking
I tried to train Tesseract 5 with a new font in Thai but The BCER value keeps increasing. This is the
Mar 12
Mridul Davesar
Mar 12
LSTM training tesseract OCR high error rate
Hey everyone , I am train my own lstm model based using some specific images that I want tesseract to
unread,
LSTM training tesseract OCR high error rate
Hey everyone , I am train my own lstm model based using some specific images that I want tesseract to
Mar 12
Ali öksüzoglu
Mar 11
I can't create OCR traindata
Hello, I am trying to solve the Captcha in this image, but I am getting an error. Is there anyone who
unread,
I can't create OCR traindata
Hello, I am trying to solve the Captcha in this image, but I am getting an error. Is there anyone who
Mar 11
thangaraj r
Mar 8
i got Failed to continue from: data/eng/eng_num_vert.lstm
Warning: LSTMTrainer deserialized an LSTMRecognizer! Error, data/eng/eng_num_vert.lstm is an integer
unread,
i got Failed to continue from: data/eng/eng_num_vert.lstm
Warning: LSTMTrainer deserialized an LSTMRecognizer! Error, data/eng/eng_num_vert.lstm is an integer
Mar 8
Minh Nguyen
Mar 7
How to get path tesseract_cmd
I'm using sam cli to build and deploy images to AWS ECR. The code snippet has been packaged into
unread,
How to get path tesseract_cmd
I'm using sam cli to build and deploy images to AWS ECR. The code snippet has been packaged into
Mar 7
thangaraj r
Mar 5
train tesseract
how to train tesseract ,prepare dataset
unread,
train tesseract
how to train tesseract ,prepare dataset
Mar 5
Sambhav Soni
Mar 4
pdf to ocr enabled world
i have a pdf of a document that i scanned via microsoft lens app , i want to convert that to editable
unread,
pdf to ocr enabled world
i have a pdf of a document that i scanned via microsoft lens app , i want to convert that to editable
Mar 4
Rhythm
Mar 4
Facing issue with Accuracy
Hi all, What can be the issue if whitelist characters are fixed(0-9) and In my example 200 is read as
unread,
Facing issue with Accuracy
Hi all, What can be the issue if whitelist characters are fixed(0-9) and In my example 200 is read as
Mar 4
Omar Samir
Mar 3
Transfer learning
How can I use transfer learning to fine-tune a tessdata_best model??
unread,
Transfer learning
How can I use transfer learning to fine-tune a tessdata_best model??
Mar 3
Omar Samir
Mar 3
Dataset used to train tessdata_best models
What is the dataset used to train ara.traineddata model in tessdata_beset
unread,
Dataset used to train tessdata_best models
What is the dataset used to train ara.traineddata model in tessdata_beset
Mar 3
Omar Samir
Mar 3
Result are worse after fine-tune
I was fine-tuning the ara.traineddata in tessdata_best and I have benchmarked the output model and
unread,
Result are worse after fine-tune
I was fine-tuning the ara.traineddata in tessdata_best and I have benchmarked the output model and
Mar 3
Iman Firouzian
, …
Tom Morris
7
Feb 29
Digits only recognized when mixed with letters
ps You can find the Farsi training data here: https://github.com/tesseract-ocr/langdata_lstm/blob/
unread,
Digits only recognized when mixed with letters
ps You can find the Farsi training data here: https://github.com/tesseract-ocr/langdata_lstm/blob/
Feb 29
Will Fetherolf
,
René JM Clais
6
Feb 27
Help recognizing text from image
The code ! I hope it will help. Le lun. 26 févr. 2024 à 17:56, Will Fetherolf <will.fetherolf@
unread,
Help recognizing text from image
The code ! I hope it will help. Le lun. 26 févr. 2024 à 17:56, Will Fetherolf <will.fetherolf@
Feb 27
Daniel
, …
Tom Morris
14
Feb 22
tesseract training flags to rtl languages
I only skimmed Ger's long reply, but didn't see a link to the issue, which I think is the
unread,
tesseract training flags to rtl languages
I only skimmed Ger's long reply, but didn't see a link to the issue, which I think is the
Feb 22
Glenn C
2
Feb 22
generic meme extraction?
Hi experts, I've read that tesseract is not good at image OCR, for images like internet photos,
unread,
generic meme extraction?
Hi experts, I've read that tesseract is not good at image OCR, for images like internet photos,
Feb 22
Mert T
,
Ger Hobbelt
6
Feb 21
Prescription scan recognition
Thank you for your detailed answer. g...@hobbelt.com schrieb am Donnerstag, 15. Februar 2024 um 18:51
unread,
Prescription scan recognition
Thank you for your detailed answer. g...@hobbelt.com schrieb am Donnerstag, 15. Februar 2024 um 18:51
Feb 21
Chaitanya Vermani
Feb 21
Trying to train a new font (LCD screen style), unable to get error rate under 40%
I have been trying to retrain tesseract to read characters on a LCD screen, like 0 with a slash,
unread,
Trying to train a new font (LCD screen style), unable to get error rate under 40%
I have been trying to retrain tesseract to read characters on a LCD screen, like 0 with a slash,
Feb 21
Raphael Stonehorse
, …
Zdenko Podobny
5
Feb 20
How to correctly define CMakeLists.txt for Tesseract?
Any reason why to use an external 3rd party app that is not available on all platforms instead of
unread,
How to correctly define CMakeLists.txt for Tesseract?
Any reason why to use an external 3rd party app that is not available on all platforms instead of
Feb 20
Philippe Argouarch
,
Tom Morris
4
Feb 19
Traineddata files
On Monday, February 19, 2024 at 1:30:37 AM UTC-5 argo...@gmail.com wrote: ... My question now is why
unread,
Traineddata files
On Monday, February 19, 2024 at 1:30:37 AM UTC-5 argo...@gmail.com wrote: ... My question now is why
Feb 19
Quan Nguyen
, …
Shravani Adivarekar
34
Feb 19
jTessBoxEditor - Tesseract box editor & trainer
Can you please guide me on how to use it and create box files also on the installation...I am new to
unread,
jTessBoxEditor - Tesseract box editor & trainer
Can you please guide me on how to use it and create box files also on the installation...I am new to
Feb 19
LTS LTS
Feb 19
Need help OCR digits from a bingo lotto board
I'm really sorry if this seems really dumb, I'm really new to OCR, other people make it seem
unread,
Need help OCR digits from a bingo lotto board
I'm really sorry if this seems really dumb, I'm really new to OCR, other people make it seem
Feb 19