Groups
Conversations
All groups and messages
Send feedback to Google
Help
Sign in
Groups
tesseract-ocr
Conversations
About
tesseract-ocr
1–30 of 6911
Mark all as read
Report abusive group
0 selected
Ray Lutz
May 13
"best" training data fails but "fast" data works using v5.3.1
Hello Friends: We just got done working for days to isolate a problem to the "best"
unread,
"best" training data fails but "fast" data works using v5.3.1
Hello Friends: We just got done working for days to isolate a problem to the "best"
May 13
Gokul
May 12
Grantha scipt trained data request
I want trained data file for grantha script please share the link for the trained data . Thanks
unread,
Grantha scipt trained data request
I want trained data file for grantha script please share the link for the trained data . Thanks
May 12
Augustin Fourcaud
,
Zdenko Podobny
3
May 10
OCR problem with condensed text
Thanks for your answer, I'll try it. Le dimanche 7 mai 2023 à 21:49:29 UTC+2, zdenop a écrit :
unread,
OCR problem with condensed text
Thanks for your answer, I'll try it. Le dimanche 7 mai 2023 à 21:49:29 UTC+2, zdenop a écrit :
May 10
Daniel Kennedy
May 9
Is reading this image accurately realistic?
I'd like to be able to read this image to text for further machine parsing. Is this realistic? I
unread,
Is reading this image accurately realistic?
I'd like to be able to read this image to text for further machine parsing. Is this realistic? I
May 9
Erez Arnon
,
Zdenko Podobny
2
May 9
Should tesseract work on handwritten text?
No it should not. As far as we know tesseract is trained on printed text. Zdenko ut 9. 5. 2023 o 6:57
unread,
Should tesseract work on handwritten text?
No it should not. As far as we know tesseract is trained on printed text. Zdenko ut 9. 5. 2023 o 6:57
May 9
Rob Aaldijk
,
Zdenko Podobny
2
May 8
Specify target file name patterns?
Hello, your request is not clear to me (eg tesseract does not ocr pdf). Maybe it would be good if you
unread,
Specify target file name patterns?
Hello, your request is not clear to me (eg tesseract does not ocr pdf). Maybe it would be good if you
May 8
Are
,
Zdenko Podobny
2
May 1
Tesseract completely fails to recognize consolas font from high resolution image
Try to use the tesseract executable if there are any problems when using API/tesseract wrappers Did
unread,
Tesseract completely fails to recognize consolas font from high resolution image
Try to use the tesseract executable if there are any problems when using API/tesseract wrappers Did
May 1
Artur Giżycki
,
Zdenko Podobny
2
May 1
How to use tesseract with low resolution data
try to post an example image. Make sure you tried suggested operation in tesseract documentation.
unread,
How to use tesseract with low resolution data
try to post an example image. Make sure you tried suggested operation in tesseract documentation.
May 1
Vishnu Madhu
Apr 27
How to enable batched inference in tesseract-ocr
I am exploring ways to improve the performance(single-thread / multi-thread) of tesseract-ocr
unread,
How to enable batched inference in tesseract-ocr
I am exploring ways to improve the performance(single-thread / multi-thread) of tesseract-ocr
Apr 27
Madhav Pandey
,
Zdenko Podobny
4
Apr 26
Getting Error: No such file or directory: 'data/foo/all-lstmf'
make training TESSDATA=./usr/local/share/tessdata unicharset_extractor --output_unicharset "data
unread,
Getting Error: No such file or directory: 'data/foo/all-lstmf'
make training TESSDATA=./usr/local/share/tessdata unicharset_extractor --output_unicharset "data
Apr 26
Scaly Green Orc
, …
Tom Morris
4
Apr 25
Parameters to improve detection of sparse text
On Tuesday, April 25, 2023 at 8:30:53 AM UTC-4 Scaly Green Orc wrote: Yes, it's a regular PDF.
unread,
Parameters to improve detection of sparse text
On Tuesday, April 25, 2023 at 8:30:53 AM UTC-4 Scaly Green Orc wrote: Yes, it's a regular PDF.
Apr 25
Ada Gomiz
,
Zdenko Podobny
2
Apr 25
help, tesseract not fun in windows 11!
Seems like you are not very familiar with the operating system you are using. Tesseract (executable)
unread,
help, tesseract not fun in windows 11!
Seems like you are not very familiar with the operating system you are using. Tesseract (executable)
Apr 25
Madhav Pandey
Apr 25
Model for Devnagri handwritten text
Hi Everyone, Is there a model available for handwritten text in Devnagri. I have been trying to build
unread,
Model for Devnagri handwritten text
Hi Everyone, Is there a model available for handwritten text in Devnagri. I have been trying to build
Apr 25
kapil beyondkey
,
La Monte H. P. Yarroll
2
Apr 24
System requirements to run just OSD command
You probably don't want tesseract at all. Take a look at deskew in the Leptonica package. On Mon,
unread,
System requirements to run just OSD command
You probably don't want tesseract at all. Take a look at deskew in the Leptonica package. On Mon,
Apr 24
Ankur Rana
Apr 24
Invalid Start of grapheme sequence
Hi, We am trying to extended Devanagari OCR training data with the four more unicode Devanagari
unread,
Invalid Start of grapheme sequence
Hi, We am trying to extended Devanagari OCR training data with the four more unicode Devanagari
Apr 24
Giuseppe Coniglio
Apr 17
File tesseract.log is empty in Linux Server
Hi to all :-) I developed a Spring boot microservice that uses Tesseract 4.3.1 and PDFBox 2.0.22 (
unread,
File tesseract.log is empty in Linux Server
Hi to all :-) I developed a Spring boot microservice that uses Tesseract 4.3.1 and PDFBox 2.0.22 (
Apr 17
Olivia
Apr 13
Is it possible to tune Tesseract's text line detection mechanism?
Hello, I have some old, busy documents that I'm trying to OCR. Tesseract does an incredible job
unread,
Is it possible to tune Tesseract's text line detection mechanism?
Hello, I have some old, busy documents that I'm trying to OCR. Tesseract does an incredible job
Apr 13
Nazar Kotsur
Apr 12
Creating single-line ground-truth from PDF with corresponding hocr
I have a PDF scan and hocr file with fixed OCR mistakes, and would like to try to train a model. Is
unread,
Creating single-line ground-truth from PDF with corresponding hocr
I have a PDF scan and hocr file with fixed OCR mistakes, and would like to try to train a model. Is
Apr 12
Harisankar
Apr 5
sir,
string extractedText; using (var engine = new TesseractEngine(HostingEnvironment.MapPath(@"~\
unread,
sir,
string extractedText; using (var engine = new TesseractEngine(HostingEnvironment.MapPath(@"~\
Apr 5
Ali Abedian
, …
Shree Devi Kumar
5
Apr 1
Tesseract training for New font/language
Is it best to train a new language? On Saturday, April 1, 2023 at 7:54:30 am UTC-7 shree wrote:
unread,
Tesseract training for New font/language
Is it best to train a new language? On Saturday, April 1, 2023 at 7:54:30 am UTC-7 shree wrote:
Apr 1
Kyle Zeneki
,
Zdenko Podobny
2
Apr 1
Tesseract accuracy.
As the first step, I would suggest you read https://github.com/tesseract-ocr/tessdoc/blob/main/
unread,
Tesseract accuracy.
As the first step, I would suggest you read https://github.com/tesseract-ocr/tessdoc/blob/main/
Apr 1
Daniele
,
nguyen ngoc hai
4
Mar 29
Fails to recognize short codes
Cool! Thank you very much! Daniele Il giorno mercoledì 29 marzo 2023 alle 13:53:15 UTC+2 nguyenng...@
unread,
Fails to recognize short codes
Cool! Thank you very much! Daniele Il giorno mercoledì 29 marzo 2023 alle 13:53:15 UTC+2 nguyenng...@
Mar 29
Isidore Paris
,
Andrea Rossato
3
Mar 26
use of unicharambigs
Ciao, Thanks for sharing! I have the same problem with script / Fraktur.traineddata, which is far
unread,
use of unicharambigs
Ciao, Thanks for sharing! I have the same problem with script / Fraktur.traineddata, which is far
Mar 26
javalover
, …
Rodhad
6
Mar 24
Tesseract doesn't recognize some numbers from an image
Hi, i'm actually trying to do the same thing that you've already tried on the same kind of
unread,
Tesseract doesn't recognize some numbers from an image
Hi, i'm actually trying to do the same thing that you've already tried on the same kind of
Mar 24
Kunal Athreya
,
Zdenko Podobny
2
Mar 24
Training a new language to perform ocr on tesseract ?
Did you follow instructions in https://github.com/tesseract-ocr/tesstrain#language-data ? Zdenko ut
unread,
Training a new language to perform ocr on tesseract ?
Did you follow instructions in https://github.com/tesseract-ocr/tesstrain#language-data ? Zdenko ut
Mar 24
Z. Jay
, …
Zdenko Podobny
3
Mar 24
tesseract returns random and spurious characters
Hello, unless you provide a test case for reproducing problem (+ information about tesseract,
unread,
tesseract returns random and spurious characters
Hello, unless you provide a test case for reproducing problem (+ information about tesseract,
Mar 24
Zdenko Podobny
,
nguyen ngoc hai
6
Mar 17
Re: [tesseract-ocr] How to get the correct text orientation with tesseract
Dear Zdenko and everyone, Thank you for your help last time. Apologize for getting back a bit late, I
unread,
Re: [tesseract-ocr] How to get the correct text orientation with tesseract
Dear Zdenko and everyone, Thank you for your help last time. Apologize for getting back a bit late, I
Mar 17
David R Anderson
,
Isidore Paris
5
Mar 16
Counting True Pixel Height of a Capital Letter
Thank you. Although, after thinking about it. I think that converting JPG to PBM in order to get full
unread,
Counting True Pixel Height of a Capital Letter
Thank you. Although, after thinking about it. I think that converting JPG to PBM in order to get full
Mar 16
Karen G
,
Ger Hobbelt
2
Mar 15
Mixed Language (Greek-Latin Alphabet) OCR: OCR of Scientific Documents
- or up-to-date Windows executables (and installers), see Uni of Mannheim (Stefan Weil). google
unread,
Mixed Language (Greek-Latin Alphabet) OCR: OCR of Scientific Documents
- or up-to-date Windows executables (and installers), see Uni of Mannheim (Stefan Weil). google
Mar 15
Prashant Sharma
,
Zdenko Podobny
2
Mar 11
Facing trouble with Tesseract OCR (from v4 to v5) for python version upgrade (from Python 3.6 to Python 3.10)
First of all: it is a good manner to provide a test case (working code + input &output) Next:
unread,
Facing trouble with Tesseract OCR (from v4 to v5) for python version upgrade (from Python 3.6 to Python 3.10)
First of all: it is a good manner to provide a test case (working code + input &output) Next:
Mar 11