Groups
Conversations
All groups and messages
Send feedback to Google
Help
Sign in
Groups
tesseract-ocr
Conversations
About
tesseract-ocr
1–30 of 6890
Mark all as read
Report abusive group
0 selected
Kyle Zeneki
Mar 25
Tesseract accuracy.
Hello, I have these images and I'm trying to print their output using Tesseract. I spent 2 hours
unread,
Tesseract accuracy.
Hello, I have these images and I'm trying to print their output using Tesseract. I spent 2 hours
Mar 25
javalover
, …
Rodhad
6
Mar 24
Tesseract doesn't recognize some numbers from an image
Hi, i'm actually trying to do the same thing that you've already tried on the same kind of
unread,
Tesseract doesn't recognize some numbers from an image
Hi, i'm actually trying to do the same thing that you've already tried on the same kind of
Mar 24
Kunal Athreya
,
Zdenko Podobny
2
Mar 24
Training a new language to perform ocr on tesseract ?
Did you follow instructions in https://github.com/tesseract-ocr/tesstrain#language-data ? Zdenko ut
unread,
Training a new language to perform ocr on tesseract ?
Did you follow instructions in https://github.com/tesseract-ocr/tesstrain#language-data ? Zdenko ut
Mar 24
Z. Jay
, …
Zdenko Podobny
3
Mar 24
tesseract returns random and spurious characters
Hello, unless you provide a test case for reproducing problem (+ information about tesseract,
unread,
tesseract returns random and spurious characters
Hello, unless you provide a test case for reproducing problem (+ information about tesseract,
Mar 24
Isidore Paris
,
Andrea Rossato
2
Mar 20
use of unicharambigs
Hi, no, unicharambigs is not used by LSTM files. It was used in the legacy mode. I'm having
unread,
use of unicharambigs
Hi, no, unicharambigs is not used by LSTM files. It was used in the legacy mode. I'm having
Mar 20
nguyen ngoc hai
,
Zdenko Podobny
7
Mar 17
How to get the correct text orientation with tesseract
Dear Zdenko and everyone, Thank you for your help last time. Apologize for getting back a bit late, I
unread,
How to get the correct text orientation with tesseract
Dear Zdenko and everyone, Thank you for your help last time. Apologize for getting back a bit late, I
Mar 17
David R Anderson
,
Isidore Paris
5
Mar 16
Counting True Pixel Height of a Capital Letter
Thank you. Although, after thinking about it. I think that converting JPG to PBM in order to get full
unread,
Counting True Pixel Height of a Capital Letter
Thank you. Although, after thinking about it. I think that converting JPG to PBM in order to get full
Mar 16
Karen G
,
Ger Hobbelt
2
Mar 15
Mixed Language (Greek-Latin Alphabet) OCR: OCR of Scientific Documents
- or up-to-date Windows executables (and installers), see Uni of Mannheim (Stefan Weil). google
unread,
Mixed Language (Greek-Latin Alphabet) OCR: OCR of Scientific Documents
- or up-to-date Windows executables (and installers), see Uni of Mannheim (Stefan Weil). google
Mar 15
Prashant Sharma
,
Zdenko Podobny
2
Mar 11
Facing trouble with Tesseract OCR (from v4 to v5) for python version upgrade (from Python 3.6 to Python 3.10)
First of all: it is a good manner to provide a test case (working code + input &output) Next:
unread,
Facing trouble with Tesseract OCR (from v4 to v5) for python version upgrade (from Python 3.6 to Python 3.10)
First of all: it is a good manner to provide a test case (working code + input &output) Next:
Mar 11
Sandra M.
, …
Vuh doo
3
Mar 1
Problem with deactivating dictionary in tesseract using Python
Even though it's 3 years later now, the disabling is possible with the -c flag. pytesseract.
unread,
Problem with deactivating dictionary in tesseract using Python
Even though it's 3 years later now, the disabling is possible with the -c flag. pytesseract.
Mar 1
Sofie Geens
Mar 1
OCR on number 0 usualy working but not always
I want to read numbers with pytesseract and it does it with 100% accuracy until a certain point, from
unread,
OCR on number 0 usualy working but not always
I want to read numbers with pytesseract and it does it with 100% accuracy until a certain point, from
Mar 1
Константин Михеев
,
Lorenzo Bolzani
2
Feb 28
Russian + English characters recognition
Hi, try rus+eng as a language or eng+rus and see what works best. You can also use more than two
unread,
Russian + English characters recognition
Hi, try rus+eng as a language or eng+rus and see what works best. You can also use more than two
Feb 28
Guo JianHeng
2
Feb 28
ld: library not found for -lpango-1.0 for Mac M1
This is config.log And tesseract version 1s 4.1.1 On Tuesday, February 28, 2023 at 5:41:08 PM UTC+8
unread,
ld: library not found for -lpango-1.0 for Mac M1
This is config.log And tesseract version 1s 4.1.1 On Tuesday, February 28, 2023 at 5:41:08 PM UTC+8
Feb 28
Ger Hobbelt
Feb 25
RFC: get a report from tesseract which '-c' variables it did use this time.
> Also posted as issue #4021 and here, a la #3998. RFC: ParamUtils::ReportParamsUsageStatistics()
unread,
RFC: get a report from tesseract which '-c' variables it did use this time.
> Also posted as issue #4021 and here, a la #3998. RFC: ParamUtils::ReportParamsUsageStatistics()
Feb 25
Ek Villain
,
Zdenko Podobny
2
Feb 25
unable to decode below mentioned php generated images.
tesseract does not support the breaking of captcha. Zdenko so 25. 2. 2023 o 9:40 Ek Villain <
unread,
unable to decode below mentioned php generated images.
tesseract does not support the breaking of captcha. Zdenko so 25. 2. 2023 o 9:40 Ek Villain <
Feb 25
Alex Porter
,
Ger Hobbelt
5
Feb 24
Improve tesseract accuracy.
:+1: Glad it works out so well for you! Met vriendelijke groeten / Best regards, Ger Hobbelt --------
unread,
Improve tesseract accuracy.
:+1: Glad it works out so well for you! Met vriendelijke groeten / Best regards, Ger Hobbelt --------
Feb 24
Vishak Raj
Feb 24
Training Details
Hello, I am training the tesseract with own dataset, in the training process, I got Error values,
unread,
Training Details
Hello, I am training the tesseract with own dataset, in the training process, I got Error values,
Feb 24
Willus Dotkom
, …
Lorenzo Bolzani
8
Feb 22
Optimal image resolution (dpi/ppi) for Tesseract 4.0.0 and eng.traineddata?
Looks like the "fast" models are better or on par with the "best" ones and more
unread,
Optimal image resolution (dpi/ppi) for Tesseract 4.0.0 and eng.traineddata?
Looks like the "fast" models are better or on par with the "best" ones and more
Feb 22
kacper
,
Zdenko Podobny
2
Feb 20
Get more accurate results from specific image pattern
So you tried all the easy parts and leave difficult parts to the forum :-) First of all - yes - this
unread,
Get more accurate results from specific image pattern
So you tried all the easy parts and leave difficult parts to the forum :-) First of all - yes - this
Feb 20
M
Feb 17
Train for Microfiche Data
I'm trying to process microfiche data that has been digitally scanned. I tried the default
unread,
Train for Microfiche Data
I'm trying to process microfiche data that has been digitally scanned. I tried the default
Feb 17
Flávio.
,
Merlijn B.W. Wajer
8
Feb 14
sending image data directly to Tesseract
Hi, On 14/02/2023 21:59, Flávio. wrote: > I'll look into that Linux option :) as for the save
unread,
sending image data directly to Tesseract
Hi, On 14/02/2023 21:59, Flávio. wrote: > I'll look into that Linux option :) as for the save
Feb 14
Robert Zhou
Feb 14
Train a handwriting English model problem
I use Tesstrain instruction to train a handwriting model from starting tessdata English model. The
unread,
Train a handwriting English model problem
I use Tesstrain instruction to train a handwriting model from starting tessdata English model. The
Feb 14
Trinitite Team
Feb 13
System.Runtime.InteropServices.SEHException
I have a program in C# that reads the text of a video displayed in a pictureBox. Currently using
unread,
System.Runtime.InteropServices.SEHException
I have a program in C# that reads the text of a video displayed in a pictureBox. Currently using
Feb 13
WeekSky
2
Feb 13
External component has thrown an exception
My bad, I meant to post on the account I reposted from. On Monday, 13 February 2023 at 12:59:01 UTC+1
unread,
External component has thrown an exception
My bad, I meant to post on the account I reposted from. On Monday, 13 February 2023 at 12:59:01 UTC+1
Feb 13
Petar Mihić
,
Zdenko Podobny
2
Feb 9
Passing bounding box coordinates of an detected object from image to extract text
Hello, Unfortunately you did not provide any examples (input image, code), so we can just guess how
unread,
Passing bounding box coordinates of an detected object from image to extract text
Hello, Unfortunately you did not provide any examples (input image, code), so we can just guess how
Feb 9
Massimo
,
Zdenko Podobny
2
Feb 8
TesserAct implementation help
Please read tesseract documentation https://github.com/tesseract-ocr/tessdoc - there is a simple and
unread,
TesserAct implementation help
Please read tesseract documentation https://github.com/tesseract-ocr/tessdoc - there is a simple and
Feb 8
Lars Aronsson
,
Zdenko Podobny
2
Feb 7
Hyphenation postprocessing
there is a (similar) feature request: https://github.com/tesseract-ocr/tesseract/issues/728 Zdenko po
unread,
Hyphenation postprocessing
there is a (similar) feature request: https://github.com/tesseract-ocr/tesseract/issues/728 Zdenko po
Feb 7
Victor Roza
Feb 7
Pytesseract. Preset a reading order LLLNLNN
Guys, does anyone know how to modify pytesseract internally to read a predefined sequence of letters
unread,
Pytesseract. Preset a reading order LLLNLNN
Guys, does anyone know how to modify pytesseract internally to read a predefined sequence of letters
Feb 7
Zisha
, …
Zdenko Podobny
3
Feb 4
How to extract non-text regions
The task you mention is called "The document layout segmentation" or "Document layout
unread,
How to extract non-text regions
The task you mention is called "The document layout segmentation" or "Document layout
Feb 4
Mars
,
Zdenko Podobny
2
Feb 4
How to enable tessedit_write_images on pytesseract ?
py-tesseract is wrapped of tesseract executable, so I suggest to use dirrecty tesseeract if something
unread,
How to enable tessedit_write_images on pytesseract ?
py-tesseract is wrapped of tesseract executable, so I suggest to use dirrecty tesseeract if something
Feb 4