Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
tesseract-ocr
Conversations
About
Sort By Relevance
Sort By Date
1–30 of many
vizier87
, …
Lorenzo Blz
9
7/20/24
thermography image issue
Bye,
Lorenzo
Il giorno sabato 20 luglio 2024 alle 08:13:08 UTC+2 abdulh...@gmail.com ha scritto: > Hi Tom. > > The position of text in thermography images are usually arbitrary
unread,
thermography image issue
Bye,
Lorenzo
Il giorno sabato 20 luglio 2024 alle 08:13:08 UTC+2 abdulh...@gmail.com ha scritto: > Hi Tom. > > The position of text in thermography images are usually arbitrary
7/20/24
Simon
, …
Lorenzo Bolzani
14
11/29/23
Training from Scratch
diagrams.
Lorenzo
Il giorno mer 29 nov 2023 alle ore 09:36 Simon ha scritto: > Hey
Lorenzo
, > > thanks a lot for your response. I've seen in the HOCR files of different >
unread,
Training from Scratch
diagrams.
Lorenzo
Il giorno mer 29 nov 2023 alle ore 09:36 Simon ha scritto: > Hey
Lorenzo
, > > thanks a lot for your response. I've seen in the HOCR files of different >
11/29/23
TroTro
11/4/23
Detection of the word working randomly
Hi everyone, I am a begginer of python and I start a project for myself. This project is for Pokemon games, and i need to get the name of pokemon i am
unread,
Detection of the word working randomly
Hi everyone, I am a begginer of python and I start a project for myself. This project is for Pokemon games, and i need to get the name of pokemon i am
11/4/23
Ali hussain
, …
Des Bw
41
10/22/23
accuracy problem after trained in fine-tune
thx. i will try with this method as soon as possible. On Sunday, 22 October, 2023 at 3:49:46 pm UTC+6 desal...@gmail.com wrote: > here it is: >
unread,
accuracy problem after trained in fine-tune
thx. i will try with this method as soon as possible. On Sunday, 22 October, 2023 at 3:49:46 pm UTC+6 desal...@gmail.com wrote: > here it is: >
10/22/23
astro
, …
Ger Hobbelt
13
8/4/23
Trying to understand why Tesseract-ocr fails on some images
. Bye
Lorenzo
Il giorno mer 26 lug 2023 alle ore 21:09 nor s ha scritto: > OK I think I found the sweet spot. Setting the location for the crop > rectangle to +933+1013 from the top left
unread,
Trying to understand why Tesseract-ocr fails on some images
. Bye
Lorenzo
Il giorno mer 26 lug 2023 alle ore 21:09 nor s ha scritto: > OK I think I found the sweet spot. Setting the location for the crop > rectangle to +933+1013 from the top left
8/4/23
DASSS
, …
Zdenko Podobny
3
7/1/23
Is there any way to capture any type of formatting?
. Bye
Lorenzo
Il giorno ven 30 giu 2023 alle ore 07:29 DASSS ha scritto: > Is there any way to gather info about the actual text formatting (markup, > markdown), etc. > > Just
unread,
Is there any way to capture any type of formatting?
. Bye
Lorenzo
Il giorno ven 30 giu 2023 alle ore 07:29 DASSS ha scritto: > Is there any way to gather info about the actual text formatting (markup, > markdown), etc. > > Just
7/1/23
Константин Михеев
,
Lorenzo Bolzani
2
2/28/23
Russian + English characters recognition
decide.
Lorenzo
Il giorno mar 28 feb 2023 alle ore 13:35 Константин Михеев < mihkostya@gmail.com> ha scritto: > Hello. I have a problem with recognition of this image (attached
unread,
Russian + English characters recognition
decide.
Lorenzo
Il giorno mar 28 feb 2023 alle ore 13:35 Константин Михеев < mihkostya@gmail.com> ha scritto: > Hello. I have a problem with recognition of this image (attached
2/28/23
Willus Dotkom
, …
Lorenzo Bolzani
8
2/22/23
Optimal image resolution (dpi/ppi) for Tesseract 4.0.0 and eng.traineddata?
Thanks,
Lorenzo
Il giorno mar 21 feb 2023 alle ore 22:22 wil...@gmail.com ha scritto: > Sorry it took a while. Take a look here > . > > On Sunday, February 27, 2022 at 9:08:
unread,
Optimal image resolution (dpi/ppi) for Tesseract 4.0.0 and eng.traineddata?
Thanks,
Lorenzo
Il giorno mar 21 feb 2023 alle ore 22:22 wil...@gmail.com ha scritto: > Sorry it took a while. Take a look here > . > > On Sunday, February 27, 2022 at 9:08:
2/22/23
Chris E.
, …
Tom Morris
8
11/13/22
Difficult image, any tips would be appreciated
> Hi
Lorenzo
, > > thank you so much for your ideas! Unfortunately, I don't think I can get a > better image quality. It's a VGA signal that's being grabbed, and
unread,
Difficult image, any tips would be appreciated
> Hi
Lorenzo
, > > thank you so much for your ideas! Unfortunately, I don't think I can get a > better image quality. It's a VGA signal that's being grabbed, and
11/13/22
Atef Chatty
,
Lorenzo Bolzani
2
7/22/22
Improve text extraction
. Bye
Lorenzo
Il giorno mer 20 lug 2022 alle ore 22:33 Atef Chatty ha scritto: > Hi, > i want to extract information from unclear images. I tried many filters > but it doesn'
unread,
Improve text extraction
. Bye
Lorenzo
Il giorno mer 20 lug 2022 alle ore 22:33 Atef Chatty ha scritto: > Hi, > i want to extract information from unclear images. I tried many filters > but it doesn'
7/22/22
Yash Mistry
,
Lorenzo Bolzani
4
6/24/22
Tesseract confused between a character and a digit which look-alike
Bye,
Lorenzo
Il giorno ven 24 giu 2022 alle ore 09:22 'Yash Mistry' via tesseract-ocr < tesseract-ocr@googlegroups.com> ha scritto: > Hi
Lorenzo
, > > Thank
unread,
Tesseract confused between a character and a digit which look-alike
Bye,
Lorenzo
Il giorno ven 24 giu 2022 alle ore 09:22 'Yash Mistry' via tesseract-ocr < tesseract-ocr@googlegroups.com> ha scritto: > Hi
Lorenzo
, > > Thank
6/24/22
juan carlos hernández
, …
Merlijn B.W. Wajer
6
10/20/21
Doubt about using 5.0.0-beta-20210916 before release version is available
47,
Lorenzo
Bolzani wrote: > > Hi Merlijn, > > out of curiosity, did you note an impovement over the previous version? > > Yes. Speed and stability is better, and accuracy
unread,
Doubt about using 5.0.0-beta-20210916 before release version is available
47,
Lorenzo
Bolzani wrote: > > Hi Merlijn, > > out of curiosity, did you note an impovement over the previous version? > > Yes. Speed and stability is better, and accuracy
10/20/21
vis li
, …
Zdenko Podobny
8
9/17/21
The pictures captured by the camera did not identify well after preprocessing
>>
Lorenzo
>>> >>> Il giorno gio 16 set 2021 alle ore 07:59 vis li ha >>> scritto: >>> >>>> Tesseract Version:4.1.1 >
unread,
The pictures captured by the camera did not identify well after preprocessing
>>
Lorenzo
>>> >>> Il giorno gio 16 set 2021 alle ore 07:59 vis li ha >>> scritto: >>> >>>> Tesseract Version:4.1.1 >
9/17/21
Hamzeh abu-ajameia
,
Lorenzo Bolzani
2
3/26/21
Improve OCR Accuracy
. Bye
Lorenzo
Il giorno ven 26 mar 2021 alle ore 14:46 Hamzeh abu-ajameia < hamzehaboajamia@gmail.com> ha scritto: > Hi Everyone, > > I am a newbie to OCR, I've been
unread,
Improve OCR Accuracy
. Bye
Lorenzo
Il giorno ven 26 mar 2021 alle ore 14:46 Hamzeh abu-ajameia < hamzehaboajamia@gmail.com> ha scritto: > Hi Everyone, > > I am a newbie to OCR, I've been
3/26/21
Alex Zetaeffesse
,
Lorenzo Bolzani
2
3/25/21
Pytesseract processing images already in memory
. Bye
Lorenzo
Il giorno gio 25 mar 2021 alle ore 05:44 Alex Zetaeffesse ha scritto: > Hi all, > > I'm already using a python library (pyvips) for cropping images with text
unread,
Pytesseract processing images already in memory
. Bye
Lorenzo
Il giorno gio 25 mar 2021 alle ore 05:44 Alex Zetaeffesse ha scritto: > Hi all, > > I'm already using a python library (pyvips) for cropping images with text
3/25/21
Soumik Ranjan Dasgupta
, …
Shree Devi Kumar
13
1/7/21
Tesseract Performance
. Bye
Lorenzo
Il giorno gio 24 dic 2020 alle ore 16:06 Soumik Ranjan Dasgupta < ranjansoumik@gmail.com> ha scritto: > Hi everyone, > I wanted to do fine-tune the ben.traineddata
unread,
Tesseract Performance
. Bye
Lorenzo
Il giorno gio 24 dic 2020 alle ore 16:06 Soumik Ranjan Dasgupta < ranjansoumik@gmail.com> ha scritto: > Hi everyone, > I wanted to do fine-tune the ben.traineddata
1/7/21
agentbond009
, …
Martin Läuter
11
11/16/20
Recognising numbers in sudoku
:30
Lorenzo
Blz wrote: > Use hough lines detector to detect the lines and draw a thick white line > over them. > > > https://opencv-python-tutroals.readthedocs.io
unread,
Recognising numbers in sudoku
:30
Lorenzo
Blz wrote: > Use hough lines detector to detect the lines and draw a thick white line > over them. > > > https://opencv-python-tutroals.readthedocs.io
11/16/20
Jean-Marc Spaggiari
, …
Zdenko Podobny
4
10/2/20
Guidance for not recognized text
4,
Lorenzo
Blz a écrit : > >> Invert the image. >> >> >> >> Il gio 1 ott 2020, 14:58 Jean-Marc Spaggiari ha >> scritto: >> >>>
unread,
Guidance for not recognized text
4,
Lorenzo
Blz a écrit : > >> Invert the image. >> >> >> >> Il gio 1 ott 2020, 14:58 Jean-Marc Spaggiari ha >> scritto: >> >>>
10/2/20
MysteriousGuy
, …
Lorenzo Bolzani
6
7/15/20
Tesseract makes different predictions on seemingly equal images. How to make it more robust?
models.
Lorenzo
Il giorno mer 15 lug 2020 alle ore 20:55 MysteriousGuy ha scritto: > This seems like an ad-hoc approach. I am already converting images to > grayscale. If I apply
unread,
Tesseract makes different predictions on seemingly equal images. How to make it more robust?
models.
Lorenzo
Il giorno mer 15 lug 2020 alle ore 20:55 MysteriousGuy ha scritto: > This seems like an ad-hoc approach. I am already converting images to > grayscale. If I apply
7/15/20
Dtractus
,
Lorenzo Bolzani
3
5/31/20
Pytesseract cant read my image(close letter problem)
3 tarihinde
Lorenzo
Blz yazdı: > > Hi, first invert the image. > > $ tesseract -l eng test2.png - > FUTLutz > > FUTSalkay > FUTLovazin > FUTRaum > >
unread,
Pytesseract cant read my image(close letter problem)
3 tarihinde
Lorenzo
Blz yazdı: > > Hi, first invert the image. > > $ tesseract -l eng test2.png - > FUTLutz > > FUTSalkay > FUTLovazin > FUTRaum > >
5/31/20
Thilina Jayathilaka
, …
Lorenzo Bolzani
9
5/5/20
What is the "Confidence"value returned by Tesseract and how it is calculated?
etc.
Lorenzo
Il giorno mar 5 mag 2020 alle ore 06:00 Sathyanarayana Gorla < sathya@sukshi.com> ha scritto: > 1. I would like to know how the HOCR algorithm works and give confidence
unread,
What is the "Confidence"value returned by Tesseract and how it is calculated?
etc.
Lorenzo
Il giorno mar 5 mag 2020 alle ore 06:00 Sathyanarayana Gorla < sathya@sukshi.com> ha scritto: > 1. I would like to know how the HOCR algorithm works and give confidence
5/5/20
O CR
, …
Shree Devi Kumar
8
4/10/20
As good as Latin.traineddata (fast integer) but faster
this.
Lorenzo
Il giorno ven 10 apr 2020 alle ore 17:27
Lorenzo
Bolzani < l.bolzani@gmail.com> ha scritto: > Hi, > I started writing this email thinking that removing some
unread,
As good as Latin.traineddata (fast integer) but faster
this.
Lorenzo
Il giorno ven 10 apr 2020 alle ore 17:27
Lorenzo
Bolzani < l.bolzani@gmail.com> ha scritto: > Hi, > I started writing this email thinking that removing some
4/10/20
hmaster
, …
Lorenzo Bolzani
5
4/3/20
fine tuning from traineddata_best
. Bye
Lorenzo
Il giorno ven 3 apr 2020 alle ore 17:48 hmaster ha scritto: > > 1. So essentially, I need to create a box file and ground-truth file > for each image I have, and run
unread,
fine tuning from traineddata_best
. Bye
Lorenzo
Il giorno ven 3 apr 2020 alle ore 17:48 hmaster ha scritto: > > 1. So essentially, I need to create a box file and ground-truth file > for each image I have, and run
4/3/20
smarty pokemon
,
Lorenzo Bolzani
6
4/3/20
Unable detect number in box
. Bye
Lorenzo
Il giorno ven 27 mar 2020 alle ore 12:11 smarty pokemon < smarty.pokemon@gmail.com> ha scritto: > Hi
Lorenzo
Biz, >> >> > Thanks for your response
unread,
Unable detect number in box
. Bye
Lorenzo
Il giorno ven 27 mar 2020 alle ore 12:11 smarty pokemon < smarty.pokemon@gmail.com> ha scritto: > Hi
Lorenzo
Biz, >> >> > Thanks for your response
4/3/20
Bill Upham
, …
Lorenzo Bolzani
8
4/2/20
Looking to hire a pytesseract consultant via skype
. Bye
Lorenzo
Il giorno gio 26 mar 2020 alle ore 06:24 Bill Upham ha scritto: > Thank you Aaron for the information, it was an improvement, I'm attaching > one of the png files
unread,
Looking to hire a pytesseract consultant via skype
. Bye
Lorenzo
Il giorno gio 26 mar 2020 alle ore 06:24 Bill Upham ha scritto: > Thank you Aaron for the information, it was an improvement, I'm attaching > one of the png files
4/2/20
Teo
, …
Lorenzo Bolzani
18
3/28/20
How to improve ocr reader?
1,
Lorenzo
Blz ha scritto: > > If you'd like to improve the OCR accuracy too a simple contrast > enhancement (with a simple S shaped curve) and a little sharpening helps
unread,
How to improve ocr reader?
1,
Lorenzo
Blz ha scritto: > > If you'd like to improve the OCR accuracy too a simple contrast > enhancement (with a simple S shaped curve) and a little sharpening helps
3/28/20
Essam Zaky
, …
Shree Devi Kumar
10
3/25/20
How to prepare fonts folder to train from scratch
. Bye
Lorenzo
Il giorno mer 25 mar 2020 alle ore 09:54 Essam Zaky ha scritto: > @Lorenozo > I need to do that because because the accuracy of current Arabic not very > good as English
unread,
How to prepare fonts folder to train from scratch
. Bye
Lorenzo
Il giorno mer 25 mar 2020 alle ore 09:54 Essam Zaky ha scritto: > @Lorenozo > I need to do that because because the accuracy of current Arabic not very > good as English
3/25/20
aby tesh
, …
Shree Devi Kumar
30
3/24/20
Tesseract not recognizing ancient language's code
3,
Lorenzo
Blz wrote: > > Common fonts do not cover every unicode symbol (about 100000). > > If one font works and another does not the text is correct and you just > need
unread,
Tesseract not recognizing ancient language's code
3,
Lorenzo
Blz wrote: > > Common fonts do not cover every unicode symbol (about 100000). > > If one font works and another does not the text is correct and you just > need
3/24/20
Lorenzo Bolzani
,
Jonathan Dahan
2
2/26/20
Re: [tesseract-ocr] Best filter/preprocess for these type of images?
2,
Lorenzo
Blz wrote: > > Do a threshold (otsu), count the white and black pixels, this will tell > you if you have white text on dark background or the opposite. > If necessary
unread,
Re: [tesseract-ocr] Best filter/preprocess for these type of images?
2,
Lorenzo
Blz wrote: > > Do a threshold (otsu), count the white and black pixels, this will tell > you if you have white text on dark background or the opposite. > If necessary
2/26/20
Albretch Mueller
, …
Zdenko Podobny
4
2/4/20
approches used for language detection on images ...
:22
Lorenzo
Bolzani napísal(a): > You can try some machine learning based text detection, like this one for > example: > > > https://www.pyimagesearch.com/2018
unread,
approches used for language detection on images ...
:22
Lorenzo
Bolzani napísal(a): > You can try some machine learning based text detection, like this one for > example: > > > https://www.pyimagesearch.com/2018
2/4/20