Groups
Sign in
Groups
tesseract-ocr
Conversations
About
Send feedback
Help
tesseract-ocr
Contact owners and managers
1–30 of 7190
Welcome,
Before sending an email to the group:
Please read
Frequently Asked Questions
Make sure you read the
Tesseract documentation
Search internet sources (including this group) for a solution
If you have a problem:
Provide all steps (including input resources) for its replication.
So not send a screenshot of the terminal - send the logs or copy text from a terminal.
Mark all as read
Report group
0 selected
lalit joshi
2:42 AM
Tesseract Output not correct in hindi text.
I am trying to build an app where I have to extract some data from pdf containing election roll data
unread,
Tesseract Output not correct in hindi text.
I am trying to build an app where I have to extract some data from pdf containing election roll data
2:42 AM
Adriel Matei
Jun 25
Help setting params to distinguish close-together characters
Hi! I am trying to use tesseract to programatically read numbers. The program works fine, except it
unread,
Help setting params to distinguish close-together characters
Hi! I am trying to use tesseract to programatically read numbers. The program works fine, except it
Jun 25
Quan Nguyen
,
JB Data31
4
Jun 25
VietOCR v6.3.0 & VietOCR.NET v6.3.0 Releases
VietOCR v6.14.0 & VietOCR.NET v6.13.0 Releases - Upgrade to Tesseract 5.4.1 http://vietocr.sf.net
unread,
VietOCR v6.3.0 & VietOCR.NET v6.3.0 Releases
VietOCR v6.14.0 & VietOCR.NET v6.13.0 Releases - Upgrade to Tesseract 5.4.1 http://vietocr.sf.net
Jun 25
Oran Mor
Jun 24
Suppressing ERROR_PTR Messages in Java Spring Application with Tesseract OCR Integration
I am integrating Tesseract OCR with my Java Spring application and encountered an issue where the
unread,
Suppressing ERROR_PTR Messages in Java Spring Application with Tesseract OCR Integration
I am integrating Tesseract OCR with my Java Spring application and encountered an issue where the
Jun 24
Danny
,
Zdenko Podobny
4
Jun 21
Error when trying to build Tesseract DLL from Scratch on Arch Linux via Cmake
Cross compiling is tricky you need to know what are you doing and how to solve problems. Better
unread,
Error when trying to build Tesseract DLL from Scratch on Arch Linux via Cmake
Cross compiling is tricky you need to know what are you doing and how to solve problems. Better
Jun 21
John Roxton
,
Ger Hobbelt
4
Jun 19
Guide me on training or better/practical pre-processing?
Couple of general notes, some of which I'm sure you already tried: - all input images: convert to
unread,
Guide me on training or better/practical pre-processing?
Couple of general notes, some of which I'm sure you already tried: - all input images: convert to
Jun 19
Saanvi Bhagat
, …
Sundara Ganesh
9
Jun 18
Inconsistencies in detection and extraction of text using tesseract
You said: Now I am trying to save it in a CSV. For that, I am using the coordinates of the detected
unread,
Inconsistencies in detection and extraction of text using tesseract
You said: Now I am trying to save it in a CSV. For that, I am using the coordinates of the detected
Jun 18
Deborah
,
John Roxton
2
Jun 17
Inconsistencies (sometimes) on similar characters. Is there a map for characters that are very similar?
Hello Deborah, Hopefully this isn't off-topic, and I don't mean to derail your thread, but I
unread,
Inconsistencies (sometimes) on similar characters. Is there a map for characters that are very similar?
Hello Deborah, Hopefully this isn't off-topic, and I don't mean to derail your thread, but I
Jun 17
Zaid Vss
, …
Osama Hussein
3
Jun 13
What is arabic language code ?
Hi, I believe it is 'ara'. In-order to install it on a server or linux use the following
unread,
What is arabic language code ?
Hi, I believe it is 'ara'. In-order to install it on a server or linux use the following
Jun 13
yanshan jiang
Jun 12
23/10000 实时翻译 23/10000图像亮度、对比度和清晰度等因素会影响文本提取的方向 划译 Factors such as image brightness, contrast, and sharpness affect the direction of text extraction
The contrast and brightness of these two images are different, the txt file is ocr-extracted for the
unread,
23/10000 实时翻译 23/10000图像亮度、对比度和清晰度等因素会影响文本提取的方向 划译 Factors such as image brightness, contrast, and sharpness affect the direction of text extraction
The contrast and brightness of these two images are different, the txt file is ocr-extracted for the
Jun 12
Ahmed Khalid
Jun 11
Tesseract
I have a problem that tesseract sometime read next line only and miss first one how can i handle that
unread,
Tesseract
I have a problem that tesseract sometime read next line only and miss first one how can i handle that
Jun 11
Misti Hamon
, …
Jeremiah
10
Jun 9
Manual review and correction for characters outside of the Latin-1 character set
Ger, Your problem set/end goal is simular to mine (textbooks/manuals not magazines and datasheets and
unread,
Manual review and correction for characters outside of the Latin-1 character set
Ger, Your problem set/end goal is simular to mine (textbooks/manuals not magazines and datasheets and
Jun 9
neelima preeti
Jun 9
Training with new Bangla font and a little change in ben.training_text. #Please help me
Hello everyone, I am new to training tesseract. So I tried with little data. Please help me. I am
unread,
Training with new Bangla font and a little change in ben.training_text. #Please help me
Hello everyone, I am new to training tesseract. So I tried with little data. Please help me. I am
Jun 9
Fred Andrews
,
Zdenko Podobny
3
Jun 7
Problem using "--oem 0" in Tesseract 5.4.0
I already did, and found nothing applicable. Why make false accusations? On Friday, June 7, 2024 at
unread,
Problem using "--oem 0" in Tesseract 5.4.0
I already did, and found nothing applicable. Why make false accusations? On Friday, June 7, 2024 at
Jun 7
Keith Gorlen
Jun 1
Frequently recognizes '5' as '9'
I just started using ocrmypdf 16.3.0 with Tesseract 5.3.4.20240503 to extract text from Pacific Gas
unread,
Frequently recognizes '5' as '9'
I just started using ocrmypdf 16.3.0 with Tesseract 5.3.4.20240503 to extract text from Pacific Gas
Jun 1
Ronny Zimmermann
May 31
Train tesseract with a font for European car license plates
I'm trying to improve tesseract's recognition for European license plates. The corresponding
unread,
Train tesseract with a font for European car license plates
I'm trying to improve tesseract's recognition for European license plates. The corresponding
May 31
Miraç Göksu Öztürk
May 30
Original LSTM Training Details
Hello, I went through the docs, papers given in the documentation, and this group's conversations
unread,
Original LSTM Training Details
Hello, I went through the docs, papers given in the documentation, and this group's conversations
May 30
Duy Hoàng
,
Zdenko Podobny
2
May 29
Error when running "make training" command
So: If you have a problem - use example data (ocrd-testset.zip) or provide your data set for
unread,
Error when running "make training" command
So: If you have a problem - use example data (ocrd-testset.zip) or provide your data set for
May 29
Kassim Papa
, …
Ger Hobbelt
8
May 27
Openmp cannot be disabled
For what it's worth, I ran into the same issue on the same platform (ms windows) about 2 years
unread,
Openmp cannot be disabled
For what it's worth, I ran into the same issue on the same platform (ms windows) about 2 years
May 27
achille sadjang
, …
Yaofu Zhou
4
May 26
Tesseract to recognize images or shapes
Did you fine-tune an existing model or trained a new model from scratch? Fine-tuning without
unread,
Tesseract to recognize images or shapes
Did you fine-tune an existing model or trained a new model from scratch? Fine-tuning without
May 26
Ahmed Khalid
May 23
Tesseract arabic numbers
I want use Tesseract to do OCR for arabic data and this is a first time to do a vision project so i
unread,
Tesseract arabic numbers
I want use Tesseract to do OCR for arabic data and this is a first time to do a vision project so i
May 23
Panumeth Khongsawatkiat
, …
Yaofu Zhou
3
May 21
Training Tesseract 5 for a New Font in Thai not wroking
You were fine-tuning an existing model, and it could take MUCH MORE than a few hundred images and a
unread,
Training Tesseract 5 for a New Font in Thai not wroking
You were fine-tuning an existing model, and it could take MUCH MORE than a few hundred images and a
May 21
Leder Extreme BR
,
Yaofu Zhou
2
May 21
Cursive letters
Yes, please take a look at Tesstrain, and particularly its Makefile, so that you know what is
unread,
Cursive letters
Yes, please take a look at Tesstrain, and particularly its Makefile, so that you know what is
May 21
Yaofu Zhou
2
May 21
How to Build Tesseract from Source Code on M1 Macbook?
The particular solution to my gcc/g++ issue was found at https://github.com/pyenv/pyenv/issues/2862.
unread,
How to Build Tesseract from Source Code on M1 Macbook?
The particular solution to my gcc/g++ issue was found at https://github.com/pyenv/pyenv/issues/2862.
May 21
testcoal
,
Yaofu Zhou
2
May 21
Train Tesseract with my own Data
Hi. You seem to be missing a lot of input. Please take a look at Tesstrain, and particularly its
unread,
Train Tesseract with my own Data
Hi. You seem to be missing a lot of input. Please take a look at Tesstrain, and particularly its
May 21
Ulan Bator
,
Yaofu Zhou
2
May 21
Tesseract recognition issues.
It is going to be a project for you but one way to achieve your goal is to fine-tune the model using
unread,
Tesseract recognition issues.
It is going to be a project for you but one way to achieve your goal is to fine-tune the model using
May 21
William John Adam Trindade
May 17
Exists a way to verify the image complexity before recognize?
Hi, I have a problem when images like the attached image is submitted to Tesseract. As I have a batch
unread,
Exists a way to verify the image complexity before recognize?
Hi, I have a problem when images like the attached image is submitted to Tesseract. As I have a batch
May 17
testcoal
,
Tom Morris
2
May 13
Train Tesseract 5 german for new font
It would be much easier to quote, and comment on, your commands and errors if they were in text
unread,
Train Tesseract 5 german for new font
It would be much easier to quote, and comment on, your commands and errors if they were in text
May 13
Antonio Jimeno Yepes
,
Jeremiah
2
May 11
Tesseract fine tuning questions
I don't know the answer to most of these questions, however one thing I noticed in your question
unread,
Tesseract fine tuning questions
I don't know the answer to most of these questions, however one thing I noticed in your question
May 11
La Monte H. P. Yarroll
May 8
Response to Xinyuan Yu on compiling tesseract
I've had two messages rejected as spam. I'm trying one last time. I think you need the -devel
unread,
Response to Xinyuan Yu on compiling tesseract
I've had two messages rejected as spam. I'm trying one last time. I think you need the -devel
May 8