Reading large gray images with only numbers yields incorrect results

81 views
Skip to first unread message

inKi Wang

unread,
Mar 26, 2024, 8:41:44 AM3/26/24
to tesseract-ocr

Hi everyone, I wish you all a good day.

I'm currently encountering an issue with image_to_string producing incorrect results when reading large gray images containing only numbers. Here's what I'm using:

  • pytesseract version 0.3.10
  • tesseractOCR version 5.3.3
  • Language: English (eng)
  • PSM: 7
  • OEM: 3

With the image provided below, the result returned when using the image_to_string function is 9.5. When I resize the image, it returns 9.0, 9.5, and sometimes 9.9. There was an instance where resizing gave 5.5, but it was incorrect for other cases with different numbers.

Do you have any suggestions for me to improve the accuracy of the results? Thank you all, and I wish you a great day!

9.0.png

inKi Wang

unread,
Mar 26, 2024, 8:47:27 AM3/26/24
to tesseract-ocr
I have also tried different PSMs, but the results are not very promising. Correct for this image, but incorrect for other images. Hope you all can help me improve the accuracy. Thank you all.
Vào lúc 19:41:44 UTC+7 ngày Thứ Ba, 26 tháng 3, 2024, inKi Wang đã viết:

Zdenko Podobny

unread,
Mar 26, 2024, 8:52:18 AM3/26/24
to tesser...@googlegroups.com
Yes, we have suggestions for me to improve the accuracy of the results - they are already in the documentation. Just read it.

Zdenko


ut 26. 3. 2024 o 13:41 inKi Wang <inki.p...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e0ee2568-95cc-42bc-aba7-7d39e8083db8n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages