Many thanks for your reply - useful to know.
I now find that pytesseract is returning the wrong coordinates for individual characters. For example, for this image (which has a 10pixel border):
image_to_boxes returns:
A: 17 32 10 22
L: 17 32 24 33
etc
etc
These I believe are interpreted as (left bottom right top) and when I extract the image for the letter A I get:
However, the same code works correctly for:
On Thursday, 11 June 2020 19:30:50 UTC+1, zdenop wrote:
I cannot understand whyTessaract fails on this (cropped) image:
Yet if i add a random white border it works:
Can anyone shed any light please?
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.