Low DPI means game over?

56 views
Skip to first unread message

Abs

unread,
Oct 28, 2019, 1:06:19 PM10/28/19
to tesseract-ocr
I'm struggling to get the square footage of the attached floor plan image.

It partially works. Tesseract returns "1474 SQ" but I am hoping for the full string "1474 SQ.FT."

$ tesseract --version
tesseract
4.1.0
 leptonica
-1.78.0
  libgif
5.1.4 : libjpeg 9c : libpng 1.6.37 : libtiff 4.0.10 : zlib 1.2.11 : libwebp 1.0.3 : libopenjp2 2.3.1
 
Found AVX2
 
Found AVX
 
Found SSE

Is my image simply too low of a DPI and nothing can be done to enhance this result?

I've tried greyscale, invert and scaling the image and couldn't improve the result.

Can anyone improve the results for the attached image? If so how?
test-floor-plan.gif

Timothy Snyder

unread,
Oct 28, 2019, 5:34:02 PM10/28/19
to tesser...@googlegroups.com
Which part are you trying to OCR? There's a lot of non-text likely interfering with recognition.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/606e7614-84d9-4356-a7c7-571f83f38521%40googlegroups.com.

Abs

unread,
Oct 28, 2019, 6:47:03 PM10/28/19
to tesseract-ocr
Hey Timothy, the bit I’m interested in is the bit that shows the total square footage at the bottom. I’m trying to extract the 1474.
Reply all
Reply to author
Forward
0 new messages