How to detect page orientation correctly

67 views
Skip to first unread message

hrishikesh kaulwar

unread,
Jul 5, 2019, 1:40:07 AM7/5/19
to tesseract-ocr
I have a scanned documents in which few pages are scanned and oriented wrongly 90, 180, 270
But --psm 0 flag on tesseract to give orientation, Opencv Hough lines, Opencv Bounding box are not working.
Could any one of you please suggest a method to detect orientation correctly and rotate to make it correctly oriented?
Thanks in advance.

Zdenko Podobny

unread,
Jul 5, 2019, 4:46:39 AM7/5/19
to tesser...@googlegroups.com
I am sorry but is not clear what you did...
Tesseract does not provide  Opencv Hough lines, Opencv Bounding box...
Post testing image, tesseract command you used, your result, expected results etc...
 
Zdenko


pi 5. 7. 2019 o 7:40 hrishikesh kaulwar <hpka...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/51e3b4fd-4a3c-4806-9899-c54cabf40025%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

hrishikesh kaulwar

unread,
Jul 5, 2019, 5:23:25 AM7/5/19
to tesseract-ocr
I have tried 3 methods.
1. Tesseract IMG output --psm 0 command on terminal which gives orientation info but it's not always correct
2. In 2nd method I tried Hough lines Opencv algorithm which is also used to detect orientation detection but it's not always giving correct results.
3. In 3rd I tried Opencv Bounding box algorithm

So, I wanted to know if there is some method that exists and some of you might know of which given accurate results for orientation detection

Thanks again.

Zdenko Podobny

unread,
Jul 5, 2019, 6:02:42 AM7/5/19
to tesser...@googlegroups.com
Once again provide image there tesseract does not provide correct output so other people can test it.


Zdenko


pi 5. 7. 2019 o 11:23 hrishikesh kaulwar <hpka...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
Reply all
Reply to author
Forward
0 new messages