--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesser...@googlegroups.com
To unsubscribe from this group, send email to
tesseract-oc...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en
---
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/tfi9RrPk3_E/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.
You need to use leptonica (e.g. via ctypes, but there is also python module pylepthonica).
Modify relevant part of script with something like this:
lep = ctypes.cdll.LoadLibrary("liblept.so") pix_image = lep.pixRead(filename) print "image width:", lep.pixGetWidth(pix_image) print "image height:", lep.pixGetHeight(pix_image) tesseract.TessBaseAPISetImage2(api, pix_image) tesseract.TessBaseAPISetRectangle(api, 30, 60, 400, 100) text_out = tesseract.TessBaseAPIGetUTF8Text(api) result_text = ctypes.string_at(text_out) print result_text
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
Hi Zdenko,
"More complex solution" is giving better results in detecting text than earlier one. In addition it points out the layout of where the text is.
But I don't have clear idea as to what this script is doing better compared to earlier one.
Please help me in understanding what the following methods are for:
TessBaseAPISetPageSegMode
TessBaseAPIGetComponentImages
TessBaseAPISetVariable
Also,What are Page Segmentation Modes?
What are Page Iterator Levels?
What is OSD?