Identifying side notes

16 views
Skip to first unread message

Mona

unread,
Jul 16, 2013, 8:16:36 AM7/16/13
to ocr...@googlegroups.com
Hi,

I want to use OCRopus to identify side notes in documents. For eg, text blocks that are written vertically on the border around a larger text block, or grouped together in small paragraphs at the foot of a page. 

I've run the 'ocropus-gpageseg' under the debug mode and I've obtained the intermediate images where I can see the seeded text lines. The text regions that I'm interested in seem to be highlighted as seeds fairly well here. I'd like to extract the image that these regions correspond to.

Any suggestions on how to go about this?

Thanks!








Reply all
Reply to author
Forward
0 new messages