Page segmentation

134 views
Skip to first unread message

zm

unread,
Jan 30, 2010, 3:48:53 AM1/30/10
to ocropus

Hi,

Having a screenshot of the web page I need to identify regions
containing navigation menus, content areas and advertisement blocks.
Based on the following posts in the group

http://groups.google.com/group/ocropus/browse_thread/thread/056d0ead0532829e/7e101d7c50ee1968?lnk=raot
http://groups.google.com/group/ocropus/browse_thread/thread/f394c0ea4889c65/d7107fad9db87a6d?lnk=gst&q=page+segmentation#d7107fad9db87a6d

I understand that ocropus, at least to some degree, is capable of
doing this. Yet I fail to find any information, how someone totally
unfamiliar with the tool can start working with segmentation
components. Can someone point me to the relevant documents or
(preferably) give a sort example utilising ocropus segmentation
features?

Thanks in advance,
zm

Faisal Shafait

unread,
Feb 4, 2010, 6:32:32 AM2/4/10
to ocr...@googlegroups.com
Hi,
Web pages are quite different to document images due to their very low resolution and anti-aliasing effects. You can try Voronoi segmentation (of course after binarizing the image) as outlined in my post that you referred to.
If you are using OCRopus 0.4, you can put that code in the commands folder and run scons at the top-level. You should get an executable for the Voronoi segmentation module.

Cheers,
Faisal


--
You received this message because you are subscribed to the Google Groups "ocropus" group.
To post to this group, send email to ocr...@googlegroups.com.
To unsubscribe from this group, send email to ocropus+u...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/ocropus?hl=en.


philip

unread,
Apr 26, 2013, 7:56:15 AM4/26/13
to ocr...@googlegroups.com
How did you go with this segmentation? I also want to do the same thing.
Reply all
Reply to author
Forward
0 new messages