Information about line and word positions

48 views
Skip to first unread message

Al Byers

unread,
Mar 4, 2013, 7:06:55 AM3/4/13
to ocr...@googlegroups.com
I would like to have ocropus analyze a document and report the position of key words so that I can go back and do a more detailed analysis of particular areas. Is there output that will give me that information?

Tom

unread,
Apr 10, 2013, 1:36:32 AM4/10/13
to ocr...@googlegroups.com
The page segmentation file (.pseg.png) contains pixel accurate information about where the lines are.

For finding words within lines, it depends on the recognizer and the script of how to get at that information. ocropus-lattices gives you bounding boxes relative to the text line. ocropus-rpred (the new recognizer) outputs a sequence of classification vectors that you could use.

Tom
Reply all
Reply to author
Forward
0 new messages