Questions about ocropus-binarize and descender touching underlines

16 views
Skip to first unread message

Jason Culverhouse

unread,
Jun 2, 2011, 2:42:32 PM6/2/11
to ocr...@googlegroups.com
Questions about ocropus-binarize
When I run the default ocropus-binarize The output removes any "descender" connected to the box underline

Original

PastedGraphic-1.tiff
PastedGraphic-2.tiff

Tom

unread,
Jun 29, 2011, 5:52:05 PM6/29/11
to ocr...@googlegroups.com
The default binarizer consists of a number of steps, one of which performs this operation.  It's implemented by the class StandardPreprocessing in C++.  It has a number of thresholds and parameters you can set.  You can see the settable parameters by typing

ocropus params StandardPreprocessing

You can either change the rmbig_... parameters or just remove the RmBig component from the preprocessing stack altogether.

Tom
Reply all
Reply to author
Forward
0 new messages