Tesseract page segmentation algorithm?

99 views

Skip to first unread message

chulwoo pack

unread,

Sep 22, 2018, 1:53:03 PM9/22/18

to tesseract-ocr

Hi everyone,

Does anyone know what kind of method/algorithm is being used in the tesseract's fully automated page segmentation?

I am specifically interested in the segmentation portion rather than any other pre-processing steps, such as deskewing or noise-removal process. I have tried really hard to find any documentation that might specify the sequence of its process or the algorithm is based on particular paper, etc.

Thank you.

Balachandar Suresh

unread,

Sep 30, 2019, 4:32:04 PM9/30/19

to tesseract-ocr

Hi,
If you are still looking at this. Here you go.
https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/35094.pdf

Reply all

Reply to author

Forward

0 new messages