Tesseract v5 architecture

231 views
Skip to first unread message

Giridharan Kumaravelu

unread,
May 30, 2022, 7:55:54 PM5/30/22
to tesseract-ocr
I am looking to understand the architecture of OCR pipeline in tesseract v5.0.1 to know about the preprocessing that happen before the LSTM network during inference and training

I could only find these 7 year old documentation notes (https://github.com/tesseract-ocr/docs/tree/main/das_tutorial2016) and I am not sure if they are still accurate. 
  1. Is the information I am looking for present anywhere in the online documentation (https://tesseract-ocr.github.io/tessdoc/)? 
  2. Is there a way to turn off the pagelayout analysis and other preprocessing before the LSTM modules? 

Amine

unread,
May 26, 2023, 2:10:50 PM5/26/23
to tesseract-ocr
Hello,

Have you found any information regarding the architecture of v5 or v4?
I'm searching as well to understand how it works.

Best regards.
Reply all
Reply to author
Forward
0 new messages