Denoising Documentation

83 views
Skip to first unread message

Pedro Correia

unread,
Nov 16, 2016, 8:36:51 AM11/16/16
to tesseract-ocr
Hi there,
I've been studying tesseract for a while now, but I still haven't found any documentation on it's denoising process. Does anyone know where to find it ?
Thanks in advance.

Quan Nguyen

unread,
Nov 18, 2016, 10:13:02 AM11/18/16
to tesseract-ocr
Probably there is none. Tesseract is an OCR engine. Images usually are preprocessed before sending them to the engine.

Pedro Correia

unread,
Nov 22, 2016, 9:27:31 AM11/22/16
to tesseract-ocr
Seems to me that tesseract does quite a lot of denoising during the layou analysis, like in ccnontextdetect.cpp (thanks to ComputeNoiseDensity method) and in strokewidht.cpp (DetectAndRemoveNosie method).
However, the one thing that I still don't understand is the vigorous_noise_removal method (from textord/makerow.cpp). Seems like it is disabled by default, is that correct? If so, why is that? What does it stand for exactly?
Reply all
Reply to author
Forward
0 new messages