Hi,
this looks hard. You have two problems here, straighten the text and clean it up.
Once you have straighten the text to something like this:
I do not know how they pre-process it, maybe they do not and throw it at a neural network trained with any kind of text.
If you want to try to clean up you need to look into edge detection (gaussian difference) and/or wavelet decomposition. I tried with gimp with weak results.
When you have something that barely looks like black on white you can try to fine tune the tesseract model but you need a lot of samples with hand transcribed text, unless you are so lucky to have pre-classified images.
I would also try to fine tune the existing model with the image as it is, with no pre-processing at all other than straightening. It may even work better.
To straighten the text you may try EAST text detection, rotate the bounding boxes. Or detect curved lines and dewarp it according to the radius. Or do component analysis, detect the letter boxes and dewarp accordingly. Not easy to do reliably on a random picture.
Bye
Lorenzo