i finetuned myanmar traineddata and i got accuracy above 95%.
But something is wired.
I render some text with different exposure to eval and i run
tesseract in exp_minus_1.png exp_minus_1 -l mya --psm 6
etc.
exposure miuns 1 output is ok.
In exp miuns 5 and 10 output, some lines output are not really exist in image. output result of exist line is still ok.
i try default psm and not different.
what is wrong?
My system is
tesseract v5.0.0-alpha.20200328
leptonica-1.78.0
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0
Found AVX2
Found AVX
Found FMA
Found SSE
Found libarchive 3.3.2 zlib/1.2.11 liblzma/5.2.3 bz2lib/1.0.6 liblz4/1.7.5
Found libcurl/7.59.0 OpenSSL/1.0.2o (WinSSL) zlib/1.2.11 WinIDN libssh2/1.7.0 nghttp2/1.31.0
traineddata file