--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com.
Have you tried
On Fri, Aug 2, 2019 at 9:26 PM Cristobal Jesus Muñoz Solano <cmun...@gmail.com> wrote:
Hello, I am trying to use tesseract and I have read all the documentation and I have done many tests, sorry if this is not the place to ask this question, but I have been researching for several days and I am having many doubts and I do not know what to do or where to investigate , I'm frustrated.--
1) If I want to train tesseract to improve its efficiency by reading images with font OCR-B, should I first do a tuning by adding the OCR-B font? or I can create a trainnedata directly with the images/box and then combine it with the best model.
2) How do I add many images / box to the best model.
3) Once you have a .trainneddata ready and save it in tessdata is it enough for you to test when you run it use that data to read the images?
I already tried this script
https://github.com/Shreeshrii/tessdata_ocrb
but I still don't understand how to add new training images to the best model
please help me, I don't want to kill myself so young
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com.
hello, I have already tried mrz.trainneddata yes quite good, but it is not accurate. How can I do it to improve it? Is it possible to use box / png files to improve its accuracy ?.
mrz.trainneddata was generated using thousands of images. I doubt you'll be able to increase the accuracy just by adding more data.
Most of the time the accuracy issues are related to poor image pre-processing.
You can try https://www.doubango.org/webapps/mrz/ which use mrz.trainneddata with the failing images to see if it works. If it works this means the issue is on the pre-processing.
If you share some sample images it would be easier to help you.
I can already generate the .box files using listbox from png images but I don't understand what follows.
How can I use them to improve the best model eng.trainneddata?
this image return L2007190588S37<<<<<<<<<<<<\n77F1912157PER22344783<K<3\n<RODRIGUEZ<<LORENZA<SObut in https://www.doubango.org/webapps/mrz/ work good. i dont know why :(
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b182f7cd-cb15-439d-a4a0-105aeedd65bb%40googlegroups.com.
<Selección_001.png>
this image return L2007190588S37<<<<<<<<<<<<\n77F1912157PER22344783<K<3\n<RODRIGUEZ<<LORENZA<SO
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/b182f7cd-cb15-439d-a4a0-105aeedd65bb%40googlegroups.com.
<Selección_001.png>