Different Outputs on creating my own traineddata

250 views
Skip to first unread message

Mobeen Ali

unread,
Nov 17, 2019, 2:24:52 AM11/17/19
to tesseract-ocr
Hi everyone!

i have successfully created my own custom traineddata file. I've done the training on ubuntu OS and it was giving me perfect results. But now when i run the same data file on windows, it gives me very poor results. this has happened twice with me. 

What could be the reason and solution for this issue?

Shree Devi Kumar

unread,
Nov 17, 2019, 2:55:06 AM11/17/19
to tesseract-ocr
tesseract --version

Share output of above command on each platform.

Share an image and output on each platform.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/dc5e5804-23a8-4677-b03d-0226a9e21f84%40googlegroups.com.


--

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

Mobeen Ali

unread,
Nov 17, 2019, 3:19:50 AM11/17/19
to tesseract-ocr
Hi shree,
following are the screenshots of details you have asked

tesseract --version (ubuntu)

Screenshot from 2019-11-17 11-00-56.png


tesseract --version (windows 10)

ss1.png


input image (for both ubuntu and windows10):

correspondence1.png

output on ubuntu:

output2.txt file (attached)


output on windows10:

aratext.txt




On Sunday, November 17, 2019 at 10:55:06 AM UTC+3, shree wrote:
tesseract --version

Share output of above command on each platform.

Share an image and output on each platform.

On Sun, Nov 17, 2019 at 12:54 PM Mobeen Ali <moby...@gmail.com> wrote:
Hi everyone!

i have successfully created my own custom traineddata file. I've done the training on ubuntu OS and it was giving me perfect results. But now when i run the same data file on windows, it gives me very poor results. this has happened twice with me. 

What could be the reason and solution for this issue?

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesser...@googlegroups.com.
output2.txt
aratext.txt

Shree Devi Kumar

unread,
Nov 17, 2019, 3:31:14 AM11/17/19
to tesseract-ocr
I have reported it in tesseract issue https://github.com/tesseract-ocr/tesseract/issues/1530

Please confirm that you are using the  same traineddata file in both platforms. 

To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ce98b803-2991-4d9c-b59c-93378e4b5bb2%40googlegroups.com.

Mobeen Ali

unread,
Nov 17, 2019, 3:36:49 AM11/17/19
to tesseract-ocr
Yes they are same i have confirmed

Stefan Weil

unread,
Nov 17, 2019, 5:31:50 AM11/17/19
to tesseract-ocr
Could you please provide your traineddata file, too? Did you add special options when running Tesseract on Windows and Linux? If yes, I need those to reproduce the issue.

Mobeen Ali

unread,
Nov 17, 2019, 6:31:00 AM11/17/19
to tesseract-ocr
No i haven't added any special option the traineddata file is attached.
the command line i used is

tesseract theInputImage outputName -l ara
ara.traineddata

Mobeen Ali

unread,
Nov 20, 2019, 7:24:32 AM11/20/19
to tesseract-ocr
Any solution??


On Sunday, November 17, 2019 at 1:31:50 PM UTC+3, Stefan Weil wrote:

Mobeen Ali

unread,
Nov 24, 2019, 2:04:33 AM11/24/19
to tesseract-ocr
Dear Stefan have you reached any conclusion for this issue?


On Sunday, November 17, 2019 at 1:31:50 PM UTC+3, Stefan Weil wrote:
Reply all
Reply to author
Forward
0 new messages