Help

50 views
Skip to first unread message

Ishak DÖLEK

unread,
Oct 31, 2019, 5:34:45 PM10/31/19
to tesser...@googlegroups.com
I want to train the manuscripts using tesseract v4. 
Do I need to convert the image of manuscript  into binary pictures to train? What do you suggest to convert binary pictures if you need to?
Thank in advance.
Sincerely

Zdenko Podobny

unread,
Nov 1, 2019, 4:04:55 AM11/1/19
to tesser...@googlegroups.com
Please provide example image you try to OCR.

Zdenko


št 31. 10. 2019 o 22:34 Ishak DÖLEK <ishakd...@gmail.com> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAA%3DdkuZA0MPw%2BDS6UyPt4rp4LeDBm%2B27J2MKXf37kjvOjjpmxQ%40mail.gmail.com.

Ishak DÖLEK

unread,
Nov 1, 2019, 4:59:54 AM11/1/19
to tesser...@googlegroups.com
I'm sending the image below. All image are like this.

image.png


Zdenko Podobny <zde...@gmail.com>, 1 Kas 2019 Cum, 11:04 tarihinde şunu yazdı:

Zdenko Podobny

unread,
Nov 1, 2019, 6:24:54 AM11/1/19
to tesser...@googlegroups.com
No, you do not need binarize image if you plan train tesseract from custom images.

Zdenko


pi 1. 11. 2019 o 9:59 Ishak DÖLEK <ishakd...@gmail.com> napísal(a):

Ishak DÖLEK

unread,
Nov 7, 2019, 1:56:46 AM11/7/19
to tesser...@googlegroups.com
I would like to create a traineddata to recognize the Ottoman manuscripts.
I have prepared both synthetic and orijinal (tif/box) data for this purpose. I also prepared a dictionary list.
For this, it would be correct to convert the Ottoman mannuscripts images into binary images?
or what would you suggest me to do this task?

Thanks in advance.

Zdenko Podobny <zde...@gmail.com>, 1 Kas 2019 Cum, 13:24 tarihinde şunu yazdı:
Reply all
Reply to author
Forward
0 new messages