About the language training data made available at Parichita

111 views
Skip to first unread message

Anunad Singh

unread,
Jan 8, 2013, 8:01:22 AM1/8/13
to parich...@googlegroups.com
First thank you for staring this great project Parichit. It is a great hope for Indic languages.

I want to know the following-

1) Is the training data different than that provided at the Tesseract project site?

2) If different, will it work with Tesseract?

3) Parichit_WIN_LIN.tar.gz : what front end does it use?

with regards..

-- Anunad Singh

Indu

unread,
Jan 10, 2013, 12:07:42 AM1/10/13
to parich...@googlegroups.com
On Tue, Jan 8, 2013 at 6:31 PM, Anunad Singh <anu...@gmail.com> wrote:
First thank you for staring this great project Parichit. It is a great hope for Indic languages.

I want to know the following-

1) Is the training data different than that provided at the Tesseract project site?

Yes the training data is different from that provided in Tesseract project site.
 
2) If different, will it work with Tesseract?
 
Yes it will work with Tesseract(3.01).I have not tried it with the current version of Tesseract.

 
3) Parichit_WIN_LIN.tar.gz : what front end does it use?

Front end of Parichit is a modification of VietOCR .Previous version of tesseract does not have the matraclipping support for processing devanagri script .So we have included the matraclipping support in the VietOCR GUI available then. Currently tesseract code itself has the matraclipping facility.
 
with regards..

-- Anunad Singh



--
Thanks & Regards

Indu

Indu

unread,
Jan 10, 2013, 4:20:42 AM1/10/13
to parich...@googlegroups.com
For Hindi you can use the training data available from Tesseract site itself [ http://code.google.com/p/tesseract-ocr/downloads/detail?name=tesseract-ocr-3.02.hin.tar.gz&can=2&q=] .Results are almost accurate.
hinout.txt
PairaviKyomAurKaise2.tif

Anunad Singh

unread,
Jan 11, 2013, 9:17:03 AM1/11/13
to parich...@googlegroups.com
Thank you Indu ji.

I downloaded  Parichit_WIN_LIN.tar.gz . Unzipped it. I did not find installer or any executable file. Does it mean I have to compile it and then use it?

with respect
ANUNAD

Sriranga(78yrsold)

unread,
Jan 11, 2013, 9:20:37 AM1/11/13
to parich...@googlegroups.com
whether you are using linux or windows ?

Indu

unread,
Jan 12, 2013, 11:27:14 AM1/12/13
to parich...@googlegroups.com

For linux there is run.sh and for windows there is run .bat file also check the readme file for instructions

Kirtikumar Patel 9825409607

unread,
Jul 31, 2014, 8:28:26 AM7/31/14
to parich...@googlegroups.com

HELLO
I INSTALL tessacart in win 7 system
but how to install other gujarati font in output txt file

pl. help

thanks
regards
kirti 9825409607 

khyati pandya

unread,
Mar 10, 2015, 9:40:01 PM3/10/15
to parich...@googlegroups.com
hi I am khyati pandya doing diploma IT m implementing OCR app as my final yr project,
I have successfully implemented it using tess-two for English but its not working for other languages it would b pleasure if u can help me...
i treid to download hindi traineddata and implemented it but its not working...There is no error shown in program but after clicking image it shows a runtime error that unfortunately program stoped. plz plz help.   
Reply all
Reply to author
Forward
0 new messages