Regarding Parichit OCR system

43 views
Skip to first unread message

Dost Singh

unread,
Jun 23, 2016, 2:01:05 AM6/23/16
to Parichit-OCR
Hi All,

I am working on an academic project where i need to convert the pdf/scan documents in various Indic languages to text file which can inserted in database for search indexing purpose. I have tried Tesseract-ocr and even banti ocr but the results are not great. I would like to get some information about parichit ocr and would like to contact people in CDAC who are still taking care or have use to take care of this project before.

I request all concern people to please respond/reply. 

thanks & regards

Dost

Raj Mashruwala

unread,
Jun 23, 2016, 4:05:46 AM6/23/16
to parich...@googlegroups.com
Google provides OCR for 70 plus languages including most Indic script. Google it to get instructions. Offers as part of google drive.
--
You received this message because you are subscribed to the Google Groups "Parichit-OCR" group.
To unsubscribe from this group and stop receiving emails from it, send an email to parichit-ocr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Hariharan Ramamurthy

unread,
Mar 27, 2017, 6:01:53 PM3/27/17
to Parichit-OCR, mas...@gmail.com
problem  with  google  drive  OCR in indic languages  is  Ok to some  level for  sigle   images  doesnt  work for  any  thing more than  a page  f goes to some thing called  cloud convert  with  just  hangs  and  doesnt  work 

RKVS Raman

unread,
Mar 27, 2017, 9:06:49 PM3/27/17
to parich...@googlegroups.com, mas...@gmail.com
Have you tries Indic-OCR system https://indic-ocr.github.io/ ?



Best Regards
-Raman

-----------------------------------------------
RKVS Raman
http://sites.google.com/site/rkvsraman
------------------------------------------------



To unsubscribe from this group and stop receiving emails from it, send an email to parichit-ocr+unsubscribe@googlegroups.com.

Shashwat Bishwen

unread,
Mar 28, 2017, 1:39:40 AM3/28/17
to parich...@googlegroups.com, mas...@gmail.com
I am already trying my hand on Indic-ocr stuff. I do have certain level of success, but it would take some more time to evolve as a stable product. But, i figured out a way to use google docs to do the job. It is a little slow process but it is working. I recently completed a book of 400+ pages and it little over 7 hours for my python script to complete the ocr of this book. The script is not a polished one, i will share the polished one in a while so that others get benefited too. 


-|शौक-ए-दीदार अगर है तो नज़र पैदा कर।

--
You received this message because you are subscribed to a topic in the Google Groups "Parichit-OCR" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/parichit-ocr/WDweKQjj_xY/unsubscribe.
To unsubscribe from this group and all its topics, send an email to parichit-ocr+unsubscribe@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages