extract telugu text from pdf file-->'patrika padakosam'

485 views
Skip to first unread message

dinu

unread,
Nov 17, 2010, 11:27:41 AM11/17/10
to తెలుగుబ్లాగు
Dear Telugu Guru's

I've created a free ,open source Eng-Tel dictionary for mobile phones
using DICTIONARY FOR MID software.

The present version of dictionary is based on English-Telugu
Dictionary (Reprint of 1853 Edition) by CHARLES PHILIP BROWN which is
now open source dict. data base available on the internet.but very
old.
download link is available at-----> http://dinu-learningisfun.blogspot.com

Recently, i have found a high quality pdf known as 'patrikapadakosam'
which is upto date and excellent reference guide.

But, the problem is the use of EMBEDDED FONTS in pdf and we are unable
to extract telugu text from pdf to notepad/word/any editor

pdf: available at www.scribd.com
size: ~1.7MB
I we could made it, we can build an excellent dictionary.

Thank you,
Dinesh. k
A.P
INDIA
Reply all
Reply to author
Forward
0 new messages