Devnagari Fonts to Unicode

1,256 views
Skip to first unread message

Nityanand Misra

unread,
Dec 2, 2011, 6:52:05 PM12/2/11
to sams...@googlegroups.com
Changing the subject, as this is a separate topic from the original thread.

Thanks Eddie Mahodaya and Shankara Ji for the links. I did not try the converter, because the fonts I have (APS-DV-Stardust and APS-DV-Tulsi) are not listed as options in the dropdown.

Meanwhile I came across a very useful piece of software - DangiSoft Prakhar Devnagari Font Parivartak - it has the options of many more fonts (250+ fonts are supported), it supports copy and paste and also preserves formatting - one can also copy paste from Excel. An introduction to the software (in Hindi) is under here

A trial version with limits is available from here. The full version costs only Rs 1500 (< USD 30) which is value for money for the features and the wide range of fonts supported.

Thanks, Nityanand

On Tue, Nov 29, 2011 at 12:55 AM, Eddie Hadley <Eddie...@ontology.demon.co.uk> wrote:
 
Nityanand,
 
“ . . . I am interested in converting texts in some Devanāgarī fonts to Unicode as well.”
 
You might want to take a look at the Unicode Conversion Gateway.
 
 
 
There, you can upload an Indian language text file in a legacy encoding and have it converted into Unicode.
I have tried it for DV-TTYogesh, and it works.
 
 
 
Also, a facility to transliterate a Web site URL, from one Indian language script to another is available at the the same site, though I haven’t tried that, myself.
 
 
    Eddie
 
 

No virus found in this message.
Checked by AVG - www.avg.com
Version: 2012.0.1873 / Virus Database: 2101/4643 - Release Date: 11/27/11

--
You received this message because you are subscribed to the Google Groups "samskrita" group.
To post to this group, send email to sams...@googlegroups.com.
To unsubscribe from this group, send email to samskrita+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/samskrita?hl=en.



--
Nityānanda Miśra
http://nmisra.googlepages.com

|| आत्मा तत्त्वमसि श्वेतकेतो ||
(Thou art from/for/of/in That Ātman, O Śvetaketu)
     - Ṛṣi Uddālaka to his son, Chāndogyopaniṣad 6.8.7, The Sāma Veda

Eddie Hadley

unread,
Dec 4, 2011, 3:55:15 PM12/4/11
to sams...@googlegroups.com, Eddie Hadley
Nityanand,
 
For serious work in extracting and converting the contents of a .pdf  format file into other common editable formats, it seems best to avoid working with material containing proprietary fonts, and also to go to straight to the factory for the all the best tools - Adobe.
 
 
At £15.37,  Adobe ExportPDF  is obtainable from https://www.acrobat.com/exportpdf/en/convert-pdf-to-word.html.
 
That’s good value in any currency. And that price includes support from Adobe themselves.
 
I’ve not actually tried it myself, but I have a great many .pdf’s that I would very much like have in the .rtf form.
 
Open Office already does .rtf <==> HTML for me.
 
Eddie

No virus found in this message.
Checked by AVG - www.avg.com

Version: 2012.0.1873 / Virus Database: 2102/4656 - Release Date: 12/04/11

Anunad Singh

unread,
Dec 5, 2011, 9:07:12 AM12/5/11
to sams...@googlegroups.com
To convert pdf files containing legacy Devanagari fonts to DOC files in Unicode Devanagari, I follow the following process-

1)  Use PDF Nitro (free trial version)  to convert PDF into DOC file .  In this process,  all the format of the PDF file is maintained.

2) Write an OpenOffice macro to convert the above legacy font into Unicode Devanagari.

3) Open the above doc file in OpenOffice  and use  the macro which gives me Unicoded Devanagari DOC file.

( By selecting only the legacy font, I avoid converting text written in English or other languages . Only the Devanagari content is converted.)

-- Anunad Singh

Anunad Singh

unread,
Dec 5, 2011, 9:16:21 AM12/5/11
to sams...@googlegroups.com
I want to add to the above that  if some text from a pdf file is selected, copied and pasted into OpenOffice Writer,  it also maintains the original format. BUT some of the 'extended Roman charecters'  get converted to corresponding simpler charecters which results into wrong Devanagari output after font conversion.

-- Anunad 

Eddie Hadley

unread,
Dec 5, 2011, 10:29:35 AM12/5/11
to sams...@googlegroups.com, Eddie Hadley
Anunad,
 
    “ Write an OpenOffice macro to convert the above legacy font into Unicode Devanagari”
 
That is the problem – so many legacy fonts, so little time, and let’s not forget those with legacy Romanised offerings.
 
Yes, I have written one or two of these conversion routines (in C# .NET), already.
But the likes of someone like myself to whom the reading of Devanagari is a one finger, one character at a time exercise, they is a real need to fully  convert all the scripts from the original .pdf’s. Once I have all that it in .rtf or .htm, or even plain Unicode text, I’m on my way.
 
The answer (for me) would appear to be to splash out a little money on Adobe.
 
This is what excites me about the Adobe offering – despite the .doc in its name, it can not only give me my .rtf format, but would appear to ”Make scanned text editable with optical character recognition”. And I have a few of those .pdf’s as well . . .
 
Copying and pasting out of .pdf’s appears to be done with extended ASCII and not with Unicode. That’s what puts the p in the portable, no doubt.
But one size does not fit all  - which makes for an interesting challenge . . .
 
Eddie
 
 
 
t .doc format is fine,away.  a the    DING for whom the Devanagari is m
--
You received this message because you are subscribed to the Google Groups "samskrita" group.
To post to this group, send email to sams...@googlegroups.com.
To unsubscribe from this group, send email to samskrita+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/samskrita?hl=en.

No virus found in this message.
Checked by AVG - www.avg.com

Version: 2012.0.1873 / Virus Database: 2102/4657 - Release Date: 12/04/11

No virus found in this message.
Checked by AVG - www.avg.com

Version: 2012.0.1873 / Virus Database: 2102/4657 - Release Date: 12/04/11

Reply all
Reply to author
Forward
0 new messages