How to convert sansknet texts to Unicode

96 views
Skip to first unread message

Pankajashree R

unread,
Oct 24, 2017, 2:39:26 AM10/24/17
to sanskrit-programmers
May I know how to convert the text corpus found in this website to Unicode? 

http://www.wilbourhall.org/sansknet/vedanta/index.htm

I obtained one text online (in PDF) which cited its source as the above link. 


I tried using TTY-Yogesh and other fonts mentioned in the Sansknet site but I got junk characters. There are so many valuable texts but all I can see is junk letters. Has anyone tried to convert them? 

Narayan Prasad

unread,
Oct 24, 2017, 2:44:14 AM10/24/17
to sanskrit-p...@googlegroups.com

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Anunad Singh

unread,
Oct 24, 2017, 4:25:18 AM10/24/17
to sanskrit-p...@googlegroups.com
DV-TT-Vedic to Unicode Converter_01.html

seems most appropriate for this pdf.
>> email to sanskrit-program...@googlegroups.com.
>> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to sanskrit-program...@googlegroups.com.

Bhasha IME

unread,
Oct 24, 2017, 8:27:41 AM10/24/17
to sanskrit-p...@googlegroups.com
namaste

Extract PDF text using Foxit reader. This preserves font info. Convert using Bhashaime's "All non-Unicode -> Unicode" menu selection. This will automatically identify texts with different fonts and apply respective conversions.

regards
Venkatesh


--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.

Anunad Singh

unread,
Oct 24, 2017, 9:33:18 AM10/24/17
to sanskrit-p...@googlegroups.com
Bhasha IME,

I have also found text Foxit reader preserving more info than others.
But don't you think text input derived from MS Office pdf extraction
facility will be still more accurate?
>> email to sanskrit-program...@googlegroups.com.
>> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to sanskrit-program...@googlegroups.com.

Bhasha IME

unread,
Oct 24, 2017, 9:46:46 AM10/24/17
to sanskrit-p...@googlegroups.com
I am not aware of this. I am on XP, MSO 2003.

If it does, well and good. The IME requires RTF to analyse. Foxit/MSO do not matter.

BTW, Which ver of MSO offers's this

regds


>> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an

> For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.

Pankajashree R

unread,
Oct 24, 2017, 9:58:02 AM10/24/17
to sanskrit-programmers
Namaste,

Thanks a lot. I did as per your instructions for the PDF file and it works like a charm! Foxit reader is better than acrobat - for copy-pasting non-unicode fonts like the one found in the file I shared.

How do I use BhashaIME for the text found in the website I mentioned, such as this - http://www.wilbourhall.org/sansknet/vedanta/vedantadeepa/index.htm ?

Pankajashree R

unread,
Oct 24, 2017, 10:06:37 AM10/24/17
to sanskrit-programmers
I am not aware of MS Office PDF extraction. How to do it ?
>> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an

Anunad Singh

unread,
Oct 24, 2017, 10:07:04 AM10/24/17
to sanskrit-p...@googlegroups.com
Reading PDF files is available feom MSO2013. Following short intro may
be useful-


With Word 2013 and 2016, you can convert a PDF into a Word document
that you can edit. What this video shows about editing PDFs in Word
2013 also applies to Word 2016.

To convert a PDF into an editable Word document, you open it like you
would any other document.

Click File > Open.

Choose the location of the PDF and click Browse.

Find the PDF and click Open.

The converted document might not have a perfect page-to-page
correspondence with the original. For example, lines and pages may
break at different locations. For more information, see Why does my
PDF look different in Word?
(https://support.office.com/en-us/article/Why-does-my-PDF-look-different-in-Word-1d1d2acc-afa0-46ef-891d-b76bcd83d9c8)
>>> email to sanskrit-program...@googlegroups.com.
>>> For more options, visit https://groups.google.com/d/optout.
>>
>>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to sanskrit-program...@googlegroups.com.

Bhasha IME

unread,
Oct 24, 2017, 10:45:46 AM10/24/17
to sanskrit-p...@googlegroups.com
select, copy (HTML formatted text on CB), paste to WPS writer (free). (NB. MSW 2003 changes the font details of certain text. Also its RTF breaks text at unlikely places like between क and ि in कि, this prevents coalescing. However, later ver of MSW may be good. You can try them. For myself, have used WPS writer with good results)

Select, copy (RTF text on CB). Run IME ...

If you have multiple html files (as in this case), batch download them using eg. FDM, convert al to RTF using a utility like 'docto' (for %f in (*.htm) do <...>\docto -f %f -T wdFormatRTF).
Select all .rtf files in WinExplorer, press CTL+ALT+SHF+F2. Hold the CTL,ALT,SHF till a popo-up says it's batch processing 'n' files and release. Wait tille all files are convreted to <file>_U.rtf.

docto uses MSW. Do a trial with a html file and then proceed. Else you may have to use WPS writer.

regds



To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 24, 2017, 10:49:04 AM10/24/17
to sanskrit-p...@googlegroups.com
Can you send me a file (Docx/doc/rtf) generated with MSW of the file attached in the original post. Besides preserving info, there are issues like some texts loosing font info, breaking of text runs between syllables etc. It will be useful to see if the RTF is good in all respects so I can recommend them to others

regds



>>> For more options, visit https://groups.google.com/d/optout.
>>
>>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an

> For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.

Anunad Singh

unread,
Oct 24, 2017, 11:56:44 AM10/24/17
to sanskrit-p...@googlegroups.com
Bhasha IME,

I mistook you as some sort of Microsoft representative by your name
'Bhasha IME' which resembles 'Bhasha India' by microsoft. Now I think
I was wrong.

If you want (Docx/doc/rtf) generated with MSW of the file attached in
the original post, I will do it tomorrow. You will have to see how
'correct' it is.<div id="DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2"><br />
<table style="border-top: 1px solid #D3D4DE;">
<tr>
<td style="width: 55px; padding-top: 13px;"><a
href="https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail"
target="_blank"><img
src="https://ipmcdn.avast.com/images/icons/icon-envelope-tick-round-orange-animated-no-repeat-v1.gif"
alt="" width="46" height="29" style="width: 46px; height: 29px;"
/></a></td>
<td style="width: 470px; padding-top: 12px; color: #41424e;
font-size: 13px; font-family: Arial, Helvetica, sans-serif;
line-height: 18px;">Virus-free. <a
href="https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail"
target="_blank" style="color: #4453ea;">www.avast.com</a>
</td>
</tr>
</table><a href="#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2" width="1"
height="1"></a></div>
>> >>> email to sanskrit-program...@googlegroups.com.
>> >>> For more options, visit https://groups.google.com/d/optout.
>> >>
>> >>
>> > --
>> > You received this message because you are subscribed to the Google
>> > Groups
>> > "sanskrit-programmers" group.
>> > To unsubscribe from this group and stop receiving emails from it, send
>> > an
>> > email to sanskrit-program...@googlegroups.com.
>> > For more options, visit https://groups.google.com/d/optout.
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "sanskrit-programmers" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to sanskrit-program...@googlegroups.com.
>> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to sanskrit-program...@googlegroups.com.

Pankajashree R

unread,
Oct 24, 2017, 1:26:58 PM10/24/17
to sanskrit-programmers


I copied one line of text from the webpage (http://www.wilbourhall.org/sansknet/vedanta/vedantadeepa/adhyaya1/pada1.html) and pasted in WPS. I get this error -  








On Tuesday, 24 October 2017 20:15:46 UTC+5:30, Bhasha IME wrote:
select, copy (HTML formatted text on CB), paste to WPS writer (free). (NB. MSW 2003 changes the font details of certain text. Also its RTF breaks text at unlikely places like between क and ि in कि, this prevents coalescing. However, later ver of MSW may be good. You can try them. For myself, have used WPS writer with good results)

Select, copy (RTF text on CB). Run IME ...

If you have multiple html files (as in this case), batch download them using eg. FDM, convert al to RTF using a utility like 'docto' (for %f in (*.htm) do <...>\docto -f %f -T wdFormatRTF).
Select all .rtf files in WinExplorer, press CTL+ALT+SHF+F2. Hold the CTL,ALT,SHF till a popo-up says it's batch processing 'n' files and release. Wait tille all files are convreted to <file>_U.rtf.

docto uses MSW. Do a trial with a html file and then proceed. Else you may have to use WPS writer.

regds


On Tue, Oct 24, 2017 at 7:28 PM, Pankajashree R <pankaj...@gmail.com> wrote:
Namaste,

Thanks a lot. I did as per your instructions for the PDF file and it works like a charm! Foxit reader is better than acrobat - for copy-pasting non-unicode fonts like the one found in the file I shared.

How do I use BhashaIME for the text found in the website I mentioned, such as this - http://www.wilbourhall.org/sansknet/vedanta/vedantadeepa/index.htm ?


On Tuesday, 24 October 2017 17:57:41 UTC+5:30, Bhasha IME wrote:
namaste

Extract PDF text using Foxit reader. This preserves font info. Convert using Bhashaime's "All non-Unicode -> Unicode" menu selection. This will automatically identify texts with different fonts and apply respective conversions.

regards
Venkatesh

On Tue, Oct 24, 2017 at 12:09 PM, Pankajashree R <pankaj...@gmail.com> wrote:
May I know how to convert the text corpus found in this website to Unicode? 

http://www.wilbourhall.org/sansknet/vedanta/index.htm

I obtained one text online (in PDF) which cited its source as the above link. 


I tried using TTY-Yogesh and other fonts mentioned in the Sansknet site but I got junk characters. There are so many valuable texts but all I can see is junk letters. Has anyone tried to convert them? 

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 24, 2017, 1:55:20 PM10/24/17
to sanskrit-p...@googlegroups.com
1. Copy a few lines from html into WPS
2. again select all text in WPS and copy to CB
3. Invoke menu

Here's what i get for the entire text and the result pasted into MSW and saved (see attached)

Inline image 1

To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.
vd.doc

Pankajashree R

unread,
Oct 25, 2017, 1:10:02 PM10/25/17
to sanskrit-programmers


Screenshot of text pasted by you from html differs from what I see in the same html page in my system. I've pasted the first few lines to show the difference. As a result Im not getting proper transliteration. Why am I seeing different characters on the webpage?
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 25, 2017, 1:57:21 PM10/25/17
to sanskrit-p...@googlegroups.com
which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Pankajashree R

unread,
Oct 25, 2017, 2:04:01 PM10/25/17
to sanskrit-programmers
chrome
which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 25, 2017, 2:04:45 PM10/25/17
to sanskrit-p...@googlegroups.com
paste the lines into Word/Wps-writer and send me the doc

To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 25, 2017, 2:06:37 PM10/25/17
to sanskrit-p...@googlegroups.com
if you have any of those fonts (DV-, SD- etc) installed on your system, uninstall and try

which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Pankajashree R

unread,
Oct 25, 2017, 2:13:35 PM10/25/17
to sanskrit-programmers
Interestingly, I just tried in Firefox and its displayed properly - 


From firefox I copied and converted and it worked.

which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 25, 2017, 2:14:35 PM10/25/17
to sanskrit-p...@googlegroups.com
glad.

which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Pankajashree R

unread,
Oct 25, 2017, 2:27:04 PM10/25/17
to sanskrit-programmers
glad.

which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bhasha IME

unread,
Oct 26, 2017, 5:29:16 AM10/26/17
to sanskrit-p...@googlegroups.com
same procedure; web->wps->cliboard->IME->MSWord

glad.

which browser ?


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsubscrib...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.
shrutaprakashika.7z

Anunad Singh

unread,
Oct 26, 2017, 8:49:38 AM10/26/17
to sanskrit-p...@googlegroups.com
Bhasha IME,

I have just tried Bhasha IME tool. It is simply GREAT. It is not a simple IME which I was thinkink of.

Such a free tool was in high demand for long, especially for keeping the formatting unchanged. It is much more than what people were seeking for. I will say it is a 'complete' tool for free.

But you did not introduce this tool here formally. Will you do it here and at other discussion fora like Scientific and Technical Hindi Group?

-- anunAda

Bhasha IME

unread,
Oct 26, 2017, 1:26:18 PM10/26/17
to sanskrit-p...@googlegroups.com
Thanks for kind words.

Did introduce here in the context of converting Vedic_heritage_Illustratred_dic_hindi.pdf to unicode. Also listed in Wiki and sanskritdocsuments.

Yes, the transliteration part is obfuscated in the IME & hence remains mostly unknown. Will separate them soon.

Mostly have left it to people who have used it to spread the word.😔.

regards
Venkatesh


विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Oct 26, 2017, 1:46:14 PM10/26/17
to sanskrit-programmers
Is it the same bhAShA IME which was once very popular for typing kannada and devanAgarI? I didn't realize that it is available for free!

kautUhalam - What language is it written in? Any thoughts about opensourcing it?
--
Vishvas /विश्वासः

Bhasha IME

unread,
Oct 26, 2017, 1:51:41 PM10/26/17
to sanskrit-p...@googlegroups.com
Bashaime has never been very popular. May be you are referring to Baraha which is completely different.

Written in Autoit script (a grave mistake. Should have gone for C++). Can provide source to any one; may not be necessary though, since keyman has become opensource and is cross-platform
Will eval keyman.

venkatesh


To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-programmers+unsub...@googlegroups.com.

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Oct 26, 2017, 2:03:11 PM10/26/17
to sanskrit-programmers
On Thu, Oct 26, 2017 at 10:51 AM, Bhasha IME <bhas...@gmail.com> wrote:
Bashaime has never been very popular. May be you are referring to Baraha which is completely different.

​Ah - I mistakenly conflated the two.​

 
Written in Autoit script (a grave mistake. Should have gone for C++). Can provide source to any one; may not be necessary though, since keyman has become opensource and is cross-platform

​I see. ​It won't be useful to me, but it might be still be a good idea to just dump a snapshot to Github for future reference, perhaps with a note that it is likely to be deprecated.

Bhasha IME

unread,
Oct 26, 2017, 2:15:09 PM10/26/17
to sanskrit-p...@googlegroups.com
will not deprecate. Many features of it are unlikely to be available in keyman. I may think of adapting parts of keyman for future versions and migrating to c++. It is in this context that I meant present source may not be useful to anyone.


Dhaval Patel

unread,
Oct 26, 2017, 9:49:38 PM10/26/17
to sanskrit-p...@googlegroups.com
Try to dump on github. You can never be sure that no one would be interested in your code. 
Message has been deleted

Pankajashree R

unread,
Nov 4, 2017, 5:16:40 AM11/4/17
to sanskrit-programmers
But Bhasha IME helped a great deal to convert non Unicode to Unicode (especially from sansknet) that too for free. I've not heard of Keyman though.

I have used another free tool named Pramukh IME

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Dec 25, 2018, 12:01:29 PM12/25/18
to sanskrit-programmers

On Tue, Oct 24, 2017 at 1:25 AM Anunad Singh <anu...@gmail.com> wrote:
DV-TT-Vedic to Unicode Converter_01.html

seems most appropriate for this pdf.

On Tue, Oct 24, 2017 at 12:14 PM, Narayan Prasad <hin...@gmail.com> wrote:
> https://sites.google.com/site/technicalhindi/home/converters

>
>
> On 24 October 2017 at 12:09, Pankajashree R <pankaj...@gmail.com> wrote:
>>
>> May I know how to convert the text corpus found in this website to
>> Unicode?
>>
>> http://www.wilbourhall.org/sansknet/vedanta/index.htm
>>
>> I obtained one text online (in PDF) which cited its source as the above
>> link.
>>
>> The PDF can be found here -
>> https://drive.google.com/open?id=0B1DiRvzvNxGaeVhlc2hnbC1sem8
>>
>> I tried using TTY-Yogesh and other fonts mentioned in the Sansknet site
>> but I got junk characters. There are so many valuable texts but all I can
>> see is junk letters. Has anyone tried to convert them?
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "sanskrit-programmers" group.
>> To unsubscribe from this group and stop receiving emails from it, send an

>> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "sanskrit-programmers" group.
> To unsubscribe from this group and stop receiving emails from it, send an

> For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

ken p

unread,
Dec 27, 2018, 4:23:01 PM12/27/18
to sanskrit-programmers
You may play around here with various fonts given in left column to  convert to unicode. This tool needs auto font identifier function.

https://service.vishalon.net/pramukh-font-converter/hindi

Here is Gita Bhashya text matching fonts shiva ji 01

|ÉlÉĻÉÉä%vŠÉÉŠÉ& ĘuųiÉŌŠÉÉä%vŠÉÉŠÉ& iÉÞiÉŌŠÉÉä%vŠÉÉŠÉ& SÉiÉÖlÉÉæ%vŠÉÉŠÉ&
     {É\SÉĻÉÉä%vŠÉÉŠÉ& đÉđ`öÉä%vŠÉÉŠÉ& šÉ{iÉĻÉÉä%vŠÉÉŠÉ& +đ]õĻÉÉä%vŠÉÉŠÉ&
     xÉīÉĻÉÉä%vŠÉÉŠÉ&

ऽरुल्रुरुरुत्व्रुरुरुज्ञ ुरुिरुरुत्व्रुरुरुज्ञ रुिरुिरुरुत्व्रुरुरुज्ञ श्रुरुि_ल्रुरुत्व्रुरुरुज्ञ
     ठ्ठरु्‌श्रुरुरुत्व्रुरुरुज्ञ रु्ररुत्व्रुरुरुज्ञ ईरुठ्ठरुिरुरुत्व्रुरुरुज्ञ ट्टउरुरुत्व्रुरुरुज्ञ
     क्ष्रुरुरुरुत्व्रुरुरुज्ञ

ken p

unread,
Dec 27, 2018, 4:23:01 PM12/27/18
to sanskrit-programmers
Reply all
Reply to author
Forward
0 new messages