Re: About Sanskrit Spell check

67 views
Skip to first unread message

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Apr 21, 2023, 8:02:57 AM4/21/23
to shantanu oak, sanskrit-programmers
Thanks! I use firefox, but I suppose it won't work if I don't use Google input?

Adding sanskrit-programmers in case someone there finds it useful.

(In the screenshot, अस्माकं उद्देश्यं should've been highlighted - अस्माकम् उद्देश्यं being correct.)


On Fri, 21 Apr 2023 at 15:14, shantanu oak <shanta...@gmail.com> wrote:
Hi,
I have developed a Sanskrit spellchecker for Firefox and Libreoffice. You can try them here...

https://addons.mozilla.org/en-US/firefox/addon/sanskrit-dictionary/

https://extensions.libreoffice.org/en/extensions/show/27509

I will like to know if you find it useful.

-- Shantanu



--
--
Vishvas /विश्वासः

Shreevatsa R

unread,
Apr 28, 2023, 9:53:19 AM4/28/23
to sanskrit-p...@googlegroups.com, shantanu oak
Interesting! Looks like the spell-checking is based on the relevant data here, in two files sa_IN.dic (3229 lines) and sa_IN.aff (1043 lines). What would you say is the quality of this spell checker, i.e. rate of false positives/negatives? I'm curious how much quality can be achieved with such little data; looks intriguing. Thanks for building these spell-checking tools.

BTW Vishvas I don't think this requires using Google input tools; it should work on any text input field in Firefox -- the second screenshot shows a Wikipedia edit page. And I was able to get this from the current featured article on sa.wikipedia.org:

image.png
(This example looks like a false positive; the underlined word is correct and all the suggested corrections make less sense AFAICT.)


--
You received this message because you are subscribed to the Google Groups "sanskrit-programmers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sanskrit-program...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/sanskrit-programmers/CAFY6qgFvZDSmbFsXc-zm457ZbzRyzbCc19%3DAVdyaqE8nxsb6tg%40mail.gmail.com.

shantanu oak

unread,
May 6, 2023, 12:37:29 AM5/6/23
to sanskrit-programmers
The files on Github were used in the first version. Since then, a lot of new words have been added. If you use a program like 7-zip, you can look at these new words by unzipping the extension file.

This is not 100% accurate spell checker like English. It is especially useful for correcting the spelling mistakes found in OCR text. A person who is good at both, "Ashtadhyayi" and "hunspell" can build a better one. :)

You have given an example of "रसायुर्वेदस्य" and I fully agree that this word should have been part of the wordlist. You could add that word to your personal dictionary by selecting the option "Add to Dictionary". Once enough words are collected, please send them to me so that I can add them in the source code. It's a small price to pay to build a better spell checker for sanskrit.

Firefox:
1) Select Help - More troubleshooting information.
2) Click on "Open Folder" button in the "Profile folder" section.
3) email the words from file "persdict.dat" to me.

Libre-office:
1) start – run - type %APPDATA%
2) email the words from file "standard.txt" found in user profile folder: \LibreOffice\4\user\wordbook

And do not forget to check the "Plus" extension for libreoffice...

https://extensions.libreoffice.org/en/extensions/show/27511

This will extract the unique mis-spelled words. It helps when you're checking a really long paper.

If you want to help, rate the add-on (preferably 5 stars)

-- Shantanu
Reply all
Reply to author
Forward
0 new messages