AI for proofreading

3 views
Skip to first unread message

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Jun 1, 2025, 2:22:29 AMJun 1
to sanskrit-programmers, Hindu-vidyA हिन्दुविद्या, hariH nArAyaNaH हरिनारायणः SF
Despite some false positives, claude v4 seems somewhat useful for identifying errors - 

image.pngr

Copilot from a pull request wasn't as useful.


Let me know your experiences.

You can get this assistance from within the Vscode or Intellij editor. 

intellij_AI_skt-typo_detection.png
vscode_AI_skt-typo_detection.png


--
--
Vishvas /विश्वासः

Hari Narayanan

unread,
Jun 1, 2025, 7:16:04 PMJun 1
to विश्वासो वासुकिजः (Vishvas Vasuki), sanskrit-programmers, Hindu-vidyA हिन्दुविद्या

Namaste mahodaya

Thank you for your quick responses. Last year Dr. Ramasubramanyan -mahodaya from IIT Bombay was here for Kaveri camp. He mentioned that they have 1000s of pages of scanned pages from many works. But due to the lack of proofreaders  they are not able to publish these for the benefit of many. They were paying Masters students ₹10 a page to proofread, still not many were there. Their main problwm was diacritical marks in the English transliteration. So we thought  may be an AI engine can be trained to identify and correct most of these. Then the manual process will be needed for a small percentage.

There were people willing to sponsor PhD students for this project.

Any ideas or experience you have will be much appreciated.

अनेके धन्यवादाः
हरिः

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Jun 1, 2025, 7:16:18 PMJun 1
to Hari Narayanan, sanskrit-programmers, Hindu-vidyA हिन्दुविद्या
On Sun, 1 Jun 2025 at 22:43, Hari Narayanan <hna...@gmail.com> wrote:

Their main problwm was diacritical marks in the English transliteration.


So we thought  may be an AI engine can be trained to identify and correct most of these.


Rather than typo-correction software -

I think that just creating a superior OCR model, specifically trained with text with diacritical marks should solve the problem.

 

Then the manual process will be needed for a small percentage.

There were people willing to sponsor PhD students for this project.

Any ideas or experience you have will be much appreciated.

अनेके धन्यवादाः
हरिः


On Sat, May 31, 2025, 11:22 PM विश्वासो वासुकिजः (Vishvas Vasuki) <vishvas...@gmail.com> wrote:
Despite some false positives, claude v4 seems somewhat useful for identifying errors - 

image.pngr

Copilot from a pull request wasn't as useful.


Let me know your experiences.

You can get this assistance from within the Vscode or Intellij editor. 

intellij_AI_skt-typo_detection.png
vscode_AI_skt-typo_detection.png


--
--
Vishvas /विश्वासः

Reply all
Reply to author
Forward
0 new messages