Hi Arun and everyone,
Following up on the note to Arun a few weeks ago, thanks to pointing out to GRETIL and
sanskritdocuments.org, both were useful, but sadly, they don't grant permission for commercial use.
Since then, I've spent time going through 100+ platforms trying to fill in gaps in my corpus: book publishers, university digitisation projects, Internet Archive, Ministry of Culture collections, foreign archives. My conclusion: Ambuda is the best for texts that are complete, proofread, machine readable, and open-licensed at the same time. Nothing else I found will fill all 4 bars at once.
A few sources I found that might be useful for Ambuda's expansion :
-
https://egangotri.org/-
https://www.sanskritebooks.org/-
https://guides.libraries.emory.edu/c.php?g=576070&p=3973627-
https://titus.uni-frankfurt.de-
https://sa.wikisource.org-
https://vedicreserve.miu.eduMy present corpus comprises the main Upanishads, Stotras, Ramcharitmanas, Ramayana, Mahabharata, Gita, and Vedas. The rest are the Puranas, the Shastras, and the Samhitas. Original Sanskrit verse only, no copyrighted translations needed. If anyone on the list knows of proofread, permissively licensed for commercial use sources for these, I'd love to hear about it.
Keshav