Samskruth Teachers job prospect and A.I . of Voice ? - With growing technology, can I recognize my own speech ?

BVK Sastry (G-S-Pop)

unread,

Jan 10, 2023, 10:50:13 PM1/10/23

to bvpar...@googlegroups.com

Namaste

Pointing to a news item which stakes the following claim-deliverable

Source: VALL-E (valle-demo.github.io) : Abstract. We introduce a language modelling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modelling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. VALL-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as an acoustic prompt. Experiment results show that VALL-E significantly outperforms the state-of-the-art zero-shot TTS system in terms of speech naturalness and speaker similarity. In addition, we find VALL-E could preserve the speaker's emotion and acoustic environment of the acoustic prompt in synthesis.

Request: Would Samskruth scholars explore and advise the impact of this ‘Advanced A I Technology of Speech – Reporduction ( – the Mass –impact of this on language standardization should be transparent enough) for ‘ Spoken Samskruth’ (= Vyaavaharika – Sambhashana –Samskrutham) and ‘Standard Samskrutham’ ( Shista – Prayoga) ? pl. How will this impact class room Samskruth – Language- Teachers job futures ?

In other words, would a language learner need a ‘ human teacher’ or accessible ‘ paid - machine resource – service ’ ?? Is it Technology controlling language ?

Regards

BVK Sastry

Mathukumalli Vidyasagar

unread,

Jan 11, 2023, 5:38:27 AM1/11/23

to bvpar...@googlegroups.com

Sastry garu,

Technology would definitely control language. When I go abroad and use Google Maps with my Indian cell phone, the voice uses an "Indian accent." The same Maps produces an "American accent" on a local phone.

But why not turn that to our advantage? We can train interested parties to pronounce Sanskrutham properly. The methodology is quite open source -- only the training and the output are proprietary. It is also not very costly, unless done at massive scale (which is really not needed).

If someone is interested, they can contact me at m.vidy...@iith.ac.in . We have a program on Indian Knowledge Systems, under which this activity may perhaps be carried out.

Best wishes.

Sagar

--
You received this message because you are subscribed to the Google Groups "भारतीयविद्वत्परिषत्" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bvparishat+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bvparishat/0ca301d92566%24810a5450%24831efcf0%24%40gmail.com.

kenp

unread,

Jan 11, 2023, 11:34:18 PM1/11/23

to भारतीयविद्वत्परिषत्

Pros and cons of Text to Speech / Speech to Tax

https://www.unf.edu/~tcavanau/presentations/NECC2002/text-to-speech_pres.htm

https://spsreviews.com/advantages-and-disadvantages-of-text-to-speech-software/

https://www.courselounge.com/best-voice-to-text-apps/

https://astaspeaks.wordpress.com/2013/05/14/things-to-consider-the-pros-and-cons-of-voice-recognition-software/

https://www.voxtab.com/transcription-blog/voice-recognition-software-pros-and-cons/