தமிழ்மழை - தமிழுக்கு ஒரு புதிய தரவு

17 views
Skip to first unread message

Muthu A

unread,
Aug 17, 2025, 9:58:11 PMAug 17
to கணித்தமிழ் ஆய்வுக் குழுமம், ThamiZha! - Free Tamil Computing(FTC), Neechal Karan, Vasu Renganathan, K Kalyanasundaram, Julien Malard, Malaikannan Sankarasubbu
வணக்கம்:
Latest drop தமிழ்மழை from 
a huge dataset of Tamil sentences and phrases for use with AI and LLMs. 
Can be found via link: huggingface.co/datasets/tamil 

 credit: Selvakumar Murugan, Tamil Arasan Bakthavatchalam and Malaikannan Sankarasubbu

Hope all young and talented folks can use this dataset to build/refine models and use it for AI/ML applications and other novel Tamil focused applications.

Congratulations to the team for developing this as well as sharing under open-source license!
 
-Muthu
Reply all
Reply to author
Forward
0 new messages