தமிழ்மழை - தமிழுக்கு ஒரு புதிய தரவு

3 views
Skip to first unread message

Muthu A

unread,
Aug 17, 2025, 9:58:07 PMAug 17
to கணித்தமிழ் ஆய்வுக் குழுமம், ThamiZha! - Free Tamil Computing(FTC), Neechal Karan, Vasu Renganathan, K Kalyanasundaram, Julien Malard, Malaikannan Sankarasubbu
வணக்கம்:
Latest drop தமிழ்மழை from 
a huge dataset of Tamil sentences and phrases for use with AI and LLMs. 
Can be found via link: huggingface.co/datasets/tamil 

 credit: Selvakumar Murugan, Tamil Arasan Bakthavatchalam and Malaikannan Sankarasubbu

Hope all young and talented folks can use this dataset to build/refine models and use it for AI/ML applications and other novel Tamil focused applications.

Congratulations to the team for developing this as well as sharing under open-source license!
 
-Muthu

Sathia Narayanan

unread,
Aug 17, 2025, 11:36:53 PMAug 17
to Muthu A, கணித்தமிழ் ஆய்வுக் குழுமம், ThamiZha! - Free Tamil Computing(FTC), Neechal Karan, Vasu Renganathan, K Kalyanasundaram, Julien Malard, Malaikannan Sankarasubbu
Very nice to hear this. Thank you !! 
--
You received this message because you are subscribed to the Google Groups "கணித்தமிழ் ஆய்வுக் குழுமம்" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kanittamiz+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/kanittamiz/CAHwtB4Y2mvguE0gp38-QOFGwDrAtxCsPofEpAsOKPtEBkHUm1A%40mail.gmail.com.

Vijay Sundar Ram

unread,
Aug 18, 2025, 1:16:54 AMAug 18
to Muthu A, கணித்தமிழ் ஆய்வுக் குழுமம், ThamiZha! - Free Tamil Computing(FTC), Neechal Karan, Vasu Renganathan, K Kalyanasundaram, Julien Malard, Malaikannan Sankarasubbu
Huge Corpus is the need of the hour. Great work.

with thanks,
Vijay

--
You received this message because you are subscribed to the Google Groups "கணித்தமிழ் ஆய்வுக் குழுமம்" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kanittamiz+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/kanittamiz/CAHwtB4Y2mvguE0gp38-QOFGwDrAtxCsPofEpAsOKPtEBkHUm1A%40mail.gmail.com.


--
R.Vijay Sundar Ram

Rama Suganthan

unread,
Aug 18, 2025, 1:33:46 AMAug 18
to Muthu A, கணித்தமிழ் ஆய்வுக் குழுமம், ThamiZha! - Free Tamil Computing(FTC), Neechal Karan, Vasu Renganathan, K Kalyanasundaram, Julien Malard, Malaikannan Sankarasubbu
very good work !!! congratulations ! would like to invite them to present in INFITT Conference in SRM university next month ! 

On Mon, Aug 18, 2025 at 7:28 AM Muthu A <ezhi...@gmail.com> wrote:
--
You received this message because you are subscribed to the Google Groups "கணித்தமிழ் ஆய்வுக் குழுமம்" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kanittamiz+...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/kanittamiz/CAHwtB4Y2mvguE0gp38-QOFGwDrAtxCsPofEpAsOKPtEBkHUm1A%40mail.gmail.com.


--
Rama Suganthan




The minute you think of giving up,
think of the reason why you held so long..


Save Nature !  “The best time to plant a tree was 20 years ago. The second best is now “
Please do not print this email unless it is absolutely necessary

recycling of a 3ft high stack of news paper will save a tree
Reply all
Reply to author
Forward
0 new messages