Keen on working for tasks related to Language Models and Information Extraction for IndicNLP

18 views
Skip to first unread message

Sinchani Chakraborty

unread,
Mar 27, 2019, 4:01:16 AM3/27/19
to indicnlp
Hi,

I am Sinchani. I have the experience of training language models, implementing deep learning modules for Text Classification. 

I am keen on contributing to this project. In the Github link , I found two tasks: 1.) Build a UlmFit model, 2.) Get translation data respectively that were unchecked. I can explore the first task i.e,Build a UlmFit model. Also, it will be really great if you could share other prior tasks that are to be done.

Thanks & Regards,
Sinchani

MS Research Scholar,
Dpartment of Computer Science and Engineering,
IIT Kharagpur

Soham Chatterjee

unread,
Mar 27, 2019, 5:35:18 AM3/27/19
to indicnlp
Hey Sinchani,

Thank you for reaching out!

The UlmFit model task is still pending. Feel free to give it a shot!

Current tasks:
1) Right now I am exploring ways to perform stemming and lemmatization for Bengali. Prof Sudeshna from IITK has done some work on it.
2) I have received access to TTS and STT data for Bengali. Will start work on that soon.
3) There are other scripts/dialects in Bengali and I was searching for datasets for the same.

It would be great if you could contribute to this! Feel free to reach out to me if you have any questions
Reply all
Reply to author
Forward
0 new messages