Re: GSoC Project: Better Sentence Aligner for Tibetan Text

72 views
Skip to first unread message

Forest Rui Jiang

unread,
Mar 10, 2021, 9:51:55 PM3/10/21
to Ashutosh Sharma, Elan Hourticolon-Retzler, Élie Roux, Claire Ho, tibetan-initi...@googlegroups.com
Hi Ashutosh,

We are interested in your background. Could you send me the link to your GSoC 2020 project? The link on your resume has expired.
Thanks!

Forest Rui Jiang | Software Engineer | for...@google.com | +1 650-862-0630


On Wed, Mar 10, 2021 at 8:16 AM Ashutosh Sharma <b18...@students.iitmandi.ac.in> wrote:
Dear Sir/Ma'am,
I am Ashutosh Sharma, currently a junior studying Computer Science and Engineering at Indian Institute of Technology Mandi, India. I am really interested in this project because of the following reasons: my skill set high compliments the technologies used in the project and currently I am learning more about heritage and old traditions. I am currently researching on" the oral history used by people in India for sustainable development". I have been a Google Summer of Code student developer in 2020 also and my work can be found on my Github project and contributions pages. Currently I am learning deep learning to grow my skill set and I always like learning by doing. I am attaching my resume and my Github profile link for your reference. 
Please let me know if I am a good fit. So, that I may dive deep into the project and start working on my proposal. I am also open to switching projects and learn new technologies as one grows after working on challenging projects only. :)
Thanks and Regards
Ashutosh Sharma

Forest Rui Jiang

unread,
Mar 12, 2021, 5:54:48 AM3/12/21
to Ashutosh Sharma, tibetan-initi...@googlegroups.com
+tibetan-initi...@googlegroups.com please keep conversations in that group.

Hi Ashutosh,

I think you can be a good fit for this project. We have listed several projects in the idea list and they are all related to heritage preservation. Could you tell us a bit why you choose the sentence aligner one? 
Thanks,

Forest Rui Jiang | Software Engineer | for...@google.com | +1 650-862-0630


On Wed, Mar 10, 2021 at 7:21 PM Ashutosh Sharma <b18...@students.iitmandi.ac.in> wrote:
Hello Sir,
I am really sorry that the link did not work because the GSoC 2020 has been moved to archived section now ans I was not aware of that.
The project was mainly focused on web development but I have a good knowledge of data science ,machine learning and deep learning also. Currently, I am exploring deep learning so that I can become confident as a developer.
Looking forward to hearing from you.
Thanks and Regards
Ashutosh Sharma

Ashutosh Sharma

unread,
Mar 12, 2021, 7:04:01 AM3/12/21
to Forest Rui Jiang, tibetan-initi...@googlegroups.com
Dear Sir,
As there were four projects mentioned in the document which are:
1. Improving Google Search on Buddhist Entities by Ingesting BDRC Database Into Wikidata.org
2. Better Computer-Assisted Translation (CAT) Tool For Non-Latin Scripts and Scholastic Needs
3. Better Sentence Aligner for Tibetan Text
4. Adding Tibetan Calendar in Unicode Common Locale Data Repository (CLDR) and International Components for Unicode (ICU) library
Sir, I read about each project very carefully. Though every project is really interesting to work on. I selected the project according to my current skill set. I am very much comfortable with the technologies of 1st and 3rd project. But for first project the knowledge of Buddhism was required which I don't have currently. For second and fourth project Java is required in which I don't have much experience. I have been programming in C and Cpp for the last past 5 years and I have started working on python 2.5 years ago when I started learning data science and machine learning in college. I have not worked on Java before so I thought I should choose a project according my current knowledge. If the knowledge of Buddhism is not necessary for first project then I am really happy to work on it also because I am good at databases also. Currently, I am working as a teaching assistant in database course offered by my college. Also, it would be really great if you could suggest me a book to read on Buddhism as I am always eager to gain new knowledge. Please let me know about the next steps.
Thanks and Regards
Ashutosh Sharma

Forest Rui Jiang

unread,
Mar 12, 2021, 1:29:38 PM3/12/21
to Ashutosh Sharma, tibetan-initi...@googlegroups.com
Hi Ashutosh,

Thanks for your explanation! Sounds you are a good fit for project #1, #3, and #4. I am still asking ICU experts about #4. It's very likely you will be developing using Rust, which for most cases students need to learn anyway. Project #1 will most likely be using Python. Project #3 could be data oriented, so we would create a benchmark set and improve the metrics by adding new features in the code.
Though it would definitely be a plus, knowledge of Buddhism and/or Tibetan is not a prerequisite for all these projects.

Happy to talk more, though we are still figuring out a better channel to communicate.
Best,

Forest Rui Jiang | Software Engineer | for...@google.com | +1 650-862-0630

Ashutosh Sharma

unread,
Mar 12, 2021, 2:03:29 PM3/12/21
to Forest Rui Jiang, tibetan-initi...@googlegroups.com
Sir,

Shall we create a slack workspace for Tiberian Initiative because slack is a really great communication channel and we can use it for professional use also. Please let me know about this.

Thanks
Ashutosh
Reply all
Reply to author
Forward
0 new messages