Introduction | GSoC 2019

54 views
Skip to first unread message

游薪渝

unread,
Apr 4, 2019, 6:10:22 AM4/4/19
to Yellowbrick
Hi,
My name is Xinyu You, a second-year postgraduate student of national university of defense technology. 

After going through more than 100 projects in  CSoC 2019, I realize that I want to apply the Yellowbrick project under NumFOCUS.

Why I want to apply:
1. The idea "visual steering" in Yellowbrick surprise me. This is a more human way to improve our decision in data science's work. I believe it has good prospects and great potential.
2. I love data science. Different from software development, Data Science give me the ability to gain insides from data. It really makes me very fascinating.
3. Your group seems very happy being together in the picture : >

My ability:
I have a strong Computer Science background. I understand the basics of python, data science processing and have studied machine learning at school. Now, I really hope to apply what I have learned to Yellowbrick. 

I have great communication ability. I write a blog about data science (http://data2art.com) in Chinese. I have planned to write more posts in English.

For you who want to know more about me, please see my resume here: "http://resume.data2art.com".

My Question:
1. I found this Project a little late. So is it too late to apply now?
2. If I write a proposal now. After finishing my proposal, can I ask to advise from the mentor?

Looking forward to your reply. Thank You!

------------------
Sincerely,
Xinyu You


Saurabh Daalia

unread,
Apr 4, 2019, 2:46:36 PM4/4/19
to Yellowbrick
Hi Xinyu,

Welcome to the Yellowbrick Group. I am Saurabh Daalia, a fellow student and Google Summer of Code applicant this year. Well I agree with you that Yellowbrick can help get deep descriptive and diagnostic visual insights. 

IMO, the student applications are still open till 09/04/2019, so if you can manage to submit your proposal before that, it will get reviewed. 
Yes, Mentors are really supportive and are happy to discuss and give valuable answers to our questions. 

Meanwhile, Here are few of some links and answers which can help you write an awesome proposal. 

2) Some Key points to be included in the Proposal as mentioned by Benjamin 

And the most important go through the Yellowbrick Contribution Guide here, to get a good grasp on how to develop visualizer, which will definitely help you structure your GSOC Proposal.   

Wish you all the Best !
Regards
Saurabh Daalia
Message has been deleted
Message has been deleted

aem...@georgetown.edu

unread,
Apr 4, 2019, 3:08:12 PM4/4/19
to Yellowbrick
Hi Xinyu,
Thanks for your message and your interest in Yellowbrick.  Everything that Saurabh said above is accurate as far I can tell.  Thank you Saurabh for answering Xinyu's questions.

Xinyu, you make some great points.  Yes, we are all about people here at Yellowbrick and that visual element for the machine learning workflow is really important to us and something that we thought was missing that we could add value for data scientists, researchers and all of our users.  We too love data science and are firmly committed to open source software.  Yes, as we are about people, we do like to be together and with our users both in person and via the internet (Twitter etc).  

This is our first year participating in GSOC and as such we intend to keep to a small footprint.  Whether or not your application is accepted, we encourage you to check out our GitHub repo page.  There are other ways as well (i.e. blog posts highlighting Yellowbrick) for you to get involved.  You have a blog so we would love to put out and retweet a blog post on Twitter (in Chinese or English is fine!).  We are growing our user base in China.  

So what are the next steps? * Star us on our GitHub page * Follow us on Twitter (@scikit-yb) *check out our docs in detail *work through our quick start guide and tutorials *read through our contributor guide carefully--this especially contains essential information to how we do our work successfully in terms of format and conventions *look at our existing issues page on GitHub repo, and identify an existing issue that is interesting or meaningful to you for use in your proposal.  Note: be sure to submit your proposal to Google via the official GSOC portal instead of through our repo.  This will ensure that we officially receive it and can review it after the April 9th deadline.  Thanks again for your interest and we'll see you online.  
Thanks,
Adam

游薪渝

unread,
Apr 4, 2019, 10:02:47 PM4/4/19
to Yellowbrick
Hi Saurabh,

Thank you for your reply. Those links you give really help me!

Best Regards
Xinyu You 


在 2019年4月5日星期五 UTC+8上午2:46:36,Saurabh Daalia写道:

游薪渝

unread,
Apr 4, 2019, 10:17:52 PM4/4/19
to Yellowbrick
Hi Adam,
Thank you for your detailed reply. The steps you give is so specific. So now I can't wait to take a good look at the project guide, familiarize myself with the project and draft my proposal.

I have forked the project and followed the Twitter(@scikit-yb). I would love to participate in any way I can, and I will explain this project in detail in my blog in the future. I hope more developers and researchers will use it.

Best Regards
Xinyu You 

在 2019年4月5日星期五 UTC+8上午3:08:12,aem...@georgetown.edu写道:

aem...@georgetown.edu

unread,
Apr 5, 2019, 7:01:14 PM4/5/19
to Yellowbrick
Hi Xinyu,
You are most welcome.  I'm glad, we try to be specific as possible to help students know exactly what to do.  That's great news, have fun with it!  Don't forget to star us on github too if you haven't yet!  That would be great.  I also wanted to bring to your attention that we have a partial translation into chinese of our docs here https://www.scikit-yb.org/zh/latest/.  We welcome any contributions to that as well.
Stay well and thanks,
Adam

游薪渝

unread,
Apr 6, 2019, 5:43:01 AM4/6/19
to Yellowbrick
Hi Adam,

You mentioned that there is a Chinese version document. Wow, this is very friendly and important for the Chinese community. So, when I read the Chinese document, I found some mistakes in it and I fixed them. Then, I also translated the tutorial section.I open a Pull Request here. Hoping this is helpful for others. 

If possible, I hope I can translate these documents into Chinese later. You know, it can greatly help increase the influence of the project in the Chinese community. Can I include this plan in my proposal?

Besides, I have one questions: Can I choose two ideas from the Yellowbrick Ideas List? Or I can just choose one?

Best Regards
Xinyu You 

在 2019年4月6日星期六 UTC+8上午7:01:14,aem...@georgetown.edu写道:

游薪渝

unread,
Apr 6, 2019, 8:18:54 AM4/6/19
to Yellowbrick
Hi Adam,

Sorry to bother you again. Could you tell me do the ideas on the project idea list all have the same importance? Is there a high, middle or low priority?

By the way, there is a time difference between us, so I wonder could you tell me when it is the appropriate time to contact you.

Thank you and Best wishes
Xinyu You

在 2019年4月5日星期五 UTC+8上午10:17:52,游薪渝写道:

aem...@georgetown.edu

unread,
Apr 8, 2019, 10:54:51 AM4/8/19
to Yellowbrick
Thanks Xinyu!  I would focus on the one the speaks most to you for your proposal.  We want to see your code, thinking and vision for tackling these problems.  And yes as Dr. Bengfort responded we can connect you later with a contributor who also knows Chinese to work out your translation.
Thanks,
Adam
Reply all
Reply to author
Forward
0 new messages