Translation tools

327 views
Skip to first unread message

Pankaj Nathani

unread,
Jan 21, 2014, 5:53:29 AM1/21/14
to build...@googlegroups.com
Starting new thread for translation tools. Here's the link to the repo: https://github.com/BuildmLearn/Translation-tools

Rahul, please let us know when there's something ready. Ashmita can learn how to run and modify the Python scripts.

If anyone else wants to work on the translation tools, do post here :)

Regards,
Pankaj.

lorenzo lambertini

unread,
Feb 24, 2014, 3:32:17 PM2/24/14
to build...@googlegroups.com
Hi, i'm Lambertini Lorenzo from Bologna, Italy.

i'm a student of the third year at Computer Science and also i work part-time at Finmeccanica as Java Developer. 

I am interested in applying for GSoC 2014 under Buildmlearn and am interested in particular on  the project idea "Translations Tools".

i want to know more details about this project.

Regards,

Lorenzo

Nicu Badescu

unread,
Feb 24, 2014, 4:36:55 PM2/24/14
to build...@googlegroups.com
Hello,

My name is Nicu Badescu and I'm a sophomore at Politehnica University of Bucharest.

Last summer I worked at an Open Source Django project for my University and I am confident that
I have the required technical skills for the 'Translation Tools' projects.

Can you tell us what can we do in order to get started with this project?

Thanks,
Nicu Badescu

Ankit Agrawal

unread,
Feb 24, 2014, 4:53:40 PM2/24/14
to build...@googlegroups.com
Hi everyone,

              I would like to introduce myself again under the context of this thread. I am Ankit Agrawal, a 4th year Dual Degree student in Electrical Engineering with Master's specialization in Signal Processing at IIT Bombay. I am interested in applying for GSoC 2014 under Buildmlearn and am interested in the project idea - 'Translation Tools'.

              I have good experience of programming in Python and my contributions to some of the open source projects in Python can be seen on my github profile. I have completed coursework in Machine Learning, Natural Language Processing, Computer Vision, Speech Processing, Information Retrieval etc. I participated last year in GSoC 2013 under scikit-image, an image processing library in Python where I implemented some algorithms relating to Feature Detectors and Binary Feature Descriptors.

              I would like to discuss in detail more about the use-cases and features expected to be implemented this project. Please let me know how I should proceed further on this idea. Thank you.

Rahul Ahuja

unread,
Feb 26, 2014, 2:56:46 AM2/26/14
to build...@googlegroups.com
Thanks everyone for your interest in this project. Many people have approached me personally for this as well.

For this project, you will need to have basic understanding of Python scripting. It is beneficial if you have also worked on UI modules in Python. We would be creating a couple of translation tools, one of which is explained in detail on the ideas page. 

What you can do now is check out the details on the ideas page, see if you would be able to work on the steps given. Check out the translation APIs provided by various providers Bing, Yahoo, Google, others.
What would be easy for you? You can share your approach towards how you will develop this project, even share rough mockups to show the UI that you will create, etc.

As mentioned in the ideas page we would want the tool to work on Windows and Linux (Ubuntu), so it should be developed with that in mind.

rishabh roy

unread,
Feb 26, 2014, 6:49:35 AM2/26/14
to build...@googlegroups.com
hi iam rishabh roy  2nd year  btech computer science student 
i have a good programming experience of programming in python my Devoloper profile, i   like to design python GUI apps and participating in online programming competitons  , recently i made a media player for windows called Droy music player using python with   python modules like Tkinter and py2exe , and i have been contributing to opensource orgs like   Mozilla and i have successfully merged my patch to mozilla remo   so i have good understanding in using Git  too .


My idea about this project
===================
for me using microsoft's translation api would be much easy and efficient to use  i have been working on microsoft translation api  for some time so my idea of implementation  is based on using urllib2 and error handling urllib2 requests
Here’s a simple usage example from Japanese to English, in the Python REPL:

>>> import msmt
>>> token = msmt.get_access_token(MY_CLIENT_ID, MY_CLIENT_SECRET)
>>> msmt.translate(token, 'これはペンです', 'en', 'ja')
<string xmlns="http://schemas.microsoft.com/2003/10/Serialization/">This is a pen</string>


The API returns XML, so a final processing step for a real program would be to use something like lxml to parse out the translation result.

so  we could  get just the translated result out of the XML object returned by the API.

In the case of the example above, this is just the classic1 phrase:

This is a pen

Message has been deleted

Alex Mathew

unread,
Feb 26, 2014, 9:22:08 AM2/26/14
to build...@googlegroups.com
Hello !

I am Alex Mathew, currently doing the 3rd year of my B.Tech course in Computer Science & Engineering, from SASTRA University, Thanjavur, India.

I came across BuildmLearn's page on the GSoC organization list. On going through it, I was really intrigued by the possibility of working on a project here. What specifically caught my eye was the Translation tools idea. I think this is a brilliant idea, which can be extremely useful as a tool, and I also believe that this is something that I can work on and efficiently complete.

I have been working with Python for more than a year now, and I am comfortable with learning more about modules to use by reading code and its documentation. I can catch up and get up to speed with existing projects, and work on it.
I am familiar with the Flask microframework, and most recently, designed and released an online Cricket trump cards game, which uses updated statistics from the Cricinfo player pages. I used PostgreSQL as the back-end database, and used Heroku for hosting the game. The game can be found on : http://cricket-trumps.herokuapp.com/

I am familiar with using the Git version control system, and of late, have been putting my code on GitHub, the link to which is here : https://github.com/AlexMathew

I am also proficient with C and C++, which I have been using for over 5 years, and comfortable with using Java, HTML and CSS. Currently, I am working with JavaScript and jQuery, to expand my skill range.

I think my perseverance and my drive to get work done would make me an ideal person to work on this. Given my desire to prove my mettle on a grand stage like the GSoC, and this being my first time applying for the GSoC, I believe that I can make an impact with some well done work.

You can contact me by email : alexma...@gmail.com

Looking forward to hear more about this project so I can get a clearer picture of the target in sight, and get to work with my best foot forward.

Cheers !

Regards,
Alex.

Rahul Ahuja

unread,
Feb 26, 2014, 10:06:59 AM2/26/14
to build...@googlegroups.com
Hi Rishabh

Your snippet is fine. Microsoft's translation API is one way to do it. We would like to support various sources like Bing, YQL, Google, Microsoft, etc. for translation and provide a selection in the tool as to which service should be used. The project also includes creating a UI for the tool which would accept a file with strings and output a file with translated strings. The tool is expected to work on Windows and Linux.

Do consider the above points when you submit your proposal!

Rahul

Rahul Ahuja

unread,
Feb 26, 2014, 10:10:36 AM2/26/14
to build...@googlegroups.com
I am Alex Mathew, currently doing the 3rd year of my B.Tech course in Computer Science & Engineering, from SASTRA University, Thanjavur, India.

Thanks Alex for the detailed introduction. Most of the information about the Translation tools project I have already posted. The ideas page provides steps of what needs to be done.

Let me know if you have any specific questions. Looking forward to your proposal!


Sarwar Chahal

unread,
Feb 26, 2014, 12:49:22 PM2/26/14
to build...@googlegroups.com
Hello, I am interested in developing Translation Tools  as I have quite a bit knowledge of Python and Google Apis having worked with them.I also have experience in Qt.

Here is a sample translation script using GoogleAi.
detech.txt
py-googletranslate.py
translatehelp.txt

Sarwar Chahal

unread,
Feb 26, 2014, 12:56:50 PM2/26/14
to build...@googlegroups.com
Hi, i'm Sarwar Chahal from BITS Pilani,INDIA.

i'm a student of the second year at Computer Science

I am interested in applying for GSoC 2014 under Buildmlearn and am interested in particular on  the project idea "Translations Tools".
I have experience in Python and Qt and also I have worked with Google Api's.
PFA a sample python translator script.


On Tuesday, January 21, 2014 4:23:29 PM UTC+5:30, Pankaj Nathani wrote:
texttranslate.zip

Pankaj Nathani

unread,
Feb 26, 2014, 1:18:21 PM2/26/14
to

Umang Goel

unread,
Feb 26, 2014, 3:49:17 PM2/26/14
to build...@googlegroups.com
Hi,

I will be participating in GSOC'14 and wish to contribute to BuildmLearn. I'd gone through ideas page and I'm really interested in working on Translation Tools project as I see a lot of scope to learn while developing it.
I've been using python for over a year. I have also used wxpython in an application to provide gui.I have also worked on an application which involved the parsing of json file so I'm also comfortable in handling json files.
Regarding apis that you intend to use for translation , google translation api has been made a paid api so using it is out of question. Also with bing translation api, we get a free trail subscription using which we could only convert one million words per month. Considering the use of this application , I think this too wouldn't be effective but still it could be considered as one of the alternatives.
As far as the platform issues are concerned , I think using python would solve it because in general(except few specific commands) the same python code could be used in both linux and windows.
Waiting for your reviews on the same.

Thanks,
Umang.

Krzysztof Kaczmarek

unread,
Feb 26, 2014, 4:16:56 PM2/26/14
to build...@googlegroups.com
Hi everyone, 
My name is Christopher and I am from Poland
I am student of second year at Computer Science at University of Technology in Gdansk.
I am interested in the project idea named Translation Tools.
 
 
To be honest I am not very experienced in Python big projects.
Right now I have been working with Python for little more than half year, but i work hard to be possibly the best.
I can say that I feel really comfortable with coding in Python and this language helps me everyday.

You should know i have experience in other languages, mostly in C and C++, 
and I have been doing big project with my friends for our University for almost a year.

I am familiar with Git and UI basics so it will be a pleasure to learn more and to work on so cool project.
This is my first time applying for GSoC and I believe this is the right project for me.
I have confidence in my coding skills, so I know i can complete this project well.
 
I've already thought about some details and I have a few ideas. I think with Python it will be easy to do this
for both, Linux and Windows systems.

Best Regards,
Christopher

Rahul Ahuja

unread,
Feb 27, 2014, 12:41:31 AM2/27/14
to build...@googlegroups.com
Regarding apis that you intend to use for translation , google translation api has been made a paid api so using it is out of question. Also with bing translation api, we get a free trail subscription using which we could only convert one million words per month. Considering the use of this application , I think this too wouldn't be effective but still it could be considered as one of the alternatives.

Please check this post where I have clarified on the APIs: https://groups.google.com/d/msg/buildmlearn/1E-PcLeAl2I/mq3sxcG8RWkJ
 
As far as the platform issues are concerned , I think using python would solve it because in general(except few specific commands) the same python code could be used in both linux and windows.

Yes, Python is what we want to use.

Rahul Ahuja

unread,
Feb 27, 2014, 12:43:12 AM2/27/14
to build...@googlegroups.com
Thanks Christopher. Python is not that hard to learn on the job. If you feel confident that you'll be able to learn and complete the project, do consider submitting a proposal :)

Saurabh Patel

unread,
Feb 27, 2014, 1:36:22 AM2/27/14
to build...@googlegroups.com
Hi,
  I am Saurabh Patel currently pursuing B.Tech in Computer Science. I am interested in your project idea of translation tools and would like to contribute in it in upcoming gsoc-2014.

I have a fair bit of understanding of Python. Link to my Python projects: https://github.com/mrfinch

I also participated in gsoc-2013 under digiKam-KDE in its kipi-plugins part of digiKam. I made export plugins for dropbox and google drive. Link to both folders of dropbox and googledrive: https://projects.kde.org/projects/extragear/graphics/kipi-plugins/repository

Also apart from learning about various translation tools, do I need to solve a bug.

Thanks

Pankaj Nathani

unread,
Feb 27, 2014, 3:26:11 AM2/27/14
to build...@googlegroups.com
Hi Saurabh, it's always great to see students who have participated in prior GSoC programs. 

> Also apart from learning about various translation tools, do I need to solve a bug.

Well, these tools would be written from scratch as a new project so there are no bugs. What you can do is include mockups of UI and details of what APIs you will use and your experience with them in your proposals.

Regards,
Pankaj.
Message has been deleted

Krzysztof Kaczmarek

unread,
Mar 2, 2014, 4:43:44 AM3/2/14
to build...@googlegroups.com
Hello again,

Could you tell me which version of Python and GUI library do you prefer?
It should be helpful for all of us.

Thanks

Kevin Sebastian

unread,
Mar 2, 2014, 5:03:23 AM3/2/14
to build...@googlegroups.com
 am keen to work on this Translation tools as I had done a similar project before in Natural Language processing.
I was thinking about the user interface for entering the language text.I was thinking of using an open-source Python based text editor.I had already created a fully functional text editor in Python using PyQT and Tkinter frameworks.Using this we can manually add options like making proper nouns bold etc to  minimize error.
Since I have worked on NLP(natural language processing) before I can use these concepts to preprocess the sentences such as adding appropriate punctuation marks,separating subject predicate etc.This will aid the API in producing most accurate results.
Using machine learning concepts we can improve accuracy even more.
I am currently exploring the google translate API though in order to use it I have to pay a fee.
I would like to know if I am thinking in the right direction.


On Tuesday, January 21, 2014 4:23:29 PM UTC+5:30, Pankaj Nathani wrote:

Rahul Ahuja

unread,
Mar 2, 2014, 10:09:00 AM3/2/14
to build...@googlegroups.com
Hi Rishabh,
 
hi i  would like to propose an interface keeping things
i have added the new cs import file option and options for selections of various api's  like bing ,yahoo and google
this interface has a option to import  .csv file
it has option for user to select the source and destination language using a pointer /check box
Once the user provides the CSV / excel file, the script would translate the strings in the first column using a REST API (google / bing translate APIs).

Your interface provides all the required options, yes. But it seems very clutter.. can you refine it a bit before submitting it in your proposal?
 

Rahul Ahuja

unread,
Mar 2, 2014, 10:10:08 AM3/2/14
to build...@googlegroups.com
We use Python 2.7. As for GUI Libraries - any cross platform python libraries would do.

Rahul Ahuja

unread,
Mar 2, 2014, 10:11:40 AM3/2/14
to build...@googlegroups.com
Kevin, I replied to your post here: https://groups.google.com/forum/#!topic/buildmlearn/e6-gYyx0Sh4

Please avoid posting duplicates.
Message has been deleted

Krzysztof Kaczmarek

unread,
Mar 5, 2014, 4:39:00 PM3/5/14
to build...@googlegroups.com
Is it possible to take one or two weeks off for exams? I dont want to be disqualified at the start.


Pankaj Nathani

unread,
Mar 6, 2014, 12:15:06 AM3/6/14
to build...@googlegroups.com
Hi Krzysztof,


Is it possible to take one or two weeks off for exams? I dont want to be disqualified at the start.



  • Do you have other commitments that we should know about? If so, please suggest a way to compensate if it will take much time away from Summer of Code.

You may include the exams or timeoff that you should mandatorily take during GSoC period. Also, mention ways in which you can compensate for the lost time.. for example, you may work a few weekends, or you may try complete some milestones ahead of time, so that you are on track.

So mention all the above details in your proposal and we will consider these while evaluating. 

rishabh roy

unread,
Mar 14, 2014, 11:46:17 AM3/14/14
to build...@googlegroups.com
 mentors i would like to add a proposal for adding a text to speech api that will be helping users to learn to speak the translated words too . it would also add to a noble act   as  lot of people need a tool specially students who want to learn new language this will require to deal with speech apis from google and and other engines . so i feel i can implement this feature so required your suggestion about this
 this is a feature that will be influencing lot a of people using this tool  ;)

Pankaj Nathani

unread,
Mar 14, 2014, 1:21:03 PM3/14/14
to build...@googlegroups.com
Hi Rishabh
 
 mentors i would like to add a proposal for adding a text to speech api that will be helping users to learn to speak the translated words too . it would also add to a noble act   as  lot of people need a tool specially students who want to learn new language this will require to deal with speech apis from google and and other engines . so i feel i can implement this feature so required your suggestion about this
 this is a feature that will be influencing lot a of people using this tool  ;)

Our tool focuses on localisation. We don't need text to speech features in this tool.

rishabh roy

unread,
Mar 14, 2014, 1:42:29 PM3/14/14
to build...@googlegroups.com
ok thanks for feedback pankaj sir .
Reply all
Reply to author
Forward
0 new messages