Hello,
My name is Augusto Souza (
http://github.com/augustorsouza) and I am a computer science masters student in University of Campinas (Brazil). My area of research is Distributed Systems and I have been trying to work with the Hadoop community to make some contributions to it:
I have been trying to help an Apache developer to make the changes related to checkpointable preemption of jobs accepted (as you can see by my activity in Hadoop's Jira).
Since I am having problems while trying to commit changes into Hadoop, I have been looking into alternative distributed systems frameworks that could have the benefit of checkpointable preemption of the mappers and reducers when a job with more priority gets scheduled, I found Disco on Github, and then I got into this mailing list.
I am curious if a checkpointing feature would be useful for this project, and if so I would like to contribute in some way to this project and measure some results to help me with my Masters work on University.
Does disco have a scheduler or something like that? If not I think I could try to write one based on Hadoop and also add a checkpointing preemption feature based on the patch I have been working on Hadoop.
Also, I am able to help in any other need Disco might have and I might perform some research in order to complete my Masters.
Thank you in advance!
Best regards,
Augusto Souza