Hi folks,
Since we gave a talk in April last year, Starfish has been evolving a lot and new versions have been released. Our goal is to make Starfish adapt to the needs of the real world users, so we're willing to hear any feedbacks from you. Welcome to join us at
http://groups.google.com/group/hadoop-starfish.
For those of you who are not familiar with Starfish, Starfish is a self-tuning system built on Hadoop to provide good performance automatically, without any need for users to understand and manipulate the many tuning knobs in Hadoop.
With Starfish, you can analyze the performance of your Hadoop job at fine grained level, e.g. the time for map processing, spilling, merging, shuffling, sorting, and reduce processing, so you can understand which part is the bottleneck of the performance.
You can also ask "what-if" questions, e.g. "What if I double io.sort.mb ?", and Starfish will predict the new behaviour of the job, so you can better understand how these parameters work. In addition, you can simply let Starfish find the optimal configurations for you to achieve the best performance.
Fell free to let us know if you have any questions.
Thanks,
Jie
------------------------
Starfish Group, Duke University
Starfish Homepage:
www.cs.duke.edu/starfish/
Starfish Google Group:
http://groups.google.com/group/hadoop-starfish