Thanks for your note. This group has been pretty quiet. I would also love to hear how other folks are using SWIM!
Here at Cloudera we're doing a lot with SWIM. I wish I could say more, but performance and performance methodology really do give competitive advantages. Become a partner and you'll find out very quickly!
The
particular change you're proposing sounds super useful! We've been
thinking about something like that for a while but haven't got around to
building it. Please don't hesitate to submit patches. I'll look at it
in a timely manner.
Re tuning - Your best bet for a
proof-of-concept type deployment would be to deploy CDH using Cloudera
Manager, because that would give you sensible default configurations
based on our field experience, and verified using SWIM and other tools. Large-scale clusters benefit from
additional per-workload, per-hardware type tuning, but that's
customer-specific enough that our field/support folks handle it.
Re analyzing - In general, it often makes sense to compare per-workload metrics, such
as median job duration, 99th percentile job duration, or average/peak
task-slot utilization. I assume the
script included with SWIM is inadequate for you. Are you allowed to talk about why? If so I hope I can suggest something more specific.
To give you a flavor of some things we're doing with SWIM, see this (circa 2011)
post from one of our partners.