Hey,
There was a thoughtful article about benchmarking at High Scalability recently. It mentioned the TechEmpower Framework Benchmarks, and included a number of opinions on what a performance metric might look like if it included both client load and latency.
I think many of the author's suggestions are very good. It would be great if we could get reliable latency numbers from the load generation tool (any news there?).
What I found most useful was the visualization. A cumulative distribution plot with logarithmic scale of percentile along the x-axis, and linear latency scale on the y-axis. Here is an example:

You can add as many lines as you can visually parse. It can get hard to read, but grouping things by color can help.
It might be nice to pick up the conversation about latency _and_ throughput again. Should we consider adding a new type of constant, defined load test?
Just riding the pine over here,
Philip