On Sunday, June 10, 2012 9:29:12 PM UTC+1, L. David Baron wrote:
> I fundamentally disagree with this change; it makes tbpl display
> less useful state about whether the tree is currently healthy in
> favor of using it as an enforcement mechanism for starring of
> intermittent (or non-intermittent) test failures. While having
> knowledge of our intermittent test failures is an important aspect
> of keeping the tree healthy in the long run, I don't think it's
> important enough that we should stop displaying information about
> the current health of the tree in order to force people to annotate
> old intermittent failures.
Hi David
This was most definitely not done to force people to annotate old intermittent failures. There are only ~6 people who've regularly filed intermittent oranges over the last 3 months and I do not believe for a second this would change that.
What it does do however, is make my & any other sheriffs' lives a lot less repetitive when it comes to monitoring the trees - since prior to this change, I/we would have to routinely cycle through the ~10 TBPL tabs we had open, since the tab title could say "[0] Mozilla-Inbound" but be lying and there in fact still be unstarred failures that needing dealing with. After the change, I can actually trust the tab title, which I hope will save me in the order of 20-25 mins each and every day (very rough sums, but it's in that ballpark).
In addition, the failure squares/count was not actually an accurate indicator for the health of the tree:
* The "last job of each type" concept was broken, such that if a previous push's job finished after a newer one, then that result would be used instead, falsely telling you that the tip was green, when in fact it may not be.
* Now that PGO is not done per push (and given that the pgo-only failure rate has sadly been higher than predicted when the idea was proposed), having a green tip is still not an indicator of a healthy tree, or that it is ready to be merged, since we have to specifically look for a push that had PGO to merge from that.
* Same as above, but with coalescing of jobs. The 'tree heath' boxes may not be showing orange/red, but 75% of the tests haven't been run for 10 pushes (or there's just loads of pending), so the tree could actually be utterly busted.
Given that it wasn't accurate and yet people were treating it as such (as proven by your post), it hindered monitoring multiple TBPL tabs & completely busted the layout whenever there were over X failures, I believe removing it was the right thing to do. However, I agree that a useful addition to TBPL would be a "can push/don't push" light (with an algorithm that tries to at least take into account some of the points above), a la bug 568819.
If you have any further questions, please let me know :-)
Best wishes,
Ed