I'm not sure.The main regressions were a couple of tests wrt separate compilation that started failing spuriously.I am worried about those. Did the higher degree of concurrency elicit some kind of race condition?Is the test framework to blame? We'll never know since those tests moved to pending recently (unless we revert the move).Additionally, some benchmarking tests were failing because they were run concurrently and thus didn't finish fast enough.We shouldn't have tests like this in the run/ category.
The random aborts we've been seeing date from before the partest change, according to Lukas.Also, the abort occurs way after, as well as before, partest runs, so I don't see how that could be related.
Finally, Paul said he has some improvements in the pipeline.
I guess our main problem is a lack of data.How much time has been lost with spurious failures, how much was gained by faster builds?When we revert, we're sure to get slower builds, but not sure to get fewer spurious failures.So the rational thing to do is not obvious to me.
On Thu, Jul 19, 2012 at 10:09 AM, Grzegorz Kossakowski <grzegorz.k...@gmail.com> wrote:
On 19 July 2012 10:00, Adriaan Moors <adriaa...@epfl.ch> wrote:
I'm not sure.The main regressions were a couple of tests wrt separate compilation that started failing spuriously.I am worried about those. Did the higher degree of concurrency elicit some kind of race condition?Is the test framework to blame? We'll never know since those tests moved to pending recently (unless we revert the move).Additionally, some benchmarking tests were failing because they were run concurrently and thus didn't finish fast enough.We shouldn't have tests like this in the run/ category.I agree. However, it takes time to smooth everything and we don't have that luxury when dealing with 2.10.x branch. That's my point.The random aborts we've been seeing date from before the partest change, according to Lukas.Also, the abort occurs way after, as well as before, partest runs, so I don't see how that could be related.We all know that in complex systems there are no things that are impossible. I think it doesn't hurt to revert partest change and see if they are not related. If our problems with Jenkins do not go away then we have to look for other suspects.Finally, Paul said he has some improvements in the pipeline.Which I want to see (in master). Again, I didn't say we should block/drop his work. I'm just asking for baby steps. That's main reason why we have two branches.--
Grzegorz Kossakowski
java.lang.InterruptedException