Kind regards,
Daniel Hiller
He / Him / His
Principal Software Engineer, KubeVirt CI, OpenShift Virtualization
![]() |
Red Hat GmbH, Registered seat: Werner von Siemens Ring 12, D-85630 Grasbrunn, Germany Commercial register: Amtsgericht Muenchen/Munich, HRB 153243, Managing Directors: Ryan Barnhart, Charles Cachera, Avril Crosse O'Flaherty
--
You received this message because you are subscribed to the Google Groups "kubevirt-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kubevirt-dev...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/kubevirt-dev/CAK%2BeyL6mRSiTfD_RMNbG3WVLOKt9Mg4zpU%2Bvt%3DrzXX9x3YhixQ%40mail.gmail.com.
To view this discussion visit https://groups.google.com/d/msgid/kubevirt-dev/CAJxgHm5mbVfB0bhLkF3_KKV%2BpRznddKw7T%2BD2J06vWYYLaQGVw%40mail.gmail.com.
Hey Dan,
We were discussing this in the last sig-ci meeting[1].
Generally speaking, we observed an increase in the clustered failures.

We discovered a correlation between this increase in serial failures and the making of 1.35 lanes to `always_run: true`.
The root cause is still under investigation, but I think that making the switch to the mandatory 1.35 provider, andthe deprecation of 1.32 provider lanes will help a lot in this direction.
Hey all,On Tue, Feb 17, 2026 at 5:16 PM Federico Fossemo <ffos...@redhat.com> wrote:Hey Dan,
We were discussing this in the last sig-ci meeting[1].
Generally speaking, we observed an increase in the clustered failures.To expand on that: what we are seeing is that all sig-compute periodic lanes show several clustered failures (ex 1.35 [2]), while the presubmit lanes do not show this. I believe there's a correlation of clustered failures on periodics caused by adding the additional 1.35 always_run lanes on top, however there's no strong indication when looking at the graphs.
Blue lines from left to right:* 1st marks the always_run: true for 1.35* 2nd marks the 1st occurrence of a cluster failure on 1.35 sig-compute* 3rd marks KubeVirtCI bump 1.35 to stableWe discovered a correlation between this increase in serial failures and the making of 1.35 lanes to `always_run: true`.
The root cause is still under investigation, but I think that making the switch to the mandatory 1.35 provider, andthe deprecation of 1.32 provider lanes will help a lot in this direction.I agree. Removing the old lanes will also give more capacity overall.
On Tue, Feb 17, 2026 at 6:35 PM Daniel Hiller <dhi...@redhat.com> wrote:Hey all,On Tue, Feb 17, 2026 at 5:16 PM Federico Fossemo <ffos...@redhat.com> wrote:Hey Dan,
We were discussing this in the last sig-ci meeting[1].
Generally speaking, we observed an increase in the clustered failures.To expand on that: what we are seeing is that all sig-compute periodic lanes show several clustered failures (ex 1.35 [2]), while the presubmit lanes do not show this. I believe there's a correlation of clustered failures on periodics caused by adding the additional 1.35 always_run lanes on top, however there's no strong indication when looking at the graphs.Correction: 1.34 and 1.35 serial presubmit lanes both show clustered failures [3]Blue lines from left to right:* 1st marks the always_run: true for 1.35* 2nd marks the 1st occurrence of a cluster failure on 1.35 sig-compute* 3rd marks KubeVirtCI bump 1.35 to stableWe discovered a correlation between this increase in serial failures and the making of 1.35 lanes to `always_run: true`.
The root cause is still under investigation, but I think that making the switch to the mandatory 1.35 provider, andthe deprecation of 1.32 provider lanes will help a lot in this direction.I agree. Removing the old lanes will also give more capacity overall.
On Tue, Feb 17, 2026 at 7:38 PM Daniel Hiller <dhi...@redhat.com> wrote:On Tue, Feb 17, 2026 at 6:35 PM Daniel Hiller <dhi...@redhat.com> wrote:Hey all,On Tue, Feb 17, 2026 at 5:16 PM Federico Fossemo <ffos...@redhat.com> wrote:Hey Dan,
We were discussing this in the last sig-ci meeting[1].
Generally speaking, we observed an increase in the clustered failures.To expand on that: what we are seeing is that all sig-compute periodic lanes show several clustered failures (ex 1.35 [2]), while the presubmit lanes do not show this. I believe there's a correlation of clustered failures on periodics caused by adding the additional 1.35 always_run lanes on top, however there's no strong indication when looking at the graphs.Correction: 1.34 and 1.35 serial presubmit lanes both show clustered failures [3]Blue lines from left to right:* 1st marks the always_run: true for 1.35* 2nd marks the 1st occurrence of a cluster failure on 1.35 sig-compute* 3rd marks KubeVirtCI bump 1.35 to stableWe discovered a correlation between this increase in serial failures and the making of 1.35 lanes to `always_run: true`.
The root cause is still under investigation, but I think that making the switch to the mandatory 1.35 provider, andthe deprecation of 1.32 provider lanes will help a lot in this direction.I agree. Removing the old lanes will also give more capacity overall.What holds us from stopping 1.32 on main right now?
To view this discussion visit https://groups.google.com/d/msgid/kubevirt-dev/CAHOEP55LUo8oA2T_MUvZeTUvtWghjx6vsY2MQ3-o0e-7q7DidA%40mail.gmail.com.