Chirp thinks your experiment is having an effect.
Analysis based on 3 days' data from 2026-03-25 to 2026-03-27
Blink.XMLParsing.NonXsltXmlParsingTime.Combined (0.5 quantile)
| Platform | Experiments | Quantile | Change | Effect size | Samples (approx.) | Test |
|---|---|---|---|---|---|---|
| M148 on Win (DEV) | Enabled_20260220 vs Control_20260220 | 50 | ⇗0.01 (0.15 +/- 0.26) ms | small | 358 M | T-test with jackknife resampling |
| M148 on Android (DEV) | Enabled_20260220 vs Control_20260220 | 50 | ⇗0.16 (0.50 +/- 0.39) ms | big | 87 M | T-test with jackknife resampling |
| M148 on Mac (DEV) | Enabled_20260220 vs Control_20260220 | 50 | ⇗0.02 (0.16 +/- 0.26) ms | medium | 10 M | T-test with jackknife resampling |
| M148 on Android (CANARY) | Enabled_20260220 vs Control_20260220 | 50 | ⇗0.14 (0.47 +/- 0.35) ms | big | 15 M | T-test with jackknife resampling |
Some subpopulations were not tested since one or more group in each of them were either completely missing or had too few data points.
| Message | Metric (histogram) | Platform | Experiments | Effect size | Samples (approx.) | Test |
|---|---|---|---|---|---|---|
| We detected significant change(s) in WebVitals.CumulativeLayoutShift6 but we also detected sample count imbalance(s). You may want to investigate it further if you intended it number of counts to be the same as that of the control group. See go/finch-analysis#mix-shift for more details. Technical problems (insufficient counts) prevented testing WebVitals.CumulativeLayoutShift6 in N=2 subpopulations. Groups defined in gcl_studies/XMLParsingRustNonXslt.gcl?plan.CANARY_DEV_BETA.canary_dev. Test run: T-test with jackknife resampling. |
WebVitals.CumulativeLayoutShift6 | M148 on AndroidWebView (CANARY) | (Control_20260220), (Enabled_20260220) | None | nan | T-test with jackknife resampling |
| M148 on Mac (DEV) | Enabled_20260220 vs Control_20260220 | big | nan | T-test with jackknife resampling | ||
| Observed significant change of Blink.XMLParsing.NonXsltXmlParsingTime.Combined in N=4 subpopulations. Technical problems (insufficient counts) prevented testing Blink.XMLParsing.NonXsltXmlParsingTime.Combined in N=1 subpopulations. Groups defined in gcl_studies/XMLParsingRustNonXslt.gcl?plan.CANARY_DEV_BETA.canary_dev. Test run: T-test with jackknife resampling. |
Blink.XMLParsing.NonXsltXmlParsingTime.Combined | M148 on Win (CANARY) | Enabled_20260220 | None | nan | T-test with jackknife resampling |
Statistical significance: p=1.11e-03 or lower across all results.
You can update this incident to associate it with a bug or snooze it.