Thanks for taking a look, Aleks!
I've run the reftests listed in my report locally and produced output that can be used with Mozilla's reftest-analyzer:
A screenshot of what that looks like:
Tip: use keyboard shortcuts 1, 2, d, p and n to quickly move through results.
As you can see, subtle differences in text rendering account for the failure, including contain-size-002.html and contain-size-003.html. css-flexbox-row-reverse-wrap-reverse.html actually passes for me locally, even with `./wpt run --verify` to run it many times. It looks like it's been flaky on wpt.fyi, I've filed
https://github.com/web-platform-tests/wpt/issues/13539.
Here are the Chrome-only reftest failures that still seem worth investigating:
/css/CSS2/box-display/block-in-inline-001.xht and -002.xht
/css/CSS2/box-display/box-generation-001.xht and -002.xht
/css/CSS2/linebox/inline-formatting-context-002.xht through -007.xht
/css/CSS2/normal-flow/block-in-inline-empty-001.xht and -004.xht
/css/CSS2/normal-flow/block-in-inline-insert-001e.xht
/css/CSS2/normal-flow/block-in-inline-insert-001h.xht
/css/CSS2/normal-flow/block-in-inline-insert-002e.xht
/css/CSS2/normal-flow/max-width-applies-to-013.xht
/css/css-fonts/font-features-across-space-1.html and -3.html
/css/css-transforms/transform3d-perspective-003.html through -005.html
/css/css-transforms/transform3d-sorting-004.html
/css/css-variables/variable-generated-content-dynamic-001.html
/svg/extensibility/foreignObject/foreign-object-size.html
So, it is quite apparent that a bit of infra work is still needed to capture screenshots, and then to
keep track of which Chrome-only failures we have already triaged.
Generating a new report today I also saw a lot of new Chrome-only failures show up and that explained by
Widespread timeouts in Edge 17 (an infra issue), i.e. some random set of tests time out for each run.
Outside of that I think it would still be valuable for folks to look through the original report, but I hope to have something a bit less noisy for the next round.