It "was" / could probably work with a few modifications.
Otherwise, for our AU tests, we have long been using local servers to serve the necessary target payloads.
If you need to select A->B testing, that is viable and the _simplest_ approach.
In this scenario, you would stage the necessary payloads locally on the DUT and target it for the update.
Alternatively, if you do not care about the target version,
you can simulate being any arbitrary signed board
and update to the actual live production matching rules served OTA today.
---
However, I still would like to understand what you are actually wanting to test for when you say: