Hi Paul,
Do you mind including your yaml? The scaling you're using (you say "default Module settings", but I'd like to make 100% sure about them). It's possible, with an average of 55,5 ms latency, that the system decided to spin up a new instance because it didn't want that number to go higher.
Automatic scaling does have a tendency to aggressively scale and spin up instances preventively. You have to understand that it assumes your app to grow in requests, so if it sees that all of a sudden, an instance receives 100 requests, the system will assume that your instance will continue to get MORE (so 100 in a minute means that, in 5 minutes, you'll be receiving 500 requests a minute), and spins up more instances to accommodate the load.
You can read more about this
here, and with your yaml, I may be able to give you more information as to what exactly happened, but if you're using the automatic scaling, chances are this is the culprit here.
Cheers!