Scaling parameters in app.yaml don't seem to affect ah-builtin-python-bundle at all. I configured default module of the app the following way:
basic_scaling:
max_instances: 10
idle_timeout: 10s
In theory it should have brought the number of instances of the default service to 0 after 10 to 40 seconds of receiving the last request. In practice Google Cloud Console still shows some idle instances assigned to version ah-builtin-python-bundle. You can also see on the screenshot that the version is configured with autoscaled mode. See
Instances (autoscaled). Moreover the console also incorrectly shows the graph of instances. It shows 0 total instances whereas there should be 6.
I'm bringing this issue up because it is affecting the monitoring for the app. While there could be 2-3 instances that are actually serving app requests there are 6-7 idle instances associated to ah-builtin-python-bundle that just sit there. It is not clear if they are counted in the billing. Our app has an alert that gets triggered if there is a big number of instances for more that an hour but the alert is currently triggered quite frequently due to the big and unpredictable number of idle instances. Ideally I'd like to configure ah-builtin-python-bundle to bring down instances ASAP. What can we do about it?