I am very frustrated with the scheduler recently. It seems to have way too many instances running at any time; i.e., I _wish_ it were more aggressive about killing instances, or better yet, not spinning up extras when it's not actually necessary.
The most pathological case is when you ask for 1 Min Idle Instance (so that you get warmup requests), but if that instance is actually idle, the scheduler sends the request to a new cold instance instead of allowing the warm one that's sitting there to handle it.
We have recently switched to Python 2.7, threadsafe: true, and F4 instances believing that we would see a fair decrease in the number of instances that are running at any given time. So far, not seeing any meaningful difference. Except for the additional cost...