Hi,
We have a gae flex project using docker.
The "problem" we're experiencing is that the number of instances spikes to 20 with single request (as far as I can tell).
In the example below I had a task that kept retrying to call the service. The service attempts to do some file processing but fails. I stopped the retrying task and the usage goes back to normal.
I would like to understand why so many instances would spin up each time, instead of just say twice the number it was using? I've included the configuration used for gunicorn and app.yaml
![Capture.PNG](https://groups.google.com/group/google-appengine/attach/4b05837c25b2c/Capture.PNG?part=0.1&view=1)
#gunicorn config:
import multiprocessing
workers = multiprocessing.cpu_count() * 2 + 1
worker_class = 'sync'
timeout = 120
graceful_timeout = 120
#-----------------------------
#app.yaml
runtime: custom
env: flex
entrypoint: gunicorn -b :$PORT main:app
resources:
cpu: 1
memory_gb: 4.0
service: conversion-service
automatic_scaling:
min_num_instances: 1
cool_down_period_sec: 120
Thanks
Rob