We have built an app using aiohttp & asyncio.. We want to run our app on single threaded env (since we have in-memory data, which needs to re-used across all requests) with asyncio.
We tried to run this on Appengine with flex env (manual scaling, with 1vCPU), all works fine except the requests are processed one after another.. When I add one more 1vCPU, then requests were able to process concurrently (2 requests concurrently). so GAE is not routing request to app based on the number of cores (tried to run direct python script and run via gunicorn)
is there a way to solve this with GAE? we have rate limit to handle concurrent requests.. I am presuming, its not possible with app engine, could you please confirm this?
Thanks in advance.