Thanks for the info
I was also thinking of a custom sleep+retry routine as you mentioned, it works on most of the appengine error scenarios as you mentioned
But I'm also concerned whether the issue will increase as my app grows
I'm potentially running parallel lease tasks from the same queue, that has different tags, however each tag is leased discretely, so there aren't parallel lease's going on from the same tag
I have a hunch there is a hidden performance limit to the pull queues, like max 50 leases in a 10 second window etc. regardless of the tags
Ideally, I want to be able to run 1000+ lease_tasks's in parallel that each pulls tasks with a unique tag