Please try again in 30 seconds.
Log files are showing latency of 10 seconds on failed calls.
This started happening after March 1, and code or configuration weren't modified weeks before that.
Any ideas?
Thanks in advance for your help!
The error you’re sharing could be caused by many different reasons. Please check Logs Viewer in Google Cloud, as shown in this Using Logs Viewer documentation, and share what you see for further troubleshooting. You could also troubleshoot the latencies from Cloud Trace.
Thank you for sharing your experience with us. You are saying that your machine workload does not match the machine type requirement and that you didn’t expect to upgrade to a class F2 instance. I will try to help.
There are some ways to mitigate this error message [1] such as:
Predict spikes and preemptively load instances. Warmup requests [2] are designed specifically to combat situations that involve predictable frequent sudden spikes. Warmup requests would "know" when they expect spikes so it preemptively load up instances for you to avoid cold booting during spikes.
Make cold boots faster [3]. You can make cold boot loading faster by having less complex code with less libraries that need to be loaded. There is an interesting article [4] on how to improve the loading performance.
Provisioning more resources is one of the easier solutions such as idle instances to avoid cold boots, however this might not be ideal as it could increase your costs.
Retry strategy if your app can accept x amount of transient failures (within our SLO), then you can simply catch those failures with a retry and your app can function without any issue.
Lastly, I would like to concur David’s suggestion to raise an official case with the GCP support [5] as the error message [1] can be due to multiple reasons (e.g sudden spiky traffic, backend issues, etc) and we do have the tools to diagnose such issues. With the right diagnosis of your case, we can determine the recommendations which might avoid resorting to changing the machine type and increasing your costs.
[1] "logMessage": "Request was aborted after waiting too long to attempt to service your request."
[2] https://cloud.google.com/appengine/docs/standard/go111/configuring-warmup-requests
[3] https://cloud.google.com/appengine/docs/standard/go/how-instances-are-managed#loading_requests
[5] https://cloud.google.com/support-hub