SUMMARY:
On Tuesday 17 June 2014, some Google App Engine applications experienced elevated latency from the URL Fetch API for a duration of 162 minutes. If your service or application was affected, we apologize — this is not the level of quality and reliability we strive to offer you, and we have taken and are taking immediate steps to improve the platform’s performance and availability.
DETAILED DESCRIPTION OF IMPACT:
On Tuesday 17 June 2014 from 04:14 to 06:56 US/Pacific, 5% of App Engine applications hosted in US datacenters experienced elevated latency from the URL Fetch API. Latency at the median was unchanged, but latency at the 90th percentile increased by 87% when aggregated over all applications that were affected. The incident did not cause a significant increase in error rates. URL Fetch calls to Google APIs (except Cloud Storage) and appspot.com URLs were not affected.
ROOT CAUSE:
An increase in load caused a software component on some machines to shut down. This resulted in a temporary reduction in available capacity in one of the systems used by the URL Fetch API.
REMEDIATION AND PREVENTION:
Google’s monitoring detected the problem at 04:47. We identified the source of the increase in load. We were able to successfully bring the affected machines back into operation after the load was reduced. To prevent recurrence, our engineers are investigating why the software component experienced issues under high load.