Description: We are experiencing an issue with Google Compute Engine beginning in 2020-08. A firmware rollout is being created that should address the issue.
The rollout is currently expected to complete next week, but mitigation efforts are still ongoing.
We will provide more information by Tuesday, 2020-11-10 16:30 US/Pacific.
Diagnosis: Affected customers will experience elevated frequency of Host Maintenance events on GCE instances with an attached GPU(s) and SSD(s).
Workaround: Temporarily switch to use V100 GPU's which are unaffected by this issue.
https://cloud.google.com/compute/docs/gpus