Hello Matthew,
To investigate the issue further, I would require your project number and instance ID. Also please provide the date/time of the issue with associated logs.
To protect your private information, you can reply through private email, by using the drop-down menu of the "reply" command at the top right of the edit window.
Hi Matthew,
Thank you for providing the data. It seems that there was an issue at Google end which caused it. The root cause of the issue has been mitigated, so there should be no further impact. Long term, we are focusing on automation to prevent this kind of issue from recurring in the future.
--
© 2017 Google Inc. 1600 Amphitheatre Parkway, Mountain View, CA 94043
Email preferences: You received this email because you signed up for the Google Compute Engine Discussion Google Group (gce-discussion@googlegroups.com) to participate in discussions with other members of the Google Compute Engine community and the Google Compute Engine Team.
---
You received this message because you are subscribed to a topic in the Google Groups "gce-discussion" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/gce-discussion/xH9nzYGtAaM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to gce-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to gce-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gce-discussion/f37ecd83-1776-4432-9dfe-c52ec6f00b86%40googlegroups.com.
Hello Matthew,
The issue would have manifested in two ways:
1. Slow snapshot operations (which may then timeout and fail). Because snapshots read from the same place as the primary data path, an unavailable disk can sometimes also be not snapshot-able.
2. Slow (or hanging) reads, which can then cause a timeout, which causes the given SCSI error in dmesg. Most filesystems are not written for a network file system, and so they can sometimes cause data corruption when encountering high latency like this.
Hope this answers your query.
So the reason the drive failed? or the failed snapshot? or both?
Hello Brent,
I have not found any similar reports lately about this particular error. Thus, and since this platform is meant for general discussions, I suggest opening a private issue tracker report to investigate it. Opening the report, please include the project ID and the instance name/ID, and we will be happy to assist.
Hello Brent,
Sorry about that. The link provided was actually wrong. Here is the correct one (I have already edited the thread above with the correct link). For further instructions about opening issue tracker reports you may also check this guide.