Kubernetes - What are the common failures in a real world k8s cluster?

2 views
Skip to first unread message
Assigned to acce...@gmail.com by me

Accela Zhao

unread,
Aug 19, 2015, 2:37:56 AM8/19/15
to google-c...@googlegroups.com
Hi,

Since kubernetes are growing increasingly popular and more and more people are run it in production or evaluating, I'm curious what are the common failures / errors that one meets in real world k8s. I found this doc mentioned some (failure modes), https://github.com/kubernetes/kubernetes/blob/37f0368ba26f5f503df2407f3241a3ad62cc8e59/docs/availability.md. But it would be definitely good if there were more details and real cases. Thanks a lot if anyone would like to share some experience!

Best,
Accela

Accela Zhao

unread,
Aug 21, 2015, 6:01:26 AM8/21/15
to Containers at Google

Found these two may be of some help

    Google ClusterData 2011:

    The Computer Failure Data Repository (CFDR) 


Reply all
Reply to author
Forward
0 new messages