Re: [kubernetes/kubernetes] When a Pod with a PV is moved to another node stuck in ContainerCreating a long time (#53059)

k8s-ci-robot

unread,

Sep 26, 2017, 12:32:59 PM9/26/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@diogo-reis: Reiterating the mentions to trigger a notification:
@kubernetes/sig-storage-bugs

In response to this:

/sig storage
cc @kubernetes/sig-storage-bugs

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

—
You are receiving this because you are on a team that was mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.

Diogo Reis

unread,

Sep 26, 2017, 12:36:07 PM9/26/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

/sig openstack

wenlxie

unread,

Sep 27, 2017, 9:35:16 PM9/27/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@diogo-reis What's your k8s version?

Diogo Reis

unread,

Sep 28, 2017, 4:58:10 AM9/28/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

My k8s version is v1.7.5.

Spas Atanasov

unread,

Oct 10, 2017, 5:37:16 AM10/10/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

k8s version 1.8 .. on AWS:

Events:
  Type     Reason                 Age              From                                                  Message
  ----     ------                 ----             ----                                                  -------
  Warning  FailedScheduling       3m (x2 over 3m)  default-scheduler                                     PersistentVolumeClaim is not bound: "www-web-0" (repeated 9 times)
  Normal   Scheduled              2m               default-scheduler                                     Successfully assigned web-0 to ip-10-222-38-161.eu-west-1.compute.internal
  Normal   SuccessfulMountVolume  2m               kubelet, ip-10-222-38-161.eu-west-1.compute.internal  MountVolume.SetUp succeeded for volume "default-token-td6qx"
  Warning  FailedMount            2m               attachdetach                                          AttachVolume.Attach failed for volume "pvc-37038357-ad9c-11e7-9798-0a5f89d13502" : Error attaching EBS volume "vol-0c159c62175f227b6" to instance "i-0af5130e8d06ffb5a": "IncorrectState: vol-0c159c62175f227b6 is not 'available'.\n\tstatus code: 400, request id: 53305145-566b-457e-b1d2-83272f8fd889"
  Warning  FailedMount            2m               attachdetach                                          AttachVolume.Attach failed for volume "pvc-37038357-ad9c-11e7-9798-0a5f89d13502" : Error attaching EBS volume "vol-0c159c62175f227b6" to instance "i-0af5130e8d06ffb5a": "IncorrectState: vol-0c159c62175f227b6 is not 'available'.\n\tstatus code: 400, request id: cd609dca-2e74-48bd-a309-c90cf98df2a7"
  Normal   SuccessfulMountVolume  2m               kubelet, ip-10-222-38-161.eu-west-1.compute.internal  MountVolume.SetUp succeeded for volume "pvc-37038357-ad9c-11e7-9798-0a5f89d13502"

Jesse Haka

unread,

Oct 13, 2017, 12:18:40 PM10/13/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I have seen this same on openstack, it takes quite many minutes to get it working. I have kube 1.8

Stanislav Makar

unread,

Nov 8, 2017, 8:53:54 AM11/8/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I have the same problem on k8s 1.8.2 (OpenStack)

how to reproduce is described here #50004

Marcel Beck

unread,

Dec 7, 2017, 8:34:57 AM12/7/17

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Got the same error.
We modified the resource request/limits for one statefulset with 3 replicas.
K8s moved one of the replicas to another node, which has enough resources, but the volume was still attached to the old node.

K8s version: v1.8.1+coreos.0
Running on AWS

Warning FailedAttachVolume 7m (x2987 over 12m) attachdetach Multi-Attach error for volume "pvc-4fe430e8-db4d-11e7-9931-02138f142c30" Volume is already exclusively attached to one node and can't be attached to another

Jesse Haka

unread,

Jan 11, 2018, 7:52:08 AM1/11/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

cinder detach is fixed here #56846 However, there is new ticket #58079 about pod failover times. This pod with pv failover times is same problem in all cloud providers not just openstack.

Jing Xu

unread,

Jan 12, 2018, 5:48:46 PM1/12/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@diogo-reis, sorry for the late reply. You mentioned you "When i move a Pod with the expression "nodeSelector:" to another Node ". Could you please confirm that the pod is first killed from node 1 and then started on node 2. Is node 1 still running?

Andy Zhang

unread,

Apr 11, 2018, 7:23:35 AM4/11/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

We got this "Multi-Attach error" also on Azure, in v1.9.6, we found volume in node.volumesInUse is not removed even after pod with that volume has already been moved from the node for a very long time, I filed another issue here: #62282

Mariusz Serwin

unread,

Apr 25, 2018, 7:31:14 AM4/25/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I'm getting the same issue on "Multi-Attach error" also on Azure, in v1.9.6 .

Andy Zhang

unread,

Apr 25, 2018, 7:49:07 AM4/25/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

hi @SuperMarioo, this issue "Multi-Attach error" on Azure is fixed in v1.9.7, PR: #62467
Pls follow below link to mitigate:
https://github.com/andyzhangx/demo/blob/master/issues/azuredisk-issues.md#5-azure-disk-pvc-multi-attach-error-makes-disk-mount-very-slow-or-mount-failure-forever

bhack

unread,

Apr 26, 2018, 11:37:49 AM4/26/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

What is the status of this on AWS/EBS? I'have the same problem on AWS with v1.9.3

Jesse Haka

unread,

Apr 26, 2018, 11:49:54 AM4/26/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

imo this "bug" exist in all volumetypes. If you have pod with pvc(any type, RWX types excluded) running in node1. You will shutdown that node1 -> the pod will start again in some another node but failovering(it will return that multi-attach error) volumes takes 6-10minutes because it will wait force detach.

I am thinking could we make this force detach time faster, it is currently 6 minutes.
Allow force detach for volume if node has shutdown taint (which was added in #60009, no cloudprovider support it yet)

bhack

unread,

Apr 26, 2018, 12:29:00 PM4/26/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Yes it seems that is general.

Andy Zhang

unread,

Apr 27, 2018, 3:01:29 AM4/27/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@zetaab that's correct, on Azure, time cost of disk detach and attach to another node would be around 1 min, so Multi-Attach error within that 1 min is expected, while we found an issue specific in containerized kubelet that UnmountDevice process always fail which lead to disk detach on one node never succeeded on one node, in that case, we hit Multi-Attach error for hours...

dElogics

unread,

May 4, 2018, 7:00:20 AM5/4/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

This's not a bug, this's expected behavior. You must not double mount with ReadWriteOnce policy; this's what Kubernetes is trying to avoid. However if the node which's down does not respond within 6 minutes (default), the volumes will be forced associated to the replacement pod.

Reference to this 'knowledge'.

bhack

unread,

May 4, 2018, 7:10:20 AM5/4/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@dElogics I think that there is a subproblem in AWS about provisiong EBS volumes and switching nodes. If it is provisioned as /dev/ and then reattached to be exposed on /dev/nvme nodes like M5. I think it will fail also if it is attached correctly to the node.

Amazing Turtle

unread,

Jun 20, 2018, 2:18:59 AM6/20/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I'm experiencing the same issue on GKE 1.8.10-gke.0

Michelle Au

unread,

Jun 20, 2018, 12:38:43 PM6/20/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@AmazingTurtle can you describe in more detail what happened to the old node to cause your pod to be rescheduled? Did the node become NotReady, or was it upgraded, terminated or repaired?

Adam

unread,

Jul 24, 2018, 8:16:33 PM7/24/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

In my case it became NotReady when I stopped the Docker service. I'm using Rancher 2.0 so everything (including kubelet) is containerized. It runs on 3 bare-metal Ubuntu 16.04 nodes, latest Docker CE and Kubernetes 1.10.1. I have a Deployment with 1 replica that mounts a Ceph RBD PVC in RWO mode.

Marc Ende

unread,

Aug 7, 2018, 1:20:44 PM8/7/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I've got the same issue with k8s 1.11.0 and ceph using dynamic provisioning.
This issue also occures when I do a

kubectl apply -f deployment.yml

As such it's not possible to modify something without waiting 6min... :(

KAIJIE Zhan

unread,

Sep 18, 2018, 5:49:46 PM9/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I think I just experienced the same issue on AWS ...

dakleine

unread,

Sep 28, 2018, 5:24:05 AM9/28/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

We are facing the same issue with k8s 1.9.8 and rbd volumes. But in our case the pod was just redeployed on another node due to changes viakubectl edit deployment ...

Jing Xu

unread,

Sep 28, 2018, 5:25:49 PM9/28/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@dakleine could you please provide what changes you made when edit deployment? What is the status of the old node? Thanks!

dakleine

unread,

Sep 29, 2018, 3:08:33 AM9/29/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@jingxu97 we only changed the image version of our postgresql deployment. the old node is ready.

Chris Christensen

unread,

Oct 17, 2018, 3:15:06 PM10/17/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

This happening to me on AWS as well. I have a pod right now that has been stuck in "ContainerCreating" for 15 minutes. Any ideas what to do?

Chris Christensen

unread,

Oct 17, 2018, 3:17:51 PM10/17/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Warning FailedMount 7s (x7 over 13m) kubelet, ip-x-x-x-x.x-x-x.compute.internal Unable to mount volumes for pod "gitlab02-runner-678ffc74f4-m2w8m_build(7a473b7f-d23e-11e8-8cfe-0688ae24c2fe)": timeout expired waiting for volumes to attach/mount for pod "build"/"gitlab02-runner-678ffc74f4-m2w8m". list of unattached/unmounted volumes=[data-volume]

Jing Xu

unread,

Oct 22, 2018, 5:33:52 PM10/22/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@christensen143, looks like your data-volume is not attached. if you can access the kube-controller-manager log on the master, it should print out the detained message related to why it failed to attach the volume.

Nico Ritsche

unread,

Dec 13, 2018, 2:12:19 PM12/13/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Just experienced the same on Digital Ocean. The pod is still in ContainerCreating after 13 mins...

postgres-deployment-77c874df64-k4hn9 0/1 ContainerCreating 0 13m

Andrew Pitt

unread,

Dec 18, 2018, 10:16:53 AM12/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Same issue on AWS. Is there a way to "fix" the volume?

Jesse Haka

unread,

Dec 18, 2018, 10:30:56 AM12/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@pittar you can try restarting all master controllers (one-by-one then you do not have downtime)

Andrew Pitt

unread,

Dec 18, 2018, 10:55:05 AM12/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@zetaab , thanks for the suggestion. We are actually on OpenShift Dedicated (managed service), so I guess I'll put in a support ticket.

Jing Xu

unread,

Dec 18, 2018, 12:40:35 PM12/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@pittar could you please provide more details about your issue? In what situation your pod is stuck in "ContainerCreating"? Are you trying to delete your pod and then start it on another node? Thanks!

Andrew Pitt

unread,

Dec 18, 2018, 2:04:15 PM12/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Hi @jingxu97 , since it's a managed service, I've submitted a support ticket, but I'll try to explain what happened here in case it benefits others.

Last night was a scheduled upgrade of our OpenShift cluster from 3.9 (k8s 1.9) to 3.11 (k8s 1.11). During the upgrade, pods would have been evacuated and re-created, so my guess is certain pods (like postgres) tried to re-attach to a PV (aws ebs) before the old pod had actually shut down. This seems to have left things in a strange state.

This morning, I tried killing the pods and restarting. The first error I would get form the pods was that the volumes were already attached to a different container (aws ebs are not RWX). After killing them again and trying to restart, I would get the timeout error similar to @christensen143 last comment.

One of the pods eventually re-attached (after I killed it and waited a good 10min before starting it again). Another that was in the same state still hasn't been able to start properly (still getting timeout trying to attach/mount the pv).

Jesse Haka

unread,

Dec 18, 2018, 2:12:26 PM12/18/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@rootfs @jsafrane @thockin do you have guys idea how we could improve this situation? This volume mount problem has been problem for a long time. I have tried to solve this twice, but always storage or node sigs are saying that my solution is incorrect.

We have customer who is using cronjobs each 5 minute, and they do have volume in it as well. Well, you can imagine what will happen when you will ask volumes to mount every 5 minute, force detach time is 6 minutes. I think we can modify force detach time in cluster, but still it does not remove this problem. It seems that this volume mount problem is in all cloudproviders, sometimes it takes 5-20minutes to get volume in place. 20minutes is quite huge time if your application is running production.

Jesse Haka

unread,

Dec 21, 2018, 8:49:48 AM12/21/18

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I compared kubernetes 1.9 and kube 1.13 with this cronjob with volume thing. In 1.13 volume mounts are working like should, 1.9 does not work correctly. So if you see problems I would say please update cluster first. Then we have problem left if node is shutdown / similar. That another ticket hopefully will solve that

Mihovil Strujic

unread,

Jan 13, 2019, 10:41:02 AM1/13/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I have similar issue in DigitalOcean. If pod is scheduled for deployment on another node it will break as current node+pod are already linked and old pod will not detach before new one is attached.

FIX attempt 1: Add RollingUpdate maxUnavailable: 100% --> FAILED
FIX attempt 2: Add FIX1 + add affinity to deploy pod only to one node --> SUCCESS

This means that you will have service for few seconds offline and you will not be able to use cluster nor to scale service to different nodes.

DigitalOcean volumes support only ReadWriteOnce as many others. That means that we need to find some better solution as deployment to one node and accepting downtime is not what Kubernetes is and it heavily undermines entire idea of persistent volumes.

Version:
Server Version: version.Info{Major:"1", Minor:"13", GitVersion:"v1.13.1", GitCommit:"eec55b9ba98609a46fee712359c7b5b365bdd920", GitTreeState:"clean", BuildDate:"2018-12-13T10:31:33Z", GoVersion:"go1.11.2", Compiler:"gc", Platform:"linux/amd64"}

fejta-bot

unread,

Apr 13, 2019, 12:29:45 PM4/13/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot

unread,

May 13, 2019, 1:15:50 PM5/13/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.

/lifecycle rotten

fejta-bot

unread,

Jun 12, 2019, 1:59:21 PM6/12/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.

Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Kubernetes Prow Robot

unread,

Jun 12, 2019, 1:59:50 PM6/12/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Kubernetes Prow Robot

unread,

Jun 12, 2019, 1:59:51 PM6/12/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Closed #53059.

Adrian Pascu

unread,

Aug 2, 2019, 2:02:09 PM8/2/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I am still having this exact issue on version: v1.12.8 on Google Kubernetes Engine.
It happens to me when I run kubectl apply -f app.yaml and make a pod recreate itself.

How is this still not fixed? Am I using Kubernetes incorrectly?

Jon Stelly

unread,

Aug 15, 2019, 1:29:55 PM8/15/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

/reopen
/remove-lifecycle rotten

Kubernetes Prow Robot

unread,

Aug 15, 2019, 1:30:06 PM8/15/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

@jonstelly: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen
/remove-lifecycle rotten

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

—

inoam

unread,

Aug 31, 2019, 3:59:29 PM8/31/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

This's not a bug, this's expected behavior. You must not double mount with ReadWriteOnce policy; this's what Kubernetes is trying to avoid. However if the node which's down does not respond within 6 minutes (default), the volumes will be forced associated to the replacement pod.

Reference to this 'knowledge'.

Unfortunately in case of iscsi, the replacement pod always remains in ContainerCreating state which's problematic.

Hi,
I'm writing a CSI plugin.
I've found the 6 minutes timeout here: https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/volume/attachdetach/attach_detach_controller.go#L89
I still have a problem: in case node is down, application will be down for 6 minutes.
I do not think this is a valid timeout for applications.
I want kubernetes to detach the volume faster.
I do not want to change this timeout (even if it was configurable without compiling the source), as it changes timeout for any umount.
What I do want is to tel k8s that volume is detached when I understand the node is down.
Since I'm the plugin manager, I believe this is a valid request.
Is there a way to do so?

kvaps

unread,

Dec 14, 2019, 4:15:02 PM12/14/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Also affected for me, I'm using linstor-csi plugin

—
You are receiving this because you are on a team that was mentioned.

Reply to this email directly, view it on GitHub, or unsubscribe.

kvaps

unread,

Dec 14, 2019, 4:19:45 PM12/14/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Steps to reproduce:

Create StatefulSet
Run kubectl get pod -o wide | grep <statefulsetname> for find the node where is it running
kubectl cordon <node>
`kubectl delete
Run kubectl get pod -o wide | grep <statefulsetname> for find the new node is it running
Container will stuck for 10 minutes in ContainerCreating state

You also can check kubectl get volumeattachments.storage.k8s.io for track detaching/attaching process

kvaps

unread,

Dec 15, 2019, 6:18:07 AM12/15/19

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

this is new bug, reported here #86281

Ashish Pahwa

unread,

Jan 9, 2020, 9:51:07 PM1/9/20

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Has this issue been resolved?

Michi

unread,

Feb 27, 2020, 8:50:17 AM2/27/20

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I have the same issue on k8s 1.17.2, rook-ceph as storage. One worker node getting turned off, pod is trying to be evicted after 5 minutes, but can not start because "Volume is exclusively used ...by the old pod". Old pod is getiing stuck in "Terminating". Workaround: kill the old pod, kill the new pod, wait until the new pod is still unable to start, kill the new pod again. Pretty weak for a cluster solution.

Claudio Poli

unread,

Mar 24, 2020, 11:11:18 PM3/24/20

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

same on scaleway kapsule

Asgoret

unread,

Apr 25, 2020, 3:23:22 PM4/25/20

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Same on OKD 3.11 with Ceph RDB storage

Adam

unread,

Apr 26, 2020, 8:54:59 AM4/26/20

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Taint-based Evictions feature should do the trick - it's GA since k8s 1.18.

Misir Jafarov

unread,

Oct 30, 2020, 7:00:53 PM10/30/20

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

The lesson learned here: Don't use kubernetes for database.

Silas Hansen

unread,

May 13, 2021, 11:08:54 AM5/13/21

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Same issue here on k8s 1.15 on AWS, provisioned with kOps.

After setting up Cluster Autoscaler together with Horisontal Pod Autoscaler, this has become a huge problem, as nodes come and go all the time, based on demand.

I've been forced to add safe-to-evict: false to all deployments tha tuse pods with PV's, which defeats some of the purpose of autoscaling.

Is this a problem with EBS attach/detach timeouts/limits or K8s?

Shaung Cheng

unread,

Jun 30, 2021, 4:35:28 PM6/30/21

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

I have similar issue in DigitalOcean. If pod is scheduled for deployment on another node it will break as current node+pod are already linked and old pod will not detach before new one is attached.

FIX attempt 1: Add RollingUpdate maxUnavailable: 100% --> FAILED
FIX attempt 2: Add FIX1 + add affinity to deploy pod only to one node --> SUCCESS

This means that you will have service for few seconds offline and you will not be able to use cluster nor to scale service to different nodes.

DigitalOcean volumes support only ReadWriteOnce as many others. That means that we need to find some better solution as deployment to one node and accepting downtime is not what Kubernetes is and it heavily undermines entire idea of persistent volumes.

Version:
Server Version: version.Info{Major:"1", Minor:"13", GitVersion:"v1.13.1", GitCommit:"eec55b9ba98609a46fee712359c7b5b365bdd920", GitTreeState:"clean", BuildDate:"2018-12-13T10:31:33Z", GoVersion:"go1.11.2", Compiler:"gc", Platform:"linux/amd64"}

Having the same issue on DitigalOcean, there're two things invovled

The update strategy, RollingUpdate vs Recreate. Obviously for zero down time RollingUpdate is preferred. It keeps old pod before new pod is ready. Here comes the problem: the new pod will fail saying "Multi-Attach error for volume "pvc-xxx" Volume is already used by pod(s) xxx". Changing to Recreate seem to eliminate this error - make sense, it destroys old pod first, leaving some down time, but it ensures volume is completely attached, then new pod schedule & attach volume. Not sure if @MichaelOrtho 's FIX1 equals to Recreate. But like @MichaelOrtho said, this beats one of k8s main purpose of zero downtime. What I see ideally is that with RollingUpdate, k8s should be able to transfer the volume attachment from old pod to new pod. Is this a bug, or it's just not possible and it's an expected limitation on k8s's RollingUpdate?
Node/pod affinity. If you have multiple nodes in node pool, replicaSet may schedule new pod on another node different than the old pod. When so, the new pod will error again - the exact error of OP @diogo-reis , saying "Volume is already exclusively attached to one node and can't be attached to another" - which links to ReadWriteOnce that only allows volume mount to one node. This error occurs even if your update strategy is Recreate. Current workaround is like @MichaelOrtho mentioned, add affinity to ensure scheduling on one same node. The question is, is this a bug for k8s, at least for Recreate, can k8s detach volume from one node/old pod, and attach it to another node/new pod?

@adipascu and other people on StackOverflow mentioned StatefulSet for stateful app, haven't tried it yet.

If the above are not considered as k8s bug, on a user/developer experience perspective, I really think maybe we should disable PVC support on Deployment completely, or if PVC used, by default configure Recreate and affinity for the user, or at least highlight this in Deployment's documentation and guide people to use StatefulSet, since user will absolutely hit the wall when using PVC on Deployment.

Nate Revo

unread,

Jun 30, 2021, 5:08:31 PM6/30/21

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

IIRC While working through this issue in 2018 the codebase for handling this situation was not "common" and had a custom implementation in each storage driver. Not sure if that info helps at all.

Uddipaan

unread,

Jan 25, 2024, 10:45:24 AM1/25/24

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

It seems HPE driver has something called the pod monitor that deletes the pod and the volumeattachment forcefully. Here is an article on it: https://datamattsson.tumblr.com/post/622329432678514688/better-automatic-recovery-for-csi-drivers-on
I am not sure if this is a very graceful solution.

—

Reply to this email directly, view it on GitHub, or unsubscribe.

You are receiving this because you are on a team that was mentioned.

carloruiz

unread,

May 8, 2024, 10:25:47 PM5/8/24

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Still having this issue on AKS v1.27.

The delayed volume detachment added a 6m delay to our database pods coming back up on a new VM. My workaround is to explicitly delete the VolumeAttachments for the node set for removal after it has been drained. This allows the pod to startup within 1m, which is prob as good as can be expected. This works only for planned maintenance operations where K8s nodes are being explicitly shut down.

—
Reply to this email directly, view it on GitHub, or unsubscribe.

You are receiving this because you are on a team that was mentioned.

Taras

unread,

Sep 12, 2024, 4:32:02 AM9/12/24

to kubernetes/kubernetes, k8s-mirror-storage-bugs, Team mention

Same with EKS 1.21..24 and aws-ebs-csi-driver:v1.5.1

—
Reply to this email directly, view it on GitHub, or unsubscribe.

You are receiving this because you are on a team that was mentioned.

Reply all

Reply to author

Forward