-
Notifications
You must be signed in to change notification settings - Fork 380
Description
The spec does not describe any duration requirements of ControllerPublishVolume
and NodePublishVolume
. I my experience these operations can take quite long time, especially when things go wrong and start timing out. I've seen EBS volume in "Attaching" state for several days on AWS.
How long should CO wait for response from these calls? It should be defined in CSI and limited to minutes (or to timeout set by CO).
Alternatively, CO could monitor the workloads it needs to run and cancel *PublishVolume
call when corresponding workload was either rescheduled to a different node or withdrawn completely. *UnpublishVolume
call could be the cancellation signal - it says CO does not want the volume attached / mounted to a node any longer, regardless what's current state of attaching / mounting of the volume. (Note that this breaks assumption that there is no more than one call “in-flight” per volume at a given time)