kubernetes

mirror of https://github.com/optim-enterprises-bv/kubernetes.git synced 2025-11-28 20:33:54 +00:00

Author	SHA1	Message	Date
Kubernetes Prow Robot	854af6aba6	Merge pull request #122411 from huww98/lift-mountedByNode ad controller: lift nodeAttachedTo.mountedByNode	2024-04-18 00:00:14 -07:00
Kubernetes Prow Robot	ef2c682635	Merge pull request #122082 from carlory/remove-keep-terminated-pod-volumes keep-terminated-pod-volumes flag on kubelet is removed	2024-04-17 23:59:54 -07:00
huweiwen	3a71fe57f7	ad controller: lift nodeAttachedTo.mountedByNode optimize adc.nodeUpdate(). Time complexity reduced from O(n) to O(1), where n is the number of nodes. Data stored in nodeAttachedTo.mountedByNode is now at actualStateOfWorld.inUseVolumes. This refactor also ensures that we can record the state update even if the volume is not present in ASW yet. The added BenchmarkNodeUpdate result is reduced from 28076923 to 16030 ns/op. The previous BenchmarkPopulateActualStateOfWorld result is also reduced from 13s to 8s.	2024-04-11 15:35:17 +08:00
Kubernetes Prow Robot	611dbaa055	Merge pull request #122790 from carlory/fix-121696 Fix flaky test: Test_Run_OneVolumeDetachFailNodeWithReadWriteOnce	2024-03-10 19:23:40 -07:00
carlory	b47c73ee26	keep-terminated-pod-volumes flag on kubelet is removed	2024-03-01 18:42:15 +08:00
Rohit Singh	13dddca6a2	Add "disable-force-detach-on-timeout" flag to kube-controller-manager	2024-02-22 18:31:52 +00:00
carlory	0cd9d40a65	Test_Run_OneVolumeDetachFailNodeWithReadWriteOnce use waitForAttachStateToNode instead of time.Sleep	2024-02-04 16:57:55 +08:00
Jan Safranek	7fc11f47ff	Mark a volume as uncertain-attached after detach error Volume that failed Detach() should not be marked as attached, CSI external-attacher is probably still trying to detach it. Mark it uncertain instead and wait for Detach() to succeed.	2023-09-13 10:03:28 +02:00
carlory	f443c458af	move non-graceful node shutdown to GA	2023-07-11 13:51:51 +08:00
xing-yang	cca6601106	Add reason to force detach metric	2023-07-10 06:30:05 +00:00
guangli.bao	931cc96b8d	remote windows condition when #116693 is merged Signed-off-by: guangli.bao <guangli.bao@daocloud.io>	2023-05-24 14:54:22 +08:00
Claudiu Belu	0979d55443	unit tests: Skip flaky tests on Windows (part 2) Some of the unit tests are currently flaky on Windows. This commit skips them until they are resolved.	2023-04-13 12:07:18 +00:00
Paco Xu	8e36e948ce	verifyVolumeNoStatusUpdateNeeded may cause flake and so only keep the last ones	2023-03-30 10:44:45 +08:00
Paco Xu	c14068c202	deflake: Add retry with timeout to wait for final conditions	2023-03-22 11:24:09 +08:00
杨军10092085	361e4ff0fa	volume: use contextual logging	2023-03-14 08:37:30 +08:00
Kubernetes Prow Robot	d7bff1c809	Merge pull request #111577 from brianpursley/troubleshoot-unit-test-flake Add logging for reconciler unit test	2022-11-11 00:44:09 -08:00
Humble Chirammal	4bafd53a02	Correct typos in pkg/controller/volume Signed-off-by: Humble Chirammal <hchiramm@redhat.com>	2022-09-16 16:50:20 +05:30
ZhangKe10140699	186ddce07b	Fix problem in updating VolumeAttached in node status	2022-08-02 19:01:57 +08:00
Brian Pursley	a29fb9abae	Add logging for reconciler unit test	2022-07-30 10:33:27 -04:00
Jan Safranek	3b94ac228a	Don't force detach volume from healthy nodes 6 minute force-deatch timeout should be used only for nodes that are not healthy. In case a CSI driver is being upgraded or it's simply slow, NodeUnstage can take more than 6 minutes. In that case, Pod is already deleted from the API server and thus A/D controller will force-detach a mounted volume, possibly corrupting the volume and breaking CSI - a CSI driver expects NodeUnstage to succeed before Kubernetes can call ControllerUnpublish.	2022-06-24 12:51:41 +02:00
Ashutosh Kumar	c00975370a	Handle Non-graceful Node Shutdown (#108486 ) Signed-off-by: Ashutosh Kumar <sonasingh46@gmail.com> Co-authored-by: Ashutosh Kumar <sonasingh46@gmail.com> Co-authored-by: xing-yang <xingyang105@gmail.com>	2022-03-26 09:23:21 -07:00
Mengjiao Liu	beda4cafb6	kubelet: Remove the deprecated flag `--experimental-check-node-capabilities-before-mount`	2022-01-06 11:47:11 +08:00
Jing Xu	69b9f9b1f0	Fix issue in node status updating VolumeAttached list During volume detach, the following might happen in reconciler 1. Pod is deleting 2. remove volume from reportedAsAttached, so node status updater will update volumeAttached list 3. detach failed due to some issue 4. volume is added back in reportedAsAttached 5. reconciler loops again the volume, remove volume from reportedAsAttached 6. detach will not be trigged because exponential back off, detach call will fail with exponential backoff error 7. another pod is added which using the same volume on the same node 8. reconciler loops and it will NOT try to tigger detach anymore At this point, volume is still attached and in actual state, but volumeAttached list in node status does not has this volume anymore, and will block volume mount from kubelet. The fix in first round is to add volume back into the volume list that need to reported as attached at step 6 when detach call failed with error (exponentical backoff). However this might has some performance issue if detach fail for a while. During this time, volume will be keep removing/adding back to node status which will cause a surge of API calls. So we changed to logic to check first whether operation is safe to retry which means no pending operation or it is not in exponentical backoff time period before calling detach. This way we can avoid keep removing/adding volume from node status. Change-Id: I5d4e760c880d72937d34b9d3e904ecad125f802e	2021-10-05 09:44:35 -07:00
caodonghui	f435e24403	Remove deadcode	2021-02-23 17:58:47 +08:00
Cheng Xing	d9a629fe3a	IsVolumeAttachedToNode() renamed to GetAttachState(), and returns 3 states instead of combining "uncertain" and "detached" into "false"	2020-10-29 13:24:51 -07:00
Cheng Xing	a61743b125	Fixes Attach Detach Controller reconciler race reading ActualStateOfWorld and operation pending states; fixes reconciler_test mock detach to account for multiple attaches on a node	2020-10-27 23:51:55 -07:00
taesun_lee	79680b5d9b	Fix pkg/controller typos in some error messages, comments etc - applied review results by LuisSanchez - Co-Authored-By: Luis Sanchez <sanchezl@redhat.com> genernal -> general iniital -> initial initalObjects -> initialObjects intentionaly -> intentionally inforer -> informer anotother -> another triger -> trigger mutli -> multi Verifyies -> Verifies valume -> volume unexpect -> unexpected unfulfiled -> unfulfilled implenets -> implements assignement -> assignment expectataions -> expectations nexpected -> unexpected boundSatsified -> boundSatisfied externel -> external calcuates -> calculates workes -> workers unitialized -> uninitialized afater -> after Espected -> Expected nodeMontiorGracePeriod -> NodeMonitorGracePeriod estimateGrracefulTermination -> estimateGracefulTermination secondrary -> secondary ShouldRunDaemonPodOnUnscheduableNode -> ShouldRunDaemonPodOnUnschedulableNode rrror -> error expectatitons -> expectations foud -> found epackage -> package succesfulJobs -> successfulJobs namesapce -> namespace ConfigMapResynce -> ConfigMapResync	2020-02-27 00:15:33 +09:00
danielqsj	657a1a1a34	change import alias of utils/strings	2019-01-30 10:44:09 +08:00
danielqsj	093328e57f	migrate to k8s.io/utils/strings	2019-01-30 10:24:00 +08:00
Jing Xu	7bac6ca73a	Address comments This commit addressed the comment and also add a unit test.	2019-01-11 10:57:37 -08:00
Jing Xu	562d0fea53	Handle failed attach operation leave uncertain volume attach state This commit adds the unit tests for the PR. It also includes some files that are affected by the function name changes.	2018-11-19 17:21:49 -08:00
Jan Safranek	e46c886bf3	Add list of pods that use a volume to multiattach events So users knows what pods are blocking a volume and can realize their error.	2018-01-24 13:22:03 +01:00
mtanino	8903e8cd85	BlockVolumesSupport: CRI, VolumeManager and OperationExecutor changes This patch contains following changes. - container runtime changes for adding block devices - volumemanager changes - operationexecutor changes	2017-11-20 14:10:26 -05:00
Hemant Kumar	68d417d7d8	Fix possibly flake in multiattach unit test It is possible that by the time we check for multiattach error on node, the reconciler loop may not have processed second volume and hence we are going to retry for multiattach error on node before giving up and marking the test as failed.	2017-10-12 16:27:54 -04:00
Hemant Kumar	67d4c40849	Fix spam of multiattach errors in event logs We should be careful while generating multiattach errors. We seem to be generating too many of them because old code had minor bug.	2017-10-03 15:45:06 -04:00
Hemant Kumar	8edae9b3fc	Always populate volume status from node	2017-09-12 09:03:42 -04:00
Jacob Simpson	29c1b81d4c	Scripted migration from clientset_generated to client-go.	2017-07-17 15:05:37 -07:00
Alexander Block	61275ad8d4	Fix flaky test Test_Run_OneVolumeAttachAndDetachMultipleNodesWithReadWriteMany Only relying on the NewAttacher/Detacher call counts is not enough as they happen in parallel to the testing/verification code and thus the actual attaching/detaching may not be done yet, resulting in flaky test results. Fixes #46244	2017-07-11 18:21:50 +02:00
Chao Xu	60604f8818	run hack/update-all	2017-06-22 11:31:03 -07:00
Chao Xu	f4989a45a5	run root-rewrite-v1-..., compile	2017-06-22 10:25:57 -07:00
NickrenREN	add091b1fb	fix regression in UX experience for double attach volume send event when volume is not allowed to multi-attach	2017-05-25 09:27:24 +08:00
Alexander Block	06baeb33b2	Don't try to attach volumes which are already attached to other nodes	2017-05-18 06:56:30 +02:00
Hemant Kumar	951a36aac7	Add Keepterminatedpodvolumes as a annotation on node and lets make sure that controller respects it and doesn't detaches mounted volumes.	2017-05-11 22:31:14 -04:00
Tomas Smetana	852c44ae59	Fix issue #34242 : Attach/detach should recover from a crash When the attach/detach controller crashes and a pod with attached PV is deleted afterwards the controller will never detach the pod's attached volumes. To prevent this the controller should try to recover the state from the nodes status.	2017-04-20 13:04:50 +02:00
deads2k	fd34b11e13	react to informer updates	2017-02-13 09:18:32 -05:00
Andy Goldstein	70c6087600	Replace hand-written informers with generated ones Replace existing uses of hand-written informers with generated ones. Follow-up commits will switch the use of one-off informers to shared informers.	2017-02-06 13:49:27 -05:00
deads2k	8a12000402	move client/record	2017-01-31 19:14:13 -05:00
deads2k	6a4d5cd7cc	start the apimachinery repo	2017-01-11 09:09:48 -05:00
chrislovecnm	a973c38c7d	The capability to control duration via controller-manager flags, and the option to shut off reconciliation.	2017-01-09 16:47:13 -07:00
rkouj	e7e3c55ad7	Add unit tests for MountVolume() of operation executor	2016-12-27 16:07:06 -08:00

1 2

60 Commits