kubernetes

mirror of https://github.com/optim-enterprises-bv/kubernetes.git synced 2025-11-02 11:18:16 +00:00

Author	SHA1	Message	Date
Anish Shah	d4f05fdda5	Introduce a metric to track kubelet admission failure.	2024-11-06 00:07:17 -08:00
Kubernetes Prow Robot	8c5472ce66	Merge pull request #128189 from zylxjtu/bug Fix the incorrect metrics setting/naming in nodeshutdown manager	2024-11-06 02:29:29 +00:00
zylxjtu	459952a067	Windows node graceful shutdown	2024-11-05 17:46:22 +00:00
zylxjtu	04b1378804	Fix the incorrect metrics setting/naming in nodeshutdown manager	2024-10-25 10:16:47 -07:00
Maksym Pavlenko	449f86b0ba	Refactor node shutdown manager Signed-off-by: Maksym Pavlenko <pavlenko.maksym@gmail.com>	2024-10-23 17:36:22 -07:00
Kubernetes Prow Robot	e2c17c09a4	Merge pull request #125070 from torredil/kublet-vm-race Ensure volumes are unmounted during graceful node shutdown	2024-10-02 00:33:48 +01:00
Abhishek Kr Srivastav	95860cff1c	Fix Go vet errors for master golang Co-authored-by: Rajalakshmi-Girish <rajalakshmi.girish1@ibm.com> Co-authored-by: Abhishek Kr Srivastav <Abhishek.kr.srivastav@ibm.com>	2024-09-20 12:36:38 +05:30
torredil	b4a21a3e43	Wait for volume teardown during graceful node shutdown Signed-off-by: torredil <torredil@amazon.com>	2024-09-19 20:39:33 +00:00
carlory	850bc09e9b	clean up codes after PodDisruptionConditions was promoted to GA and locked to default	2024-07-11 10:40:21 +08:00
Kubernetes Prow Robot	4a214f6ad9	Merge pull request #125461 from mimowo/pod-disruption-conditions-ga Graduate PodDisruptionConditions to stable	2024-07-09 11:08:13 -07:00
Matthieu MOREL	0cde5f1e28	fix: enable bool-compare rule from testifylint linter (#125135 ) * fix: enable bool-compare rule from testifylint linter Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com> * Update hack/golangci.yaml.in Co-authored-by: Patrick Ohly <patrick.ohly@intel.com> * Update golangci.yaml.in * Update golangci-strict.yaml * Update golangci.yaml.in * Update golangci.yaml.in * Update golangci.yaml.in * Update golangci.yaml.in * Update golangci.yaml * Update golangci-hints.yaml * Update golangci-strict.yaml * Update golangci.yaml.in * Update golangci.yaml * Update mux_test.go --------- Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com> Co-authored-by: Patrick Ohly <patrick.ohly@intel.com>	2024-06-28 10:58:05 -07:00
Michal Wozniak	bf0c9885a4	Graduate PodDisruptionConditions to stable	2024-06-28 16:36:51 +02:00
Kubernetes Prow Robot	ba19ecb8c9	Merge pull request #123298 from henry118/spell Fix func name typo	2024-06-24 21:01:40 -07:00
Marek Siarkowicz	3ee8178768	Cleanup defer from SetFeatureGateDuringTest function call	2024-04-24 20:25:29 +02:00
Henry Wang	c4b88d72ea	Fix func name typo Signed-off-by: Henry Wang <henwang@amazon.com>	2024-02-22 16:10:35 +00:00
Qian Xiao	0944c00778	Fix some typo in kubelet component source code	2023-08-03 23:56:50 -07:00
Mike Miranda	feba08a694	Create local copy to avoid potential race condition	2023-03-29 19:50:06 +00:00
Patrick Ohly	961819a4d0	dependencies: update klog v2.90.1 This improves performance of the text formatting and ktesting. Because ktesting no longer buffers messages by default, one unit test needs to ask for that explicitly.	2023-03-01 19:03:50 +01:00
Michal Wozniak	c803892bd8	Enable the feature into beta	2022-11-09 09:02:40 +01:00
Michal Wozniak	52cd6755eb	Add pod disruption conditions for kubelet initiated failures	2022-11-07 11:23:22 +01:00
inosato	3b95d3b076	Remove ioutil in kubelet and its tests Signed-off-by: inosato <si17_21@yahoo.co.jp>	2022-07-30 12:35:26 +09:00
Patrick Ohly	7f55a0bae0	kubelet: avoid manipulating global logger during unit test The code as it stands now works, but it is still complicated and previous versions had race conditions (https://github.com/kubernetes/kubernetes/issues/108040). Now the test works without modifying global state. The individual test cases could run in parallel, this just isn't done because they complete quickly already (2 seconds).	2022-06-24 11:27:40 +02:00
Patrick Ohly	65385fec20	kubelet: convert node shutdown manager to contextual logging This will make output checking easier (done in a separate commit). kubelet itself still uses the global logger.	2022-06-24 11:20:34 +02:00
Clayton Coleman	1d518adb76	kubelet: Pod probes should be handled by pod worker The pod worker is the owner of when a container is running or not, and the start and stop of the probes for a given pod should be handled during the pod sync loop. This ensures that probes do not continue running even after eviction. Because the pod semantics allow lifecycle probes to shorten grace period, the probe is removed after the containers in a pod are terminated successfully. As an optimization, if the pod will have a very short grace period (0 or 1 seconds) we stop the probes immediately to reduce resource usage during eviction slightly. After this change, the probe manager is only called by the pod worker or by the reconcile loop.	2022-06-06 17:00:54 -05:00
Kubernetes Prow Robot	d796dd7d0f	Merge pull request #108193 from utkarsh348/myfeature Fixed race condition in test manager shutdown	2022-03-27 05:55:21 -07:00
Hemant Kumar	13b34d9c77	Use tempdir for shutdown tests	2022-03-24 11:58:49 -04:00
Shiming Zhang	ced991cb00	Emit Metrics in the shutdown process	2022-03-16 10:14:55 +08:00
Shiming Zhang	a1fadab4b0	Atomic write status file	2022-03-11 17:50:33 +08:00
Shiming Zhang	4aed18935e	Add test for storage	2022-03-11 17:31:10 +08:00
Shiming Zhang	5eb3e88f6b	Support metrics for node shutdown	2022-03-11 17:31:10 +08:00
utkarsh348	eaee96efd3	Fixed race condition test manager shutdown	2022-02-18 11:20:02 +05:30
calvin	d9ab5e18d3	fix: data race when hijack klog Signed-off-by: calvin <wen.chen@daocloud.io>	2022-01-24 15:01:49 +08:00
Kubernetes Prow Robot	09fccc3533	Merge pull request #106796 from jonyhy96/fix-timer kubelet: use newtimer instead in nodeshutdown manager	2022-01-06 11:47:12 -08:00
haoyun	92fa957dd1	feat: use clock instead Signed-off-by: haoyun <yun.hao@daocloud.io>	2021-12-10 13:59:12 +08:00
David Porter	95264a418d	kubelet: set failed phase during graceful shutdown Revert to previous behavior in 1.21/1.20 of setting pod phase to failed during graceful node shutdown. Setting pods to failed phase will ensure that external controllers that manage pods like deployments will create new pods to replace those that are shutdown. Many customers have taken a dependency on this behavior and it was breaking change in 1.22, so this change reverts back to the previous behavior. Signed-off-by: David Porter <david@porter.me>	2021-12-09 13:17:40 -08:00
Shiming Zhang	545313bdc7	Implement graceful shutdown based on Pod priority	2021-11-17 11:47:12 +08:00
Shiming Zhang	e47c78a354	Add log for creating node shutdown manager	2021-10-15 11:16:21 +08:00
Shiming Zhang	b468c24e85	Refactor to use structure to pass parameters	2021-10-15 11:16:21 +08:00
Ryan Phillips	e2e938066d	kubelet: add probe termination to graceful shutdowns	2021-09-22 14:13:25 -05:00
Kubernetes Prow Robot	7c71e06cd1	Merge pull request #104959 from calvin0327/issue-test-dataRace fix the test issue of node shutdown manager	2021-09-21 11:56:30 -07:00
calvin0327	db82e282fc	fix the test issue of data race to node shutdown manager	2021-09-13 18:12:19 +08:00
wojtekt	53ce79a18a	Migrate to k8s.io/utils/clock in pkg/kubelet	2021-09-10 12:20:09 +02:00
Stephen Augustus	481cf6fbe7	generated: Run hack/update-gofmt.sh Signed-off-by: Stephen Augustus <foo@auggie.dev>	2021-08-24 15:47:49 -04:00
Kubernetes Prow Robot	8dbc33d649	Merge pull request #101081 from rphillips/add_graceful_shutdown_event kubelet: add graceful shutdown events	2021-08-17 22:08:08 -07:00
Kubernetes Prow Robot	d7c1663556	Merge pull request #103137 from wzshiming/fix/expected_inhibit_delay Allow the actual inhibit delay to be greater than the expected inhibit delay	2021-08-17 11:41:49 -07:00
Kubernetes Prow Robot	a6c2cd7d18	Merge pull request #103291 from wzshiming/fix/nodeshutdown-restart Fix Data Race in nodeshutdown restart	2021-07-09 08:43:14 -07:00
Clayton Coleman	3eadd1a9ea	Keep pod worker running until pod is truly complete A number of race conditions exist when pods are terminated early in their lifecycle because components in the kubelet need to know "no running containers" or "containers can't be started from now on" but were relying on outdated state. Only the pod worker knows whether containers are being started for a given pod, which is required to know when a pod is "terminated" (no running containers, none coming). Move that responsibility and podKiller function into the pod workers, and have everything that was killing the pod go into the UpdatePod loop. Split syncPod into three phases - setup, terminate containers, and cleanup pod - and have transitions between those methods be visible to other components. After this change, to kill a pod you tell the pod worker to UpdatePod({UpdateType: SyncPodKill, Pod: pod}). Several places in the kubelet were incorrect about whether they were handling terminating (should stop running, might have containers) or terminated (no running containers) pods. The pod worker exposes methods that allow other loops to know when to set up or tear down resources based on the state of the pod - these methods remove the possibility of race conditions by ensuring a single component is responsible for knowing each pod's allowed state and other components simply delegate to checking whether they are in the window by UID. Removing containers now no longer blocks final pod deletion in the API server and are handled as background cleanup. Node shutdown no longer marks pods as failed as they can be restarted in the next step. See https://docs.google.com/document/d/1Pic5TPntdJnYfIpBeZndDelM-AbS4FN9H2GTLFhoJ04/edit# for details	2021-07-06 15:55:22 -04:00
Shiming Zhang	212ce7c287	Shorten test time	2021-06-30 09:48:26 +08:00
Shiming Zhang	a42c066af7	Fix Data Race in nodeshutdown restart	2021-06-29 16:23:45 +08:00
Shiming Zhang	97bcfbd674	Allow the actual inhibit delay to be greater than the expected inhibit delay	2021-06-24 14:11:58 +08:00

1 2

75 Commits