kubernetes

mirror of https://github.com/optim-enterprises-bv/kubernetes.git synced 2025-11-03 03:38:15 +00:00

Author	SHA1	Message	Date
Francesco Romani	2a99bfc3d1	node: cm: don't share containerMap instances between managers Since the GA graduation of memory manager in https://github.com/kubernetes/kubernetes/pull/128517 we are sharing the initial container map across managers. The intention of this sharing was not to actually share a data structure, but 1. save the relatively expensive relisting from runtime 2. have all the managers share a consistent view - even though the chance for misalignement tend to be tiny. The unwanted side effect though is now all the managers race to modify a data shared, not thread safe data structure. The fix is to clone (deepcopy) the computed map when passing it to each manager. This restores the old semantic of the code. This issue brings the topic of possibly managers go out of sync since each of them maintain a private view of the world. This risk is real, yet this is how the code worked for most of the lifetime, so the plan is to look at this and evaluate possible improvements later on. Signed-off-by: Francesco Romani <fromani@redhat.com>	2024-11-07 16:02:55 +01:00
zhangzhifei16	1381e41f28	feat: Integrate device plugin registration gRPC server health checks.	2024-11-05 19:59:56 +08:00
Kubernetes Prow Robot	4932adf80d	Merge pull request #125296 from jsturtevant/windows-numa-support Support CPU and Topology manager on Windows	2024-11-05 00:33:28 +00:00
Kubernetes Prow Robot	b4d91d1b8a	Merge pull request #128517 from Tal-or/memorymanager-ga-features node: kubelet: memmgr: Memory Manager to GA	2024-11-04 17:19:36 +00:00
Talor Itzhak	a56fde3694	lint: fix linter's complaints Signed-off-by: Talor Itzhak <titzhak@redhat.com>	2024-11-04 09:59:39 +02:00
Talor Itzhak	37bcd38785	memorymanager: FG is locked to ON by default We can remove the `if()` guards, since the feature is always available. Signed-off-by: Talor Itzhak <titzhak@redhat.com>	2024-11-04 09:57:21 +02:00
Sergey Kanzhelev	1297d0cdd1	converge DRA and Device Plugin plugins registration	2024-10-30 16:58:13 +00:00
James Sturtevant	4d25c25eb0	Support CPU and Memory manager on Windows Signed-off-by: James Sturtevant <jsturtevant@gmail.com>	2024-10-25 09:14:53 -07:00
Kir Kolyshkin	54d43ecaed	pkg/kubelet/user/userns: remove, use moby/sys/userns The code from github.com/opencontainers/runc/libcontainer/userns package was moved into github.com/moby/sys/user and github.com/moby/sys/userns (see [1]), and the runc package is now deprecated in favor of moby/sys (see [2]). In addition, moby/sys/userns now has a non-Linux implementation, so pkg/kubelet/user/userns package (introduced in commit `2e999ff` to make a non-Linux implementation) is not really needed anymore. Let's switch to moby/sys/userns, and remove the package. [1]: https://github.com/moby/sys/releases/tag/userns%2Fv0.1.0 [2]: https://github.com/opencontainers/runc/pull/4350 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2024-10-22 14:36:14 -07:00
Ed Bartosh	e1bc8defac	kubelet: Migrate DRA Manager to contextual logging Co-authored-by: Patrick Ohly <patrick.ohly@intel.com>	2024-08-22 11:12:41 +03:00
Sergey Kanzhelev	62f96d2748	set AllocatedResourcesStatus in the Pod Status	2024-07-24 00:29:35 +00:00
Patrick Ohly	877829aeaa	DRA kubelet: adapt to v1alpha3 API This adds the ability to select specific requests inside a claim for a container. NodePrepareResources is always called, even if the claim is not used by any container. This could be useful for drivers where that call has some effect other than injecting CDI device IDs into containers. It also ensures that drivers can validate configs. The pod resource API can no longer report a class for each claim because there is no such 1:1 relationship anymore. Instead, that API reports claim, API devices (with driver/pool/device as ID) and CDI device IDs. The kubelet itself doesn't extract that information from the claim. Instead, it relies on drivers to report this information when the claim gets prepared. This isolates the kubelet from API changes. Because of a faulty E2E test, kubelet was told to contact the wrong driver for a claim. This was not visible in the kubelet log output. Now changes to the claim info cache are getting logged. While at it, naming of variables and some existing log output gets harmonized. Co-authored-by: Oksana Baranova <oksana.baranova@intel.com> Co-authored-by: Ed Bartosh <eduard.bartosh@intel.com>	2024-07-22 18:09:34 +02:00
Harshal Patil	68d317a8d1	Add a warning log, event and metric for cgroup version 1 Signed-off-by: Harshal Patil <harpatil@redhat.com>	2024-07-09 11:34:46 -04:00
Itamar Holder	a6b971f14b	Use kubelet owned directories for mounting rather than /tmp Signed-off-by: Itamar Holder <iholder@redhat.com>	2024-05-21 13:18:16 +03:00
Itamar Holder	74f29880bd	Replace log entry by a warning event Signed-off-by: Itamar Holder <iholder@redhat.com>	2024-05-21 13:18:16 +03:00
Itamar Holder	29535c0463	Warn of swap is enabled on the OS and tmpfs noswap is not supported When --fail-swap-on=false kubelet CLI argument is provided, but tmpfs noswap is not supported by the kernel, warn about the risks of memory-backed volumes being swapped into disk Signed-off-by: Itamar Holder <iholder@redhat.com>	2024-05-21 13:18:16 +03:00
Kevin Klues	639e887631	kubelet: DRA: add a reconcile loop to unprepare claims for deleted pods Signed-off-by: Kevin Klues <kklues@nvidia.com>	2024-05-03 13:23:29 +00:00
Kevin Klues	a8931c6c25	kubelet: DRA: update locking/checkpoint semantics of the claimInfo cache Signed-off-by: Kevin Klues <kklues@nvidia.com>	2024-05-03 13:23:27 +00:00
Patrick Ohly	d59676a545	dra kubelet: publish NodeResourceSlices The information is received from the DRA driver plugin through a new gRPC streaming interface. This is backwards compatible with old DRA driver kubelet plugins, their gRPC server will return "not implemented" and that can be handled by kubelet. Therefore no API break is needed. However, DRA drivers need to be updated because the Go API changed. They can return status.New(codes.Unimplemented, "no node resource support").Err() if they don't support the new ListAndWatchResources method and structured parameters. The controller in kubelet then synchronizes this information from the driver with NodeResourceSlice objects, creating, updating and deleting them as needed.	2024-03-07 22:22:13 +01:00
Akihiro Suda	2e999fff02	Fix compiling e2e.test on macOS Fix issue 122650 (regression in PR 122552) ``` $ make WHAT=test/e2e/e2e.test +++ [0109 10:06:53] Building go targets for darwin/amd64 k8s.io/kubernetes/test/e2e/e2e.test (test) package k8s.io/kubernetes/test/e2e imports k8s.io/kubernetes/test/e2e/common imports k8s.io/kubernetes/test/e2e/common/node imports k8s.io/kubernetes/pkg/kubelet imports github.com/opencontainers/runc/libcontainer/userns: C source files not allowed when not using cgo or SWIG: userns_maps.c !!! [0109 10:06:54] Call tree: !!! [0109 10:06:54] 1: /Users/suda/gopath/src/k8s.io/kubernetes/hack/lib/golang.sh:948 kube::golang::build_binaries_for_platform(...) !!! [0109 10:06:54] 2: hack/make-rules/build.sh:27 kube::golang::build_binaries(...) !!! [0109 10:06:54] Call tree: !!! [0109 10:06:54] 1: hack/make-rules/build.sh:27 kube::golang::build_binaries(...) !!! [0109 10:06:54] Call tree: !!! [0109 10:06:54] 1: hack/make-rules/build.sh:27 kube::golang::build_binaries(...) make: *** [all] Error 1 ``` Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2024-01-09 10:42:20 +09:00
Gunju Kim	a0610a97b3	pkg/kubelet/cm: Remove deprecated sets.String and sets.Int This removes deprecated sets.String and sets.Int - replace sets.String with sets.Set[string] - replace sets.Int with sets.Set[int] - replace sets.NewString with sets.New[string] - replace sets.NewInt with sets.New[int] - replace sets.(OLD).List with sets.List(NEW)	2023-09-27 22:02:15 +09:00
Kubernetes Prow Robot	bdcf812c95	Merge pull request #118254 from elezar/4009/add-cdi-devices-to-device-plugin Add CDI devices to device plugin API	2023-07-17 05:21:08 -07:00
Evan Lezar	b57c7e2fe4	Add CDI devices to device plugin API This change adds CDI device IDs to the ContainerAllocateResponse in the device plugin API. This allows a device plugin to specify CDI devices by their unique fully-qualified CDI device names using the related field in the CRI specification. Signed-off-by: Evan Lezar <elezar@nvidia.com>	2023-07-17 11:53:09 +02:00
Kubernetes Prow Robot	900237fada	Merge pull request #118635 from ffromani/devmgr-check-pod-running kubelet: devices: skip allocation for running pods	2023-07-15 05:43:16 -07:00
Francesco Romani	3bcf4220ec	kubelet: devices: skip allocation for running pods When kubelet initializes, runs admission for pods and possibly allocated requested resources. We need to distinguish between node reboot (no containers running) versus kubelet restart (containers potentially running). Running pods should always survive kubelet restart. This means that device allocation on admission should not be attempted, because if a container requires devices and is still running when kubelet is restarting, that container already has devices allocated and working. Thus, we need to properly detect this scenario in the allocation step and handle it explicitely. We need to inform the devicemanager about which pods are already running. Note that if container runtime is down when kubelet restarts, the approach implemented here won't work. In this scenario, so on kubelet restart containers will again fail admission, hitting https://github.com/kubernetes/kubernetes/issues/118559 again. This scenario should however be pretty rare. Signed-off-by: Francesco Romani <fromani@redhat.com>	2023-07-12 13:25:36 +02:00
PiotrProkop	f855a23b45	topologymanager: promote TopologyManagerPolicyOptions feature to beta * Promote TopologyManagerPolicyOptions feature to beta * Promote PreferClosestNUMANodes TopologyManagerPolicyOption to beta Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2023-07-11 15:06:57 +02:00
Moshe Levi	04ad946e8f	kubelet dra: lock before getting claimInfo CDIDevices and annotations fields Currently claimInfo CDIDevices and annotations access directly without RLock. This can lead to concurrent read write error. To avoid it we added RLock all before getting the CDIDevices and annotations Signed-off-by: Moshe Levi <moshele@nvidia.com>	2023-05-01 15:09:43 +03:00
Moshe Levi	2a568bcfc8	kubelet podresources: extend List to support Dynamic Resources and implement Get API Signed-off-by: Moshe Levi <moshele@nvidia.com>	2023-03-14 19:33:04 +02:00
Kubernetes Prow Robot	a4a0fd44d8	Merge pull request #115912 from moshe010/dra-checkpoint kubelet DRA: Add checkpointing mechanism in the DRA Manager	2023-03-12 12:20:40 -07:00
Moshe Levi	e7256e08d3	kubelet dra: add checkpointing mechanism in the DRA Manager The checkpointing mechanism will repopulate DRA Manager in-memory cache on kubelet restart. This will ensure that the information needed by the PodResources API is available across a kubelet restart. The ClaimInfoState struct represent the DRA Manager in-memory cache state in checkpoint. It is embedd in the ClaimInfo which also include the annotation field. The separation between the in-memory cache and the cache state in the checkpoint is so we won't be tied to the in-memory cache struct which may change in the future. In the ClaimInfoState we save the minimal required fields to restore the in-memory cache. Signed-off-by: Moshe Levi <moshele@nvidia.com>	2023-03-10 12:22:15 +02:00
Kubernetes Prow Robot	f6564d33ba	Merge pull request #114357 from dengyufeng2206/1208pull Log spelling formatting	2023-03-09 21:33:22 -08:00
Swati Sehgal	937d330393	node: topologymgr: Remove ResourceAllocator as TM is always enabled With Topology Manager enabled by default, we no longer need `resourceAllocator` as Topology Manager serves as the main PodAdmitHandler completely responsible for admission check based on hints received from the hintProviders and the subsequent allocation of the corresponding resources to a pod as can be seen here: https://github.com/kubernetes/kubernetes/blob/v1.26.0/pkg/kubelet/cm/topologymanager/scope.go#L150 With regard to DRA, the passing of `cm.draManager` into resourceAllocator seems redundant as no admission checks (and allocation of resources handled by DRA) is taking place in `Admit` method of resourceAllocator. DRA has a completely different model to the rest of the resource managers where pod is only scheduled on a node once resources are reserved for it. Because of this, admission checks or waiting for resources to be provisioned after the pod has been scheduled on the node is not required. Before making the above change, it was verified that DRA Manager is instantiated in `NewContainerManager`: https://github.com/kubernetes/kubernetes/blob/v1.26.0/pkg/kubelet/cm/container_manager_linux.go#L318 Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2023-03-06 12:51:11 +00:00
Swati Sehgal	6a62f0236a	node: topologymgr: trivial internal variable renaming Since Topology manager is graduating to GA, we remove internal configuration variable names with `Experimental` prefix. There is no expected change in behavior, only trival variable renaming. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2023-03-06 12:51:11 +00:00
Swati Sehgal	d536a342b4	node: topologymgr: GA graduation implies Feature Gate is ON by default Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2023-03-06 12:51:05 +00:00
Sergey Kanzhelev	04189b1fc4	rename ExperimentalPodPidsLimit to PodPidsLimit	2023-03-04 01:48:16 +00:00
Ed Bartosh	5a86895070	DRA: pass CDI devices through CRI CDIDevice field	2023-02-28 19:21:20 +02:00
Ed Bartosh	4f88332ab4	kubelet: prepare DRA resources before CNI setup	2023-02-06 20:40:11 +02:00
Ian K. Coolidge	e5143d16c2	cpuset: Make 'ToSlice*' methods look like 'set' methods In 'set', conversions to slice are done also, but with different names: ToSliceNoSort() -> UnsortedList() ToSlice() -> List() Reimplement List() in terms of UnsortedList to save some duplication.	2023-01-06 23:32:51 +00:00
Ian K. Coolidge	a0c989b99a	cpuset: Remove *Int64 methods These are rarely used and can be accommodated with a trivial helper.	2023-01-06 23:32:51 +00:00
dengyufeng2206	8525cfab02	Log spelling formatting	2022-12-08 15:02:19 +08:00
Ed Bartosh	abcb56defb	kubelet: do not enter termination status if pod might need to unprepare resources	2022-11-11 21:58:03 +01:00
Ed Bartosh	ae0f38437c	kubelet: add support for dynamic resource allocation Dependencies need to be updated to use github.com/container-orchestrated-devices/container-device-interface. It's not decided yet whether we will implement Topology support for DRA or not. Not having any toppology-related code will help to avoid wrong impression that DRA is used as a hint provider for the Topology Manager.	2022-11-11 21:58:03 +01:00
Kubernetes Prow Robot	243ba086e7	Merge pull request #112914 from PiotrProkop/topology-manager-policies-flag node: topologymanager: Improved multi-numa alignment in Topology Manager	2022-11-07 16:00:51 -08:00
David Ashpole	64af1adace	Second attempt: Plumb context to Kubelet CRI calls (#113591 ) * plumb context from CRI calls through kubelet * clean up extra timeouts * try fixing incorrectly cancelled context	2022-11-05 06:02:13 -07:00
PiotrProkop	75bb437a6b	Improved multi-numa alignment in Topology Manager: implement closest numa policy Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2022-11-03 10:45:25 +01:00
Kubernetes Prow Robot	433787d25b	Merge pull request #113018 from fromanirh/cpumanager-ga-features node: kubelet: cpumgr: CPU Manager to GA	2022-11-02 14:41:01 -07:00
Francesco Romani	a6b928d90c	kubelet: cpumgr: internal variable trivial rename CPUManager is going GA, thus it makes little sense to keep the names of the internal configuration variables `Experimental*`. Trivial rename only. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-11-02 18:41:42 +01:00
Francesco Romani	ff44dc1932	cpumanager: the FG is locked to default (ON) hence we can remove the if() guards, the feature is always available. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-11-02 18:41:41 +01:00
Antonio Ojea	9c2b333925	Revert "plumb context from CRI calls through kubelet" This reverts commit `f43b4f1b95`.	2022-11-02 13:37:23 +00:00
Swati Sehgal	8b29eded52	node: devicemgr: Remove `devicePluginEnabled` field from container mgr With graduation of device plugins to GA in 1.26, the feature gate is enabled by default so `devicePluginEnabled` field no longer needs to be passed at the time of Container Manager creation. In addition to that, we remove the `ManagerStub` as it is no longer needed. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2022-11-02 11:05:20 +00:00

1 2 3 4 5 ...

274 Commits