kubernetes

mirror of https://github.com/optim-enterprises-bv/kubernetes.git synced 2025-11-02 11:18:16 +00:00

Author	SHA1	Message	Date
Morten Torkildsen	36d8a44b9c	DRA: Update controller for Prioritized Alternatives in Device Requests	2025-02-28 19:32:59 +00:00
Patrick Ohly	4638ba9716	client-go/tools/cache: add APIs with context parameter The context is used for cancellation and to support contextual logging. In most cases, alternative *WithContext APIs get added, except for NewIntegerResourceVersionMutationCache where code searches indicate that the API is not used downstream. An API break around SharedInformer couldn't be avoided because the alternative (keeping the interface unchanged and adding a second one with the new method) would have been worse. controller-runtime needs to be updated because it implements that interface in a test package. Downstream consumers of controller-runtime will work unless they use those test package. Converting Kubernetes to use the other new alternatives will follow. In the meantime, usage of the new alternatives cannot be enforced via logcheck yet (see https://github.com/kubernetes/kubernetes/issues/126379 for the process). Passing context through and checking it for cancellation is tricky for event handlers. A better approach is to map the context cancellation to the normal removal of an event handler via a helper goroutine. Thanks to the new HandleErrorWithLogr and HandleCrashWithLogr, remembering the logger is sufficient for handling problems at runtime.	2024-12-18 18:45:02 +01:00
Patrick Ohly	33ea278c51	DRA: use v1beta1 API No code is left which depends on the v1alpha3, except of course the code implementing that version.	2024-11-06 13:03:19 +01:00
Davanum Srinivas	2b0592ee77	Use k8s.io/utils/lru instead of github.com/golang/groupcache/lru Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2024-11-04 10:51:13 -05:00
Kubernetes Prow Robot	daef8c2419	Merge pull request #127266 from pohly/dra-admin-access-in-status DRA API: AdminAccess in DeviceRequestAllocationResult + DRAAdminAccess feature gate	2024-10-30 03:41:25 +00:00
Kubernetes Prow Robot	c5ccf59974	Merge pull request #128379 from pohly/dra-owners-wg-label DRA: add wg/device-management label automatically	2024-10-29 15:24:57 +00:00
Patrick Ohly	4419568259	DRA: treat AdminAccess as a new feature gated field Using the "normal" logic for a feature gated field simplifies the implementation of the feature gate. There is one (entirely theoretic!) problem with updating from 1.31: if a claim was allocated in 1.31 with admin access, the status field was not set because it didn't exist yet. If a driver now follows the current definition of "unset = off", then it will not grant admin access even though it should. This is theoretic because drivers are starting to support admin access with 1.32, so there shouldn't be any claim where this problem could occur.	2024-10-29 10:22:31 +01:00
Patrick Ohly	9a7e4ccab2	DRA admin access: add feature gate The new DRAAdminAccess feature gate has the following effects: - If disabled in the apiserver, the spec.devices.requests[*].adminAccess field gets cleared. Same in the status. In both cases the scenario that it was already set and a claim or claim template get updated is special: in those cases, the field is not cleared. Also, allocating a claim with admin access is allowed regardless of the feature gate and the field is not cleared. In practice, the scheduler will not do that. - If disabled in the resource claim controller, creating ResourceClaims with the field set gets rejected. This prevents running workloads which depend on admin access. - If disabled in the scheduler, claims with admin access don't get allocated. The effect is the same. The alternative would have been to ignore the fields in claim controller and scheduler. This is bad because a monitoring workload then runs, blocking resources that probably were meant for production workloads.	2024-10-29 09:50:11 +01:00
Patrick Ohly	9d1b0654e0	DRA: add wg/device-management label automatically This makes PRs show up automatically in the WG's project board (https://github.com/orgs/kubernetes/projects/95/views/1).	2024-10-28 16:36:04 +01:00
Patrick Ohly	c2524cbf9b	DRA resourceclaims: maintain metric of total and allocated claims These metrics can provide insights into ResourceClaim usage. The total count is redundant because the apiserver also provides count of resources, but having it in the same sub-system next to the count of allocated claims might be more discoverable and helps monitor the controller itself.	2024-10-18 09:13:42 +02:00
Kubernetes Prow Robot	b1b4e5d397	Merge pull request #128003 from pohly/dra-classic-dra-removal DRA: remove "classic DRA"	2024-10-18 00:55:17 +01:00
Patrick Ohly	d572df2493	DRA resource claim controller: improve log messages Some code paths didn't log anything. One log message about "claim got deleted" was incorrect.	2024-10-17 18:28:55 +02:00
Patrick Ohly	f84eb5ecf8	DRA: remove "classic DRA" This removes the DRAControlPlaneController feature gate, the fields controlled by it (claim.spec.controller, claim.status.deallocationRequested, claim.status.allocation.controller, class.spec.suitableNodes), the PodSchedulingContext type, and all code related to the feature. The feature gets removed because there is no path towards beta and GA and DRA with "structured parameters" should be able to replace it.	2024-10-16 23:09:50 +02:00
Kevin Hannon	03da672159	remove 1.27 deterministic support for resource claims	2024-09-18 08:25:06 -04:00
Patrick Ohly	0fc78b9bcc	DRA resource claim controller: update test The resource claim controller is completely agnostic to the claim spec. It doesn't care about classes or devices, therefore it needs no changes in 1.31 besides the v1alpha2 -> v1alpha3 renaming from a previous commit.	2024-07-22 18:09:34 +02:00
Patrick Ohly	8a629b9f15	DRA: remove "sharable" from claim allocation result Now all claims are shareable up to the limit imposed by the size of the "reserverFor" array. This is one of the agreed simplifications for 1.31.	2024-07-21 17:28:14 +02:00
Patrick Ohly	de5742ae83	DRA: remove immediate allocation As agreed in https://github.com/kubernetes/enhancements/pull/4709, immediate allocation is one of those features which can be removed because it makes no sense for structured parameters and the justification for classic DRA is weak.	2024-07-21 17:28:14 +02:00
Patrick Ohly	b51d68bb87	DRA: bump API v1alpha2 -> v1alpha3 This is in preparation for revamping the resource.k8s.io completely. Because there will be no support for transitioning from v1alpha2 to v1alpha3, the roundtrip test data for that API in 1.29 and 1.30 gets removed. Repeating the version in the import name of the API packages is not really required. It was done for a while to support simpler grepping for usage of alpha APIs, but there are better ways for that now. So during this transition, "resourceapi" gets used instead of "resourcev1alpha3" and the version gets dropped from informer and lister imports. The advantage is that the next bump to v1beta1 will affect fewer source code lines. Only source code where the version really matters (like API registration) retains the versioned import.	2024-07-21 17:28:13 +02:00
Kubernetes Prow Robot	ac9aec9f9b	Merge pull request #125116 from pohly/dra-one-of-source DRA: remove "source" indirection from v1 Pod API	2024-06-28 12:46:45 -07:00
Patrick Ohly	bde9b64cdf	DRA: remove "source" indirection from v1 Pod API This makes the API nicer: resourceClaims: - name: with-template resourceClaimTemplateName: test-inline-claim-template - name: with-claim resourceClaimName: test-shared-claim Previously, this was: resourceClaims: - name: with-template source: resourceClaimTemplateName: test-inline-claim-template - name: with-claim source: resourceClaimName: test-shared-claim A more long-term benefit is that other, future alternatives might not make sense under the "source" umbrella. This is a breaking change. It's justified because DRA is still alpha and will have several other API breaks in 1.31.	2024-06-27 17:53:24 +02:00
Kubernetes Prow Robot	92e0db2bbf	Merge pull request #125640 from googs1025/resourceclaim_controller_log_fix1 added resourceclaim_controller log info	2024-06-27 03:20:10 -07:00
googs1025	5f8fb17652	added resourceclaim_controller log info Signed-off-by: googs1025 <googs1025@gmail.com>	2024-06-26 18:38:11 +08:00
Patrick Ohly	2da9e660e3	resourceclaim controller: add missing log output The logging was fairly complete about not doing something, but the actual ResourceClaim creation was not logged.	2024-06-25 16:12:31 +02:00
liyuerich	8e97c0ff7d	drop deprecated pointer package in controller Signed-off-by: liyuerich <yue.li@daocloud.io> Update job_controller.go Signed-off-by: liyuerich <yue.li@daocloud.io>	2024-05-09 11:34:25 +08:00
Kubernetes Prow Robot	1dc30bf90f	Merge pull request #124600 from alvaroaleman/typed-wq Use the generic/typed workqueue throughout	2024-05-06 16:18:31 -07:00
carlory	76aa289608	bugfix: resourceclaim forgot to wait for podSchedulingSynced and templatesSynced	2024-05-06 16:56:16 +08:00
Alvaro Aleman	6d0ac8c561	Use the generic/typed workqueue throughout This change makes us use the generic workqueue throughout the project in order to improve type safety and readability of the code.	2024-05-04 14:33:12 -04:00
Kubernetes Prow Robot	eb2a59e8d8	Merge pull request #124214 from Monokaix/dev fix wrong comments of dra	2024-04-18 03:24:28 -07:00
Xuzheng Chang	3e08030d53	fix wrong comments of dra Signed-off-by: Xuzheng Chang <changxuzheng@huawei.com>	2024-04-09 09:41:25 +08:00
Patrick Ohly	4126e37f08	dra controller: unit tests	2024-03-22 10:03:22 +01:00
Patrick Ohly	3de376ecf6	dra controller: support structured parameters When allocation was done by the scheduler, the controller needs to do the deallocation because there is no control-plane controller which could react to "DeallocationRequested".	2024-03-07 22:22:13 +01:00
Mengjiao Liu	b584b87a94	kube-controller-manager: readjust log verbosity - Increase the global level for broadcaster's logging to 3 so that users can ignore event messages by lowering the logging level. It reduces information noise. - Making sure the context is properly injected into the broadcaster, this will allow the -v flag value to be used also in that broadcaster, rather than the above global value. - test: use cancellation from ktesting - golangci-hints: checked error return value	2024-02-26 14:51:56 +08:00
Patrick Ohly	3c2cfd9a4f	resource claim controller: separate generated suffix from base When the resource claim name inside the pod had some suffix like "1a" in "resource-1a", the generated name suffix got added directly after that, leading to "my-pod-resource-1ax6zgt". Adding another hyphen makes the result more readable: "my-pod-resource-1a-x6zgt".	2023-09-04 09:45:25 +02:00
Patrick Ohly	80ab8f0542	dra: handle scheduled pods in kube-controller-manager When someone decides that a Pod should definitely run on a specific node, they can create the Pod with spec.nodeName already set. Some custom scheduler might do that. Then kubelet starts to check the pod and (if DRA is enabled) will refuse to run it, either because the claims are still waiting for the first consumer or the pod wasn't added to reservedFor. Both are things the scheduler normally does. Also, if a pod got scheduled while the DRA feature was off in the kube-scheduler, a pod can reach the same state. The resource claim controller can handle these two cases by taking over for the kube-scheduler when nodeName is set. Triggering an allocation is simpler than in the scheduler because all it takes is creating the right PodSchedulingContext with spec.selectedNode set. There's no need to list nodes because that choice was already made, permanently. Adding the pod to reservedFor also isn't hard. What's currently missing is triggering de-allocation of claims to re-allocate them for the desired node. This is not important for claims that get created for the pod from a template and then only get used once, but it might be worthwhile to add de-allocation in the future.	2023-07-13 21:27:11 +02:00
Patrick Ohly	cffbb1f1b2	dra controller: enhance testing The allocation mode is relevant when clearing the reservedFor: for delayed allocation, deallocation gets requested, for immediate allocation not. Both should get tested. All pre-defined claims now use delayed allocation, just as they would if created normally.	2023-07-13 21:27:11 +02:00
Patrick Ohly	5cec6d798c	dra: revamp event handlers in kube-controller-manager Enabling logging is useful to track what the code is doing. There are some functional changes: - The pod handler checks for existence of claims. This avoids adding pods to the work queue in more cases when nothing needs to be done, at the cost of making the event handlers a bit slower. This will become more important when adding more work to the controller - The handler for deleted ResourceClaim did not check for cache.DeletedFinalStateUnknown.	2023-07-13 21:27:11 +02:00
Patrick Ohly	98ba89d31d	resourceclaim controller: avoid caching deleted pod unnecessarily We don't need to remember that a pod got deleted when it had no resource claims because the code which checks the cached UIDs only checks for pods which have resource claims.	2023-07-12 16:57:17 +02:00
Patrick Ohly	fec25785ee	dra: store generated ResourceClaims in cache This addresses the following bad sequence of events: - controller creates ResourceClaim - updating pod status fails - pod gets retried before the informer receives the created ResourceClaim - another ResourceClaim gets created Storing the generated ResourceClaim in a MutationCache ensures that the controller knows about it during the retry. A positive side effect is that ResourceClaims now get index by pod owner and thus iterating over existing ones becomes a bit more efficient.	2023-07-11 14:23:49 +02:00
Patrick Ohly	444d23bd2f	dra: generated name for ResourceClaim from template Generating the name avoids all potential name collisions. It's not clear how much of a problem that was because users can avoid them and the deterministic names for generic ephemeral volumes have not led to reports from users. But using generated names is not too hard either. What makes it relatively easy is that the new pod.status.resourceClaimStatus map stores the generated name for kubelet and node authorizer, i.e. the information in the pod is sufficient to determine the name of the ResourceClaim. The resource claim controller becomes a bit more complex and now needs permission to modify the pod status. The new failure scenario of "ResourceClaim created, updating pod status fails" is handled with the help of a new special "resource.kubernetes.io/pod-claim-name" annotation that together with the owner reference identifies exactly for what a ResourceClaim was generated, so updating the pod status can be retried for existing ResourceClaims. The transition from deterministic names is handled with a special case for that recovery code path: a ResourceClaim with no annotation and a name that follows the Kubernetes <= 1.27 naming pattern is assumed to be generated for that pod claim and gets added to the pod status. There's no immediate need for it, but just in case that it may become relevant, the name of the generated ResourceClaim may also be left unset to record that no claim was needed. Components processing such a pod can skip whatever they normally would do for the claim. To ensure that they do and also cover other cases properly ("no known field is set", "must check ownership"), resourceclaim.Name gets extended.	2023-07-11 14:23:48 +02:00
Kubernetes Prow Robot	6f9d1d38d8	Merge pull request #118817 from pohly/dra-delete-claims DRA: improve handling of completed pods	2023-07-06 10:15:15 -07:00
Patrick Ohly	a514f40131	dra resourceclaim controller: delete generated claims when pod is done When a pod is done, but not getting removed yet for while, then a claim that got generated for that pod can be deleted already. This then also triggers deallocation.	2023-07-05 16:10:20 +02:00
Patrick Ohly	e8a0c42212	dra resourceclaim controller: remove reservation for completed pods When a pod is known to never run (again), the reservation for it also can be removed. This is relevant in particular for the job controller.	2023-07-05 16:10:20 +02:00
Patrick Ohly	7f5a02fc7e	dra resourceclaim controller: enhance logging Adding logging to event handlers makes it more obvious why (or why not) claims and pods need to be processed.	2023-07-05 16:10:20 +02:00
Patrick Ohly	d1ba893ad8	dra resourceclaim controller: refactor isPodDone This covers pods that get deleted before running and will be used more than once soon.	2023-07-05 16:09:41 +02:00
Patrick Ohly	1b47e6433b	dra delayed allocation: deallocate when a pod is done This releases the underlying resource sooner and ensures that another consumer can get scheduled without being influenced by a decision that was made for the previous consumer. An alternative would have been to have the apiserver trigger the deallocation whenever it sees the `status.reservedFor` getting reduced to zero. But that then also triggers deallocation when kube-scheduler removes the last reservation after a failed scheduling cycle. In that case we want to keep the claim allocated and let the kube-scheduler decide on a case-by-case basis which claim should get deallocated.	2023-06-29 09:47:30 +02:00
Patrick Ohly	99151c39b7	kube-controller-manager: convert to structured logging Most of the individual controllers were already converted earlier. Some log calls were missed or added and then not updated during a rebase. Some of those get updated here to fill those gaps. Adding of the name to the logger used by each controller gets consolidated in this commit. By using the name under which the controller is registered we ensure that the names in the log are consistent.	2023-03-14 19:16:32 +01:00
Kubernetes Prow Robot	49649c89ea	Merge pull request #113584 from yangjunmyfm192085/volume-contextual-logging volume: use contextual logging	2023-03-14 10:40:16 -07:00
Patrick Ohly	29941b8d3e	api: resource.k8s.io v1alpha1 -> v1alpha2 For Kubernetes 1.27, we intend to make some breaking API changes: - rename PodScheduling -> PodSchedulingHints (https://github.com/kubernetes/kubernetes/issues/114283) - extend ResourceClaimStatus (https://github.com/kubernetes/enhancements/pull/3802) We need to switch from v1alpha1 to v1alpha2 for that.	2023-03-14 07:52:03 +01:00
杨军10092085	361e4ff0fa	volume: use contextual logging	2023-03-14 08:37:30 +08:00
Patrick Ohly	0e1139d027	dra: avoid goroutine leaks from event broadcaster When using these controllers in test/integration/scheduler_perf, the goroutine leak check there pointed out that broadcaster.Shutdown function wasn't called and thus goroutines leaked during a test.	2023-02-15 15:14:27 +01:00

1 2

52 Commits