kubernetes

mirror of https://github.com/optim-enterprises-bv/kubernetes.git synced 2025-11-24 18:35:10 +00:00

Author	SHA1	Message	Date
Kubernetes Prow Robot	f5ae0413ca	Merge pull request #126347 from vinayakankugoyal/kep2862impl KEP-2862: Fine-grained Kubelet API Authorization	2024-10-18 03:53:04 +01:00
Vinayak Goyal	b1f290d444	KEP-2862: Fine-grained Kubelet API Authorization Signed-off-by: Vinayak Goyal <vinaygo@google.com>	2024-10-17 20:53:27 +00:00
Patrick Ohly	f84eb5ecf8	DRA: remove "classic DRA" This removes the DRAControlPlaneController feature gate, the fields controlled by it (claim.spec.controller, claim.status.deallocationRequested, claim.status.allocation.controller, class.spec.suitableNodes), the PodSchedulingContext type, and all code related to the feature. The feature gets removed because there is no path towards beta and GA and DRA with "structured parameters" should be able to replace it.	2024-10-16 23:09:50 +02:00
Kubernetes Prow Robot	67cdc26214	Merge pull request #127497 from pohly/dra-scheduler-queueing-hints-fix DRA scheduler: fix queuing hint support	2024-09-30 23:21:48 +01:00
Patrick Ohly	aee77bfc84	DRA scheduler: add special ActionType for ResourceClaim changes Having a dedicated ActionType which only gets used when the scheduler itself already detects some change in the list of generated ResourceClaims of a pod avoids calling the DRA plugin for unrelated Pod changes.	2024-09-27 16:53:58 +02:00
Kubernetes Prow Robot	9571d3b6c6	Merge pull request #125995 from carlory/remove-unnecessary-permissions remove unneeded permissions for volume controllers	2024-09-24 15:38:00 +01:00
Mangirdas Judeikis	4e4eb8c5c9	wire in ctx to rbac plugins	2024-09-17 20:04:02 +03:00
Antoni Zawodny	2156efa7b3	Make Node authorizer's index authoritative for unauthorized calls	2024-09-06 09:26:19 +02:00
Antoni Zawodny	cf93a5bfa8	Clean up Node authorizer's non-thread-safe method names	2024-09-03 21:27:24 +02:00
Kubernetes Prow Robot	71389d5082	Merge pull request #126920 from atiratree/statefulset-autodelete-pvc-rbac add missing RBAC to statefulset-controller for StatefulSetAutoDeletePVC feature	2024-08-28 19:29:10 +01:00
Filip Křepinský	55efd85c10	add missing RBAC to statefulset-controller for StatefulSetAutoDeletePVC feature	2024-08-26 13:57:50 +02:00
Cici Huang	0f19faf9be	Remove feature gate ValiatingAdmissionPolicy after stable.	2024-08-22 17:39:18 +00:00
carlory	ae9e381f6c	remove unneeded permissions for system:controller:persistent-volume-binder and system:controller:expand-controller clusterroles	2024-08-21 12:01:53 +08:00
Jefftree	b5a62f14cd	CLE rbac for lease and leasecandidate in kube-system	2024-07-24 14:38:11 +00:00
Kubernetes Prow Robot	d21b17264e	Merge pull request #125488 from pohly/dra-1.31 DRA for 1.31	2024-07-22 11:45:55 -07:00
Patrick Ohly	91d7882e86	DRA: new API for 1.31 This is a complete revamp of the original API. Some of the key differences: - refocused on structured parameters and allocating devices - support for constraints across devices - support for allocating "all" or a fixed amount of similar devices in a single request - no class for ResourceClaims, instead individual device requests are associated with a mandatory DeviceClass For the sake of simplicity, optional basic types (ints, strings) where the null value is the default are represented as values in the API types. This makes Go code simpler because it doesn't have to check for nil (consumers) and values can be set directly (producers). The effect is that in protobuf, these fields always get encoded because `opt` only has an effect for pointers. The roundtrip test data for v1.29.0 and v1.30.0 changes because of the new "request" field. This is considered acceptable because the entire `claims` field in the pod spec is still alpha. The implementation is complete enough to bring up the apiserver. Adapting other components follows.	2024-07-22 18:09:34 +02:00
Jordan Liggitt	c75c07c8e1	Authorize Node reads via name, not graph	2024-07-21 15:01:46 -04:00
Patrick Ohly	b51d68bb87	DRA: bump API v1alpha2 -> v1alpha3 This is in preparation for revamping the resource.k8s.io completely. Because there will be no support for transitioning from v1alpha2 to v1alpha3, the roundtrip test data for that API in 1.29 and 1.30 gets removed. Repeating the version in the import name of the API packages is not really required. It was done for a while to support simpler grepping for usage of alpha APIs, but there are better ways for that now. So during this transition, "resourceapi" gets used instead of "resourcev1alpha3" and the version gets dropped from informer and lister imports. The advantage is that the next bump to v1beta1 will affect fewer source code lines. Only source code where the version really matters (like API registration) retains the versioned import.	2024-07-21 17:28:13 +02:00
Kubernetes Prow Robot	892acaa6a7	Merge pull request #126107 from enj/enj/i/svm_not_found_err svm: set UID and RV on SSA patch to cause conflict on logical create	2024-07-20 08:18:01 -07:00
Jordan Liggitt	4d535db8be	Add selector authorization to the Node authorizer	2024-07-19 15:06:51 -04:00
David Eads	92e3445e9d	add field and label selectors to authorization attributes Co-authored-by: Jordan Liggitt <liggitt@google.com>	2024-07-19 15:06:47 -04:00
Kubernetes Prow Robot	f2428d66cc	Merge pull request #125163 from pohly/dra-kubelet-api-version-independent-no-rest-proxy DRA: make kubelet independent of the resource.k8s.io API version	2024-07-18 17:47:48 -07:00
Patrick Ohly	a7396fdd3f	kubelet: grant permission for DeleteCollection also with RBAC If the node authorizer is active, RBAC rules are not needed. But if it's disabled, kubelet needs to get permission through RBAC. In contrast to the authorizer code which is a bit more flexible and isn't directly tied to the current kubelet implementation (i.e. it allows list+delete instead of just deletecollection), the RBAC entry is just for what the current kubelet does because it's a bit easier to change.	2024-07-18 23:30:09 +02:00
Monis Khan	6a6771b514	svm: set UID and RV on SSA patch to cause conflict on logical create When a resource gets deleted during migration, the SVM SSA patch calls are interpreted as a logical create request. Since the object from storage is nil, the merged result is just a type meta object, which lacks a name in the body. This fails when the API server checks that the name from the request URL and the body are the same. Note that a create request is something that SVM controller should never do. Once the UID is set on the patch, the API server will fail the request at a slightly earlier point with an "uid mismatch" conflict error, which the SVM controller can handle gracefully. Setting UID by itself is not sufficient. When a resource gets deleted and recreated, if RV is not set but UID is set, we would get an immutable field validation error for attempting to update the UID. To address this, we set the resource version on the SSA patch as well. This will cause that update request to also fail with a conflict error. Added the create verb on all resources for SVM controller RBAC as otherwise the API server will reject the request before it fails with a conflict error. The change addresses a host of other issues with the SVM controller: 1. Include failure message in SVM resource 2. Do not block forever on unsynced GC monitor 3. Do not immediately fail on GC monitor being missing, allow for a grace period since discovery may be out of sync 4. Set higher QPS and burst to handle large migrations Test changes: 1. Clean up CRD webhook convertor logs 2. Allow SVM tests to be run multiple times to make finding flakes easier 3. Create and delete CRs during CRD test to force out any flakes 4. Add a stress test with multiple parallel migrations 5. Enable RBAC on KAS 6. Run KCM directly to exercise wiring and RBAC 7. Better logs during CRD migration 8. Scan audit logs to confirm SVM controller never creates Signed-off-by: Monis Khan <mok@microsoft.com>	2024-07-18 17:19:11 -04:00
Patrick Ohly	8d814298bb	kubelet: grant permission for DeleteCollection `2e34e187c9` enabled kubelet to do List and Watch requests with the caveat that kubelet should better use a field selector (which it does). The same is now also needed for DeleteCollection because kubelet will use that to clean up in one operation instead of using multiple.	2024-07-18 09:09:19 +02:00
carlory	850bc09e9b	clean up codes after PodDisruptionConditions was promoted to GA and locked to default	2024-07-11 10:40:21 +08:00
Patrick Ohly	bde9b64cdf	DRA: remove "source" indirection from v1 Pod API This makes the API nicer: resourceClaims: - name: with-template resourceClaimTemplateName: test-inline-claim-template - name: with-claim resourceClaimName: test-shared-claim Previously, this was: resourceClaims: - name: with-template source: resourceClaimTemplateName: test-inline-claim-template - name: with-claim source: resourceClaimName: test-shared-claim A more long-term benefit is that other, future alternatives might not make sense under the "source" umbrella. This is a breaking change. It's justified because DRA is still alpha and will have several other API breaks in 1.31.	2024-06-27 17:53:24 +02:00
Nilekh Chaudhari	9161302e7f	feat: implements svm controller Signed-off-by: Nilekh Chaudhari <1626598+nilekhc@users.noreply.github.com>	2024-03-08 19:25:10 +00:00
Patrick Ohly	0b6a0d686a	dra api: rename NodeResourceSlice -> ResourceSlice While currently those objects only get published by the kubelet for node-local resources, this could change once we also support network-attached resources. Dropping the "Node" prefix enables such a future extension. The NodeName in ResourceSlice and StructuredResourceHandle then becomes optional. The kubelet still needs to provide one and it must match its own node name, otherwise it doesn't have permission to access ResourceSlice objects.	2024-03-07 22:22:55 +01:00
Patrick Ohly	3de376ecf6	dra controller: support structured parameters When allocation was done by the scheduler, the controller needs to do the deallocation because there is no control-plane controller which could react to "DeallocationRequested".	2024-03-07 22:22:13 +01:00
Patrick Ohly	096e948905	dra scheduler: support structured parameters When a claim uses structured parameters, as indicated by the resource class flag, the scheduler is responsible for allocating it. To do this it needs to gather information about available node resources by watching NodeResourceSlices and then match the in-tree claim parameters against those resources.	2024-03-07 22:21:04 +01:00
Patrick Ohly	2e34e187c9	node authorizer: lock down access for NodeResourceSlice The kubelet running on one node should not be allowed to access NodeResourceSlice objects belonging to some other node, as defined by the NodeResourceSlice.NodeName field.	2024-03-07 16:15:52 +01:00
cici37	de506ce7ac	Promote ValidatingAdmissionPolicy to GA.	2024-03-05 16:00:21 -08:00
Jan Chaloupka	0045ef5294	system:kube-scheduler: extend the RBAC with pods/finalizers When enabling DynamicResourceAllocation the dynamicresource plugin may error during scheduling with: ``` E0212 08:57:53.817268 1 framework.go:1323] "Plugin failed" err="podschedulingcontexts.resource.k8s.io \"pod\" is forbidden: cannot set blockOwnerDeletion if an ownerReference refers to a resource you can't set finalizers on: , <nil>" plugin="DynamicResources" pod="gpu-test2/pod" ```	2024-02-26 15:42:35 +01:00
Kubernetes Prow Robot	42941cb88a	Merge pull request #122635 from carlory/KEP-2799 Promote LegacyServiceAccountTokenCleanUp to GA	2024-02-02 12:47:23 -08:00
Kubernetes Prow Robot	55f9657e07	Merge pull request #122692 from aroradaman/reject-packets-to-invalid-port proxy/nftables: reject packets destined for invalid ports of service ips	2024-01-24 23:17:34 +01:00
Daman Arora	4b40299133	pkg/proxy/nftables: handle traffic to cluster ip NFTables proxy will now drop traffic directed towards unallocated ClusterIPs and reject traffic directed towards invalid ports of Cluster IPs. Signed-off-by: Daman Arora <aroradaman@gmail.com>	2024-01-21 19:58:37 +05:30
Quan Tian	3a6060745c	Remove unneeded permissions of node-controller role Signed-off-by: Quan Tian <qtian@vmware.com>	2024-01-18 15:58:28 +08:00
carlory	0fc1b9c9aa	Promote LegacyServiceAccountTokenCleanUp to GA	2024-01-08 10:58:49 +08:00
Jordan Liggitt	cda349cda4	Make service-cidrs-controller role feature-gated	2023-11-01 15:10:33 -04:00
Antonio Ojea	4ff80864e1	service cidr controller manager Controls the lifecycle of the ServiceCIDRs adding finalizers and setting the Ready condition in status when they are created, and removing the finalizers once it is safe to remove (no orphan IPAddresses) An IPAddress is orphan if there are no ServiceCIDR containing it. Change-Id: Icbe31e1ed8525fa04df3b741c8a817e5f2a49e80	2023-10-31 21:05:05 +00:00
tinatingyu	5925dc0775	LegacyServiceAccountTokenCleanUp beta	2023-10-27 03:52:06 +00:00
Michal Wozniak	32fdb55192	Use Patch instead of SSA for Pod Disruption condition	2023-10-19 21:00:19 +02:00
Kubernetes Prow Robot	f55f2785e2	Merge pull request #116254 from pohly/dra-node-authorizer node authorizer: limit kubelet access to ResourceClaim objects	2023-07-18 13:44:04 -07:00
Hao Ruan	056c19e8b3	Add slash ended urls for service-account-issuer-discovery to match API in swagger (#117511 ) * Add slash ended urls for service-account-issuer-discovery to match API in swagger * update the comment for adding slash-ended URLs Co-authored-by: Jordan Liggitt <jordan@liggitt.net> --------- Co-authored-by: Jordan Liggitt <jordan@liggitt.net>	2023-07-18 01:11:08 -07:00
Kubernetes Prow Robot	bea27f82d3	Merge pull request #118209 from pohly/dra-pre-scheduled-pods dra: pre-scheduled pods	2023-07-13 14:43:37 -07:00
Jiahui Feng	049614f884	ValidatingAdmissionPolicy controller for Type Checking (#117377 ) * [API REVIEW] ValidatingAdmissionPolicyStatucController config. worker count. * ValidatingAdmissionPolicyStatus controller. * remove CEL typechecking from API server. * fix initializer tests. * remove type checking integration tests from API server integration tests. * validatingadmissionpolicy-status options. * grant access to VAP controller. * add defaulting unit test. * generated: ./hack/update-codegen.sh * add OWNERS for VAP status controller. * type checking test case.	2023-07-13 13:41:50 -07:00
Patrick Ohly	80ab8f0542	dra: handle scheduled pods in kube-controller-manager When someone decides that a Pod should definitely run on a specific node, they can create the Pod with spec.nodeName already set. Some custom scheduler might do that. Then kubelet starts to check the pod and (if DRA is enabled) will refuse to run it, either because the claims are still waiting for the first consumer or the pod wasn't added to reservedFor. Both are things the scheduler normally does. Also, if a pod got scheduled while the DRA feature was off in the kube-scheduler, a pod can reach the same state. The resource claim controller can handle these two cases by taking over for the kube-scheduler when nodeName is set. Triggering an allocation is simpler than in the scheduler because all it takes is creating the right PodSchedulingContext with spec.selectedNode set. There's no need to list nodes because that choice was already made, permanently. Adding the pod to reservedFor also isn't hard. What's currently missing is triggering de-allocation of claims to re-allocate them for the desired node. This is not important for claims that get created for the pod from a template and then only get used once, but it might be worthwhile to add de-allocation in the future.	2023-07-13 21:27:11 +02:00
Patrick Ohly	1db11c07ff	node authorizer: limit kubelet access to ResourceClaim objects As discussed during the KEP and code review of dynamic resource allocation for Kubernetes 1.26, to increase security kubelet should only get read access to those ResourceClaim objects that are needed by a pod on the node. This can be enforced easily by the node authorizer: - the names of the objects are defined by the pod status - there is no indirection as with volumes (pod -> pvc -> pv) Normally the graph only gets updated when a pod is not scheduled yet. Resource claim status changes can still happen after that, so they also must trigger an update.	2023-07-13 08:35:12 +02:00
Patrick Ohly	444d23bd2f	dra: generated name for ResourceClaim from template Generating the name avoids all potential name collisions. It's not clear how much of a problem that was because users can avoid them and the deterministic names for generic ephemeral volumes have not led to reports from users. But using generated names is not too hard either. What makes it relatively easy is that the new pod.status.resourceClaimStatus map stores the generated name for kubelet and node authorizer, i.e. the information in the pod is sufficient to determine the name of the ResourceClaim. The resource claim controller becomes a bit more complex and now needs permission to modify the pod status. The new failure scenario of "ResourceClaim created, updating pod status fails" is handled with the help of a new special "resource.kubernetes.io/pod-claim-name" annotation that together with the owner reference identifies exactly for what a ResourceClaim was generated, so updating the pod status can be retried for existing ResourceClaims. The transition from deterministic names is handled with a special case for that recovery code path: a ResourceClaim with no annotation and a name that follows the Kubernetes <= 1.27 naming pattern is assumed to be generated for that pod claim and gets added to the pod status. There's no immediate need for it, but just in case that it may become relevant, the name of the generated ResourceClaim may also be left unset to record that no claim was needed. Components processing such a pod can skip whatever they normally would do for the claim. To ensure that they do and also cover other cases properly ("no known field is set", "must check ownership"), resourceclaim.Name gets extended.	2023-07-11 14:23:48 +02:00

1 2 3 4 5 ...

711 Commits