131921 Commits

Author SHA1 Message Date
Ivan Valdes
b562335cb3 Build etcd v3.6.4 image 2025-07-25 13:51:25 -07:00
Kubernetes Prow Robot
a493bafd02 Merge pull request #133156 from ritazh/draadminaccess-update-flake
DRAAdminAccess: move metrics test from e2e to integration
2025-07-25 13:40:27 -07:00
Maciej Skoczeń
17d733e243 KEP-5229: Send API calls through dispatcher and cache 2025-07-25 15:35:36 +00:00
Kubernetes Prow Robot
f9ed14bf9b Merge pull request #133214 from pohly/dra-e2e-stress-test-removal
DRA E2E: remove "supports reusing resources" stress test
2025-07-25 06:40:29 -07:00
Patrick Ohly
40a90df3b3 DRA E2E: remove stress test
The test needs to schedule 256 pods at once, which only works with three
nodes (default limit is 100, but could also be lower). It's also a stress test
which flaked recently.

For now it gets removed without a replacement. A similar integration test is in
development, but too big (needs some infrastructure changes in
test/integration/dra) to add during code freeze.
2025-07-25 12:45:01 +02:00
utam0k
b956484c25 KEP-5229: Add metrics for async API dispatcher
Signed-off-by: utam0k <k0ma@utam0k.jp>
2025-07-25 19:29:14 +09:00
Kubernetes Prow Robot
b93bcbfba1 Merge pull request #133195 from nojnhuh/dra-ga-comment
Update DRA Pod spec comments
2025-07-25 00:32:35 -07:00
Kubernetes Prow Robot
6d4ca967f7 Merge pull request #132824 from roycaihw/psi-pressure-test
Extend E2E test coverage for PSI metrics under pressure
2025-07-25 00:32:27 -07:00
Kubernetes Prow Robot
1e04b170e4 Merge pull request #133203 from pacoxu/revert-133167-preemption-conor-case
Revert "fix: handle corner cases in the async preemption"
2025-07-24 22:18:26 -07:00
Kubernetes Prow Robot
e72c318f72 Merge pull request #133192 from yongruilin/golangci-lint-config
fix(golangci-lint): skip config verification when -c none is used
2025-07-24 20:14:41 -07:00
Kubernetes Prow Robot
9409af4b80 Merge pull request #133180 from ylink-lfs/chore/ptr_cast_replace
chore: replace ptr caster with unified ptr.To
2025-07-24 20:14:34 -07:00
Kubernetes Prow Robot
72f9a9260a Merge pull request #130606 from Jpsassine/dra_device_health_status
Expose DRA device health in PodStatus
2025-07-24 20:14:27 -07:00
Paco Xu
006d7620a8 Revert "fix: handle corner cases in the async preemption" 2025-07-25 10:38:34 +08:00
Kubernetes Prow Robot
b09f1bfe12 Merge pull request #132902 from haircommander/userns-metrics
KEP-127: kubelet: add metrics for userns pods
2025-07-24 19:08:41 -07:00
Kubernetes Prow Robot
fdefb8a9e6 Merge pull request #132822 from roycaihw/psi-metrics-beta
Promote PSI metrics feature to beta
2025-07-24 19:08:33 -07:00
Kubernetes Prow Robot
3fd1251165 Merge pull request #131089 from KevinTMtz/pod-level-hugepage-cgroups
[PodLevelResources] Propagate Pod level hugepage cgroup to containers
2025-07-24 19:08:26 -07:00
Kubernetes Prow Robot
2d5f58a0ed Merge pull request #133176 from liggitt/psa-host-probe-emulation
Make PSA host enforcement honor emulation version
2025-07-24 17:59:27 -07:00
Kubernetes Prow Robot
802198e2da Merge pull request #133113 from alaypatel07/resource-claim-concurrent-syncs
bump resource claim controller concurrent syncs to 50
2025-07-24 17:59:20 -07:00
Kubernetes Prow Robot
7c2cc0fa0d Merge pull request #133062 from zylxjtu/shutdown
Promote windows graceful shutdown to Beta
2025-07-24 17:59:13 -07:00
Kubernetes Prow Robot
1451dd1b08 Merge pull request #132942 from thockin/kyaml
Add KYAML support to kubectl
2025-07-24 17:59:06 -07:00
Kubernetes Prow Robot
ba9ba5f92a Merge pull request #132940 from guptaNswati/kubelet-podresources-dra-beta
KEP-3695: add kubeletPodResources feature gate to Beta
2025-07-24 17:59:00 -07:00
Eddie
727a6e6db5 Reject pod when attachment limit is exceeded (#132933)
* Reject pod when attachment limit is exceeded

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Record admission rejection

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Fix pull-kubernetes-linter-hints

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Fix AD Controller unit test failure

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Consolidate error handling logic in WaitForAttachAndMount

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Improve error context

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Update admissionRejectionReasons to include VolumeAttachmentLimitExceededReason

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Update status message

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Add TestWaitForAttachAndMountVolumeAttachLimitExceededError unit test

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Add e2e test

Signed-off-by: Eddie Torres <torredil@amazon.com>

* Fix pull-kubernetes-linter-hints

Signed-off-by: Eddie Torres <torredil@amazon.com>

---------

Signed-off-by: Eddie Torres <torredil@amazon.com>
2025-07-24 17:58:54 -07:00
Kubernetes Prow Robot
a5d8ab60ef Merge pull request #132632 from sdowell/gc-rv-race
fix: add RV check on GC delete calls
2025-07-24 17:58:47 -07:00
Kubernetes Prow Robot
8a85a750c1 Merge pull request #132515 from gemmahou/ccm-33.0.0
Update CCM image to v33.1.1
2025-07-24 17:58:40 -07:00
Tiffany Pei
a652896307 Allow white-spaced CABundle during webhook client creation and validation (#132514)
* apiextensions: Treat whitespace-only caBundle as empty for webhook client config and validation

- Updates webhookClientConfigForCRD to treat caBundle values containing only whitespace as empty, ensuring system trust roots are used in this case.
- Updates ValidateCABundle to treat whitespace-only caBundle as valid, consistent with empty or nil values.
- Adds/updates unit tests to verify that whitespace-only caBundle is handled equivalently to empty or nil.
- Ensures consistent and user-friendly handling of caBundle across CRD conversion webhook configuration and validation.

* Revert validation logic

* Add integration test for webhook bypass

* Fix linting
2025-07-24 17:58:33 -07:00
Kubernetes Prow Robot
d12fae55d0 Merge pull request #131001 from Chulong-Li/winstats-contextual-logging
Migrate pkg/kubelet/winstats to contextual logging
2025-07-24 17:58:26 -07:00
Jon Huhn
f1737db166 Update DRA Pod spec comments 2025-07-24 18:58:24 -05:00
Kubernetes Prow Robot
26045b2fab Merge pull request #132642 from yuanwang04/restart-rules
Implement container restart policy rules
2025-07-24 16:44:51 -07:00
Kubernetes Prow Robot
bd7fb738bd Merge pull request #132605 from toVersus/feat/downward-api-plresources
[PodLevelResources] Update Downward API defaulting for resource limits
2025-07-24 16:44:42 -07:00
Kubernetes Prow Robot
63011fe547 Merge pull request #132277 from KevinTMtz/pod-level-resources-eviction-manager
[PodLevelResources] Pod Level Resources Eviction Manager
2025-07-24 16:44:34 -07:00
Kubernetes Prow Robot
7912e5fd67 Merge pull request #131549 from carlory/KEP-3751-GA
[Kep-3751] Promote VolumeAttributesClass to GA
2025-07-24 16:44:27 -07:00
John-Paul Sassine
b7de71f9ce feat(kubelet): Add ResourceHealthStatus for DRA pods
This change introduces the ability for the Kubelet to monitor and report
the health of devices allocated via Dynamic Resource Allocation (DRA).
This addresses a key part of KEP-4680 by providing visibility into
device failures, which helps users and controllers diagnose pod failures.

The implementation includes:
- A new `v1alpha1.NodeHealth` gRPC service with a `WatchResources`
  stream that DRA plugins can optionally implement.
- A health information cache within the Kubelet's DRA manager to track
  the last known health of each device and handle plugin disconnections.
- An asynchronous update mechanism that triggers a pod sync when a
  device's health changes.
- A new `allocatedResourcesStatus` field in `v1.ContainerStatus` to
  expose the device health information to users via the Pod API.

Update vendor

KEP-4680: Fix lint, boilerplate, and codegen issues

Add another e2e test, add TODO for KEP4680 & update test infra helpers

Add Feature Gate e2e test

Fixing presubmits

Fix var names, feature gating, and nits

Fix DRA Health gRPC API according to review feedback
2025-07-24 23:23:18 +00:00
yongruilin
d4a83b9a52 fix(golangci-lint): skip config verification when -c none is used
Prevents failure when no config file is specified by only
verifying config when golangci_config is non-empty.
2025-07-24 21:41:28 +00:00
Haowei Cai
252513a1b9 Add WithFeature and WithSerial, also check if cgroup v2 is used in test 2025-07-24 21:40:08 +00:00
Kevin Torres
1bc995c195 Generated files 2025-07-24 21:29:04 +00:00
Kevin Torres
f925e55548 E2E tests for container hugepage resources immutability
Pod level hugepage resources are not propagated to the containers, only pod level cgroup values are propagated to the containers when they do not specify hugepage resources.
2025-07-24 21:29:04 +00:00
Kevin Torres
9f5b09eb7b Unit test pod level hugepage Default and Validation logic 2025-07-24 21:29:04 +00:00
Kevin Torres
845e94d370 Validation logic and Defaulting update for pod level hugepages
The hugepage aggregated container limits cannot be greater than pod-level limits.

This was already enforced with the defaulted requests from the specfied
limits, however it did not make it clear about both hugepage requests and limits.
2025-07-24 21:29:01 +00:00
Rita Zhang
c15a54f8c0 draadminaccess: move metrics test from e2e to integration
Signed-off-by: Rita Zhang <rita.z.zhang@gmail.com>
2025-07-24 14:08:14 -07:00
Kubernetes Prow Robot
5be5fd0229 Merge pull request #133167 from sanposhiho/preemption-conor-case
fix: handle corner cases in the async preemption
2025-07-24 13:05:18 -07:00
Kubernetes Prow Robot
d9538c2c8c Merge pull request #133163 from pohly/revert-133110-DRAAdminAccess_upgradedowngradetest
Revert "DRAAdminAccess: add upgrade downgrade test"
2025-07-24 13:05:11 -07:00
Kubernetes Prow Robot
ebbebe8be6 Merge pull request #133157 from haircommander/cgroup-driver-cri-ga
KEP 4033: Add metric for out of support CRI and bump feature to GA
2025-07-24 13:05:04 -07:00
Kubernetes Prow Robot
1ad815cbe2 Merge pull request #133152 from bart0sh/PR185-Kubelet-DRA-close-connection-on-unregistration
kubelet: DRA: Close gRPC connection when removing DRA plugin
2025-07-24 13:04:56 -07:00
Kubernetes Prow Robot
dfc0998baa Merge pull request #132935 from benluddy/cbor-bump-custom-marshalers
KEP-4222: Adopt text and JSON transcoding support for CBOR.
2025-07-24 13:04:49 -07:00
Kubernetes Prow Robot
e4e13c1e80 Merge pull request #132818 from ffromani/e2e-node-cpumanager-cgroupv1-compat
e2e: node: cpumanager cgroup v1 compatibility
2025-07-24 13:04:41 -07:00
Kubernetes Prow Robot
1bc48d707c Merge pull request #132773 from saschagrunert/protoc-plugin
Convert `k8s.io/kubelet/pkg/apis/pluginregistration` from gogo to protoc
2025-07-24 13:04:34 -07:00
Kubernetes Prow Robot
1b3be1fde6 Merge pull request #132767 from mortent/PrioritizedListBeta
[KEP-4816] DRAPrioritizedList to Beta
2025-07-24 13:04:27 -07:00
Tim Hockin
d176808d3a Rename hack/*-yamlfmt to -owners-fmt 2025-07-24 12:12:59 -07:00
Tim Hockin
7adcd21148 Replace cmd/yamlfmt with k-sigs/yaml/yamlfmt
And run it.
2025-07-24 12:11:04 -07:00
Swati Gupta
79833578f6 KEP-3695: add kubeletPodResources feature gate to Beta
Signed-off-by: Swati Gupta <swatig@nvidia.com>
2025-07-24 18:52:34 +00:00