Commit Graph

1739 Commits

Author SHA1 Message Date
googs1025
938b5f8262 chore(scheduler): add more log info when no preemption candidates is found in preemption 2025-02-11 13:53:32 +08:00
Kensei Nakada
3701e39327 fix: remove the mention that DRA uses Pending 2025-02-04 06:45:05 +09:00
Kubernetes Prow Robot
1b7a059187 Merge pull request #128999 from macsko/improve_goroutines_metric_writes_in_parallelizer_until
Improve Goroutines metric calls in parallelizer.Until
2025-02-03 07:30:57 -08:00
Kubernetes Prow Robot
fc268ecd09 Merge pull request #129823 from googs1025/chore/log_improve
fix(dra plugin): when there is no resourceclaim, return directly
2025-02-02 16:28:56 -08:00
googs1025
ed826dddfe fix(dra plugin): when there is no resourceclaim, return directly 2025-01-29 08:47:52 +08:00
Maciej Skoczeń
bd8dee9637 Improve Goroutines metric calls in parallelizer.Until 2025-01-27 12:01:17 +00:00
Davanum Srinivas
4e05bc20db Linter to ensure go-cmp/cmp is used ONLY in tests
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2025-01-24 20:49:14 -05:00
Kubernetes Prow Robot
2056dbca18 Merge pull request #129697 from saza-ku/remove-pvc-not-found
chore: remove duplicated test case of `pvc not found`
2025-01-24 09:35:21 -08:00
googs1025
27ec5de863 chore(scheduler): improve log output for podtopologyspread filter plugin 2025-01-24 19:29:23 +08:00
Kubernetes Prow Robot
a716ea756d Merge pull request #129635 from macsko/cache_calculateresource_result_when_adding_pod_to_nodeinfo
Use cached calculateResource result when removing pod from NodeInfo in preemption
2025-01-21 04:16:37 -08:00
Maciej Skoczeń
0452ae402a Use cached calculateResource result when removing pod from NodeInfo in preemption 2025-01-21 10:02:57 +00:00
saza-ku
e26fb1c393 remove duplicated test case of pvc not found 2025-01-20 01:52:35 +09:00
Kubernetes Prow Robot
0b789d7cca Merge pull request #129427 from macsko/improve_map_in_interpodaffinity_prefilter
Improve topologyToMatchedTermCount map in InterPodAffinity PreFilter
2025-01-10 10:40:33 -08:00
Maciej Skoczeń
2d82687114 Improve topologyToMatchedTermCount map in InterPodAffinity PreFilter 2025-01-10 10:55:49 +00:00
Paco Xu
2653caa248 fix dra test lint 2025-01-09 10:42:40 +08:00
googs1025
77eae7c34f feature(scheduler): remove dra plugin resourceslice QueueingHintFn 2025-01-08 16:24:28 +08:00
Kubernetes Prow Robot
1c2b2cce10 Merge pull request #129119 from macsko/fix_podtopologyspread_for_multiple_constraints_with_the_same_key
Fix PodTopologySpread matching pods counts for constraints with the same topologyKey
2025-01-01 11:04:14 +01:00
Maciej Skoczeń
c3a54926a4 Fix PodTopologySpread matching pods counts for constraints with the same topologyKey 2024-12-30 09:35:24 +00:00
Kubernetes Prow Robot
078664b424 Merge pull request #129023 from zhifei92/cleanup-actiontype
scheduler:  Rename UpdatePodTolerations for code style consistency
2024-12-12 05:28:52 +00:00
zhifei92
27608fa25d refactor(scheduler): Rename UpdatePodTolerations for code style consistency. 2024-11-29 13:13:09 +08:00
googs1025
c725e18e07 feature(scheduler): more fine-grained QHints for interpodaffinity plugin 2024-11-14 20:00:38 +08:00
Kubernetes Prow Robot
8fe10dc378 Merge pull request #128691 from sanposhiho/flaky-candidate
fix: flaky TestPrepareCandidate
2024-11-09 19:38:44 +00:00
Kensei Nakada
00f7b95b7b fix: flaky TestPrepareCandidate 2024-11-09 12:07:16 +09:00
ndixita
6db40446de Scheduler changes:
1. Use pod-level resource when feature is enabled and resources are set at pod-level
2. Edge case handling: When a pod defines only CPU or memory limits at pod-level (but not both), and container-level requests/limits are unset, the pod-level requests stay empty for the resource without a pod-limit. The container's request for that resource is then set to the default request value from schedutil.
2024-11-08 03:00:54 +00:00
Kensei Nakada
e601eb7c5a fix: run activate() only when fail 2024-11-07 14:09:35 +09:00
Kensei Nakada
9b5928f764 fix: refactor TestPrepareCandidate 2024-11-07 14:09:35 +09:00
Kensei Nakada
d4d91d4ace fix: use set methods 2024-11-07 14:09:35 +09:00
Kensei Nakada
a95b8b5085 fix: use Activate always 2024-11-07 14:09:35 +09:00
Kensei Nakada
8be51d3f32 chore: fix based on reviews 2024-11-07 14:09:35 +09:00
Kensei Nakada
677792663f fix: register Pod/Delete event at the preemption plugin 2024-11-07 14:09:35 +09:00
Kensei Nakada
3124c5bd56 fix: avoid errors from the race condition detector 2024-11-07 14:09:35 +09:00
Kensei Nakada
c322294883 implement PodActivator to activate when preemption fails 2024-11-07 14:09:35 +09:00
Kensei Nakada
8f2243fe74 chore: tiny wording on the comment 2024-11-07 14:09:35 +09:00
Kensei Nakada
1e2511031e feat: support metrics 2024-11-07 14:09:35 +09:00
Kensei Nakada
fe3119fa69 make sure DefaultPreemption implements PreEnqueuePlugin 2024-11-07 14:09:35 +09:00
Kensei Nakada
98bdbdcf18 fix: refactor TestPrepareCandidate to test async feature too 2024-11-07 14:09:34 +09:00
Kensei Nakada
02eabc7d8e fix: modify newly added preemption test 2024-11-07 14:09:34 +09:00
Kensei Nakada
68aeced18e fix: fixed some tests 2024-11-07 14:09:34 +09:00
Kensei Nakada
e40f3f40bd fix: address reviews 2024-11-07 14:09:34 +09:00
Kensei Nakada
69a8d0ec0b feature(KEP-4832): asynchronous preemption 2024-11-07 14:09:34 +09:00
Patrick Ohly
33ea278c51 DRA: use v1beta1 API
No code is left which depends on the v1alpha3, except of course the code
implementing that version.
2024-11-06 13:03:19 +01:00
Kubernetes Prow Robot
0fad78930f Merge pull request #127904 from towca/jtuznik/dra-autoscaling
DRA: allow Cluster Autoscaler to integrate with DRA scheduler plugin
2024-11-06 10:01:29 +00:00
Kubernetes Prow Robot
f81a68f488 Merge pull request #128377 from tallclair/allocated-status-2
[FG:InPlacePodVerticalScaling] Implement AllocatedResources status changes for Beta
2024-11-05 23:21:49 +00:00
Kuba Tużnik
3d22a7f414 scheduler/autoscaler_contract: expand the Cluster Autoscaler contract
SharedDRAManager is added to the contract, as well as RunReservePluginsReserve
that will be run by Cluster Autoscaler to obtain claim allocations.
2024-11-05 14:11:08 +01:00
Kuba Tużnik
8d489425aa scheduler/dynamicresources: extract obtaining and tracking in-memory modifications of DRA objects
All logic related to obtaining DRA objects and tracking modifications
to ResourceClaims in-memory is extracted to DefaultDRAManager, which
implements framework.SharedDRAManager.

This is intended to be a no-op in terms of the DRA plugin behavior.
2024-11-05 14:11:04 +01:00
Kuba Tużnik
87cd496a29 scheduler/framework: introduce pluggable SharedDRAManager
SharedDRAManager will be used by the DRA plugin to obtain DRA
objects, and to track modifications to them in-memory. The current
DRA plugin behavior will be the default implementation of
SharedDRAManager.

Plugging a different implementation will allow Cluster Autoscaler
to provide a simulated state of DRA objects to the DRA plugin when
making scheduling simulations, as well as obtain the modifications
to DRA objects from the plugin.
2024-11-05 13:52:57 +01:00
Patrick Ohly
7863d9a381 DRA scheduler: refactor CEL compilation cache
A better place is the cel package because a) the name can become shorter
and b) it is tightly coupled with the compiler there.

Moving the compilation into the cache simplifies the callers.
2024-11-05 08:34:42 +01:00
Tim Allclair
81df195819 Stop using status.AllocatedResources to aggregate resources 2024-11-01 14:02:58 -07:00
Patrick Ohly
6f07fa3a5e DRA scheduler: update some stale comments 2024-11-01 13:23:42 +01:00
Patrick Ohly
ae6b5522ea DRA scheduler: rename variable
"Allocated devices" are the ones which can be observed from the informer. "All
allocated devices" also includes those which are in flight and haven't been
written back to the apiserver.
2024-11-01 13:23:42 +01:00