mirror of
https://github.com/outbackdingo/kubernetes.git
synced 2026-02-25 12:20:29 +00:00
Automatic merge from submit-queue (batch tested with PRs 43844, 44284) Add a retry to cluster-autoscaler e2e This should fix https://github.com/kubernetes/kubernetes/issues/44268. The flake was caused by following sequence of events: 1. Cluster was at minimum size (3), some node was unneeded for a while. 2. Setup for some test (scale-down, failure) would increase node group size (to 5) and wait for new nodes to come up. 3. As soon as new node come up (cluster size 4) CA would scale-down the old unneeded node (setting node group size to 4). 4. Node group would not reach size 5 (as the target was now 4) and the test would timeout and fail. This PR makes the setup monitor re-set the target node group size if the above scenario happens.