typhoon

mirror of https://github.com/outbackdingo/typhoon.git synced 2026-01-27 18:20:41 +00:00

Author	SHA1	Message	Date
Dalton Hubble	be179b0e6e	Update kube-apiserver manifest for MutatingAdmissionPolicy	2025-11-23 16:36:07 -08:00
Dalton Hubble	6c5caf5fe2	Update Kubernetes from v1.33.1 to v1.34.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.34.md	2025-11-22 12:22:52 -08:00
Dalton Hubble	552dbb4001	Rollback Cilium from v1.17.6 to v1.17.5 * Cilium v1.17.6 is broken, see https://github.com/cilium/cilium/issues/40571	2025-07-27 14:22:49 -07:00
Dalton Hubble	e88b9c52df	Update Kubernetes from v1.33.2 to v1.33.3 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.33.md#v1333	2025-07-19 09:50:54 -07:00
Dalton Hubble	97a88bb4dc	Standardize load balancer variables before release * Use consistent variable names in AWS and Azure Kubernetes modules for whether HTTP load balancing rules should be enabled or not	2025-07-06 13:31:19 -07:00
Dalton Hubble	bdaa1d02c2	azure: Allow workers with NvmeDisk Ephemeral OS disks * Several v6 SKU types come with ephemeral OS disks with Nvme so you get faster local storage and avoid managed disk costs * Ensure worker_disk_size is set to the appropriate size for the SKU's ephemeral storage, since you pay for it either way * Requires https://github.com/hashicorp/terraform-provider-azurerm/pull/30044	2025-07-01 11:21:44 -07:00
Dalton Hubble	bd4147c844	Update Kubernetes from v1.33.1 to v1.33.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.33.md#v1332 * Update Cilium and flannel CNI	2025-06-29 17:31:54 -07:00
Dalton Hubble	528ed63a7e	Set Azure VMSS upgrade policy to Rolling * Set a rolling upgrade policy so that changes to the worker node pool are rolled out gradually. Previously, the VMSS model could change, but instances would not receive it until manually replaced * Align Azure node pool behaviors more closely with AWS and GCP: * On AWS, worker instance template changes trigger an instance refresh * On GCP, worker instance template changes roll out via proactive * Define Azure automatic instance repair using Application Health Extension probes to 10256 (kube-proxy or Cilium equivalent) to match the strategy used on Google Cloud	2025-06-19 16:24:01 -07:00
Dalton Hubble	0ac3d1a05b	Add enable_http_load_balancing variable to Azure clusters * Azure Load Balancers charge by load balancer rues (5 included) so its useful to provide ways to stay under that number, either by dropping support for port 80 traffic or IPv6 traffic. When using global proxies, you can usually serve IPv6 or http->https redirects separately anyway	2025-06-14 20:42:59 -07:00
Dalton Hubble	ff477d163c	Update Kubernetes from v1.33.0 to v1.33.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.33.md#v1331	2025-05-24 20:27:33 -07:00
Dalton Hubble	fe2de85d85	Update Kubernetes from v1.32.3 to v1.33.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.33.md#v1330	2025-05-06 20:02:11 -07:00
Dalton Hubble	eb084031ec	Update Cilium from v1.17.1 to v1.17.2 * Also update flannel from v0.26.2 to v0.26.5	2025-03-18 20:10:02 -07:00
Dalton Hubble	4c2c6d5029	Update Kubernetes from v1.32.1 to v1.32.3 * Update Cilium from v1.16.5 to v1.17.1	2025-03-12 21:13:54 -07:00
Dalton Hubble	cdf4ef700e	Add service_account_issuer variable for kube-apiserver * Allow the service account token issuer to be adjusted or served from a public bucket or static cache Docs: https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/#service-account-issuer-discovery Rel: https://github.com/poseidon/terraform-render-bootstrap/pull/405	2025-02-07 12:52:17 -08:00
Dalton Hubble	fe08a4426e	Update Kubernetes from v1.32.0 to v1.32.1 * Enable the Kubernetes MutatingAdmissionPolicy alpha via feature gate * Update CoreDNS from v1.11.4 to v1.12.0 * Update flannel from v0.26.2 to v0.26.3 Docs: https://kubernetes.io/docs/reference/access-authn-authz/mutating-admission-policy/	2025-01-20 15:27:18 -08:00
Dalton Hubble	19a7868b2d	Restore Azure auto-scale settings for VMSS node pools * Using spot instances, when an instance is deleted it actually lowers the desired number of nodes in the VMSS so the node is not replaced * Restore the auto-scale setting needed to maintain a consistent desired number of workers while spot instances come and go. This was mistakely removed in refactoring	2025-01-19 20:35:44 -08:00
Dalton Hubble	111b1206ba	azure: Add `enable_ipv6_load_balancing` variable and default false * Azure Load Balancers include 5 rules (3 LB rules, 2 outbound) whether used or not * [#1468](https://github.com/poseidon/typhoon/pull/1468) added 3 LB rules to support IPv6 load balancing, raising the rules count from 5 to 8 and added ~$21/mo to the cost of the load balancer. If you use an edge (e.g. Cloudflare) a cluster does not need to load balance IPv6, so this additional cost can be avoided * I noticed this because my load balancing costs were up for the last few months. The gotcha is that outbound rules count toward the 5 rules included with the base cost of the LB (~$18/mo) Docs: https://azure.microsoft.com/en-us/pricing/details/load-balancer/	2024-12-30 16:22:41 -08:00
Dalton Hubble	1955b23819	Change flannel port from 4789 to 8472 * flannel and Cilium default to UDP 8472 for VXLAN traffic to avoid conflicts with other VXLAN usage (e.g. Open vSwith) * Aligning flannel and Cilium to use the same vxlan port makes firewall rules or security policies simpler across clouds Rel: https://github.com/poseidon/terraform-render-bootstrap/pull/403	2024-12-30 15:10:02 -08:00
Dalton Hubble	1fed24d0d2	Remove calico from component configuration * Calico is no longer supported, so enabling or disabling the component does nothing. Remove the field from components	2024-12-29 20:35:16 -08:00
Dalton Hubble	8059eb9f0c	Remove support for Calico CNI * Cilium has been the default for about 3 years and is the defacto standard CNI choice. flannel is supported as a simple alternative * Remove various historical options that were needed that are specific to Calico	2024-12-28 20:45:28 -08:00
Dalton Hubble	a8eae32b53	Configure Kubelets for parallel image pulls * By default, Kubelet will pull container images one by one (in series), which is mostly related to Docker-era bugs in parallel image pulls. These days we use containerd so parallel pulls should be fine * Serial image pulls are undesirable because one slow registry or image can cause other image pulls to wait. Parallel image pulls ensure only large images / slow registries see that impact Docs: https://kubernetes.io/docs/reference/config-api/kubelet-config.v1beta1/	2024-12-27 20:03:18 -08:00
Dalton Hubble	44fc53e8db	Change the default Pod CIDR to 10.20.0.0/14 * Change the default Pod CIDR from 10.2.0.0/16 to 10.20.0.0/14 (10.20.0.0 - 10.23.255.255) to support 1024 nodes by default * Most CNI providers divide the Pod CIDR so that each node has a /24 to allocate to local pods (256). The previous `10.2.0.0/16` default only fits 256 /24's so 256 nodes were supported without customizing the pod_cidr	2024-12-27 19:51:56 -08:00
Dalton Hubble	e1072283c5	Update Kubernetes from v1.31.4 to v1.32.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.32.md#v1320	2024-12-20 17:00:20 -08:00
Dalton Hubble	cbedda4b28	Update Kubernets from v1.31.3 to v1.31.4 * Update flannel from v0.26.0 to v0.26.2 * Update Cilium from v1.16.4 to v1.16.5	2024-12-20 15:10:51 -08:00
Dalton Hubble	bc59d5153e	Update Kubernetes from v1.31.2 to v1.31.3 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.31.md#v1313 * Update CoreDNS from v1.11.3 to v1.11.4 * Update Cilium from v1.16.3 to v1.16.4 * Plan to drop support for using Calico CNI, recommend everyone use the Cilium default	2024-11-24 08:43:54 -08:00
Dalton Hubble	dfb307b1a7	Use consistent resources naming btw Azure Flatcar/FCOS * Fix Azure Public IP name in the Flatcar Linux configuration	2024-11-23 21:20:00 -08:00
Dalton Hubble	61ffc0bc19	Update Kubernetes from v1.31.1 to v1.31.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.31.md#v1312 * Update Cilium from v1.16.1 to v1.16.3 * Update flannel from v0.25.6 to v0.26.0	2024-10-26 08:33:43 -07:00
Dalton Hubble	598f707cbd	Update Kubernetes from v1.31.0 to v1.31.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.31.md#v1311	2024-09-20 14:43:39 -07:00
Dalton Hubble	9a2448f711	Remove upper bound on azurerm provider version * Allow folks to start upgrading to azurerm provider v4.0.0, don't set an upper bound on versions going forward	2024-08-23 21:51:29 -07:00
Dalton Hubble	3412060c3c	Use Cilium kube-proxy replacement when Cilium CNI is used * When using the Cilium component, disable bootstrapping the kube-proxy DaemonSet. Instead, configure Cilium to provide its kube-proxy replacement with BPF * Update the self-managed Cilium component to use kube-proxy replacement as well	2024-08-23 12:33:32 -07:00
Dalton Hubble	effa13c141	Fix flannel-cni container image * Close #1496	2024-08-22 19:26:19 -07:00
Dalton Hubble	10be34daa2	Update Kubernetes from v1.30.4 to v1.31.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.31.md#v1310	2024-08-17 08:32:35 -07:00
Dalton Hubble	320d76c934	Update Kubernetes from v1.30.3 to v1.30.4 * Update Cilium from v1.16.0 to v1.16.1	2024-08-16 08:27:07 -07:00
Dalton Hubble	2daa23be50	Update default Cilium and CoreDNS components * Update the CoreDNS and Cilium versons used by default when folks aren't managing the components themselves	2024-08-05 08:47:06 -07:00
Dalton Hubble	6e2daded02	Remove some seldom used variables and set reasonable * Set reasonable values and remove some variable clutter * enable_reporting is only used with Calico and we can just default to false, I doubt anyone uses Calico and cares much about reporting metrics to upstream Calico	2024-08-02 20:45:37 -07:00
Dalton Hubble	0120b9f38d	Remove the cluster_domain_suffix variable * Drop support for `cluster_domain_suffix` customization and always use `cluster.local`. Many components in the Kubernetes ecosystem assume this default suffix and its very rare to be setting a special value here these days * Cleanup a few variables that are seldom used	2024-08-02 15:05:25 -07:00
Dalton Hubble	af27661432	Configure controller and worker node architecture separately * On platforms that support ARM64 instances, configure controller and worker node host architectures separately * For example, you can run arm64 controllers and amd64 workers * Add `controller_arch` and `worker_arch` variables * Remove `arch` variable	2024-08-02 15:04:57 -07:00
Dalton Hubble	d046026511	Fix incorrect terraform-render-bootstrap SHA	2024-07-25 21:41:54 -07:00
Dalton Hubble	0669d44026	Update Kubernetes from v1.30.2 to v1.30.3 * Update builtin Cilium manifests from v1.15.6 to v1.15.7 * Update builtin flannel manifests from v0.25.4 to v0.25.5	2024-07-20 11:04:32 -07:00
Dalton Hubble	672bbad10b	Generate Azure Virtual Network IPv6 ULA space at random * Private IPv6 address space should be assigned randomly within an organization per https://datatracker.ietf.org/doc/html/rfc4193	2024-07-20 11:01:50 -07:00
Dalton Hubble	0d10d180f8	Change worker node pools from uniform to flexible orchestration mode * Use flexible orchestration mode. Azure has started to recommend this mode because it allows interacting with VMSS instances like regular VMs via the CLI or via the Azure Portal * Add options to allow workers nodes to use ephemeral local disks * Add `controller_disk_type` and `controller_disk_size` variables * Add `worker_disk_type`, `worker_disk_size`, and `worker_ephemeral_disk` variables	2024-07-14 11:58:15 -07:00
Dalton Hubble	a4fab61066	Remove an IPv4 address from Azure clusters * Consolidate load balancer frontend IPs to just the minimal IPv4 and IPv6 addresses that are needed per load balancer. apiserver and ingress use separate ports, so there is not a true need for a separate public IPv4 address just for apiserver * Some might prefer a separate IP just because it slightly hides the apiserver, but these are public hosted endpoints that can be discovered * Reduce the cost of an Azure cluster since IPv4 public IPs are billed ($3.60/mo/cluster)	2024-07-10 22:29:43 -07:00
Dalton Hubble	24b7f31c55	Rename Azure cluster region variable to location * Rename the region variable to location to align with Azure platform conventions, where resources are created within an Azure location, which are themselves part of broader geographical regions	2024-07-09 07:56:58 -07:00
Dalton Hubble	48d4973957	Add IPv6 support for Typhoon Azure clusters * Define a dual-stack virtual network with both IPv4 and IPv6 private address space. Change `host_cidr` variable (string) to a `network_cidr` variable (object) with "ipv4" and "ipv6" fields that list CIDR strings. * Define dual-stack controller and worker subnets. Disable Azure default outbound access (a deprecated fallback mechanism) * Enable dual-stack load balancing to Kubernetes Ingress by adding a public IPv6 frontend IP and LB rule to the load balancer. * Enable worker outbound IPv6 connectivity through load balancer SNAT by adding an IPv6 frontend IP and outbound rule * Configure controller nodes with a public IPv6 address to provide direct outbound IPv6 connectivity * Add an IPv6 worker backend pool. Azure requires separate IPv4 and IPv6 backend pools, though the health probe can be shared * Extend network security group rules for IPv6 source/destinations Checklist: Access to controller and worker nodes via IPv6 addresses: * SSH access to controller nodes via public IPv6 address * SSH access to worker nodes via (private) IPv6 address (via controller) Outbound IPv6 connectivity from controller and worker nodes: ``` nc -6 -zv ipv6.google.com 80 Ncat: Version 7.94 ( https://nmap.org/ncat ) Ncat: Connected to [2607:f8b0:4001:c16::66]:80. Ncat: 0 bytes sent, 0 bytes received in 0.02 seconds. ``` Serve Ingress traffic via IPv4 or IPv6 just requires setting up A and AAAA records and running the ingress controller with `hostNetwork: true` since, hostPort only forwards IPv4 traffic	2024-07-09 07:55:00 -07:00
Dalton Hubble	931d6d18de	Update Kubernetes from v1.30.1 to v1.30.2 * Update CoreDNS from v1.9.4 to v1.11.1 * Update Cilium from v1.15.5 to v1.15.6 * Update flannel from v0.25.1 to v0.25.4	2024-06-17 08:20:03 -07:00
Dalton Hubble	533ace7011	Update Cilium from v1.15.4 to v1.15.5 * https://github.com/cilium/cilium/releases/tag/v1.15.5	2024-05-19 16:38:08 -07:00
Dalton Hubble	b3c384fbc0	Introduce the component system for managing pre-installed addons * Previously: Typhoon provisions clusters with kube-system components like CoreDNS, kube-proxy, and a chosen CNI provider (among flannel, Calico, or Cilium) pre-installed. This is convenient since clusters come with "batteries included". But it also means upgrading these components is generally done in lock-step, by upgrading to a new Typhoon / Kubernetes release * It can be valuable to manage these components with a separate plan/apply process or through automations and deploy systems. For example, this allows managing CoreDNS separately from the cluster's lifecycle. * These "components" will continue to be pre-installed by default, but a new `components` variable allows them to be disabled and managed as "addons", components you apply after cluster creation and manage on a rolling basis. For some of these, we may provide Terraform modules to aide in managing these components. ``` module "cluster" { # defaults components = { enable = true coredns = { enable = true } kube_proxy = { enable = true } # Only the CNI set in var.networking will be installed flannel = { enable = true } calico = { enable = true } cilium = { enable = true } } } ``` An earlier variable `install_container_networking = true/false` has been removed, since it can now be achieved with this more extensible and general components mechanism by setting the chosen networking provider enable field to false.	2024-05-19 16:33:57 -07:00
Dalton Hubble	563feacd29	Update Kubernetes from v1.30.0 to v1.30.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.30.md#v1301	2024-05-15 21:59:00 -07:00
Dalton Hubble	3f34e047f1	azure: Add controller security group and subnet outputs * Output the network security group name and address prefixes for controller nodes, to allow adding custom network security rules that apply specifically to controller nodes	2024-05-14 21:34:31 -07:00
Dalton Hubble	cc80ec9b98	Add firewall and security rules for Cilium/Hubble metrics * Add firewall or security riles to allow node-to-node traffic on ports 9962-9965 for Cilium and Hubble metrics. Cilium runs with host network, so these require cloud firewall changes	2024-05-13 21:27:38 -07:00

1 2 3 4 5

249 Commits