terraform-render-bootstrap

github-personal/terraform-render-bootstrap

mirror of https://github.com/outbackdingo/terraform-render-bootstrap.git synced 2026-01-27 18:20:40 +00:00

Author	SHA1	Message	Date
Dalton Hubble	990286021a	Organize CoreDNS and kube-proxy manifests so they're optional * Add a `coredns` variable to configure the CoreDNS manifests, with an `enable` field to determine whether CoreDNS manifests are applied to the cluster during provisioning (default true) * Add a `kube-proxy` variable to configure kube-proxy manifests, with an `enable` field to determine whether the kube-proxy Daemonset is applied to the cluster during provisioning (default true) * These optional allow for provisioning clusters without CoreDNS or kube-proxy, so these components can be customized or managed through separate plan/apply processes or automation	2024-05-12 18:05:55 -07:00
Dalton Hubble	720adbeb43	Configure Cilium agents to connect to apiserver explicitly * Cilium v1.14 seems to have problems reliably accessing the apiserver via default in-cluster service discovery (relies on kube-proxy instead of DNS) after some time * Configure Cilium agents to use the DNS name resolving to the cluster's load balanced apiserver and port. Regrettably, this relies on external DNS rather than being self-contained, but its what Cilium pushes users towards	2023-10-29 16:08:21 -07:00
Dalton Hubble	5b2fbbef84	Allow Kubelet kubeconfig to drain nodes * Allow the Kubelet kubeconfig to get/list workloads and evict pods to perform drain operations, via the kubelet-delete ClusterRole bound to the system:nodes group * Previously, the ClusterRole only allowed node deletion	2022-10-23 21:49:38 -07:00
Dalton Hubble	8add7022d1	Normalize CA certs mounts in static Pods and kube-proxy * Mount both /etc/ssl/certs and /etc/pki into control plane static pods and kube-proxy, rather than choosing one based a variable (set based on Flatcar Linux or Fedora CoreOS) * Remove `trusted_certs_dir` variable * Remove deprecated `--port` from `kube-scheduler` static Pod	2021-12-09 09:26:28 -08:00
Dalton Hubble	362f42a7a2	Update CoreDNS from v1.8.0 to v1.8.4 * https://coredns.io/2021/01/20/coredns-1.8.1-release/ * https://coredns.io/2021/02/23/coredns-1.8.2-release/ * https://coredns.io/2021/02/24/coredns-1.8.3-release/ * https://coredns.io/2021/05/28/coredns-1.8.4-release/	2021-06-23 23:26:27 -07:00
Nesc58	016d4ebd0c	Mount /run/xtables.lock in flannel Daemonset * Mount xtables.lock (like Calico and Cilium) since iptables may be called by other processes (kube-proxy)	2020-09-16 19:01:42 -07:00
Dalton Hubble	f2dd897d67	Change seccomp annotations to Pod seccompProfile * seccomp graduated to GA in Kubernetes v1.19. Support for seccomp alpha annotations will be removed in v1.22 * Replace seccomp annotations with the GA seccompProfile field in the PodTemplate securityContext * Switch profile from `docker/default` to `runtime/default` (no effective change, since docker is the runtime) * Verify with docker inspect SecurityOpt. Without the profile, you'd see `seccomp=unconfined` Related: * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.19.md#seccomp-graduates-to-general-availability	2020-09-10 00:28:58 -07:00
Dalton Hubble	e75697ce35	Rename controller node label and NoSchedule taint * Use node label `node.kubernetes.io/controller` to select controller nodes (action required) * Tolerate node taint `node-role.kubernetes.io/controller` for workloads that should run on controller nodes. Don't tolerate `node-role.kubernetes.io/master` (action required)	2020-06-17 22:46:35 -07:00
Dalton Hubble	a83ddbb30e	Add CoreDNS "soft" nodeAffinity for controller nodes * Add nodeAffinity to CoreDNS deployment PodSpec to prefer running CoreDNS pods on controllers, while relying on podAntiAffinity for spreading. * For single master clusters, running two CoreDNS pods on the master or running one pod on a worker is permissible. * Note: Its still _possible_ to end up with CoreDNS pods all running on workers since we only express scheduling preference ("soft"), but unlikely. Plus the motivating scenario (below) is also rare. Background: * CoreDNS replicas are set to the higher of 2 or the number of control plane nodes to (at a minimum) support Deployment updates or pod restarts and match the cluster size (e.g. 5 master/controller nodes likely means a larger cluster, so run 5 CoreDNS replicas) * In the past (before v1.14), we required kube-dns (CoreOS predecessor) to run CoreDNS pods on master nodes. With CoreDNS this node selection was relaxed. We'd like a gentler form of it now. Motivation: * On clusters using 100% preemptible/spot workers, it is possible that CoreDNS pods schedule to workers that are all preempted at the same time, causing a loss of cluster internal DNS service until a CoreDNS pod reschedules (1 min). We'd like CoreDNS to prefer controller/master nodes (which aren't preempted) to reduce the possibility of control plane disruption	2020-05-09 22:48:56 -07:00
Dalton Hubble	924beb4b0c	Enable Kubelet TLS bootstrap and NodeRestriction * Enable bootstrap token authentication on kube-apiserver * Generate the bootstrap.kubernetes.io/token Secret that may be used as a bootstrap token * Generate a bootstrap kubeconfig (with a bootstrap token) to be securely distributed to nodes. Each Kubelet will use the bootstrap kubeconfig to authenticate to kube-apiserver as `system:bootstrappers` and send a node-unique CSR for kube-controller-manager to automatically approve to issue a Kubelet certificate and kubeconfig (expires in 72 hours) * Add ClusterRoleBinding for bootstrap token subjects (`system:bootstrappers`) to have the `system:node-bootstrapper` ClusterRole * Add ClusterRoleBinding for bootstrap token subjects (`system:bootstrappers`) to have the csr nodeclient ClusterRole * Add ClusterRoleBinding for bootstrap token subjects (`system:bootstrappers`) to have the csr selfnodeclient ClusterRole * Enable NodeRestriction admission controller to limit the scope of Node or Pod objects a Kubelet can modify to those of the node itself * Ability for a Kubelet to delete its Node object is retained as preemptible nodes or those in auto-scaling instance groups need to be able to remove themselves on shutdown. This need continues to have precedence over any risk of a node deleting itself maliciously Security notes: 1. Issued Kubelet certificates authenticate as user `system:node:NAME` and group `system:nodes` and are limited in their authorization to perform API operations by Node authorization and NodeRestriction admission. Previously, a Kubelet's authorization was broader. This is the primary security motivation. 2. The bootstrap kubeconfig credential has the same sensitivity as the previous generated TLS client-certificate kubeconfig. It must be distributed securely to nodes. Its compromise still allows an attacker to obtain a Kubelet kubeconfig 3. Bootstrapping Kubelet kubeconfig's with a limited lifetime offers a slight security improvement. * An attacker who obtains the kubeconfig can likely obtain the bootstrap kubeconfig as well, to obtain the ability to renew their access * A compromised bootstrap kubeconfig could plausibly be handled by replacing the bootstrap token Secret, distributing the token to new nodes, and expiration. Whereas a compromised TLS-client certificate kubeconfig can't be revoked (no CRL). However, replacing a bootstrap token can be impractical in real cluster environments, so the limited lifetime is mostly a theoretical benefit. * Cluster CSR objects are visible via kubectl which is nice 4. Bootstrapping node-unique Kubelet kubeconfigs means Kubelet clients have more identity information, which can improve the utility of audits and future features Rel: https://kubernetes.io/docs/reference/command-line-tools-reference/kubelet-tls-bootstrapping/	2020-04-25 19:38:56 -07:00
Dalton Hubble	42723d13a6	Change default kube-system DaemonSet tolerations * Change kube-proxy, flannel, and calico-node DaemonSet tolerations to tolerate `node.kubernetes.io/not-ready` and `node-role.kubernetes.io/master` (i.e. controllers) explicitly, rather than tolerating all taints * kube-system DaemonSets will no longer tolerate custom node taints by default. Instead, custom node taints must be enumerated to opt-in to scheduling/executing the kube-system DaemonSets. Background: Tolerating all taints ruled out use-cases where certain nodes might legitimately need to keep kube-proxy or CNI networking disabled	2020-03-25 22:43:50 -07:00
Dalton Hubble	e76f0a09fa	Switch from upstream hyperkube to component images * Kubernetes plans to stop releasing the hyperkube image in the future. * Upstream will continue releasing container images for `kube-apiserver`, `kube-controller-manager`, `kube-proxy`, and `kube-scheduler`. Typhoon will use these images * Upstream will release the kubelet as a binary for distros to package, either as a traditional DEB/RPP or as a container image for container-optimized operating systems. Typhoon will take on the packaging of Kubelet and its dependencies as a new container image (alongside kubectl) Rel: https://github.com/kubernetes/kubernetes/pull/88676 See: https://github.com/poseidon/kubelet	2020-03-17 22:13:42 -07:00
Dalton Hubble	ac4b7af570	Configure kube-proxy to serve /metrics on 0.0.0.0:10249 * Set kube-proxy --metrics-bind-address to 0.0.0.0 (default 127.0.0.1) so Prometheus metrics can be scraped * Add pod port list (informational only) * Require node firewall rules to be updated before scrapes can succeed	2019-12-29 11:56:52 -08:00
Dalton Hubble	43e1230c55	Update CoreDNS from v1.6.2 to v1.6.5 * Add health `lameduck` option 5s. Before CoreDNS shuts down, it will wait and report unhealthy for 5s to allow time for plugins to shutdown cleanly * Minor bug fixes over a few releases * https://coredns.io/2019/08/31/coredns-1.6.3-release/ * https://coredns.io/2019/09/27/coredns-1.6.4-release/ * https://coredns.io/2019/11/05/coredns-1.6.5-release/	2019-11-13 14:33:50 -08:00
Dalton Hubble	e09d6bef33	Switch kube-proxy from iptables mode to ipvs mode * Kubernetes v1.11 considered kube-proxy IPVS mode GA * Many problems were found https://github.com/poseidon/typhoon/pull/321 * Since then, major blockers seem to have been addressed	2019-10-15 22:55:17 -07:00
Dalton Hubble	6e59af7113	Migrate from a self-hosted to static pod control plane * Run kube-apiserver, kube-scheduler, and kube-controller-manager as static pods on each controller node * Boostrap a minimal control plane by copying `static-manifests` to the Kubelet `--pod-manifest-path` and tls/auth secrets to `/etc/kubernetes/bootstrap-secrets`. Then, kubectl apply Kubernetes manifests. * Discontinue using bootkube to bootstrap and pivot to a self-hosted control plane. * Remove bootkube self-hosted kube-apiserver DaemonSet and kube-scheduler and kube-controller-manager Deployments * Remove pod-checkpointer manifests (no longer needed) Advantages: * Reduce control plane bootstrapping complexity. Self-hosted pivot and pod checkpointing worked well, but in-place edits to kube-apiserver, kube-controller-manager, or kube-scheduler is infrequently used. The concept was originally geared toward continuously in-place upgrading clusters, a goal Typhoon doesn't take on (rec. blue/green clusters). As such, the value-add isn't justifying the extra components for this particular project. * Static pods still provide kubectl visibility and log access Drawbacks: * In-place edits to kube-apiserver, kube-controller-manager, and kube-scheduler are not possible via kubectl (non-goal) * Assets must be copied to each controller (not just one) * Static pod must load credentials via hostPath, which is less clean compared with the former Kubernetes secrets and service accounts	2019-09-02 20:52:46 -07:00
Dalton Hubble	4caca47776	Run kube-apiserver as non-root user (nobody)	2019-07-06 13:51:54 -07:00
Dalton Hubble	3bfd1253ec	Always run kube-apiserver on port 6443 (internally) * Require bootstrap-kube-apiserver and kube-apiserver components listen on port 6443 (internally) to allow kube-apiserver pods to run with lower user privilege * Remove variable `apiserver_port`. The kube-apiserver listen port is no longer customizable. * Add variable `external_apiserver_port` to allow architectures where a load balancer fronts kube-apiserver 6443 backends, but listens on a different port externally. For example, Google Cloud TCP Proxy load balancers cannot listen on 6443	2019-07-06 13:50:22 -07:00
Dalton Hubble	62df9ad69c	Update Kubernetes from v1.14.3 to v1.15.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG-1.15.md#v1150	2019-06-23 13:04:13 -07:00
Dalton Hubble	efd1cfd9bf	Update CoreDNS from v1.3.1 to v1.5.0 * Add `ready` plugin and change the readinessProbe to check default port 8181 to ensure all plugins are ready * `upstream [ADDRESS]` defines upstream resolvers for external services. If no address is given, resolution is against CoreDNS itself, which is the default. So `upstream` can be removed	2019-05-27 00:07:59 -07:00
Dalton Hubble	b9bef14a0b	Add enable_aggregation option (defaults to false) * Add an `enable_aggregation` variable to enable the kube-apiserver aggregation layer for adding extension apiservers to clusters * Aggregation is disabled by default. Typhoon recommends you not enable aggregation. Consider whether less invasive ways to achieve your goals are possible and whether those goals are well-founded * Enabling aggregation and extension apiservers increases the attack surface of a cluster and makes extensions a part of the control plane. Admins must scrutinize and trust any extension apiserver used. * Passing a v1.14 CNCF conformance test requires aggregation be enabled. Having an option for aggregation keeps compliance, but retains the stricter security posture on default clusters	2019-04-07 02:27:40 -07:00
Dalton Hubble	1528266595	Resolve in-addr.arpa and ip6.arpa zones with CoreDNS kubernetes plugin * Resolve in-addr.arpa and ip6.arpa DNS PTR requests for Kubernetes service IPs and pod IPs * Previously, CoreDNS was configured to resolve in-addr.arpa PTR records for service IPs (but not pod IPs)	2019-03-04 22:33:21 -08:00
Dalton Hubble	593f0e3655	Add a readinessProbe to CoreDNS * https://github.com/kubernetes/kubernetes/pull/74137	2019-02-23 13:11:19 -08:00
Dalton Hubble	c5f5aacce9	Assign Pod Priority Classes to control plane components * Priority Admission Controller has been enabled since Typhoon v1.11.1 * Assign cluster and node components a builtin priorityClassName (higher is higher priority) to inform scheduler prepemption, scheduling order, and node out-of-resource eviction order	2019-02-17 17:12:46 -08:00
Dalton Hubble	7dc8f8bf8c	Switch CoreDNS to use the forward plugin instead of proxy * Use the forward plugin to forward to upstream resolvers, instead of the proxy plugin. The forward plugin is reported to be a faster alternative since it can re-use open sockets * https://coredns.io/explugins/forward/ * https://coredns.io/plugins/proxy/ * https://github.com/kubernetes/kubernetes/issues/73254	2019-01-30 22:19:13 -08:00
Dalton Hubble	7b06557b7a	Reduce kube-controller-manager --pod-eviction-timeout to 1m * Pods on preempted nodes should be moved to healthy nodes more quickly (1 min instead of 5 minutes)	2019-01-27 16:20:01 -08:00
Dalton Hubble	e892e291b5	Restore Kubelet authorization to delete nodes * Fix a regression caused by lowering the Kubelet TLS client certificate to system:nodes group (#100) since dropping cluster-admin dropped the Kubelet's ability to delete nodes. * On clouds where workers can scale down (manual terraform apply, AWS spot termination, Azure low priority deletion), worker shutdown runs the delete-node.service to remove a node to prevent NotReady nodes from accumulating * Allow Kubelets to delete cluster nodes via system:nodes group. Kubelets acting with system:node and kubelet-delete ClusterRoles is still an improvement over acting as cluster-admin	2019-01-14 23:26:41 -08:00
Dalton Hubble	f1e69f1d93	Re-enable kube-scheduler and kube-controller-manager HTTP ports * Fix regression added in `48730c0f12`, allow Prometheus to scrape metrics from kube-scheduler and kube-controller-manager	2019-01-11 23:52:57 -08:00
Dalton Hubble	48730c0f12	Probe kube-scheduler and kube-controller-manager HTTPS ports * Disable kube-scheduler and kube-controller-manager HTTP ports	2019-01-09 20:50:57 -08:00
Dalton Hubble	0e65e3567e	Enable certificates.k8s.io API certificate issuance * Allow kube-controller-manager to sign Approved CSR's using the cluster CA private key to issue cluster certificates * System components that need to use certificates signed by the cluster CA can submit a CSR to the apiserver, have an admin inspect and manually approve it, and be issued a certificate * Admins should inspect CSRs very carefully to ensure their origin and authorization level are appropriate * https://kubernetes.io/docs/tasks/tls/managing-tls-in-a-cluster/#approving-certificate-signing-requests	2019-01-06 17:17:03 -08:00
Dalton Hubble	ea30087577	Structure control plane manifests neatly	2019-01-05 21:47:30 -08:00
Dalton Hubble	f5ea389e8c	Update CoreDNS from v1.2.6 to v1.3.0 * https://coredns.io/2018/12/15/coredns-1.3.0-release/ * Limit log plugin to just log error class	2019-01-05 13:21:10 -08:00
Dalton Hubble	a7bd306679	Add admin kubeconfig and limit Kubelet cert to system:nodes group * Change Kubelet TLS client certificate to belong to the system:nodes group instead of the system:masters group (more limited) * Bind the system:node ClusterRole to the system:nodes group (yes, the ClusterRole is singular) * Generate separate admin.crt and admin.key files (which do still use system:masters). Output kubeconfig-kubelet and kubeconfig-admin values from the module * Remove the kubeconfig output to force users to pick the correct kubeconfig, depending on how the output is used (action required!) Related: * https://kubernetes.io/docs/reference/access-authn-authz/rbac/#core-component-roles Note, NodeAuthorizer/NodeRestriction would be an enhancement, but to work across platforms it effectively requires TLS bootstraping which doesn't have a viable attestation strategy and clashes with CCM. This change improves Kubelet limitations, but intentionally doesn't aim to steer toward NodeAuthorizer/NodeRestriction	2019-01-02 23:08:09 -08:00
Dalton Hubble	7bcca25043	Use a kube-apiserver ServiceAccount and ClusterRoleBinding * Switch kube-apiserver from using the kube-system default ServicAccount (with cluster-admin) to using a kube-apiserver ServiceAccount bound to cluster-admin (as before) * Remove the default-sa ClusterRoleBinding that allowed kube-apiserver and kube-scheduler (or other 3rd-party components added to kube-system) to use the kube-system default ServiceAccount for cluster-admin * Require all future components in kube-system define their own ServiceAccount	2019-01-01 17:30:28 -08:00
Dalton Hubble	fa4c2d8a68	Use a kube-scheduler ServiceAccount and ClusterRoleBinding * Switch kube-scheduler from using the kube-system default ServiceAccount (with cluster-admin) to using a kube-scheduler ServiceAccount bound to the builtin system:kube-scheduler and system:volume-scheduler (required for StorageClass) ClusterRoles * https://kubernetes.io/docs/reference/access-authn-authz/rbac/#core-component-roles	2019-01-01 17:29:36 -08:00
Dalton Hubble	cff13f9248	Update hyperkube from v1.12.3 to v1.13.0 * Remove controller-manager empty dir mount added for v1.12 https://github.com/kubernetes/kubernetes/issues/68973 * No longer required https://github.com/kubernetes/kubernetes/pull/69884	2018-12-03 20:42:14 -08:00
Dalton Hubble	bffb5d5d23	Update pod-checkpointer image to query Kubelet secure api * Updates pod-checkpointer to prefer the Kubelet secure API (before falling back to the Kubelet read-only API that is disabled on Typhoon clusters since https://github.com/poseidon/typhoon/pull/324) * Previously, pod-checkpointer checkpointed an initial set of pods during bootstrapping so recovery from power cycling clusters was unaffected, but logs were noisy * https://github.com/kubernetes-incubator/bootkube/pull/1027 * https://github.com/kubernetes-incubator/bootkube/pull/1025	2018-11-26 20:11:01 -08:00
Dalton Hubble	3f3ab6b5c0	Enable CoreDNS loop and loadbalance plugins * loop sends an initial query to detect infinite forwarding loops in configured upstream DNS servers and fast exit with an error (its a fatal misconfiguration on the network that will otherwise cause resolvers to consume memory/CPU until crashing, masking the problem) * https://github.com/coredns/coredns/tree/master/plugin/loop * loadbalance randomizes the ordering of A, AAAA, and MX records in responses to provide round-robin load balancing (as usual, clients may still cache responses though) * https://github.com/coredns/coredns/tree/master/plugin/loadbalance	2018-11-10 17:33:30 -08:00
Dalton Hubble	365d089610	Set kube-apiserver's kubelet preferred address types * Prefer InternalIP and ExternalIP over the node's hostname, to match upstream behavior and kubeadm * Previously, hostname-override was used to set node names to internal IP's to work around some cloud providers not resolving hostnames for instances (e.g. DO droplets)	2018-11-03 14:58:30 -07:00
Dalton Hubble	6a77775e52	Update CoreDNS from v1.2.2 to v1.2.4 * https://coredns.io/2018/10/17/coredns-1.2.4-release/ * https://coredns.io/2018/10/16/coredns-1.2.3-release/	2018-10-27 15:35:21 -07:00
Dalton Hubble	79065baa8c	Fix CoreDNS AntiAffinity to prefer spreading pods	2018-10-17 22:15:53 -07:00
Dalton Hubble	81f19507fa	Update Kubernetes from v1.11.3 to v1.12.1 * Mount an empty dir for the controller-manager to work around https://github.com/kubernetes/kubernetes/issues/68973 * Update coreos/pod-checkpointer to strip affinity from checkpointed pod manifests. Kubernetes v1.12.0-rc.1 introduced a default affinity that appears on checkpointed manifests; but it prevented scheduling and checkpointed pods should not have an affinity, they're run directly by the Kubelet on the local node * https://github.com/kubernetes-incubator/bootkube/issues/1001 * https://github.com/kubernetes/kubernetes/pull/68173	2018-10-16 20:03:04 -07:00
Dalton Hubble	2437023c10	Add docker/default seccomp profile to control plane pods * By default, Kubernetes starts containers without the Docker runtime's default seccomp profile (e.g. seccomp=unconfined) * https://docs.docker.com/engine/security/seccomp/#pass-a-profile-for-a-container	2018-10-13 18:06:34 -07:00
Dalton Hubble	4e0ad77f96	Add livenessProbe to kube-proxy DaemonSet	2018-10-13 17:59:44 -07:00
Dalton Hubble	f7c2f8d590	Raise CoreDNS replica count to at least 2 * Run at least two replicas of CoreDNS to better support rolling updates (previously, kube-dns had a pod nanny) * On multi-master clusters, set the CoreDNS replica count to match the number of masters (e.g. a 3-master cluster previously used replicas:1, now replicas:3) * Add AntiAffinity preferred rule to favor distributing CoreDNS pods across nodes	2018-10-13 17:19:02 -07:00
Dalton Hubble	7797377d50	Raise scheduler/controller-manager replicas in multi-master * Continue to ensure scheduler and controller-manager run at least two replicas to support performing kubectl edits on single-master clusters (no change) * For multi-master clusters, set scheduler / controller-manager replica count to the number of masters (e.g. a 3-master cluster previously used replicas:2, now replicas:3)	2018-10-13 15:43:31 -07:00
Dalton Hubble	9e6fc7e697	Update hyperkube from v1.11.0 to v1.11.1 * Kubernetes v1.11.1 defaults to enabling the Priority admission controller. List the Priority admission controller explicitly for readability	2018-07-20 00:27:31 -07:00
Dalton Hubble	81ba300e71	Switch from kube-dns to CoreDNS * Add system:coredns ClusterRole and binding * Annotate CoreDNS service for Prometheus metrics scraping * Remove kube-dns deployment, service, service account, and variables * Deprecate kube_dns_service_ip module output, use cluster_dns_service_ip instead	2018-07-01 16:17:04 -07:00
Dalton Hubble	eb2dfa64de	Explicitly disable apiserver 127.0.0.1 insecure port * Although the --insecure-port flag is deprecated, apiserver continues to default to listening on 127.0.0.1:8080 * Explicitly disable insecure local listener since its unused * https://github.com/kubernetes/kubernetes/pull/59018#discussion_r177849954 * `5f3546b66f`	2018-06-27 22:30:29 -07:00
Dalton Hubble	2bcf61b2b5	Change apiserver port from 443 to 6443 * Requires updating load balancers, firewall rules, security groups, and potentially routers/balancers * Temporarily allow apiserver_port override to accommodate edge cases or migration * https://github.com/kubernetes-incubator/bootkube/pull/789	2018-06-19 23:40:09 -07:00

1 2

89 Commits