117 Commits

Author SHA1 Message Date
Gage Hugo
d200c97464 Update several default ubuntu release defaults
Several roles and scripts in the osh-infra repo have defaults for
ubuntu that are still on xenial when we mostly run bionic or newer.

This change updates these references to default to focal.

Change-Id: I84bca2d685cf1c67b10eee3bb7c05deb2cc4670b
2022-04-27 11:29:27 -05:00
Gage Hugo
79d75267ea Move osh-infra jobs to use helm3
This change updates many of the deployment scripts to properly
handle deploying each service via helm 3 and updates each job
to use the helm v3 install script.

Change-Id: I90a7b59231376b9179439c2554e46449d59b9c15
2022-03-24 13:05:42 -05:00
Anderson, Craig (ca846m)
feeab3291c Add DNS sanity checks to k8s deploy script
Check that k8s DNS is working, and terminate at the beginning if this is
not the case.

Change-Id: I30867671f39dd9d80f46f5a4381adc9d34df7ab7
2022-02-18 00:15:59 -08:00
jayonlau
4df5e23c06 Remove helm status from deployment scripts for common
With the move to helm v3, helm status requires a namespace to be specified, but doing so breaks helm v2 compatability. This change removes the usage of helm serve in openstack-helm-infra's deployment scripts.

Change-Id: I8e035d70dd652d5253f534ad6b28042347158ff4
2021-10-13 12:12:22 -04:00
Tin Lam
b72f750e87 fix(script): removes replacement overrides
This removes the functionality to perform envsubst in the feature
gate script to prevent users with specific env set running into
unexpected error. This feature will be re-visited in the future to
be made more robust.

Signed-off-by: Tin Lam <tin@irrational.io>
Change-Id: I6dcfd4dad138573294a9222e4e7af80c9bff4ac0
2021-03-19 01:14:09 -05:00
Gupta, Sangeet (sg774j)
d64c5fea64 TLS gates from infra monitoring
Change-Id: If41dae31763abac8c9c65b1650556438a9b6b0df
2021-03-02 16:10:24 +00:00
jinyuanliu
5c6d281b62 Remove zookeeper residue
About zookeeper chart,It's been removed,But there are still some related scripts that have not been completely deleted,we should remove them.

Change-Id: Iae20717482ad6c7a40f54174eef120d094abbd59
2021-03-01 14:31:03 +08:00
Xiaoguang(William) Zhang
567e4703e9 Remove Alerta from openstack-helm-infra repository
There is no significant value been added from Alerta base on current user story.

Change-Id: I274263e3dfefd7b9ec8ff84d03504d194225d693
2021-02-18 12:02:18 -05:00
astebenkova
a589db04e9 Add possibility to use overrides for some charts
The deal is that all the jobs' scripts include extra arguments
when deploying helm charts, except these ones in the commit.
It would be useful to use override files in these charts.

+ Fix typo in apparmor.yaml for node-exporter
+ Amend apparmor.yaml for openstack-exporter since
those values are already by default in values.yaml

Change-Id: Ibe8b38977216e618dccba7e8443b3cc05a772de5
2021-01-27 18:45:19 +02:00
Zuul
02368a4d99 Merge "[ceph] Make sure loopback devices persistent across reboots" 2020-11-23 22:49:04 +00:00
Andrii Ostapenko
13315e57a7 Fix openvswitch gate issue with systemd 237-3ubuntu10.43
New systemd 237-3ubuntu10.43 bumps memlock limit from 16 to 64 MB [1]
which seems to cause issues with eBPF related operations in containers
run with root [2] as a possible root cause.

Here we have an option to downgrade systemd to previous available
version or to set previous default memlock limit to systemd defaults or
docker unit. Setting systemd DefaultLimitMEMLOCK in this commit.

[1] https://launchpad.net/ubuntu/+source/systemd/237-3ubuntu10.43
[2] https://bugs.launchpad.net/ubuntu/+source/systemd/+bug/1837580/comments/9

Change-Id: I55d14ffa47a7a29d059f2f3b502bb38be0a5dd3d
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-11-22 14:04:15 +00:00
Chinasubbareddy Mallavarapu
515d31f9ae [ceph] Make sure loopback devices persistent across reboots
Change-Id: I50ddfcf0903fe00fc020c819e784ea289d5baae6
2020-11-09 21:23:03 +00:00
Andrii Ostapenko
9d7d4e4ba6 Add conntrack required for k8s 1.18.x
Change-Id: I8c85e11ad984fa283139b3d71d52bc7c3fb8e900
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-07 18:34:34 +00:00
Andrii Ostapenko
7a0558bd78 Override images provided as artifacts from buildset registry
Switch from using images from defined in docker_images to provided as
zuul artifacts. Currently to be used in conjunction with [0] in
openstack-helm-images pipelines.

[0] https://review.opendev.org/741551

Change-Id: I43dbd38906e8854c87a361f2e5e479f57850252f
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-09-28 17:38:29 +00:00
anthony.bellino
96369491cb Patroni exclusion for Postgres
This PS removes the previously put in place HA clustering support
Patroni provided.

Change-Id: I03ed11282413a454062ab34b8594ba60ac2175aa
2020-08-31 18:02:37 +00:00
Zuul
754d8e93b4 Merge "Add Alerta feature to osh-infra" 2020-08-19 21:19:18 +00:00
Xiaoguang(William) Zhang
83a55fd19e Add Alerta feature to osh-infra
Change-Id: Id8dc3f86b8d6754df4ba3c0c720a78731e3f54d5
2020-08-19 13:35:40 +00:00
Zuul
622bc4d972 Merge "Remove remaining test pods before new test run" 2020-08-18 16:21:00 +00:00
Gayathri Devi Kathiri
a57190fd8a Remove remaining test pods before new test run
If the test pod still exists, then the new test run
fails with ERROR: pods "abc-test" already exists

So, Removing remaining test pods before new test run

Change-Id: I3b3ed5ceaf420aa39a669b4a50a838ad154b1fdd
Closes-Bug: #1882030
2020-08-13 18:12:25 +00:00
Vipul Jain
628186f05e prometheus exporter to monitor health check for web URL
1) A separate prometheus job need to provide target and scrap metrics
2) it is based on https://github.com/prometheus/blackbox_exporter

Adding script file for deployment and job under zuul.d
Resolving conflict
Change-Id: Ia15ab7d8ef882886fe0e37cc2599e6815d7bcc6c
2020-08-11 22:39:54 +05:30
Zuul
8cd4e26a42 Merge "Fluentd: Allow for Multiple Config Files" 2020-08-05 14:48:13 +00:00
Zuul
e292b356b5 Merge "Use precreated minikube-aio image" 2020-08-04 22:01:04 +00:00
Zuul
3d76931e55 Merge "Fluentd: Remove Deployment Option" 2020-08-04 21:06:26 +00:00
Steven Fitzpatrick
4e73195bc2 Fluentd: Allow for Multiple Config Files
Change-Id: I2eb6ff2a599ee7bc479f9fc4955016b18902c879
2020-08-04 20:14:09 +00:00
Andrii Ostapenko
10019004da Use precreated minikube-aio image
With this commit minikube is installed using contents of precreated
minikube-aio image containing installation script, all required binaries
and images inside. Pulling a single image from dockerhub via opendev
dockerhub proxy and loading images allows to save up to 6 minutes in
minikube installation.

Change-Id: I5936f440eb0567b8dcba2fdae614e4c5e88a7b9a
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-08-04 19:46:43 +00:00
Steven Fitzpatrick
959417f321 Fluentd: Remove Deployment Option
This chart could deploy fluentd either as a Deployment
or a Daemonset. Both options would use the deployment-fluentd
template with various sections toggled off based on values.yaml

I'd like to know - Does anyone run this chart as a Deployment?
We can simplify the chart, and zuul gates, by changing the chart
to deploy a Daemonset specifically.

Change-Id: Ie88ceadbf5113fc60e5bb0ddef09e18fe07a192c
2020-08-04 19:06:37 +00:00
Chinasubbareddy Mallavarapu
4358251073 [CEPH] OSH-INFRA: Update ceph scripts to create loopback devices
This is to update ceph scripts to create loopback devices
in single script and also to update gate scripts.

Change-Id: Id6e3c09dca20d98fcbcc434e65f790c06b6272e8
2020-07-29 10:05:37 -05:00
Andrii Ostapenko
d103da6c06 Fix tiller metrics port exposure issue for minikube
Along with fixing the bug, with this we'll decrease build time for
all jobs using minikube and collecting tiller metrics for more than
2 minutes.

Change-Id: Ia166584eae48c643248f977b959aa6336e3a327e
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-07-23 23:39:35 -05:00
Zuul
af5a742a17 Merge "Add generate openAPIV3Schema schema for DaemonJob CRD." 2020-07-01 21:56:31 +00:00
Luna Das
64c744d756 Add generate openAPIV3Schema schema for DaemonJob CRD.
change docker image to point to the latest metacontroller image.
change python image to point to version 3.7
add updateStrategy to CompositeController.
add replicas config to DaemonJobController via zuul gate.

Change-Id: I2a48bc6472017802267980fe474d81886113fcda
2020-06-30 01:13:41 +05:30
Chinasubbareddy Mallavarapu
3bde9f5b90 [CEPH] OSH-INFRA: use loopback devices for ceph osds
- This is to make use of loopback devices for ceph osds since
support for directory backed osds going to deprecate.

- Move to bluestore from filestore for ceph-osds.
- Seperate DB and WAL partitions from data so that gates will validate
  the scenario where we will have fast storage disk for DB and WAL.

Change-Id: Ief6de17c53d6cb57ef604895fdc66dc6c604fd89
2020-06-29 14:09:32 +00:00
Steve Wilkerson
a31bb2b049 Add node-problem-detector chart
This adds a chart for the node problem detector. This chart
will help provide additional insight into the status of the
underlying infrastructure of a deployment.

Updated the chart with new yamllint checks.

Change-Id: I21a24b67b121388107b20ab38ac7703c7a33f1c1
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-06-22 13:00:55 -05:00
KHIYANI, RAHUL (rk0850)
d88b553727 Adding python3-pip package to installation script
python3-pip package is required to support openstack-client

Change-Id: I285a32a766dacf6c0e956216144ca1fcf300da77
2020-06-12 11:23:35 -05:00
Zuul
e53d28718d Merge "Remove OSH Authors copyright" 2020-05-12 20:00:38 +00:00
Steven Fitzpatrick
2039805314 Fluentd: Render Config as Template
This change updates the fluentd configmap-etc to render
.Values.conf.fluentd.template as a template, allowing for greater
flexibility in configuration.

Change-Id: I8809767c679c377e319ecc53960c55ae18e1b558
2020-05-11 10:13:44 -05:00
Gage Hugo
d14d826b26 Remove OSH Authors copyright
The current copyright refers to a non-existent group
"openstack helm authors" with often out-of-date references that
are confusing when adding a new file to the repo.

This change removes all references to this copyright by the
non-existent group and any blank lines underneath.

Change-Id: I1882738cf9757c5350a8533876fd37b5920b5235
2020-05-07 02:11:15 +00:00
Pete Birley
19a1fbf8f7 Ceph: Dont mount log directories to host
This PS updates the bind mounts for ceph logs directorys to be
emptydirs. This ensures we do not polute the hosts permanantly
with ceph logs, which should be directed to stdout.

Change-Id: I6d72c0864b9ecc493cd62564e0e0450d90cfcf00
Signed-off-by: Pete Birley <pete@port.direct>
2020-04-07 18:05:04 +00:00
Steven Fitzpatrick
9d2e08f1a4 Fluentd: Switch to Native Metrics Plugin
This change updates the fluentd chart to use the native
fluent-plugin-prometheus for metric production. This plugin
provides more detailed metrics about fluentd's operations,
specifically regarding input and output statistics.

https://github.com/fluent/fluent-plugin-prometheus

Using the plugin, each fluentd pod produces metrics, so
scape annotations have been added to the pod spec. The
zuul check on metric producers has been updated to account
for this.

Depends-On: https://review.opendev.org/714167
Change-Id: I809356d92b0cff1e31cb2062102bbedefd4843fd
2020-03-30 00:44:58 -05:00
diwakar thyagaraj
c0bec2c744 Enable Apparmor to fluentd
Change-Id: I21640c263cbf7871319d2710160e37a9dddf0eb2
Signed-off-by: diwakar thyagaraj <diwakar.chitoor.thyagaraj@att.com>
2020-03-25 21:50:53 +00:00
Zuul
965fd345c3 Merge "Revert "Enable Apparmor for fluentd"" 2020-03-25 00:01:34 +00:00
Steve Wilkerson
d898a65a2d Revert "Enable Apparmor for fluentd"
This reverts commit a3110abd66.

Change-Id: I90180d5caa6cd7873220fcc91570c92ae7a234e5
2020-03-24 19:46:19 +00:00
Zuul
eb5bccc4be Merge "Fix Kibana Selenium tests" 2020-03-23 16:18:10 +00:00
Luna Das
55beab680f Add more fields to daemonjob crd spec.
Change-Id: I1690035c7e35887245bbdcfdc97b19409fd6ab8a
2020-03-20 04:03:13 +00:00
dt241s@att.com
a3110abd66 Enable Apparmor for fluentd
Change-Id: I6b759beff9fd3166a9868a4d71319836f272bc25
2020-03-18 21:26:48 +00:00
dmyrhorodskyi
49b50d632b Fix Kibana Selenium tests
XPath to expected element was changed after
Kibana upgrade, this commit changes XPath
according chnges in new Kibana.

Change-Id: I501de225e1226991db9c263cedf38397cda7b51f
2020-03-17 17:33:37 +02:00
Luna Das
9b9913d64a Add Helm Charts for MetaController and DaemonJobController.
These charts bootstraps a metacontroller on a
Kubernetes cluster using the Helm package manager.
This enables you to deploy custom controllers as
service + deployment pairs. A DaemonJobController
chart bootstraps the CompositeController and register
DaemonJob CRD, the daemonjob controller executes
DaemonJob(CR's) in kubernetes Cluster.

Change-Id: Ic946f564ea1cf07e89c90a598e59230dc240950c
2020-03-10 01:04:42 +05:30
diwakar thyagaraj
344c0543fa Enable runtime Apparmor default for All Ceph Components
Change-Id: Id62fe453846ffe6ab01198177d5d8046378d61bf
Signed-off-by: diwakar thyagaraj <diwakar.chitoor.thyagaraj@att.com>
2020-02-24 15:52:11 +00:00
Steve Wilkerson
781e65ac5d Fluentd: Update kernel and auth inputs to use systemd
This updates the overrides provided for deploying fluentd as a
daemonset to get kernel messages from the journal instead of
/var/log/kern.log directly, and also uses the journal to get
messages associated with logging to auth.log (syslog facility
10). This provides additional metadata and
a cleaner interface for gathering these logs via fluentd

Change-Id: I8e832db276095771d6a869e998d7a69795dfee37
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2020-01-16 13:38:52 +00:00
Stephen Taylor
016b56e586 Ceph Nautilus compatibility
This change updates the Ceph charts to use Ceph Nautilus images
built on Ubuntu Bionic instead of Xenial. The mirror that hosts
Ceph packages only provides Nautilus packages for Bionic at
present, so this is necessary for Nautilus deployment.

There are also several configuration and scripting changes
included to provide compatibility with Ceph Nautilus. Most of
these simply allow existing logic to execute for Nautilus
deployments, but some logical changes are required to support
Nautilus as well.

NOTE: The cephfs test has been disabled because it was failing
the gate. This test has passed in multiple dev environments, and
since cephfs isn't used by any openstack-helm-infra components we
don't want this to block getting this change merged. The gate
issue will be investigated and addressed in a subsequent patch
set.

Change-Id: Id2d9d7b35d4dc66e93a0aacc9ea514e85ae13467
2019-12-17 18:47:24 +00:00
Steve Wilkerson
edd6ffd712 Reduce osh-infra-logging job scope
This updates the osh-infra-logging single node job to omit the
fluentbit deployment step, as having multiple logging daemonsets
deployed to the single node jobs is causing IO issues. Also, it
was noted that the fluentd-deployment step was missing the
overrides to move the fluentd-deployment release from utilizing a
daemonset to a deployment. This resulted in 3 logging daemons
being deployed to a single host

Change-Id: I4a0c5550e6ea6a331aab0082a975f161e65704bf
Signed-off-by: Steve Wilkerson <sw5822@att.com>
2019-12-17 12:43:12 -06:00