Commit Graph

36 Commits

Author SHA1 Message Date
Steve Wilkerson
6e2ea01ae0 Mariadb: Use correct credentials for exporter in secret
This updates the mariadb chart to use the correct auth values for
the mariadb prometheus exporter. The correct credentials to use
are the credentials in the oslo_db endpoint

Change-Id: I2d325167d7ffdf911a56fe97b879cb13b0d4c195
2019-02-04 06:23:33 -06:00
Zuul
6ef3f58fb8 Merge "Add pre-fixes to the Selenium jobs and remove "|| true"" 2019-01-31 20:39:40 +00:00
Zuul
b30012a616 Merge "[CEPH] Fixes for the OSD defrag cronjob" 2019-01-31 16:05:14 +00:00
Matthew Heler
fc76091261 [CEPH] Fixes for the OSD defrag cronjob
Fix a naming issue with the cronjob's binary, and schedule the cron
job to run every 15 minutes for the gates. Additonally check to
to ensure we are only running on block devices. Also update the
script to work with ceph-volume created devices.

Change-Id: I8aedab0ac41c191ef39a08034fff3278027d7520
2019-01-31 06:13:05 -06:00
Chris Wedgwood
b7b7c5ea44 [alertmanager] default to 1 replica, multinode gate uses 3
Change-Id: Ifb1420f8dcf7237349a79f1f97aea5e547bafeab
2019-01-30 08:43:18 +00:00
Meg Heisler
98fbc9a1e2 Add pre-fixes to the Selenium jobs and remove "|| true"
This adds xxx-job name prefixes to the Selenium jobs for consistency

This will also remove the "|| true" suffix that was added temporarily to
ensure the Kibana selenium job did not error. The fix for the issue
was merged so the quick fix is no longer needed and may prevent an
error when an issue actually occurs.
Change-Id: I16881974cbf618b31813964b17c090dbfe33fe51
2019-01-29 20:24:57 -06:00
Steve Wilkerson
1e40765d88 OSH-Infra: Update multinode and aio-monitoring/logging jobs
This proposes moving the multinode job to a periodic job to
match the approach used in the openstack-helm repo.

This also adds the openstack-exporter to the aio monitoring job as
it was previously missing.

This also proposes moving the aio-logging and aio-monitoring jobs
to voting

Change-Id: Idcd4544e03facdcd2430683b66bd80c79e73a372
2019-01-23 08:49:48 -06:00
Zuul
958127477d Merge "Additional Selenium tests for Kibana dashboard" 2019-01-17 23:46:14 +00:00
Meg Heisler
9289cd0987 Additional Selenium tests for Kibana dashboard
This helps verify Kibana is working properly by using
Selenium Webdriver to navigate to different index dashboards
and takes screenshot of each one. It also add the scripts to
the gates for single and multinode deployments.

Change-Id: Ic2c91734d1eaac0ea4e7985bf69082942166715d
2019-01-17 11:24:19 -06:00
Steve Wilkerson
00b40480a3 Nagios: Fix elasticsearch query clause volume mount
This fixes the Nagios volume mount for the Elasticsearch query
file. Previously, the check for adding the volumemount to the
pod definition was incorrect. This fixes the conditional check,
and also adds the same conditional check to the configuration
secret

This adds a simple check to the monitoring and multinode jobs to
validate the resulting json gets mounted into the pod successfully

Change-Id: I2af289ccc4e1cff1669cb5e6e829514781b14dd3
2019-01-15 16:18:01 -06:00
Meg Heisler
c3bef9e88f Selenium Tests for OSH Infra
This adds scripts using Selenium Webdriver to verify
the dashboards for Gafana, Nagios, and Prometheus are
reachable and functioning as expected. The scripts
create screenshots of each dashboard as well as
pages that can be navigated to.

It also adds the scripts to the gates for the single
and multinode deployments.

Change-Id: I1699e0ba8ff82ce8f59342cc71aad10cff7d2516
2019-01-07 15:59:42 -06:00
Zuul
2a0b183613 Merge "Helm tests for Ceph-OSD and Ceph-Client charts" 2018-12-03 05:03:31 +00:00
Jean-Charles Lopez
f7e03d4763 Helm Tests for Ceph-RGW chart
Co-Authored-By: Renis Makadia <renis.makadia@att.com>

Change-Id: I81cc0cb498b2ca911d5b7bfa7c3bd9b8552e0e2b
2018-12-01 08:08:28 +00:00
Renis Makadia
b1005b23b4 Helm tests for Ceph-OSD and Ceph-Client charts
Change-Id: If4a846f0593b8679558662205a8560aa3cbb18ae
2018-12-01 08:08:00 +00:00
Zuul
2c9ff8bee8 Merge "Fix the checkPGs cronjob" 2018-11-09 22:57:50 +00:00
Meg Heisler
774e0cb654 RGW: Fix multinode deploy for ceph rgw
Change deployment script for rgw to not use the docker
bridge for public and cluster network overrides. Instead,
calculate network values in same way as other ceph multinodes
deployment steps

Change-Id: I2bacd1af1cc331d76a5d61f3b589ca6ef80b1b2e
2018-11-08 11:39:23 -06:00
Matthew Heler
e1c82f3465 Fix the checkPGs cronjob
Currently the cronjob is broken due to syntax and
permission issues.

Additionally move the cronjob from once a month to
every 15 minutes, and automatically disable the job
unless explicitly enabled.

Change-Id: Id72bdb286c805ccb0ea4e9fcf65fabca94a180dd
2018-11-06 19:39:23 -06:00
Steve Wilkerson
4c29bafcbc Gates: Update fluent-logging/elasticsearch configurations
This updates the fluentd buffer output configurations to account
for the restraints of the jobs deploying fluentd. This also
renames the fluentd configuration key from td_agent to fluentd to
reflect the fact we're no longer deploying td-agent

This also updates the Elasticsearch default replicas and overrides
the replica counts in each Elasticsearch deployment to account for
resource constraints

Change-Id: I55dee410eced99c3e1645f7452e4306ad646e601
2018-10-19 17:30:08 +00:00
Steve Wilkerson
538d51e991 Organize aio gates by function
This organizes the single node gates for osh-infra by function.
This organization aims to improve the single node gates in the
following ways:

1. Reduce number of services deployed in single node jobs
2. Only deploy Ceph for logging job, as Elasticsearch requires
   RGW for snapshot repositories.
3. Use NFS for storage for monitoring job, as Ceph is not a
   requirement for any of the services here.
4. Remove duplicate services deployed to multiple single node jobs
5. Remove storage from openstack-support job, as the only service
   requiring storage is rabbitmq. Rabbitmq is deployed with
   storage enabled in the openstack-helm checks/gates.

This also removes the documentation for the single node deployments,
as those deployments do not make sense with this change. This should
be revisited as a follow-on once we have a clear path forward for
the larger gate refactoring work

Change-Id: I46951f76904fa2ab245a202d55f76019b7503362
2018-10-19 12:28:18 -05:00
Gupta, Sangeet (sg774j)
a34a7d8e50 Adding Falco
This commit adds falco daemonset of the node for behavioral activity
monitor designed to detect anomalous activity.

Change-Id: I783a2acc03592471c81a8a54e1dc0df140b34a42
2018-10-05 16:18:51 +00:00
Steve Wilkerson
fa09705867 Fluentbit: Add kernel, kubelet, and dockerd logs
This adds inputs for kernel logs on the host, as well as dockerd
and kubelet logs via the systemd plugin. This also adds a filter
for adding the hostname to the kernel log events, for renaming the
fields for systemd logs as kibana can not visualize fields that
begin with an underscore, and adds elasticsearch indexes for both
kernel and systemd logs

Change-Id: I026470dd45a971047f1e5bd1cd49bd0889589d12
2018-10-01 11:56:58 +00:00
Steve Wilkerson
a084769410 Elasticsearch S3 repo
This ps adds the ability to use the ceph radosgw s3 api for
snapshot repositories. It removes the ability to use a RWM pvc, as
the radosgw solution provides a more robust approach for storing
index snapshots

Change-Id: Ie56ac41ccdc61bfadcac52b400cceb35403e9fae
2018-09-19 15:53:21 -05:00
Steve Wilkerson
bc6e22f392 Gate: Trim dev-deploy gates
This attempts to trim down the dev-deploy gates until further
gate refactoring is complete. This disables the elasticsearch and
fluentd exporters and removes the openstack exporter from the
single node deployment gates to ease the load on nodepool vms

Change-Id: If211511e8f52fe39d293966abbd7e62b45b65970
2018-09-17 13:56:51 +00:00
Zuul
40e0ddba33 Merge "Gate: Add process exporter to gate deployments and docs" 2018-09-05 20:52:24 +00:00
Steve Wilkerson
93630ac6e3 MariaDB: Move chart to openstack-helm-infra
This moves the mariadb chart to openstack-helm-infra as part of
the effort to move charts to the appropriate repositories

Change-Id: Ife56e28de46c536108cebb4f4cdf6bad2a415289
Story: 2002204
Task: 21725
2018-09-04 18:57:53 -05:00
Steve Wilkerson
0aae608aa0 Gate: Add process exporter to gate deployments and docs
This adds the process exporter to both the developer and multinode
gates, along with adding the relevant deployment steps to the docs

Change-Id: I85d5c398fbbb62145c9bb4e3a885e9a774725e5a
2018-09-04 15:54:25 -05:00
Steve Wilkerson
8037bf4ca9 Add Ceph to osh-infra gates
This adds a ceph developer gate to openstack-helm-infra, which
depends on ceph moving to openstack-helm-infra. This also replaces
the NFS backed storage for the multinode gate with ceph instead

Change-Id: I11268463aa037a2e037217a2dbc89c7432c0d277
2018-08-28 15:39:03 -05:00
Jean-Philippe Evrard
bf069b2311 Revert "Update OSH Author copyrights to OSF"
This reverts commit 178aa271a4.

Change-Id: I38a52d866527dfff2689b618e055f439bc248c13
2018-08-28 17:25:54 +00:00
Matt McEuen
178aa271a4 Update OSH Author copyrights to OSF
This PS updates the "Openstack-Helm Authors" copyright attribution
to be the "OpenStack Foundation", as decided in the 2018-03-20
team meeting:
http://eavesdrop.openstack.org/meetings/openstack_helm/2018/openstack_helm.2018-03-20-15.00.log.html

No other copyright attributions were changed.

Change-Id: I1137dee2ae5728771835f4b33fcaff60fcc22ca9
2018-08-26 17:17:06 -05:00
Steve Wilkerson
9fd4cd0917 Deploy ingress in osh-infra dev and multinode gates
This deploys the ingress chart in the openstack-helm-infra dev
and multinode gates, which allows for enabling ingresses in the
charts where defined

Change-Id: I055c7b02d9af68f6e3c5eda33d69dd0b8b1b70ca
2018-07-30 07:14:00 -05:00
Zuul
fbf2464042 Merge "Add ldap support to grafana, update version, add helm tests" 2018-05-15 07:41:37 +00:00
Steve Wilkerson
e081c19fe8 Add ldap support to grafana, update version, add helm tests
This adds ldap support to the grafana chart. This required updating
the version of Grafana to 5.0, as this version allows for using
configuration files to bootstrap the datasources and dashboards
instead of using the grafana http api. This was a necessary change
as using ldap for grafana presented issues trying to create the
datasource via the http api

This also adds a basic helm test for grafana. This test simply
verifies whether the prometheus datasource configured exists and
whether the number of dashboards reported by the admin api matches
the number of dashboards expected

Change-Id: I2e987cb425adba9f909722ffdb25b83f82710c4d
2018-05-15 01:42:04 +00:00
Steve Wilkerson
3c692abd6e Add ldap support in elasticsearch/kibana apache proxies
This adds required configuration for enabling LDAP through
the apache proxy in the elasticsearch and kibana charts by
default

Change-Id: Iaff8f328ff50944ddad94ec86b1134ca73750176
2018-05-14 13:53:30 +00:00
Steve Wilkerson
85208fe98a LDAP: Move ldap chart to openstack-helm-infra
This moves the ldap chart to openstack-helm-infra from
openstack-helm, allowing for ldap to provide an authentication
mechanism for components of the LMA stack, and can still be used for
keystone in openstack-helm, as openstack-helm-infra is a required
project

Change-Id: I211bc47c7a3ae875614102c8f64daa1099f702e8
2018-05-14 08:53:00 -05:00
Pete Birley
80db87033b Gate: stop pulling default images
This PS stops pulling the charts default images, as the make file
target cannot target over-rides - resulting in longer gate runs with
twice as many images pulled than required in some cases.

Change-Id: I04c1d43d6ac9d2f509604709139583b085b406e3
See: https://review.openstack.org/#/c/566738/
2018-05-13 09:40:05 -05:00
Steve Wilkerson
b492ee54c3 OSH-infra multinode gate rework
This moves to using bash scripts for deploying the charts for the
multinode gates in osh-infra as opposed to using the ansible
chart builder

Change-Id: I2951ccf57250a5e6e272872f2d6f7a81bd11f184
2018-05-10 00:06:04 +00:00