Commit Graph

2684 Commits

Author SHA1 Message Date
Zuul
c19ee4ab94 Merge "[ceph-client] Fix crush weight comparison in reweight_osds()" 2020-08-13 20:40:46 +00:00
Zuul
592153734a Merge "[ceph] Check for osds deployed with zero crush weight" 2020-08-13 20:03:31 +00:00
Taylor, Stephen (st053q)
f66f9fe560 [ceph-client] Fix crush weight comparison in reweight_osds()
The recently-added crush weight comparison in reweight_osds() that
checks weights for zero isn't working correctly because the
expected weight is being calculated to two decimal places and then
compared against "0" as a string. This updates the comparison
string to "0.00" to match the calculation.

Change-Id: I29387a597a21180bb7fba974b4daeadf6ffc182d
2020-08-13 12:00:32 -06:00
Chinasubbareddy Mallavarapu
64b423cee0 [ceph] Check for osds deployed with zero crush weight
This is to check for osds deployed with zero crush weight from
helm tests.

Change-Id: Ie8d9c65b33bf7a026a342d1d7e81ec37cb981db3
2020-08-13 14:39:38 +00:00
Zuul
8631c79548 Merge "Add security context template for keystone-webhook container" 2020-08-12 22:51:41 +00:00
Kabanov, Dmitrii
787052a975 [ceph-osd] update post apply job
The PS updates post apply job and moves execution of the command outside
of if statement. The output of the command stored in a variable
which will be checked in if statement. Added "-z" to correct comparison
of the length of the string (variable). It was accidentally missed in
the initial PS.

Change-Id: I907f75d0a9e5ef27fba5306ddb86199e94b01b3b
2020-08-12 15:49:20 +00:00
Zuul
dbd465c4c6 Merge "prometheus exporter to monitor health check for web URL" 2020-08-12 00:51:52 +00:00
Zuul
8bf7505396 Merge "Run node-problem-detector from 0 user" 2020-08-11 22:35:41 +00:00
KHIYANI, RAHUL (rk0850)
117fae68e6 Run node-problem-detector from 0 user
This adds the runAsUser flag to node-problem-detector at pod level

Change-Id: I0db38599e037ac38b24e50bbc2228ef1fd264c4e
2020-08-11 19:36:15 +00:00
Vipul Jain
628186f05e prometheus exporter to monitor health check for web URL
1) A separate prometheus job need to provide target and scrap metrics
2) it is based on https://github.com/prometheus/blackbox_exporter

Adding script file for deployment and job under zuul.d
Resolving conflict
Change-Id: Ia15ab7d8ef882886fe0e37cc2599e6815d7bcc6c
2020-08-11 22:39:54 +05:30
Taylor, Stephen (st053q)
f1e9a6ba83 [ceph-client] Refrain from reweighting OSDs to 0
If circumstances are such that the reweight function believes
OSD disks have zero size, refrain from reweighting OSDs to 0.
This can happen if OSDs are deployed with the noup flag set.

Also move the setting and unsetting of flags above this
calculation as an additional precautionary measure.

Change-Id: Ibc23494e0e75cfdd7654f5c0d3b6048b146280f7
2020-08-11 09:48:53 -06:00
KHIYANI, RAHUL (rk0850)
a58a78ff83 Add security context template for keystone-webhook container
This implements security context override at pod level and adds
readOnly-fs to keystone-webhook container

Change-Id: Ia67947b7323e41363a5ee379c0dfb001936b5107
2020-08-11 09:45:08 -05:00
Zuul
10fd77b6e4 Merge "Update alertmanager include snmp_notifier function" 2020-08-11 06:16:10 +00:00
Zuul
b05c64033f Merge "Add missing security context template to mariadb-backup pod" 2020-08-11 05:28:51 +00:00
Zuul
d2d3b05273 Merge "Prometheus: Render Rules as Templates" 2020-08-10 20:31:45 +00:00
Steven Fitzpatrick
fb7fc87d23 Prometheus: Render Rules as Templates
This change allows us to substitute values into our rules files.

Example:

- alert: my_region_is_down
  expr: up{region="{{ $my_region }}"} == 0
  
To support this change, rule annotations that used the expansion
{{ $labels.foo }} had to be surrounded with "{{` ... `}}" to render
correctly.

Change-Id: Ia7ac891de8261acca62105a3e2636bd747a5fbea
2020-08-10 18:16:35 +00:00
Zuul
87d7ddaf56 Merge "Add Application Armor to elastic-apm" 2020-08-07 19:35:35 +00:00
Kabanov, Dmitrii
f99444f37a [ceph-osd] update post_apply job
The PS updates wait_for_pods function and adds query to filter the pods
which are not in Running or Succeeded state.
Also the PS reduces the amount of 'kubectl get' requests.

Change-Id: Ie2abdaf0a87ca377f5ce287a3de9e87d1ca6c0d4
2020-08-07 17:16:53 +00:00
Xiaoguang(William) Zhang
7c94deae43 Update alertmanager include snmp_notifier function
Change-Id: I5aedbdcdbba397a9fddde19a0898cb91de08553a
2020-08-07 12:25:33 -04:00
dt241s@att.com
25d03950b0 Add Application Armor to Elastic FileBeat
Change-Id: Ifa6eb48e56e1a92d090ea9dbaaaf5890f95ae032
2020-08-07 14:34:37 +00:00
dt241s@att.com
564cada4ad Add Application Armor to elastic-apm
Change-Id: Id1e6b70db03f71b87539f6e3e466f39d8440b773
2020-08-07 14:34:27 +00:00
Andrii Ostapenko
817efe966c Pass parameter to skip helm tests for component in the same script
Pass parameter from job allowing to parallelize helm tests using
separate scripts.

Change-Id: I3e06c5590d51c75448dc5ff5978dc7fc90daca6f
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-08-06 00:51:15 +00:00
Andrii Ostapenko
fdcbd03784 Allow parallelization in gate runner
Change-Id: I393a22cfcaecb00d14d8ac643bd4b7ffbba03b12
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-08-06 00:51:11 +00:00
Zuul
a6a9146de4 Merge "Add missing security context template to nagios init container" 2020-08-05 19:06:21 +00:00
Zuul
fde1d53cd4 Merge "[Ceph-OSD] Refactor the code of OSD init script" 2020-08-05 18:59:12 +00:00
KHIYANI, RAHUL (rk0850)
cf131bacb2 Add missing security context template to nagios init container
This change adds security context template at container level
to implement readOnly-fs flag

Change-Id: Ibd4f8a916bcd74c1d89aa360e89d4477cd01d367
2020-08-05 10:29:30 -05:00
Zuul
8cd4e26a42 Merge "Fluentd: Allow for Multiple Config Files" 2020-08-05 14:48:13 +00:00
Kabanov, Dmitrii
cbc8001255 [Ceph-OSD] Refactor the code of OSD init script
The PS adds the changes which simplifies the code and
removes unnecessary steps.

Change-Id: I66e2b661e4d2cbc4a08d7690514321f9f3127e19
2020-08-05 14:45:50 +00:00
Zuul
e292b356b5 Merge "Use precreated minikube-aio image" 2020-08-04 22:01:04 +00:00
Zuul
3d76931e55 Merge "Fluentd: Remove Deployment Option" 2020-08-04 21:06:26 +00:00
Steven Fitzpatrick
4e73195bc2 Fluentd: Allow for Multiple Config Files
Change-Id: I2eb6ff2a599ee7bc479f9fc4955016b18902c879
2020-08-04 20:14:09 +00:00
Andrii Ostapenko
10019004da Use precreated minikube-aio image
With this commit minikube is installed using contents of precreated
minikube-aio image containing installation script, all required binaries
and images inside. Pulling a single image from dockerhub via opendev
dockerhub proxy and loading images allows to save up to 6 minutes in
minikube installation.

Change-Id: I5936f440eb0567b8dcba2fdae614e4c5e88a7b9a
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-08-04 19:46:43 +00:00
Steven Fitzpatrick
959417f321 Fluentd: Remove Deployment Option
This chart could deploy fluentd either as a Deployment
or a Daemonset. Both options would use the deployment-fluentd
template with various sections toggled off based on values.yaml

I'd like to know - Does anyone run this chart as a Deployment?
We can simplify the chart, and zuul gates, by changing the chart
to deploy a Daemonset specifically.

Change-Id: Ie88ceadbf5113fc60e5bb0ddef09e18fe07a192c
2020-08-04 19:06:37 +00:00
Zuul
3fa84d655f Merge "Add Application Armor to Ceph-Provisioners-config test" 2020-08-04 17:56:27 +00:00
Zuul
8d8d53c65c Merge "feat(tls): add tls to prometheus-openstack-exporter" 2020-08-04 14:54:26 +00:00
Zuul
8b09a07423 Merge "Fix overrides diff" 2020-08-03 23:05:18 +00:00
Zuul
262fa219d0 Merge "Remove updateStrategy of childresources of DaemonJobController." 2020-08-03 22:43:40 +00:00
Gupta, Sangeet (sg774j)
4d512f6eff feat(tls): add tls to prometheus-openstack-exporter
This patchset enables passing of tls certificate to
openstack.

Change-Id: I370d69d8747ce894684dbff87b3580b6d1e82647
2020-08-03 22:20:34 +00:00
Zuul
9ed951aa32 Merge "[Ceph-client] Add check of target osd value" 2020-08-03 21:31:09 +00:00
Zuul
c0b86523a7 Merge "[ceph-client] update logic of inactive pgs check" 2020-08-03 20:12:06 +00:00
Frank Ritchie
5909bcbdef Use hostPID for ceph-mgr deployment
This change is to address a memory leak in the ceph-mgr deployment.
The leak has also been noted in:

https://review.opendev.org/#/c/711085

Without this change memory usage for the active ceph-mgr pod will
steadily increase by roughly 100MiB per hour until all available
memory has been exhausted. Reset messages will also be seen in the
active and standby ceph-mgr pod logs.

Sample messages:

---

0 client.0 ms_handle_reset on v2:10.0.0.226:6808/1
0 client.0 ms_handle_reset on v2:10.0.0.226:6808/1
0 client.0 ms_handle_reset on v2:10.0.0.226:6808/1

---

The root cause of the resets and associated memory leak appears to
be due to multiple ceph pods sharing the same IP address (due to
hostNetwork being true) and PID (due to hostPID being false).
In the messages above the "1" at the end of the line is the PID.
Ceph appears to use the Version:IP:Port/PID (v2:10.0.0.226:6808/1)
tuple as a unique identifier. When hostPID is false conflicts arise.

Setting hostPID to true stops the reset messages and memory leak.

Change-Id: I9821637e75e8f89b59cf39842a6eb7e66518fa2c
2020-08-03 17:35:51 +00:00
dt241s@att.com
4c46b2662a Add Application Armor to Ceph-Provisioners-config test
1) Added  to service account name insted of traditional pod name
   to resolve for dynamic release names.

Change-Id: Ibf4c69415e69a7baca2e3b96bcb23851e68d07d8
2020-08-03 16:42:53 +00:00
Kabanov, Dmitrii
f6d6ae051d [ceph-client] update logic of inactive pgs check
The PS updates wait_for_inactive_pgs function:
- Changed the name of the function to wait_for_pgs
- Added a query for getting status of pgs
- All pgs should be in "active+" state at least three times in a row

Change-Id: Iecc79ebbdfaa74886bca989b23f7741a1c3dca16
2020-08-03 08:42:58 -07:00
Kabanov, Dmitrii
47ce52a5cf [Ceph-client] Add check of target osd value
The PS adds the check of target osd value. The expected amount of OSDs
should be always more or equal to existing OSDs. If there is more OSDs
than expected it means that the value is not correct.

Change-Id: I117a189a18dbb740585b343db9ac9b596a34b929
2020-08-03 15:38:14 +00:00
Zuul
3ce0170da8 Merge "Prometheus: Allow input of TLS client creds in values.yaml" 2020-08-01 22:47:56 +00:00
KHIYANI, RAHUL (rk0850)
4397ed6152 Add missing security context template to mariadb-backup pod
Values are already overridden in values.yaml
https://github.com/openstack/openstack-helm-infra/blob/master/mariadb/values.yaml#L112-L122

Change-Id: I2931eb1408d1d8145b476a76ff8cfac36e9f2bfe
2020-08-01 20:56:38 +00:00
Andrii Ostapenko
cf90f32e8b Fix overrides diff
Check proper path.

Change-Id: Icd3d0711fb530b77d049227b09904c433e26dc78
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-07-31 21:52:59 -05:00
Zuul
f79704a8f0 Merge "[ceph-client] Fix a helm test issue and disable PG autoscaler" 2020-07-31 22:09:17 +00:00
Zuul
3f9006ccae Merge "Enable Read-Only for Node-Problem Detector" 2020-07-31 21:39:06 +00:00
Zuul
b5250bb517 Merge "Fix postgresql backup cronjob deployment issues" 2020-07-31 20:13:21 +00:00