patroni

mirror of https://github.com/outbackdingo/patroni.git synced 2026-01-27 18:20:05 +00:00

Author	SHA1	Message	Date
Polina Bungina	71ccf91e36	Don't filter out contradictory nofailover tag (#2992 ) * Ensure that nofailover will always be used if both nofailover and failover_priority tags are provided * Call _validate_failover_tags from reload_local_configuration() as well * Properly check values in the _validate_failover_tags(): nofailover value should be casted to boolean like it is done when accessed in other places	2024-01-02 09:30:18 +01:00
Alexander Kukushkin	7e89583ec7	Please new flake8 (#2789 ) it stopped liking lack of space character between `,` and `\` ```python foo,\ bar ```	2023-07-31 09:08:46 +02:00
Alexander Kukushkin	2ea0357854	DCS failsafe mode (#2379 ) If enabled it will allow Patroni to cope with DCS outages. In case of a DCS outage the leader tries to call all remaining members in the cluster via API and if all of them respond with success the leader will not be demoted. The failsafe_mode could be enabled by running ```sh patronictl edit-config -s failsafe_mode=true ``` or by calling the `/config` REST API endpoint. Co-authored-by: Polina Bungina <bungina@gmail.com>	2023-01-13 13:35:05 +01:00
Alexander Kukushkin	5bbb5dceeb	Improve /(a)sync checks in behave tests (#2521 ) They are frequently failing because sometimes replicas are a bit slow realizing that they are synchronous. Instead of instroducing more sleeps we will poll for required http status code with some timeout.	2023-01-12 08:23:59 +01:00
Alexander Kukushkin	49f1ccf874	Enable SSL in REST API and Postgres if possible when running behave (#2498 ) If openssl binary is available use it to generate a self-signed certificate. Use it to protect Patroni REST API (`verify_client: required`). In case if Postgres is compiled with SSL support enable it in the configuration and configure pg_hba.conf to check client certificates (`verify-ca`) in addition to passwords. Also configure superuser/replication/rewind users to use client certificates and verify server certificate (`verify-ca`)	2022-12-21 10:20:30 +01:00
Alexander Kukushkin	4d77b444dc	Enforce search_path=pg_catalog for non-replication connections (#2496 ) There is a known [vector of attact](https://pganalyze.com/blog/5mins-postgres-security-patch-releases-pgspot-pghostile) by creating functions and/or operators in a public scheme with the same name and signature as corresponding objects in `pg_catalog`. Since Patroni is heavily relying on superuser connections we want to mitigate it by enforcing `search_path=pg_catalog` for all connections created by Patroni (except replication connections). It is achieved by introducing a new function, that wraps psycopg.connect() and appends ` -c search_path=pg_catalog` to `options` parameter. In addition to that, we set connection.autocommit to True before returning it.	2022-12-20 09:56:14 +01:00
Alexander Kukushkin	580530b30f	Behave tests on Windows (#2432 ) Windows doesn't support `SIGTERM`, but our behave tests in majority of cases relying on Patroni graceful shutdown. In order to emulate the behaviour we introduced the new REST API endpoint `POST /sigterm`. The endpoint works only on Windows and when `BEHAVE_DEBUG` environment variable is set. Besides that some minor adjustments in behave tests were done. Mainly related to backslash-slash handling. In addition to that improve test coverage on Windows by properly mocking access to filesystem and avoiding calling `subprocess.call()`. Specifically, symlink creation on Windows requires Admin privileges and there is no `true.exe`.	2022-10-21 12:24:24 +02:00
Alexander Kukushkin	d3e3b4e16f	Minor tuning of tests (#2201 ) - Reduce verbosity for unit tests - Refactor GH actions config and try again macos behave tests	2022-02-10 15:38:16 +01:00
Alexander Kukushkin	90a4208390	Get rid from requests module (#1296 ) It wasn't used for anything critical anyway, so it doesn't make a lot of sense to keep it as an explicit dependency.	2019-11-22 15:31:55 +01:00
Michael Banck	073074f83e	Run coverage as python -m coverage (#968 ) Depending on the platform the coverage binary might not always be available under the standard name.	2019-02-13 16:02:12 +01:00
Alexander Kukushkin	18786464a1	Rename failover to switchover and make new failover work without leader (#588 ) In addition to that implement /switchover endpoint as an alias to /failover endpoint and implement more checks like: * candidate must be provided for a failover * switchover can't be scheduled in a pause state * and so on Fixes https://github.com/zalando/patroni/issues/585 Fixes https://github.com/zalando/patroni/issues/520	2018-01-05 15:17:56 +01:00
Alexander Kukushkin	4328c15010	Make Patroni Kubernetes native (#500 ) * Use ConfigMaps or Endpoins for leader elections and to keep cluster state * Label pods with a postgres role * change behavior of pip install. From now on it will not install all dependencies, you have to specify explicitly DCS you want to use Patroni with: `pip install patroni[etcd,zookeeper,kubernetes]`	2017-12-08 16:55:00 +01:00
Alexander Kukushkin	acc6d7c2c2	Watchdog unit-tests, bugfixes and questions (#449 ) Implement missing unit-tests for and drop unused code	2017-07-11 10:00:30 +02:00
Ants Aasma	a70b46ef13	Add watchdog support on Linux (#343 ) Ensures that system gets rebooted before TTL runs out. Initial version. Open questions: Do we want to disable watchdog while we are not master?	2017-06-01 16:53:46 +02:00
Alexander Kukushkin	37b020e7a3	Various bugfixes and improvements: (#346 ) * Replace pytz.UTC with dateutil.tz.tzutc, it helps to reduce memory by more than 4Mb... * fix check of python version: 0x0300000 => 0x3000000 * Update leader key before restart and demote	2016-11-04 18:42:56 +02:00
Ants Aasma	7e53a604d4	Add synchronous replication support. (#314 ) Adds a new configuration variable synchronous_mode. When enabled Patroni will manage synchronous_standby_names to enable synchronous replication whenever there are healthy standbys available. With synchronous mode enabled Patroni will automatically fail over only to a standby that was synchronously replicating at the time of master failure. This effectively means zero lost user visible transactions. To enforce the synchronous failover guarantee Patroni stores current synchronous replication state in the DCS, using strict ordering, first enable synchronous replication, then publish the information. Standby can use this to verify that it was indeed a synchronous standby before master failed and is allowed to fail over. We can't enable multiple standbys as synchronous, allowing PostreSQL to pick one because we can't know which one was actually set to be synchronous on the master when it failed. This means that on standby failure commits will be blocked on the master until next run_cycle iteration. TODO: figure out a way to poke Patroni to run sooner or allow for PostgreSQL to pick one without the possibility of lost transactions. On graceful shutdown standbys will disable themselves by setting a nosync tag for themselves and waiting for the master to notice and pick another standby. This adds a new mechanism for Ha to publish dynamic tags to the DCS. When the synchronous standby goes away or disconnects a new one is picked and Patroni switches master over to the new one. If no synchronous standby exists Patroni disables synchronous replication (synchronous_standby_names=''), but not synchronous_mode. In this case, only the node that was previously master is allowed to acquire the leader lock. Added acceptance tests and documentation. Implementation by @ants with extensive review by @CyberDem0n.	2016-10-19 16:12:51 +02:00
Alexander Kukushkin	4594bc98da	Increase timeouts when running AT on travis (#324 ) * Increase timeouts two times when running AT on travis * Make up to 3 attempts to download DCS * Get rid from hard-coded names	2016-09-28 15:13:09 +02:00
Oleksii Kliukin	c91eda8d78	Merge branch 'master' into feature/scheduled_restarts	2016-07-11 12:56:24 +02:00
Oleksii Kliukin	7a1e2e0c72	Fix the assert message.	2016-06-28 17:11:13 +02:00
Oleksii Kliukin	d2832ee43b	Address the code review. Fix return value in the should_run_scheduled_action and the comments. Correct the json composition in the scheduled_restart test. Fix the delete in case there is no scheduled restart. Fix the usage of format in the logger output. Fix the indentation in the evaluate_scheduled_restart. Fix the condition related to the body_is_optional in the do_POST_restart. Fix a few typos in the error messages. Fix the _read_json_content Make the scheduled restart unit-tests a bit less ugly	2016-06-28 16:54:20 +02:00
Oleksii Kliukin	29845dd383	Restart the node according to the schedule. The scheduled restart data structures are now independent of those used by the normal restarts. This would be fixed in subsequent commits. Add the behave tests, that cover the POST /restart (but not DELETE).	2016-06-23 10:43:54 +02:00
Alexander Kukushkin	27bdc65e46	Fix acceptance tests with python3	2016-06-16 15:27:41 +02:00
Alexander Kukushkin	fcde17583c	Acceptance tests for patronictl Call patronictl.py when it's possible instead of doing REST API calls.	2016-06-16 15:06:18 +02:00
Alexander Kukushkin	f7912991a8	Reshuffle acceptance tests one more time	2016-05-30 12:37:14 +02:00
Alexander Kukushkin	e085c866dc	Reshuffle acceptance tests Move dynamic config tests from basic_replication to patroni_api	2016-05-30 11:30:41 +02:00
Alexander Kukushkin	073ef3784f	Implement PATCH /config	2016-05-27 16:29:33 +02:00
Alexander Kukushkin	24a2ea6cef	Refactor acceptance tests to make them work against ZooKeeper and make it easier to implement controllers for new DCS, i.e. consul	2016-04-10 10:37:43 +02:00
Alexander Kukushkin	5f6beae22f	Enforce data-type checks for step matcher and increase default timeout for patroni start	2016-03-11 14:46:14 +01:00
Alexander Kukushkin	30d3982d25	Acceptance tests with behave	2016-03-11 12:56:29 +01:00

29 Commits