patroni

mirror of https://github.com/optim-enterprises-bv/patroni.git synced 2026-01-04 13:51:30 +00:00

Author	SHA1	Message	Date
Alexander Kukushkin	93eb4edbe6	Reformat imports with isort (#3123 ) Besides that: 1. Introduce `setup.py isort` for quick check 2. Introduce GH actions to check imports	2024-08-13 17:53:59 +02:00
Polina Bungina	6e1f9f7a6e	Prepare repo migration (#3085 )	2024-06-17 09:04:43 +02:00
Polina Bungina	14a44e14ba	Re-enable SSL for MacOS GH action runners (#3005 )	2024-06-12 13:28:01 +02:00
Polina Bungina	ae53260030	Extend behave tests with nostream feature (#3036 ) Check state and permanent logical replication slots behaviour	2024-03-29 12:54:40 +01:00
Alexander Kukushkin	a4e0a2220d	Disable SSL for MacOS GH action runners (#2976 ) Latest runners release (20231127.1) somehow broke our tests. Connections to postgres somehow failing with strange error: ``` could not accept SSL connection: Socket operation on non-socket ```	2023-12-06 15:28:03 +01:00
Israel	bb90feb393	Add support for additional parameters on custom bootstrap (#2927 ) Previous to this commit, if a user would ever like to add parameters to the custom bootstrap script call, they would need to configure Patroni like this: ``` bootstrap: method: custom_method_name custom_method_name: command: /path/to/my/custom_script --arg1=value1 --arg2=value2 ... ``` This commit extends that so we achieve a similar behavior that is seen when using `create_replica_methods`, i.e., we also allow the following syntax: ``` bootstrap: method: custom_method_name custom_method_name: command: /path/to/my/custom_script arg1: value1 arg2: value2 ``` All keys in the mapping which are not recognized by Patroni, will be dealt with as if they were additional named arguments to be passed down to the `command` call. References: PAT-218.	2023-10-25 15:01:08 +02:00
Alexander Kukushkin	c5fffb3c97	Further work on permanent physical slots (#2891 ) - Fixed issues with has_permanent_slots() method. It didn't took into account the case of permanent physical slots for members, falsely concluding that there are no permanent slots. - Write to the status key only LSNs for permanent slots (not just for slots that exist on the primary). - Include pg_current_wal_flush_lsn() to slots feedback, so that slots on standby nodes could be advanced - Improved behave tests: - Verify that permanent slots are properly created on standby nodes - Verify that permanent slots are properly advanced, including DCS failsafe mode - Verify that only permanent slots are written to the `/status`	2023-10-23 08:24:28 +02:00
Alexander Kukushkin	aa3ebe0af8	Don't cache anything in Zookeeper implementation (#2909 ) Cache creates a lot of problems and prevents implementing a feature of automatic retention of physical replication slots for members with configurable retention policy. Just read the entire cluster from Zookeeper instead and use watchers only for the `/leader` and `/config` keys.	2023-10-17 08:56:31 +02:00
Alexander Kukushkin	42976df86f	Make it easier to debug callbacks (#2902 ) 1. Introduce DEBUG logs for callbacks 2. Configure log format in behave tests to include filename, line, and method name that triggered the callback and enable DEBUG logs for `patroni.postgresql.callback_executor` module. P.S. unfortunately it works only starting from python 3.8, but it should be good enough for debug purpose because 3.7 is already EOL.	2023-10-16 08:55:07 +02:00
Alexander Kukushkin	f77073c8e1	Speed up dcs failsafe behave tests (#2890 ) - get rid from sleeps - reduce retry_timeout - avoid graceful Patroni shut down while DCS is "paused", just kill Patroni and after that gracefully stop postgres - don't try to delete Pod when Patroni is killed. If K8s API is paused it takes ages The run time on my laptop is reduced from 2m to 1m28s.	2023-09-28 10:44:11 +02:00
Alexander Kukushkin	7e89583ec7	Please new flake8 (#2789 ) it stopped liking lack of space character between `,` and `\` ```python foo,\ bar ```	2023-07-31 09:08:46 +02:00
Alexander Kukushkin	06db296612	Fixes in `patroni.request` (#2768 ) 1. Take client certificates only from the `ctl` section. Motivation: sometimes there are server-only certificates that can't be used as client certificates. As a result neither Patroni not patronictl work correctly even if `--insecure` option is used. 2. Document that if `restapi.verify_client` is set to `required` then client certificates must be provided in the `ctl` section. 3. Add support for `ctl.authentication` and prefer to use it over `restapi.authentication`. 4. Silence annoying InsecureRequestWarning when `patronictl -k` is used, so that behavior becomes is similar to `curl -k`.	2023-07-25 08:48:18 +02:00
Alexander Kukushkin	0a8fb0860e	Skip flaky scenario when running with Raft (#2771 ) Sometimes Patroni doesn't see the latest Raft data on start.	2023-07-21 16:09:34 +02:00
Mark Pekala	412c51ddf1	Prevent splitbrain from duplicate names in configuration (#2724 ) When starting check if node with the same is registered in DCS and try to query it's REST API. If REST API is accessible exit with the error. Close #1804	2023-07-11 07:43:57 +02:00
Alexander Kukushkin	af318b2473	Fix kubernetes behave tests (#2707 ) Starting from 1.27 there is containerd process, which also uses k3s binary and being detected by pidof. Therefore we will search for "k3s server" string in the process list instead of just "k3s".	2023-06-01 13:28:29 +02:00
Polina Bungina	ab9fea7d6b	Fix openssl certificate generation in behave tests (#2672 ) --addext -> -addext (doesn't work on macOS) set keyfile permissions to 600 (to avoid "private key file has group or world access")	2023-05-12 10:42:53 +02:00
Alexander Kukushkin	4d35f85b87	Fix behave tests (#2656 ) 1. specify `subjectAltName=IP:127.0.0.1` when generating certificate 2. run more behave tests with psycopg2	2023-04-27 12:18:44 +02:00
Polina Bungina	3fe2a7868a	Ignore D401 in flake8-docstrings (#2627 ) * Ignore D401 in flake8-docstrings * Fix newly reported flake8 issues, ignore the old W503 rule * rely on concatenation of adjecent strings * Format behave scripts * Reformat ha.py according to new rules Co-authored-by: Alexander Kukushkin <cyberdemn@gmail.com>	2023-04-03 09:52:22 +02:00
Alexander Kukushkin	c1bfb0e6d6	Remove python 2.7 support (#2571 ) - get rid from 2.7 specific modules: `six`, `ipaddress` - use Python3 unpacking operator - use `shutil.which()` instead of `find_executable()`	2023-03-13 17:00:04 +01:00
Polina Bungina	b85f155dbe	Pass 'master' role to a callback script instead of 'promoted' (#2554 ) Co-authored-by: Alexander Kukushkin <cyberdemn@gmail.com>	2023-02-08 14:09:51 +01:00
Alexander Kukushkin	4c3af2d1a0	Change master->primary/leader/member (#2541 ) keep as much backward compatibility as possible. Following changes were made: 1. All internal checks are performed as `role in ('master', 'primary')` 2. All internal variables/functions/methods are renamed 3. `GET /metrics` endpoint returns `patroni_primary` in addition to `patroni_master`. 4. Logs are changed to use leader/primary/member/remote depending on the context 5. Unit-tests are using only role = 'primary' instead of 'master' to verify that 1 works. 6. patronictl still supports old syntax, but also accepts `--leader` and `--primary`. 7. `master_(start\|stop)_timeout` is automatically translated to `primary_(start\|stop)_timeout` if the last one is not set. 8. updated the documentation and some examples Future plan: in the next major release switch role name from `master` to `primary` and maybe drop `master` altogether. The Kubernetes implementation will require more work and keep two labels in parallel. Label values should probably be configurable as described in https://github.com/zalando/patroni/issues/2495.	2023-01-27 07:40:24 +01:00
Alexander Kukushkin	79458688d1	Check unexpected exceptions in Patroni logs after behave (#2538 ) and make behave fail if there are anything unexpected found. In addition to that fix globing rule when uploading artifacts with logs.	2023-01-25 11:02:52 +01:00
Alexander Kukushkin	4872ac51e0	Citus integration (#2504 ) Citus cluster (coordinator and workers) will be stored in DCS as a fleet of Patroni logically grouped together: ``` /service/batman/ /service/batman/0/ /service/batman/0/initialize /service/batman/0/leader /service/batman/0/members/ /service/batman/0/members/m1 /service/batman/0/members/m2 /service/batman/ /service/batman/1/ /service/batman/1/initialize /service/batman/1/leader /service/batman/1/members/ /service/batman/1/members/m1 /service/batman/1/members/m2 ... ``` Where 0 is a Citus group for coordinator and 1, 2, etc are worker groups. Such hierarchy allows reading the entire Citus cluster with a single call to DCS (except Zookeeper). The get_cluster() method will be reading the entire Citus cluster on the coordinator because it needs to discover workers. For the worker cluster it will be reading the subtree of its own group. Besides that we introduce a new method get_citus_coordinator(). It will be used only by worker clusters. Since there is no hierarchical structures on K8s we will use the citus group suffix on all objects that Patroni creates. E.g. ``` batman-0-leader # the leader config map for the coordinator batman-0-config # the config map holding initialize, config, and history "keys" ... batman-1-leader # the leader config map for worker group 1 batman-1-config ... ``` Citus integration is enabled from patroni.yaml: ```yaml citus: database: citus group: 0 # 0 is for coordinator, 1, 2, etc are for workers ``` If enabled, Patroni will create the database, citus extension in it, and INSERTs INTO `pg_dist_authinfo` information required for Citus nodes to communicate between each other, i.e. 'password', 'sslcert', 'sslkey' for superuser if they are defined in the Patroni configuration file. When the new Citus coordinator/worker is bootstrapped, Patroni adds `synchronous_mode: on` to the `bootstrap.dcs` section. Besides that, Patroni takes over management of some Postgres GUCs: - `shared_preload_libraries` - Patroni ensures that the "citus" is added to the first place - `max_prepared_transactions` - if not set or set to 0, Patroni changes the value to `max_connections*2` - wal_level - automatically set to logical. It is used by Citus to move/split shards. Under the hood Citus is creating/removing replication slots and they are automatically added by Patroni to the `ignore_slots` configuration to avoid accidental removal. The coordinator primary actively discovers worker primary nodes and registers/updates them in the `pg_dist_node` table using citus_add_node() and citus_update_node() functions. Patroni running on the coordinator provides the new REST API endpoint: `POST /citus`. It is used by workers to facilitate controlled switchovers and restarts of worker primaries. When the worker primary needs to shut down Postgres because of restart or switchover, it calls the `POST /citus` endpoint on the coordinator and the Patroni on the coordinator starts a transaction and calls `citus_update_node(nodeid, 'host-demoted', port)` in order to pause client connections that work with the given worker. Once the new leader is elected or postgres started back, they perform another call to the `POST/citus` endpoint, that does another `citus_update_node()` call with actual hostname and port and commits a transaction. After transaction is committed, coordinator reestablishes connections to the worker node and client connections are unblocked. If clients don't run long transaction the operation finishes without client visible errors, but only a short latency spike. All operations on the `pg_dist_node` are serialized by Patroni on the coordinator. It allows to have more control and ROLLBACK transaction in progress if its lifetime exceeding a certain threshold and there are other worker nodes should be updated.	2023-01-24 16:14:58 +01:00
Alexander Kukushkin	40d16443f9	Fixes and improvements in failsafe (#2532 ) 1. Fix problem with logical slots not advancing when only the primary lost access to DCS 2. Don't let Patroni to join as a raft voting member when running failsafe behave tests. It allows to test exactly the same conditions as for other DCS 3. Speed up dcs_failsafe_mode behave tests by getting rid from long sleeps, slight reshuffling of places when we start/stop outage, and by killing Patroni/Postgres to avoid long shutdown due to the leader key removal attempts.	2023-01-24 14:07:31 +01:00
Alexander Kukushkin	2ea0357854	DCS failsafe mode (#2379 ) If enabled it will allow Patroni to cope with DCS outages. In case of a DCS outage the leader tries to call all remaining members in the cluster via API and if all of them respond with success the leader will not be demoted. The failsafe_mode could be enabled by running ```sh patronictl edit-config -s failsafe_mode=true ``` or by calling the `/config` REST API endpoint. Co-authored-by: Polina Bungina <bungina@gmail.com>	2023-01-13 13:35:05 +01:00
Michael Banck	e3e4ad0ada	Start etcd with V2 API enabled for V2 etcd acceptance tests (#2509 ) Otherwise, the etcd (not etcd3) behave tests fail to connect: ``` Jan 02 09:56:18 HOOK-ERROR in before_all: AssertionError: etcd instance is not available for queries after 5 seconds ```	2023-01-03 15:39:30 +01:00
Alexander Kukushkin	49f1ccf874	Enable SSL in REST API and Postgres if possible when running behave (#2498 ) If openssl binary is available use it to generate a self-signed certificate. Use it to protect Patroni REST API (`verify_client: required`). In case if Postgres is compiled with SSL support enable it in the configuration and configure pg_hba.conf to check client certificates (`verify-ca`) in addition to passwords. Also configure superuser/replication/rewind users to use client certificates and verify server certificate (`verify-ca`)	2022-12-21 10:20:30 +01:00
Alexander Kukushkin	4d77b444dc	Enforce search_path=pg_catalog for non-replication connections (#2496 ) There is a known [vector of attact](https://pganalyze.com/blog/5mins-postgres-security-patch-releases-pgspot-pghostile) by creating functions and/or operators in a public scheme with the same name and signature as corresponding objects in `pg_catalog`. Since Patroni is heavily relying on superuser connections we want to mitigate it by enforcing `search_path=pg_catalog` for all connections created by Patroni (except replication connections). It is achieved by introducing a new function, that wraps psycopg.connect() and appends ` -c search_path=pg_catalog` to `options` parameter. In addition to that, we set connection.autocommit to True before returning it.	2022-12-20 09:56:14 +01:00
Alexander Kukushkin	c7a925a238	Switch from localkube to kind and/or k3d (#2465 ) The only advantage of localkube was being a low weight. Anything else started creating only problems: 1. It is not properly maintained for many years. 2. It effectively worked only on Linux, but stopped on modern version due to changes in iptables. Instead, we will use widely adoped tools like kind or k3s. The "kind-kind" is the default K8s context (see ~/.kube/config), but it could be overriden using `PATRONI_KUBERNETES_CONTEXT` environment variable. When executed from GH actions the context is set to k3d-k3s-default, because K3s is much faster to start.	2022-12-06 13:15:56 +01:00
Alexander Kukushkin	580530b30f	Behave tests on Windows (#2432 ) Windows doesn't support `SIGTERM`, but our behave tests in majority of cases relying on Patroni graceful shutdown. In order to emulate the behaviour we introduced the new REST API endpoint `POST /sigterm`. The endpoint works only on Windows and when `BEHAVE_DEBUG` environment variable is set. Besides that some minor adjustments in behave tests were done. Mainly related to backslash-slash handling. In addition to that improve test coverage on Windows by properly mocking access to filesystem and avoiding calling `subprocess.call()`. Specifically, symlink creation on Windows requires Admin privileges and there is no `true.exe`.	2022-10-21 12:24:24 +02:00
Alexander Kukushkin	ead798d9ac	Speed up behave tests by always using loop_wait=2 (#2361 ) run time is reduced from ~5m30s to ~5m	2022-07-18 15:23:55 +02:00
Alexander Kukushkin	4c5cce5efd	Automatically skip some behave tests on legacy Postgres (#2358 ) previously behave had to be started with `--tags=-skip` argument.	2022-07-13 12:13:36 +02:00
Alexander Kukushkin	729f1dddc8	Compatibility with PostgreSQL 15 beta1 (#2299 ) * update postgresql/validator.py * pg_rewind doesn't like if there are unix sockets in PGDATA * pg_rewind now supports --config-file option	2022-05-19 15:36:09 +02:00
Alexander Kukushkin	d3e3b4e16f	Minor tuning of tests (#2201 ) - Reduce verbosity for unit tests - Refactor GH actions config and try again macos behave tests	2022-02-10 15:38:16 +01:00
Alexander Kukushkin	fce889cd04	Compatibility with psycopg 3.0 (#2088 ) By default `psycopg2` is preferred. The `psycopg>=3.0` will be used only if `psycopg2` is not available or its version is too old.	2021-11-19 14:32:54 +01:00
Christian Clauss	75e52226a8	Fix typos discovered by codespell (#1997 )	2021-07-06 10:01:30 +02:00
Alexander Kukushkin	99626a07f2	Fix issues with raft traffic encryption (#1919 ) and run raft behave tests with encryption enabled. Using the new `pysyncobj` release allowed us to get rid of a lot of hacks with accessing private properties and methods of the parent class and reduce the size of the `raft.py`. Close https://github.com/zalando/patroni/issues/1746	2021-04-30 11:28:41 +02:00
melrifa	6d6b504cb8	Add support for patroni replication user socket connection (#1865 ) Close #1866	2021-04-20 09:43:05 +02:00
krishna	b3dc765e6d	Choose synchronous nodes based on replication lag (#1786 ) This commit makes it possible to configure the maximum lag (`maximum_lag_on_syncnode`) after which Patroni will "demote" the node from synchronous and replace it with another node. The previous implementation always tried to stick to the same synchronous nodes (even if they are not optimal ones).	2021-02-02 15:45:02 +01:00
Alexander Kukushkin	e3ef9ac306	Fix issues with zookeeper (#1792 ) 1. The `ttl` was incorrectly returned 1000 times higher then it should 2. The `watch()` method must return True if the parent method returned True. Not doing so resulted in the incorrect calculation of sleep time. 3. Move mock of exhibitor api to the features/environment.py. It simplifies testing with behave.	2020-12-14 15:12:57 +01:00
Alexander Kukushkin	1530ed0b9c	Switch to GH actions (#1778 ) it allows up to 20 parallel builds	2020-12-04 21:52:34 +01:00
Alexander Kukushkin	23dcfaab49	Make it possible to bypass kubernetes service (#1614 ) When running on K8s Patroni is communicating with API via the `kubernetes` service, which is address is exposed via the `KUBERNETES_SERVICE_HOST` environment variable. Like any other service, the `kubernetes` service is handled by `kube-proxy`, that depending on configuration is either relying on userspace program or `iptables` for traffic routing. During K8s upgrade, when master nodes are replaced, it is possible that `kube-proxy` doesn't update the service configuration in time and as a result Patroni fails to update the leader lock and demotes postgres. In order to improve the user experience and get more control on the problem we make it possible to bypass the `kubernetes` service and connect directly to API nodes. The strategy is very simple: 1. Resolve list IPs of API nodes from the kubernetes endpoint on every iteration of HA loop. 2. Stick to one of these IPs for API requests 3. Switch to a different IP if connected to IP is not from the list 4. If the request fails, switch to another IP and retry Such a strategy is already used for Etcd and proven to work quite well. In order to enable the feature, you need either to set to `true` `kubernetes.bypass_api_service` in the Patroni configuration file or `PATRONI_KUBERNETES_BYPASS_API_SERVICE` environment variable. If for some reason `GET /default/endpoints/kubernetes` isn't allowed Patroni will disable the feature.	2020-08-14 12:39:47 +02:00
Alexander Kukushkin	3341c898ff	Add Etcd v3 protocol support via api gRPC-gateway (#1162 ) The only python-etcd3 client working directly via gRPC still supports only a single endpoint, which is not very nice for high-availability. Since Patroni is already using a heavily hacked version of python-etcd with smart retries and auto-discovery out-of-the-box, I decided to enhance the existing code with limited support of v3 protocol via gRPC-gateway. Unfortunately, watches via gRPC-gateway requires us to open and keep the second connection to the etcd. Known limitations: * The very minimal supported version is 3.0.4. On earlier versions transactions don't work due to bugs in grpc-gateway. Without transactions we can't do atomic operations, i.e. leader locks. * Watches work only starting from 3.1.0 * Authentication works only starting from 3.3.0 * gRPC-gateway does not support authentication using TLS Common Name. This is because gRPC-proxy terminates TLS from its client so all the clients share a cert of the proxy: https://github.com/etcd-io/etcd/blob/master/Documentation/op-guide/authentication.md#using-tls-common-name	2020-07-31 14:33:40 +02:00
Alexander Kukushkin	bfbc4860d5	PoC: Patroni on pure RAFT (#375 ) * new node can join the cluster dynamically and become a part of consensus * it is also possible to join only Patroni cluster (without adding the node to the raft), just comment or remove `raft.self_addr` for that * when the node joins the cluster it is using values from `raft.partner_addrs` only for initial discovery. * It is possible to run Patroni and Postgres on two nodes plus one node with `patroni_raft_controller` (without Patroni and Postgres). In such setup one can temporarily lose one node without affecting the primary.	2020-07-29 15:34:44 +02:00
Alexander Kukushkin	a68692a3e4	Get rid of kubernetes python module (#1586 ) The official python kubernetes client contains a lot of auto-generated code and therefore very heavy, but we need only a little fraction of it. The naive implementation, that covers all API methods we use, takes about 250 LoC, and about half of it is responsible for the handling of configuration files. Disadvantage: If somebody was using the `patronictl` outside of the pod (on his machine), it might not work anymore (depending on the environment).	2020-07-17 08:31:58 +02:00
Alexander Kukushkin	902411239f	More compatibility with windows (#1367 ) * unix-domain sockets are not yet supported * signal.SIGQUIT doesn't exists	2020-01-24 12:52:55 +01:00
Igor Yanchenko	2174d66f97	Rewriten shell scripts in python to make them compatible with windows (#1326 )	2019-12-11 12:07:05 +01:00
Pavlo Golub	919e9c54d2	Make `dest` argument default value of `backup()` cross platform (#1324 ) Fixes #1325	2019-12-11 11:25:41 +01:00
Alexander Kukushkin	a5ff38a034	Improve behave tests (#1313 ) Hopefully, make them less flaky	2019-12-02 10:33:44 +01:00
Alexander Kukushkin	183adb7848	Housekeeping (#1284 ) * Implement proper tests for `multiprocessing.set_start_method()` * Exclude some watchdog code from coverage (it is used only for behave tests) * properly use os.path.join for windows compatibility * import DCS modules in `features/environment.py` on demand. It allows to run behave tests against chosen DCS without installing all dependencies. * remove some unused behave code * fix some minor issues in the dcs.kubernetes module	2019-11-21 13:27:55 +01:00

1 2

89 Commits