patroni

mirror of https://github.com/outbackdingo/patroni.git synced 2026-01-27 18:20:05 +00:00

Author	SHA1	Message	Date
Alexander Kukushkin	f8f928420d	Release 1.5.2 (#875 ) * Update release notes * Bump version v1.5.2	2018-11-26 10:31:14 +01:00
Shea Stewart	6519a192b1	add openshift customizations, templates, and test (#871 ) - It modifies the Dockerfile and entrypoint slightly to allow for OpenShift SCCs to operate correctly - It adds 2 template examples that can be easily modified by changing parameters Fixes #572	2018-11-21 18:01:39 +01:00
Alexander Kukushkin	8c7e1892ee	touch_member method should not raise exceptions (#859 ) Fixes https://github.com/zalando/patroni/issues/853	2018-11-21 12:51:52 +01:00
Lardière Sébastien	a1ba2cdca7	Add confd template for pgbouncer (#844 )	2018-11-21 12:03:22 +01:00
Kostiantyn Nemchenko	ce9c7bdadc	Describe Consul registration parameters (#870 ) Mention Consul related parameters introduced by #802	2018-11-21 12:00:40 +01:00
Alexander Kukushkin	fb01aaebc5	Compatibility with kazoo-2.6.0 (#872 ) Recently 2.6.0 was release which changes the way how create_connection method is called. Before it was passing two arguments, and in the new version all argument names are specified explicitly.	2018-11-19 14:26:20 +01:00
alago197	a13cc8b847	Update SETTINGS.rst with connect_address clarifications (#858 )	2018-11-12 16:56:51 +01:00
lwsbox	bd9f30372e	fix bug: change pip to pip3 (#851 ) should use python3 to install in postgres10 image.	2018-11-05 14:17:22 +01:00
Josh Berkus	d247b5ae17	Fix several build problems with the Docker image for testing Kubernet… (#735 ) 1. Update to Postgres 10 2. Install most of the python modules from deb packages 3. Do not upgrade pip	2018-11-04 08:31:11 +01:00
Alexander Kukushkin	b4f35ecca0	Release 1.5.1 (#846 ) * Bump version * Update release notes v1.5.1	2018-11-01 16:18:41 +01:00
Alexander Kukushkin	0f666e69f3	Prefix system tables, views and functions with pg_catalog (#845 ) and implement missing unit tests	2018-11-01 16:17:40 +01:00
Alexander Kukushkin	2efd97baab	Permanent replication slots (#819 ) Permanent replication slots are preserved on failover/switchover, that is Patroni on the new primary will create configured replication slots right after doing promote. Slots could be configured with the help of `patronictl edit-config`. The initial configuration could be also done in the `bootstrap.dcs` ```yaml slots: permanent_physical_1: type: physical permanent_logical_1: type: logical database: foo plugin: pgoutput ``` It is the responsibility of the operator to make sure that there are no clashes in names between replication slots automatically created by Patroni for members and permanent replication slots. Closes https://github.com/zalando/patroni/issues/656	2018-10-31 11:37:42 +01:00
Alexander Kukushkin	c65c4f1ffe	Round-robing across all masters in pause mode if DCS is not accessible (#842 ) Regression was introduced in `90cf930036` Fixes https://github.com/zalando/patroni/issues/841	2018-10-30 16:48:53 +01:00
Srikanth	2264190e79	Add pre-command to start watchdog (#835 ) Add `ExecStartPre` commands to systemd unit to help start watchdog devices.	2018-10-26 07:55:01 +01:00
Alexander Kukushkin	f70edefc65	A few bugfixes in the "standby cluster" workflow (#823 ) * Always run `pg_rewind` against the remote master * Always use the remote master as the source when "recovering" stopped standby leader * Use remote master as the source when "recovering" the node in the unhealthy cluster * Use the local dbname as the fallback when doing `pg_rewind` from the remote master * `no_replication_slot` is the allowed key in the `RemoteMember` object * Make it possible to "bootstrap" the new `standby_cluster` with existing (and valid) data directory. There is one prerequisite though, there should be no `patroni.dynamic.json` file in it!	2018-10-09 13:30:48 +02:00
Yogesh Sharma	6567f509b1	Add pgbackrest support (#1 ) (#822 ) * pgbackrest support pgBackrest can restore in existing $PGDATA folder, this allows speedy restore as files which have not changed since last backup are skipped, to support this feature new param keep_data has been introduced. When keep_data=True, cleanup of $PGDATA will be skipped. Patroni passes some extra parameters to custom_replica_methods when calling, this causes an error due to pgbackrest strict parameter checking. New param no_params=True can be set to skip parameters passing. Fixes https://github.com/zalando/patroni/issues/625	2018-10-08 19:00:30 +02:00
Alexander Kukushkin	534829d617	Release 1.5.0 (#809 ) Update release notes and bump version v1.5.0	2018-09-20 16:29:00 +02:00
Alexander Kukushkin	76d1b4cfd8	Minor fixes (#808 ) * Use `shutil.move` instead of `os.replace`, which is available only from 3.3 * Introduce standby-leader health-check and consul service * Improve unit tests, some lines were not covered * rename `assertEquals` -> `assertEqual`, due to deprecation warning	2018-09-19 16:32:33 +02:00
Pavel Golub	3d76a013a7	Support for Windows (#799 ) Postgres on Windows is using different signals, backslashes as file separators and some of the functions and syscalls are not available there.	2018-09-19 13:50:36 +02:00
Pavel Kirillov	2e9cb412e4	Register service in consul (#802 ) Кegister service 'scope_name' with tag 'master' or 'replica' example with scope 'pgsql-pgpi' ```[root@pgpi1 ~]# host -t SRV pgsql-pgpi.service.consul. 127.0.0.1 Using domain server: Name: 127.0.0.1 Address: 127.0.0.1#53 Aliases: pgsql-pgpi.service.consul has SRV record 1 1 5432 pgpi1.node.dc.consul. pgsql-pgpi.service.consul has SRV record 1 1 5432 pgpi2.node.dc.consul. [root@pgpi1 ~]# host -t SRV master.pgsql-pgpi.service.consul. 127.0.0.1 Using domain server: Name: 127.0.0.1 Address: 127.0.0.1#53 Aliases: master.pgsql-pgpi.service.consul has SRV record 1 1 5432 pgpi2.node.dc.consul. [root@pgpi1 ~]# host -t SRV replica.pgsql-pgpi.service.consul. 127.0.0.1 Using domain server: Name: 127.0.0.1 Address: 127.0.0.1#53 Aliases: replica.pgsql-pgpi.service.consul has SRV record 1 1 5432 pgpi1.node.dc.consul.``` Fixes: https://github.com/zalando/patroni/issues/771	2018-09-07 15:17:56 +02:00
Dmitry Dolgov	dd7c3c349f	[WIP] Standby cluster implementation (#679 ) Implementation of "standby cluster" described in #657. Standby cluster consists of a "standby leader", that replicates from a "remote master" (which is not a part of current patroni cluster and can be anywhere), and cascade replicas, that replicate from the corresponding standby leader. "Standby leader" behaves pretty much like a regular leader, which means that it holds a leader lock in DSC, in case if disappears there will be an election of a new "standby leader". One can define such a cluster using the section "standby_cluster" in patroni config file. This section provides parameters for standby cluster, that will be applied only once during bootstrap and can be changed only through DSC.	2018-09-07 10:10:56 +02:00
Alexander Kukushkin	4ca8a6e506	Make retries of calls to DCS consistent across implementations (#805 ) in addition to that do a small refactoring of zookeeper and consul and try to improve the stability of AT	2018-09-06 08:37:26 +02:00
wilfriedroset	0136f252ab	Add patronictl -k/--insecure flag and suport for restapi cert (#790 ) Fixes https://github.com/zalando/patroni/issues/785	2018-08-29 16:08:13 +02:00
anikin-aa	0e13677880	exclude members with nofailover tag (#798 ) Exclude members with nofailover tag from `patronictl switchover/failover` output. Fixes https://github.com/zalando/patroni/issues/769	2018-08-29 16:07:01 +02:00
anikin-aa	68c0d87d42	check if output lines of controldata are possible to split (#797 ) Otherwise it fails with scary stacktrace	2018-08-29 16:06:06 +02:00
Alexander Kukushkin	90cf930036	Refactor REST API health-checks (#779 ) Make it more readable and easy to understand. Mostly it is needed to implement https://github.com/zalando/patroni/issues/772	2018-08-29 11:35:22 +02:00
Jan Mussler	2b87ae0cd0	Add member name to error message. (#792 ) Analog to success message.	2018-08-29 11:30:51 +02:00
Alexander Kukushkin	5ca5dacaa9	Immediately reserve LSN on upon creation of replication slot (#783 ) This feature is available starting from 9.6	2018-08-29 11:30:01 +02:00
Alexander Kukushkin	87e9aab04c	Improve tests (#778 ) * Implement missing unit-tests * Add acceptance tests for ISSUE #776 * Update list of classifiers, keywords and authors	2018-08-29 11:29:37 +02:00
Alexander Kukushkin	518df2bc49	Search new sync candidate amoung potential and async standbys (#794 ) In synchronos_mode_strict we put '*' into synchronos_standby_names, what makes one connection 'sync' and other connections 'potential'. The code picking up the correct sync standby didn't consider 'potential' as a good candidate. Fixes: https://github.com/zalando/patroni/issues/789	2018-08-29 11:28:46 +02:00
Alexander Kukushkin	a513a7bb68	Improve stability of acceptance tests (#780 ) last time tests were failing due to postgres/patroni slowness in picking sync standby	2018-08-29 11:13:18 +02:00
Alexander Kukushkin	715caaddf3	Remove .zappr.yaml (#795 ) and switch to github approvals	2018-08-29 11:09:35 +02:00
Oleksii Kliukin	b165183503	Reset is_leader status on demote (#777 ) Make sure demoted cluster member stops responding with code 200 on the /master API call. Issue a new minor release. Fixes https://github.com/zalando/patroni/issues/776 v1.4.6	2018-08-14 17:08:08 +02:00
Dmitry Dolgov	b282a0f254	Add "cluster_unlocked" field (#764 ) Add a field to an api to figure out if a master is there from patroni point of view. It can be useful, when you have an alert, based on Auto Scaling Groups, and then ASG decided to shutdown the current master, spin up a new instance but the current master shutdown is stuck. In this situation the current master is no longer a part of ASG, but patroni and Postgres are still alive on the instance, which means a new replica will not be promoted yet - this will lead to a false alert, saying that your cluster doesn't have any master node.	2018-08-13 14:02:01 +02:00
Oleksii Kliukin	5e7345a2ca	Release notes 1.4.5 (#762 ) bump version update release notes v1.4.5	2018-08-03 17:02:11 +02:00
Don Seiler	502094ee79	Log config change or not (#731 ) This adds INFO log messages that clearly state if configuration values were seen as changed by Patroni after SIGHUP/reload and warrant reloading (or if nothing was changed an no reloading is necessary). This ended up being a lot simpler than I had imagined once I found postgresql.py:reload_config(). I add a log line in config.py:reload_local_configuration() since that function will short-circuit the process early if the local config wasn't changed. But the final determination of whether or not values have changed and need reloading is in postgresql.py:reload_config().	2018-08-03 17:00:57 +02:00
Alexander Kukushkin	0c1ae6fbeb	Respond 200 to master health-check only if update_lock was successful (#713 ) If Patroni gets partitioned it starts receiving stale information from DCS. We can't use this information to determine that we have the leader key. Instead, we will record in Ha object the actual state of acquire/update lock and report as a leader only if it was successful. P.S. despite responding with 200 on `GET /master` postgres was still running read-only.	2018-08-03 17:00:01 +02:00
Alexander Kukushkin	2fd2556050	Set role to demoted if postgres isn't running and no recovery.conf (#757 ) In really rare cases it was causing following behavior: ``` 2018-07-31 10:35:30,302 INFO: starting as a secondary 2018-07-31 10:35:30,309 INFO: Lock owner: postgresql0; I am postgresql1 2018-07-31 10:35:30,310 INFO: Demoting master during restarting after failure 2018-07-31 10:35:30,381 INFO: postmaster pid=17709 2018-07-31 10:35:30,386 INFO: lost leader lock during restarting after failure 2018-07-31 10:35:30,388 ERROR: Exception during CHECKPOINT ```	2018-08-03 16:59:04 +02:00
Oleksii Kliukin	d47049ce0e	Fix condition for the replica start due to pg_rewind in paused state. (#754 ) Avoid starting the replica that had already executed pg_rewind before. Fixes in #753	2018-08-03 16:45:33 +02:00
Henning Jacobs	2f7c53031c	Python 3.6 and 3.7 are now supported, too (#752 )	2018-07-24 10:51:25 +02:00
Christoph Berg	a2c6ed5504	async is a keyword in python3.7 (#751 ) * async is a keyword in python3.7 Setting up patroni (1.4.4-1) ... File "/usr/lib/python3/dist-packages/patroni/ha.py", line 610 'offline': dict(stop='fast', checkpoint=False, release=False, offline=True, async=False), ^ SyntaxError: invalid syntax Fix #750 by replacing dict member "async" with "async_req". * requirements.txt: Update for new kubernetes version compatible with 3.7	2018-07-23 20:42:33 +02:00
Oleksii Kliukin	00c2e1c2d0	Grant delete on endpoints and configmaps in RBAC. (#749 ) 'patronictl remove' deletes the cluster configuration (stored either in configmaps or endpoints) and cannot be run from the postgres pod w/o 'delete' on those objects being granted to the pod service account.	2018-07-23 20:39:46 +03:00
Don Seiler	f5927bad70	Add EnvironmentFile directive (#746 ) Add an EnvironmentFile directive to read in a configuration file with environment variables. The "-" prefix means it can proceed if the file doesn't exist. This would allow users to keep sensitive information like the SUPERUSER/REPLICATION passwords in the config file separate from a YAML file that might be deployed from source control.	2018-07-23 20:31:47 +03:00
Alexander Kukushkin	26466237b9	Update docker-compose example to postgres 10 (#737 ) Some other changes are related to the new version of confd, which now requires specifying etcd url instead of etcd host.	2018-07-23 16:41:17 +02:00
Tony Sorrentino	c8f9199988	Added setting state to "stopped" when a member is stopped in Ha.shutdown (#733 ) Changes by @tonys66, review by @CyberDem0n	2018-07-23 14:59:39 +02:00
Alexander Kukushkin	2356af679b	Convert query params from list to dict (#744 ) Patroni is relying on params to determinte timeout and amount of retries when executing api requests to consul. Starting from v1.1.0 python-consul changed internal API and started using `list` instead of `dict` to pass query parameters. Such change broke "watch" functionality. Fixes https://github.com/zalando/patroni/issues/742 and https://github.com/zalando/patroni/issues/734	2018-07-23 14:56:51 +02:00
Ants Aasma	3b633abd91	Improve logging when stale postmaster.pid matches running process (#738 ) Currently the informational message logged is beyond confusing. This improves the logging so there is some indication what this message is about and that it is somewhat normal. Changes by @ants	2018-07-17 16:46:22 +02:00
alago197	936a4238fb	Update some descriptions for the REST API endpoints (#729 ) * Update some descriptions for the REST API endpoints By @alago197	2018-07-10 15:40:53 +02:00
Don Seiler	50a8114d0b	Use enforced minimums in postgresX.yml files (#730 ) Fix the discrepancy for the values of max_wal_senders and max_replication_slots between the sample postgres.yml files and hard-coded defaults in Patroni, bumping the former to 10. Contributed by @dtseiler	2018-07-04 10:08:54 +02:00
Don Seiler	4e8709b266	Adding reload functionality (#726 ) This allows the config to be reloaded via `systemctl reload patroni`, sending SIGHUP to the patroni process. Tested on CentOS.	2018-06-30 23:16:42 +02:00

1 2 3 4 5 ...

1575 Commits