patroni

mirror of https://github.com/outbackdingo/patroni.git synced 2026-01-28 02:20:04 +00:00

Author	SHA1	Message	Date
Alexander Kukushkin	729f1dddc8	Compatibility with PostgreSQL 15 beta1 (#2299 ) * update postgresql/validator.py * pg_rewind doesn't like if there are unix sockets in PGDATA * pg_rewind now supports --config-file option	2022-05-19 15:36:09 +02:00
Alexander Kukushkin	333d41d9f0	Release 2.1.3 (#2219 ) * Implement missing unit-tests * Bump version * Update release notes	2022-02-18 14:16:15 +01:00
Alexander Kukushkin	aa91557a80	Fix bug in divergence timeline check (#2221 ) Patroni was falsely assuming that timelines have diverged. For pg_rewind it didn't create any problem, but if pg_rewind is not allowed and the `remove_data_directory_on_diverged_timelines` is set, it resulted in reinitializing the former leader. Close https://github.com/zalando/patroni/issues/2220	2022-02-17 15:53:13 +01:00
Alexander Kukushkin	d7dc3c2d96	Handle missing timelines in history file when deciding to rewind (#2120 ) When restore_command is configured Postgres is trying to fetch/apply all possible WAL segments and also fetch history files in order to select the correct timeline. It could result in a situation where the new history file will be missing some timelines. Example: - node1 demotes/crashes on timeline 1 - node2 promotes to timeline 2 and archives `00000002.history` and crashes - node1 recovers as a replica, "replays" `00000002.history` and promotes to timeline 3 As a result, the `00000003.history` will not have the line with timeline 2, because it never replayed any WAL segment from it. The `pg_rewind` tool is supposed to correctly handle such case when rewinding node2 from node1, but Patroni when deciding whether the rewind should happen was searching for the exact timeline in the history file from the new primary. The solution is to assume that rewind is required if the current replica timeline is missing. In addition to that this PR makes sure that the primary isn't running in recovery before starting the procedure of rewind check. Close https://github.com/zalando/patroni/issues/2118 and https://github.com/zalando/patroni/issues/2124	2021-12-02 11:35:30 +01:00
Alexander Kukushkin	17e523b175	Optimize checkpoint after promote (#2114 ) 1. Avoid doing CHECKPOINT if `pg_control` is already updated. 2. Explicitly call ensure_checkpoint_after_promote() right after the bootstrap finished successfully.	2021-11-19 14:33:24 +01:00
Alexander Kukushkin	fce889cd04	Compatibility with psycopg 3.0 (#2088 ) By default `psycopg2` is preferred. The `psycopg>=3.0` will be used only if `psycopg2` is not available or its version is too old.	2021-11-19 14:32:54 +01:00
Alexander Kukushkin	c7173aadd7	Failover logical slots (#1820 ) Effectively, this PR consists of a few changes: 1. The easy part: In case of permanent logical slots are defined in the global configuration, Patroni on the primary will not only create them, but also periodically update DCS with the current values of `confirmed_flush_lsn` for all these slots. In order to reduce the number of interactions with DCS the new `/status` key was introduced. It will contain the json object with `optime` and `slots` keys. For backward compatibility the `/optime/leader` will be updated if there are members with old Patroni in the cluster. 2. The tricky part: On replicas that are eligible for a failover, Patroni creates the logical replication slot by copying the slot file from the primary and restarting the replica. In order to copy the slot file Patroni opens a connection to the primary with `rewind` or `superuser` credentials and calls `pg_read_binary_file()` function. When the logical slot already exists on the replica Patroni periodically calls `pg_replication_slot_advance()` function, which allows moving the slot forward. 3. Additional requirements: In order to ensure that primary doesn't cleanup tuples from pg_catalog that are required for logical decoding, Patroni enables `hot_standby_feedback` on replicas with logical slots and on cascading replicas if they are used for streaming by replicas with logical slots. 4. When logical slots are copied from to the replica there is a timeframe when it could be not safe to use them after promotion. Right now there is no protection from promoting such a replica. But, Patroni will show the warning with names of the slots that might be not safe to use. Compatibility. The `pg_replication_slot_advance()` function is only available starting from PostgreSQL 11. For older Postgres versions Patroni will refuse to create the logical slot on the primary. The old "permanent slots" feature, which creates logical slots right after promotion and before allowing connections, was removed. Close: https://github.com/zalando/patroni/issues/1749	2021-03-25 16:18:23 +01:00
Alexander Kukushkin	8446077fb3	Fixes around pg_rewind (#1794 ) 1. If the superuser name is different from postgres, the pg_rewind in the standby cluster was failing because the connection string didn't contain the database name. 2. Provide output if the single-user mode recovery failed. Close https://github.com/zalando/patroni/pull/1736	2020-12-16 19:54:19 +01:00
Alexander Kukushkin	7f343c2c57	Try to fetch missing WAL if pg_rewind complains about it (#1561 ) It could happen that the WAL segment required for `pg_rewind` doesn't exist in the `pg_wal` anymore and therefore `pg_rewind` can't find the checkpoint location before the diverging point. Starting from PostgreSQL 13 `pg_rewind` could use `restore_command` for fetching missing WALs, but we can do better than that. On older PostgreSQL versions Patroni will parse the stdout and stderr of failed rewind attempt, try to fetch the missing WAL by calling the `restore_command`, and repeat an attempt.	2020-06-25 16:24:21 +02:00
Alexander Kukushkin	cd1b2741fa	Improve timeline divergence check (#1563 ) We don't need to rewind when: 1. replayed location for the former replica is not ahead of switchpoint 2. end of checkpoint record for the former primary is the same as switchpoint In order to get the end of checkpoint record we use the `pg_waldump` and parse its output. Close https://github.com/zalando/patroni/issues/1493	2020-05-29 14:15:10 +02:00
Alexander Kukushkin	08b3d5d20d	Move ensure_clean_shutdown into rewind module (#1528 ) Logically fits there better	2020-05-15 16:22:57 +02:00
Alexander Kukushkin	30aa355eb5	Shorten and beautify history log output (#1526 ) when Patroni is trying to figure out the necessity of pg_rewind it could write the content history file from the primary into the log. The history file is growing with every failover/switchover and eventually starts taking too many lines in the log, most of them are not so much useful. Instead of showing the raw data, we will show only 3 lines before the current replica timeline and 2 lines after.	2020-05-15 16:14:25 +02:00
Alexander Kukushkin	285bffc68d	Use pg_rewind with --restore-target-wal on 13 if possible (#1525 ) On PostgreSQL 13 check if restore_command is configured and tell pg_rewind to use it	2020-05-15 16:05:07 +02:00
Alexander Kukushkin	e6ef3c340a	Wake up the main thread after checkpoint is done (#1524 ) Replicas are waiting for checkpoint indication via member key of the leader in DCS. The key is normally updated only one time per HA loop. Without waking the main thread up replicas will have to wait up to `loop_wait` seconds longer than necessary.	2020-05-15 16:02:17 +02:00
Alexander Kukushkin	80fbe90056	Issue CHEKPOINT explicitely after promote happened (#1498 ) It is safe to call pg_rewind on the replica only when pg_control on the primary contains information about the latest timeline. Postgres is usually doing immediate checkpoint right after promote and in most cases it works just fine. Unfortunately we regularly receive complaints that it takes to long (minutes) until the checkpoint is done and replicas can't perform rewind. At the same time doing the checkpoint manually immediately helped. So Patroni starts doing the same. When the promotion happened and postgres is not running in recovery, we explicitly issue the checkpoint. We are intentionally not using the AsyncExecutor here, because we want the HA loop continues doing its normal flow.	2020-04-20 11:55:05 +02:00
Alexander Kukushkin	1a6db4f5af	Reverse logic around checkpoint_after_promote (#1084 ) It will be set to false in the JSON only until the checkpoint actually happened. The next improvement of `bba9066315`	2019-06-17 10:42:31 +02:00
Alexander Kukushkin	a4bd6a9b4b	Refactor postgresql class (#1060 ) * Convert postgresql.py into a package * Factor out cancellable process into a separate class * Factor out connection handler into a separate class * Move postmaster into postgresql package * Factor out pg_rewind into a separate class * Factor out bootstrap into a separate class * Factor out slots handler into a separate class * Factor out postgresql config handler into a separate class * Move callback_executor into postgresql package This is just a careful refactoring, without code changes.	2019-05-21 16:02:47 +02:00

17 Commits