patroni

mirror of https://github.com/outbackdingo/patroni.git synced 2026-01-27 18:20:05 +00:00

Files

Alexander Kukushkin 6d65aa311a Configurable retention of members replication slots (#3108 )

Current problem of Patroni that strikes many people is that it removes replication slot for member which key is expired from DCS. As a result, when the replica comes back from a scheduled maintenance WAL segments could be already absent, and it can't continue streaming without pulling files from archive.
With PostgreSQL 16 and newer we get another problem: logical slot on a standby node could be invalidated if physical replication slot on the primary was removed (and `pg_catalog` vacuumed).
The most problematic environment is Kubernetes, where slot is removed nearly instantly when member Pod is deleted.

So far, one of the recommended solutions was to configure permanent physical slots with names that match member names to avoid removal of replication slots. It works, but depending on environment might be non-trivial to implement (when for example members may change their names).

This PR implements support of `member_slots_ttl` global configuration parameter, that controls for how long member replication slots should be kept when the member key is absent. Default value is set to `30min`.
The feature is supported only starting from PostgreSQL 11 and newer, because we want to retain slots not only on the leader node, but on all nodes that could potentially become the new leader, and they should be moved forward using `pg_replication_slot_advance()` function.

One could disable feature and get back to the old behavior by setting `member_slots_ttl` to `0`.

2024-08-23 14:50:36 +02:00

_static

high availability across multiple datacenter #2587 (#2598 )

2023-03-14 15:39:50 +01:00

citus.rst

Quorum based failover (#2668 )

2024-08-13 08:51:01 +02:00

conf.py

Change all links and org references (#3086 )

2024-06-17 10:28:21 +02:00

contributing_guidelines.rst

Change all links and org references (#3086 )

2024-06-17 10:28:21 +02:00

CONTRIBUTING.rst

Generate API docs from code with sphinx autodoc (#2699 )

2023-08-17 10:27:33 +02:00

dcs_failsafe_mode.rst

Add a documentation page for patronictl (#2874 )

2023-10-04 11:43:38 +02:00

dynamic_configuration.rst

Configurable retention of members replication slots (#3108 )

2024-08-23 14:50:36 +02:00

ENVIRONMENT.rst

Implement support of log.mode. (#3122 )

2024-08-13 08:11:28 +02:00

existing_data.rst

Add initial docs for patroni --validate/generate config (#2929 )

2023-10-25 14:20:17 +02:00

faq.rst

Configurable retention of members replication slots (#3108 )

2024-08-23 14:50:36 +02:00

ha_loop_diagram.dot

Update diagram

2016-12-20 16:32:00 +01:00

ha_loop_diagram.png

Update diagram

2016-12-20 16:32:00 +01:00

ha_multi_dc.rst

Make wal_log_hints configurable (#3063 )

2024-05-24 09:55:26 +02:00

index.rst

Use target_session_attrs only when multiple hosts in standby_cluster (#3040 )

2024-04-02 11:59:57 +02:00

installation.rst

Add JSON log format to logging configuration (#2982 )

2024-01-16 10:42:48 +01:00

kubernetes.rst

Change all links and org references (#3086 )

2024-06-17 10:28:21 +02:00

Makefile

Make patroni documentation available on patroni.readthedocs.io. (#373 )

2016-12-20 18:22:57 +01:00

patroni_configuration.rst

Patroni doesn't forece wal_log_hints anymore (#3109 )

2024-07-22 09:42:37 +02:00

patronictl.rst

Remove patronictl failover --leader option (#3129 )

2024-08-16 10:18:18 +02:00

pause.rst

Add a documentation page for patronictl (#2874 )

2023-10-04 11:43:38 +02:00

README.rst

Change all links and org references (#3086 )

2024-06-17 10:28:21 +02:00

releases.rst

Release v3.3.2 (#3099 )

2024-07-11 13:01:57 +02:00

replica_bootstrap.rst

Use target_session_attrs only when multiple hosts in standby_cluster (#3040 )

2024-04-02 11:59:57 +02:00

replication_modes.rst

Quorum based failover (#2668 )

2024-08-13 08:51:01 +02:00

rest_api.rst

Quorum based failover (#2668 )

2024-08-13 08:51:01 +02:00

security.rst

Add a documentation page for patronictl (#2874 )

2023-10-04 11:43:38 +02:00

standby_cluster.rst

Make wal_log_hints configurable (#3063 )

2024-05-24 09:55:26 +02:00

tools_integration.rst

Release v3.3.0 (#3043 )

2024-04-04 17:51:26 +02:00

watchdog.rst

docs: Change term "Master" to "primary" or "leader" (#2417 )

2022-09-29 08:48:49 +02:00

yaml_configuration.rst

Implement support of log.mode. (#3122 )

2024-08-13 08:11:28 +02:00

README.rst

.. _readme:

============
Introduction
============

Patroni is a template for high availability (HA) PostgreSQL solutions using Python. Patroni originated as a fork of `Governor <https://github.com/compose/governor>`__, the project from Compose. It includes plenty of new features.

For additional background info, see:

* `PostgreSQL HA with Kubernetes and Patroni <https://www.youtube.com/watch?v=iruaCgeG7qs>`__, talk by Josh Berkus at KubeCon 2016 (video)
* `Feb. 2016 Zalando Tech blog post <https://tech.zalando.de/blog/zalandos-patroni-a-template-for-high-availability-postgresql/>`__

Development Status
------------------

Patroni is in active development and accepts contributions. See our :ref:`Contributing <contributing>` section below for more details.

We report new releases information :ref:`here <releases>`.

Technical Requirements/Installation
-----------------------------------

Go :ref:`here <installation>` for guidance on installing and upgrading Patroni on various platforms.

.. _running_configuring:

Planning the Number of PostgreSQL Nodes
---------------------------------------

Patroni/PostgreSQL nodes are decoupled from DCS nodes (except when Patroni implements RAFT on its own) and therefore
there is no requirement on the minimal number of nodes. Running a cluster consisting of one primary and one standby is
perfectly fine. You can add more standby nodes later.

Running and Configuring
-----------------------

The following section assumes Patroni repository as being cloned from https://github.com/patroni/patroni. Namely, you
will need example configuration files `postgres0.yml` and `postgres1.yml`. If you installed Patroni with pip, you can
obtain those files from the git repository and replace `./patroni.py` below with `patroni` command.

To get started, do the following from different terminals:
::

> etcd --data-dir=data/etcd --enable-v2=true
> ./patroni.py postgres0.yml
> ./patroni.py postgres1.yml

You will then see a high-availability cluster start up. Test different settings in the YAML files to see how the cluster's behavior changes. Kill some of the components to see how the system behaves.

Add more ``postgres*.yml`` files to create an even larger cluster.

Patroni provides an `HAProxy <http://www.haproxy.org/>`__ configuration, which will give your application a single endpoint for connecting to the cluster's leader. To configure,
run:

> haproxy -f haproxy.cfg

> psql --host 127.0.0.1 --port 5000 postgres

YAML Configuration
------------------

Go :ref:`here <yaml_configuration>` for comprehensive information about settings for etcd, consul, and ZooKeeper. And for an example, see `postgres0.yml <https://github.com/patroni/patroni/blob/master/postgres0.yml>`__.

Environment Configuration
-------------------------

Go :ref:`here <environment>` for comprehensive information about configuring(overriding) settings via environment variables.

Replication Choices
-------------------

Patroni uses Postgres' streaming replication, which is asynchronous by default. Patroni's asynchronous replication configuration allows for ``maximum_lag_on_failover`` settings. This setting ensures failover will not occur if a follower is more than a certain number of bytes behind the leader. This setting should be increased or decreased based on business requirements. It's also possible to use synchronous replication for better durability guarantees. See :ref:`replication modes documentation <replication_modes>` for details.

Applications Should Not Use Superusers
--------------------------------------

When connecting from an application, always use a non-superuser. Patroni requires access to the database to function properly. By using a superuser from an application, you can potentially use the entire connection pool, including the connections reserved for superusers, with the ``superuser_reserved_connections`` setting. If Patroni cannot access the Primary because the connection pool is full, behavior will be undesirable.

Testing Your HA Solution
--------------------------------------
Testing an HA solution is a time consuming process, with many variables. This is particularly true considering a cross-platform application. You need a trained system administrator or a consultant to do this work. It is not something we can cover in depth in the documentation.

That said, here are some pieces of your infrastructure you should be sure to test:

* Network (the network in front of your system as well as the NICs [physical or virtual] themselves)
* Disk IO
* file limits (nofile in Linux)
* RAM. Even if you have oomkiller turned off, the unavailability of RAM could cause issues.
* CPU
* Virtualization Contention (overcommitting the hypervisor)
* Any cgroup limitation (likely to be related to the above)
* ``kill -9`` of any postgres process (except postmaster!). This is a decent simulation of a segfault.

One thing that you should not do is run ``kill -9`` on a postmaster process. This is because doing so does not mimic any real life scenario. If you are concerned your infrastructure is insecure and an attacker could run ``kill -9``, no amount of HA process is going to fix that. The attacker will simply kill the process again, or cause chaos in another way.