postgres

Commit Graph

Author	SHA1	Message	Date
Dragos Andriciuc	89f5000235	Update Patroni config file (#551 ) The original config file was taken from [here](https://github.com/jobinau/pgscripts/blob/main/patroni/patroni.yml) and it is currently replaced with a more "up to date" version [here](`bbb1df011f/templates/patroni.yml.j2`).	2 weeks ago
Dragos Andriciuc	afdbffb422	Add information regarding key rotation during backups for pg_basebackup making servers fail to start (#550 ) - add as known issue in release notes - fix a broken link in features.md (not related to issue...) - add to global key providers a warning about keyring provider with WAL encrypt - add new subtopic in Backup WAL about key rotations during backups for file-based key providers Based on PG-1895 description.	2 weeks ago
Dragos Andriciuc	532d264054	Remove an extra s from param and remove ensure_new_key param (#555 )	2 weeks ago
Andrew Pogrebnoy	2364be29cc	Fix XLogging of rotated key Before this commit, we XLogged the provider ID (keyringId) of the old key. Yet, we then attempt to fetch the new key from the old provider during the Redo, which obviously fails and crashes the recovery. So the next steps lead to the recovery stalemate: - Create new provider (with new destination - mount_path, url etc). - Create new server/global key. - Rotate key. - <Crash!> This commit fixes it by Xlogging the new key's provider ID. For: PG-1895	2 weeks ago
Andrew Pogrebnoy	a711d8befa	Fix possible _keys file loss during key rotation There is no reason to do durable_unlink before durable_rename. Rename can handle existing file. But with this sequence, the cluster may endup in unrecoverable state should server crash in-between this two ops, as there is going to be no "_keys" at all. The current sequence may also cause an issue the backup: <durable_unlink>, <pg_basebackup gets a file list>, <durable_rename>. And no "_keys" file in the backup as the result.	2 weeks ago
Dragos Andriciuc	fb543801dc	Add WAL release note for 2.0 release (#482 ) - add new date variable for 2.0 release - populated with feedback after code freeze and team comments	2 weeks ago
Dragos Andriciuc	6719db5704	Update WAL backup topic regarding TAR not supporting `-X stream` when WAL encryption is enabled (#548 ) - renamed title, reorg content for easier scanning - turned into note the `pg_tde/wal_keys` at the end	2 weeks ago
Dragos Andriciuc	acaddab9ab	PG-1858 Document backing up with WAL with encrypt enabled (#534 ) - add new topic called # Backing up with WAL encryption enabled - add two suptopics for other wal methods and restore backup created with wal encrypt - reword to short form option flags	3 weeks ago
Dragos Andriciuc	307b33d656	Add WAL content for 2.0 release (#499 ) - remove (tech preview) - remove mentions of WAL being BETA and warning notes - add WAL tool support to limitations, improve flow, add button to setup - add limitation regarding WAL shipping standy not supported with WAL encryption - add mention of open source and enterprise ed being supported for pg_tde - add none method to basebackup and link to topic - add Example Patroni configuration for Patroni tool - improve supported vs unsupported tools section in Limitations	3 weeks ago
Dragos Andriciuc	c0ad12b50f	PG-1832 Document the archive and restore commands cont (#531 ) Continued from #523 - add pg_tde archive and restore commands - update cli-tools.md with paragraphs explaining New and extended tools - update pg-tde-restore-encrypt tool with new information and better descriptions for clarity - update the Features topic button for better clarity	3 weeks ago
Dragos Andriciuc	9d6297fa30	Update the Features topic buttons for better clarity (#508 )	3 weeks ago
Anders Åstrand	50b3ec8e97	Bump percona version to 17.5.3 This is the version we're about to release.	3 weeks ago
Anders Åstrand	d55240d24c	Bump pg_tde version to 2.0 This release is supposed to be 2.0. The SQL upgrade file is a dummy, but I believe it's required.	3 weeks ago
Andreas Karlsson	415cb8dd5b	Properly print errors from system() in archive and restore commands We used to assume that the only errors which could happen were ones which set the errno, but that is not the case. We also want to give nice output on non-zero return values and if the process was killed by a signal.	3 weeks ago
Andrew Pogrebnoy	481030de9a	pg_basebackup: encrypt streamed WAL with new key Before, pg_basebackup would encrypt streamed WAL according to the keys in pg_tde/wal_keys in the destination dir. This commit introduces the number of changes: pg_basebackup encrypts WAL only if the "-E --encrypt-wal" flag is provided. In such a case, it would extract the principal key, truncate pg_tde/wal_keys and encrypt WAL with a newly generated WAL key. We still expect pg_tde/wal_keys and pg_tde/1664_providers in the destination dir. In case these files are not provided, but "-E" is specified, it fails with an error. We also throw a warning if pg_basebackup runs w/o -E, but there is wal_keys on the source as WAL might be compromised, and the backup is broken For PG-1603, PG-1857	3 weeks ago
Andreas Karlsson	6df714b25c	PG-1867 Improve archiving test and fix race condition There was a race condition in the WAL archiving tests where if the end-of-recovery checkpoint had completed the tests for the WAL contents were non-sensical and racy. Solve this by explicilty promoting the server first after we have looked at the WAL contents but still making sure to wait until all WAL has been replayed. Additionally improve the tests by actually making sure the replica starts in a good state where all WAL is encrypted and testing both the plaintext and the encrypted scenarios.	3 weeks ago
Andreas Karlsson	80e0bb0b56	PG-1867 Make pg_tde_restore_encrypt re-use old keys Unfortunately the logic for generating a new key to protect the stream cipher used to encrypt the WAL stream in our restore command was based on totally incorrect assumptions due to how the recovery is implemented. Recovery is a state machine which can go back and forward between one mode where it streams from a primary and another where it first tries to fetch WAL from the archive and if that fails from the pg_wal directory, and in the pg_wal directory we may have files which are encrypted with whatever keys were there originally. To handle all the possible scenarios we remove the ability of pg_tde_restore_encrypt to generate new keys and just has it use whatever keys there are in the key file. This unfortunately means we open ourselves to some attacks on the stream cipher if the system is tricked into encrypting a different WAL stream at the same TLI and LSN as we already have encrypted. As far as I know this should be rare under normal operations since normally e.g. the WAL should be the same in the archive as the one in pg_wal or which we receive through streaming. Ideally we would want to fix this but for now it is better to have WAL encryption with this weakness than to not have it at all. This also incidentally fixes a bug we discovered caused by generating a new key only invalidating one key rather than all keys which should have become invalid, since we no longer generate a new key.	3 weeks ago
Andreas Karlsson	1338ceb137	Do not try to fetch the last key when we do not have to This is likely a leftover from when the logic for unencrypted keys was different from the one for real encryption keys.	3 weeks ago
Anders Åstrand	aed49c0847	PG-1866 Reset WAL key cache on shmem init It seems like there are cases when the postmaster have "restarted" after a backend crash where the wal cache inherited from the postmaster is wrong. I'm not at all sure exactly how and why this happens, but this patch fixes a bug with this and allows recovery/013_crash_restart to pass with WAL encryption enabled.	3 weeks ago
Andrew Pogrebnoy	621c3f8d3d	Suppress LSAN complaints on pgbench	3 weeks ago
Andrew Pogrebnoy	dbaeda163d	Fix leaked var in tde archiver tools aka make sanitizers happy	3 weeks ago
Anders Åstrand	b709662c25	Hold required lock when initializing shmem According to the documentation, each backend is supposed to hold AddinShmemInitLock when calling ShmemInitStruct. We only did that for half of our calls before this patch.	3 weeks ago
Zsolt Parragi	ff8a389bfd	PG-1604 fix: preallocate one more record for the cache There is at lesat one corner case scenario where we have to load the last record into the cache during a write: * replica crashes, receives last segment from primary * replica replays last segment, reaches end * replica activtes new key * replica replays prepared transaction, has to use old keys again * old key write function sees that we generated a new key, tries to load it In this scenario we could get away by detecting that we are in a write, and asserting if we tried to use the last key. But in a release build assertions are not fired, and we would end up writing some non encrypted data to disk, and later if we have to run recovery failing. It could be a FATAL, but that would still crash the server, and the next startup would crash again and again... Instead, to properly avoid this situation we preallocate memory for one more key in the cache during initialization. Since we can only add one extra key to the cache during the servers run, this means we no longer try to allocate in the critical section in any corner case. While this is not the nicest solution, it is simple and keeps the current cache and decrypt/encrypt logic the same as before. Any other solution would be more complex, and even more of a hack, as it would require dealing with a possibly out of date cache.	3 weeks ago
Andreas Karlsson	167aef2ba2	PG-1605 Fix issue with test which crashes when re-run We make sure to delete the keyring files before running the test.	4 weeks ago
Andreas Karlsson	f96ade0f2d	PG-1605 Fix encryption with old keys with disabled WAL encryption To not break recovery when we replay encrypted WAL but WAL encryption is disabled the simplest way is to treat disabled WAL encryption just like enabled WAL encryption. The issue is not big in practice since it should only hit users who disable WAL encryption and then crash the database but treating both cases the same way makes the code simple to understand.	4 weeks ago
Zsolt Parragi	9dfed22f84	PG-1604: Improve last key LSN calculation logic Previosly we simply set the LSN for the new key to the first write location. This is however not correct, as there are many corner cases around this: * recovery / replication might write old LSNs * we can't handle multiple keys with the same TLI/LSN, which can happen with quick restarts without writes To support this in this commit we modify the following: * We only activate new keys outside crash recovery, or immediately if encryption is turned off * We also take the already existing last key into account (if exists), and only activate a new key if we progressed past its start location The remaining changes are just support infrastructure for this: * Since we might rewrite old records, we use the already existing keys for those writes, not the active last keys * We prefetch existing keys during initialization, so it doesn't accidentally happen in the critical section during a write There is a remaining bug with stopping wal encryption, also mentioned in a TODO message in the code. This will be addressed in a later PR as this fix already took too long.	4 weeks ago
Zsolt Parragi	c7e7dc52a7	Xlog encryption bugfix: offset calculation was off on TLI change The min/max comparisons of LSNs assumed that everyting is in the same timeline. In practice, with replication + recovery combinations, it is possible that keys span at least 3 timelines, which means that this has to be included in both combinations, as in other timelines, the restrictions are less strict.	4 weeks ago
Anders Åstrand	6ddc86c4e3	Fix tabs in usage instructions These should be spaces inside the usage instruction string, not tabs.	4 weeks ago
Anders Åstrand	f5082879dc	PG-1862 Use single argument for wrapped command Use a single argument for the wrapped command in the archivation wrappers. Instead of giving all of the arguments of the command separately and trying to figure out which one should be replaced by the path to the unencrypted WAL segment, we take a single argument and do % parameter replacement similar to what postgres does with archive_command and restore_command. This also mean that we can simplify by using system() instead of exec(). We also clean up usage instructions and make the two wrappers more symmetrical by requiring the same parameters. Co-authored-by: Andreas Karlsson <andreas.karlsson@percona.com>	4 weeks ago
Anders Åstrand	db41dae201	Fix typo in pg_tde_archive_decrypt It's decrypt, not deceypt.	4 weeks ago
Andreas Karlsson	2d91a89189	PG-1842 PG-1843 Optimize deletion of leftover relation keys Instead of first deleting any leftover key and then writing the new key we do a single pass through the file where we replace any old key that we find. To make this happen on redo too we need to stop generating a separate WAL record for the key deletion for encrypted tables and only generate that record for unencrypted tables where we still need a key deletion record. We except this optimization to primarily be visible on WAL replay where only a single backend is used to replay everything, but it also speeds up table creation in general on workloads with many tables.	4 weeks ago
Andreas Karlsson	13c1038eeb	PG-1863 Consistently use create/delete for keys We used a mix of create, add, delete and remove. We still use free and save in pg_tde_tdemap.c but that is soemthing we can fix later.	4 weeks ago
Andreas Karlsson	bccaa7ab15	PG-1863 Do not try to delete keys or log WAL for temporary tables We forgot to have a check against trying to delete leftover SMGR keys for temporary tables which is a useless operation since they are store in memory. Additionally we forgot to prevent WAL from being written when creating or removing a key in smgrcreate() for temporary tables.	4 weeks ago
Andreas Karlsson	d83b36e58d	Remove merge separator from documentation page It was accidentally introduced in commit `8d88d3f28a`.	1 month ago
Andreas Karlsson	57eefad2a1	Clean up code blocks in our documentation - Fix whitespace - Make sure to use the right languages - Do not wrap short SQL queries unnecessarily - Add missing end of code block - Add missing semicolon to SQL query	1 month ago
Andreas Karlsson	b2bb77c0ef	Move common things for key files into a separate header file Instead of having the WAL key code include the headers for the SMGR keys we move the shared code into a separate header file. Additionally we clean up some minor header issues.	1 month ago
Andreas Karlsson	cff0bf5ad3	Remove warning about WAL encryption being unstable In the next release the WAL encryption will no longer be in beta testing and the on disk format is guaranteed to be stable going forward.	1 month ago
Andreas Karlsson	3d90419b08	Constify some function arguments Some of the functions which take a principal key should take a const pointer.	1 month ago
Andreas Karlsson	1d12fe4d26	Remove two pointless debug log statemenets Just logging that the function was called at DEBUG2 is not very helpful to anyone and is presumably jsut a leftover from someone's attempt at debugging a particular issue they had at some point.	1 month ago
Andreas Karlsson	695a1426cc	Move code from key rotation helpers into the function Breaking these particular snippets out as separate functions did not improve readability and was only done because they use to be called from multiple locations. This change has already been done in the WAL key code.	1 month ago
Andreas Karlsson	cf91f94710	Use sizeof directly instead of defines We have already done this refactoring for the WAL key code so let's do it for the SMGR keys too. This makes the code easier to understand.	1 month ago
Dragos Andriciuc	bbe1728be4	PG-1523 Rework uninstallation documentation cleanup (#490 ) - update the pg_tde uninstall steps and add a troubleshooting section in case user receives an error during uninstall - improve introductory paragraph	1 month ago
Artem Gavrilov	458e6ed0be	Add missing test to meson build Add missing key vaidation test to meson build configuration.	1 month ago
Andrew Pogrebnoy	da899e0f32	PG-1603 Make pg_basebackup work with encrypted WAL When WAL is streamed during the backup (default mode), it comes in unencrypted. But we need keys to encrypt it. For now, we expect that the user would put `pg_tde` dir containing the `1664_key` and `1664_providers` into the destination directory before starting the backup. We encrypt the streamed WAL according to internal keys. No `pg_tde` dir means no streamed WAL encryption.	1 month ago
Andreas Karlsson	602cd736e2	Split key type enum into two to make code less confusing Also rename enum variants for consistency plus renumber the types for the WAL keys which is fine since this file is newly introduced which makes breaking backwards compatibility not an issue.	1 month ago
Andreas Karlsson	588938d7b9	Clean up type code for the key map file Let's stop pretending that we support more than two status: empty or that there is a SMGR key.	1 month ago
Andreas Karlsson	d7b42c1fde	Remove checks for empty entries in WAL key file Sincw we never delete WAL keys this logic only confuses the reader of the code. Plus we can optimize the insertion of a new WAL key by using seek().	1 month ago
Andrew Pogrebnoy	1a20e9bb45	PG-1813 Make WAL keys TLI aware Before this commit, WAL keys didn't mind TLI at all. But after pg_rewind, for example, pg_wal/ may contain segments from two timelines. And the wal reader choosing the key may pick the wrong one because LSNs of different TLIs may overlap. There was also another bug: There is a key with the start LSN 0/30000 in TLI 1. And after the start in TLI 2, the wal writer creates a new key with the SN 0/30000, but in TLI 2. But the reader wouldn't fetch the latest key because w/o TLI, these are the same. This commit adds TLI to the Internal keys and makes use of it along with LSN for key compares.	1 month ago
Anders Åstrand	87c55e6690	Remove some unused fields from InternalKey Add them as unused fields in the TDEMapEntry structure however, so we do not affect existing key files.	1 month ago
Andreas Karlsson	8d7192cdba	Move things out of header files after key file split Some definitions should be in the .c files rather than in the header files since they are just used in one file.	1 month ago

1 2 3 4 5 ...

60680 Commits (89f5000235e0e260e8c2fbf1a42825cf5b0bd11b) All Branches Search

60680 Commits (89f5000235e0e260e8c2fbf1a42825cf5b0bd11b)

All Branches