Postgres 16 Continuous archiving and PITR "invalid checkpoint" error

Question 1

Here is what I am doing:

I have a Postgres 16 docker container which is continuously archiving WAL files
After 00012 and 00013 were archived, I take a base-backup of the database using pg_basebackup which generates a 00014..backup file (these are example files and actual WALs are longer I know)
Now I copy over the base backup and the archived WALs into another Postgres 16 docker container which is newly created from same docker image (including version)
I make postgres user as owner for all these files
I remove the WALs from the pg_wal directory of base backup and update the restore_command of postgresql.conf (additionally I also undo the archive configs which were there on primary)
I remove 00012 and 00013 WALs as those are already there in base backup, so now archived WALs is only 00014 and the .backup file created
I create a recovery.signal file, which is empty and in the data directory
Then finally I change the name of the current pgdata to pgdata_ini and the backup directory as pgdata so that it acts as my data directory
Then I stop the container and start again but database startup fails due to invalid checkpoint record and could not find required checkpoint record

Can someone point out what I am doing wrong here?

Question 2

Turns out that the invalid checkpoint record happened because the database system identifiers of the two containers were different, which we can check by doing select system_identifier from pg_control_system();

Now this may sound obvious but it has a slight caveat. When we do PITR, the first step is to load the basebackup, and after loading this backup and renaming it to pgdata, but before restarting, the identifier actually ends up being the same as per the above query. (Even though it is actually not for the system)

So we have to do the following to get around this:

Make two copies of the basebackup on new container
Restart the container with one of the backups so that system identifier truly reflects the old one but it is not up-to-date
Then copy the archived WAL files and update the second copy of the basebackup (removing pg_wal files, updating restore_command in conf, adding recovery.signal etc)
Now, we can rename this second basebackup copy as pgdata and restart the container

Now, it will correctly pick up the archived files and recover up-to-date data because the system identifiers are actually same. It just needs one extra restart with the initial basebackup.

Ironscar Ironscar 12 bronze badges · Answer 1 · 2025-01-06 04:04:47Z

Turns out that the invalid checkpoint record happened because the database system identifiers of the two containers were different, which we can check by doing select system_identifier from pg_control_system();

Now this may sound obvious but it has a slight caveat. When we do PITR, the first step is to load the basebackup, and after loading this backup and renaming it to pgdata, but before restarting, the identifier actually ends up being the same as per the above query. (Even though it is actually not for the system)

So we have to do the following to get around this:

Make two copies of the basebackup on new container
Restart the container with one of the backups so that system identifier truly reflects the old one but it is not up-to-date
Then copy the archived WAL files and update the second copy of the basebackup (removing pg_wal files, updating restore_command in conf, adding recovery.signal etc)
Now, we can rename this second basebackup copy as pgdata and restart the container

Now, it will correctly pick up the archived files and recover up-to-date data because the system identifiers are actually same. It just needs one extra restart with the initial basebackup.

Stack Exchange Network

Postgres 16 Continuous archiving and PITR "invalid checkpoint" error

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Postgres 16 Continuous archiving and PITR "invalid checkpoint" error

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions