diff options
author | Tom Lane <tgl@sss.pgh.pa.us> | 2022-10-11 18:54:31 -0400 |
---|---|---|
committer | Tom Lane <tgl@sss.pgh.pa.us> | 2022-10-11 18:54:31 -0400 |
commit | 18a4a620e2de0e25a15303a8f78ff415950a14ef (patch) | |
tree | e6eab90fbbd32dc02304ac7c3f462a26cace2461 /src/backend/tcop/postgres.c | |
parent | b8f2687fdc410371bbfa579ab7c4fd4b7a5ed1cb (diff) | |
download | postgresql-18a4a620e2de0e25a15303a8f78ff415950a14ef.tar.gz postgresql-18a4a620e2de0e25a15303a8f78ff415950a14ef.zip |
Harden pmsignal.c against clobbered shared memory.
The postmaster is not supposed to do anything that depends
fundamentally on shared memory contents, because that creates
the risk that a backend crash that trashes shared memory will
take the postmaster down with it, preventing automatic recovery.
In commit 969d7cd43 I lost sight of this principle and coded
AssignPostmasterChildSlot() in such a way that it could fail
or even crash if the shared PMSignalState structure became
corrupted. Remarkably, we've not seen field reports of such
crashes; but I managed to induce one while testing the recent
changes around palloc chunk headers.
To fix, make a semi-duplicative state array inside the postmaster
so that we need consult only local state while choosing a "child
slot" for a new backend. Ensure that other postmaster-executed
routines in pmsignal.c don't have critical dependencies on the
shared state, either. Corruption of PMSignalState might now
lead ReleasePostmasterChildSlot() to conclude that backend X
failed, when actually backend Y was the one that trashed things.
But that doesn't matter, because we'll force a cluster-wide reset
regardless.
Back-patch to all supported branches, since this is an old bug.
Discussion: https://postgr.es/m/3436789.1665187055@sss.pgh.pa.us
Diffstat (limited to 'src/backend/tcop/postgres.c')
0 files changed, 0 insertions, 0 deletions