diff options
| author | Amit Kapila <akapila@postgresql.org> | 2023-01-24 09:25:36 +0530 | 
|---|---|---|
| committer | Amit Kapila <akapila@postgresql.org> | 2023-01-24 09:25:36 +0530 | 
| commit | 6c6d6ba3ee2c160b53f727cf8e612014b316d6e4 (patch) | |
| tree | e2b20e5a6ad953cabaa6377eaba91d8d9e3ecea3 /src/backend/optimizer/util | |
| parent | 728f86fec65537eade8d9e751961782ddb527934 (diff) | |
Fix the Drop Database hang.
The drop database command waits for the logical replication sync worker to
accept ProcSignalBarrier and the worker's slot creation waits for the drop
database to finish which leads to a deadlock. This happens because the
tablesync worker holds interrupts while creating a slot.
We prevent cancel/die interrupts while creating a slot in the table sync
worker because it is possible that before the server finishes this
command, a concurrent drop subscription happens which would complete
without removing this slot and that leads to the slot existing until the
end of walsender. However, the slot will eventually get dropped at the
walsender exit time, so there is no danger of the dangling slot.
This patch reallows cancel/die interrupts while creating a slot and
modifies the test to wait for slots to become zero to prevent finding an
ephemeral slot.
The reported hang doesn't happen in PG14 as the drop database starts to
wait for ProcSignalBarrier with PG15 (commits 4eb2176318 and e2f65f4255)
but it is good to backpatch this till PG14 as it is not a good idea to
prevent interrupts during a network call that could block indefinitely.
Reported-by: Lakshmi Narayanan Sreethar
Diagnosed-by: Andres Freund
Author: Hou Zhijie
Reviewed-by: Vignesh C, Amit Kapila
Backpatch-through: 14, where it was introduced in commit 6b67d72b60
Discussion: https://postgr.es/m/CA+kvmZELXQ4ZD3U=XCXuG3KvFgkuPoN1QrEj8c-rMRodrLOnsg@mail.gmail.com
Diffstat (limited to 'src/backend/optimizer/util')
0 files changed, 0 insertions, 0 deletions
