summaryrefslogtreecommitdiff
path: root/doc/src/sgml/ref/pg_rewind.sgml
diff options
context:
space:
mode:
authorThomas Munro <tmunro@postgresql.org>2021-03-17 17:46:39 +1300
committerThomas Munro <tmunro@postgresql.org>2021-03-17 18:06:52 +1300
commit4e0f0995e923948631c4114ab353b256b51b58ad (patch)
tree487d0e54a578a82588cd82c6cbdaf5dc03b1192d /doc/src/sgml/ref/pg_rewind.sgml
parent4d072bf2a031f343ef796dac6d324d9a03121183 (diff)
Fix race in Parallel Hash Join batch cleanup.
With very unlucky timing and parallel_leader_participation off, PHJ could attempt to access per-batch state just as it was being freed. There was code intended to prevent that by checking for a cleared pointer, but it was buggy. Fix, by introducing an extra barrier phase. The new phase PHJ_BUILD_RUNNING means that it's safe to access the per-batch state to find a batch to help with, and PHJ_BUILD_DONE means that it is too late. The last to detach will free the array of per-batch state as before, but now it will also atomically advance the phase at the same time, so that late attachers can avoid the hazard, without the data race. This mirrors the way per-batch hash tables are freed (see phases PHJ_BATCH_PROBING and PHJ_BATCH_DONE). Revealed by a one-off build farm failure, where BarrierAttach() failed a sanity check assertion, because the memory had been clobbered by dsa_free(). Back-patch to 11, where the code arrived. Reported-by: Michael Paquier <michael@paquier.xyz> Discussion: https://postgr.es/m/20200929061142.GA29096%40paquier.xyz
Diffstat (limited to 'doc/src/sgml/ref/pg_rewind.sgml')
0 files changed, 0 insertions, 0 deletions