reftable/stack: fix corruption on concurrent compaction - user/sven/git.git

diff options

author	Patrick Steinhardt <ps@pks.im>	2024-08-08 16:06:53 +0200
committer	Junio C Hamano <gitster@pobox.com>	2024-08-08 10:14:43 -0700
commit	ed1ad6b44deaf5089c64e0fbcae43a37d5cf666a (patch)
tree	59a2f131e777f5ff9ae2e49f75654da0a1ebfcf1 /builtin/commit.c
parent	128b9aa3e9d3dc0417f8f65a240aad736db97959 (diff)

reftable/stack: fix corruption on concurrent compaction

The locking employed by compaction uses the following schema: 1. Lock "tables.list" and verify that it matches the version we have loaded in core. 2. Lock each of the tables in the user-supplied range of tables that we are supposed to compact. These locks prohibit any concurrent process to compact those tables while we are doing that. 3. Unlock "tables.list". This enables concurrent processes to add new tables to the stack, but also allows them to compact tables outside of the range of tables that we have locked. 4. Perform the compaction. 5. Lock "tables.list" again. 6. Move the compacted table into place. 7. Write the new order of tables, including the compacted table, into the lockfile. 8. Commit the lockfile into place. Letting concurrent processes modify the "tables.list" file while we are doing the compaction is very much part of the design and thus expected. After all, it may take some time to compact tables in the case where we are compacting a lot of very large tables. But there is a bug in the code. Suppose we have two processes which are compacting two slices of the table. Given that we lock each of the tables before compacting them, we know that the slices must be disjunct from each other. But regardless of that, compaction performed by one process will always impact what the other process needs to write to the "tables.list" file. Right now, we do not check whether the "tables.list" has been changed after we have locked it for the second time in (5). This has the consequence that we will always commit the old, cached in-core tables to disk without paying to respect what the other process has written. This scenario would then lead to data loss and corruption. This can even happen in the simpler case of one compacting process and one writing process. The newly-appended table by the writing process would get discarded by the compacting process because it never sees the new table. Fix this bug by re-checking whether our stack is still up to date after locking for the second time. If it isn't, then we adjust the indices of tables to replace in the updated stack. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>

Diffstat (limited to 'builtin/commit.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: