Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content

Commit 9a5e4a6

Browse files
committed
Fix failure to delete spill files of aborted transactions
Logical decoding's reorderbuffer.c may spill transaction files to disk when transactions are large. These are supposed to be removed when they become "too old" by xid; but file removal requires the boundary LSNs of the transaction to be known. The final_lsn is only set when we see the commit or abort record for the transaction, but nothing sets the value for transactions that crash, so the removal code misbehaves -- in assertion-enabled builds, it crashes by a failed assertion. To fix, modify the final_lsn of transactions that don't have a value set, to the LSN of the very latest change in the transaction. This causes the spilled files to be removed appropriately. Author: Atsushi Torikoshi Reviewed-by: Kyotaro HORIGUCHI, Craig Ringer, Masahiko Sawada Discussion: https://postgr.es/m/54e4e488-186b-a056-6628-50628e4e4ebc@lab.ntt.co.jp
1 parent ad592f4 commit 9a5e4a6

File tree

2 files changed

+19
-2
lines changed

2 files changed

+19
-2
lines changed

src/backend/replication/logical/reorderbuffer.c

Lines changed: 17 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1757,8 +1757,8 @@ ReorderBufferAbortOld(ReorderBuffer *rb, TransactionId oldestRunningXid)
17571757
* Iterate through all (potential) toplevel TXNs and abort all that are
17581758
* older than what possibly can be running. Once we've found the first
17591759
* that is alive we stop, there might be some that acquired an xid earlier
1760-
* but started writing later, but it's unlikely and they will cleaned up
1761-
* in a later call to ReorderBufferAbortOld().
1760+
* but started writing later, but it's unlikely and they will be cleaned
1761+
* up in a later call to this function.
17621762
*/
17631763
dlist_foreach_modify(it, &rb->toplevel_by_lsn)
17641764
{
@@ -1768,6 +1768,21 @@ ReorderBufferAbortOld(ReorderBuffer *rb, TransactionId oldestRunningXid)
17681768

17691769
if (TransactionIdPrecedes(txn->xid, oldestRunningXid))
17701770
{
1771+
/*
1772+
* We set final_lsn on a transaction when we decode its commit or
1773+
* abort record, but we never see those records for crashed
1774+
* transactions. To ensure cleanup of these transactions, set
1775+
* final_lsn to that of their last change; this causes
1776+
* ReorderBufferRestoreCleanup to do the right thing.
1777+
*/
1778+
if (txn->serialized && txn->final_lsn == 0)
1779+
{
1780+
ReorderBufferChange *last =
1781+
dlist_tail_element(ReorderBufferChange, node, &txn->changes);
1782+
1783+
txn->final_lsn = last->lsn;
1784+
}
1785+
17711786
elog(DEBUG2, "aborting old transaction %u", txn->xid);
17721787

17731788
/* remove potential on-disk data, and deallocate this tx */

src/include/replication/reorderbuffer.h

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -168,6 +168,8 @@ typedef struct ReorderBufferTXN
168168
* * plain abort record
169169
* * prepared transaction abort
170170
* * error during decoding
171+
* * for a crashed transaction, the LSN of the last change, regardless of
172+
* what it was.
171173
* ----
172174
*/
173175
XLogRecPtr final_lsn;

0 commit comments

Comments
 (0)