PostgreSQL 原始碼解讀(112)- WAL#8(XLogCtrl資料結構)
本節簡單介紹了XLOG全域性(所有程式之間)共享的資料結構:XLogCtlData和XLogCtlInsert。在這兩個結構體中,儲存了REDO point/Lock等相關重要的資訊.
一、資料結構
XLogCtlInsert
WAL插入記錄時使用的共享資料結構
/*
* Shared state data for WAL insertion.
* WAL插入記錄時使用的共享資料結構
*/
typedef struct XLogCtlInsert
{
//包含CurrBytePos和PrevBytePos的lock
slock_t insertpos_lck; /* protects CurrBytePos and PrevBytePos */
/*
* CurrBytePos is the end of reserved WAL. The next record will be
* inserted at that position. PrevBytePos is the start position of the
* previously inserted (or rather, reserved) record - it is copied to the
* prev-link of the next record. These are stored as "usable byte
* positions" rather than XLogRecPtrs (see XLogBytePosToRecPtr()).
* CurrBytePos是保留WAL的結束位置。
* 下一條記錄將插入到那個位置。
* PrevBytePos是先前插入(或者保留)記錄的起始位置——它被複制到下一條記錄的prev-link中。
* 這些儲存為“可用位元組位置”,而不是XLogRecPtrs(參見XLogBytePosToRecPtr())。
*/
uint64 CurrBytePos;
uint64 PrevBytePos;
/*
* Make sure the above heavily-contended spinlock and byte positions are
* on their own cache line. In particular, the RedoRecPtr and full page
* write variables below should be on a different cache line. They are
* read on every WAL insertion, but updated rarely, and we don't want
* those reads to steal the cache line containing Curr/PrevBytePos.
* 確保以上激烈競爭的自旋鎖和位元組位置在它們自己的快取line上。
* 特別是,RedoRecPtr和下面的全頁寫變數應該位於不同的快取line上。
* 它們在每次插入WAL時都被讀取,但很少更新,
* 我們不希望這些讀取竊取包含Curr/PrevBytePos的快取line。
*/
char pad[PG_CACHE_LINE_SIZE];
/*
* fullPageWrites is the master copy used by all backends to determine
* whether to write full-page to WAL, instead of using process-local one.
* This is required because, when full_page_writes is changed by SIGHUP,
* we must WAL-log it before it actually affects WAL-logging by backends.
* Checkpointer sets at startup or after SIGHUP.
* fullpagewrite是所有後臺程式使用的主副本,
* 用於確定是否將整個頁面寫入WAL,而不是使用process-local副本。
* 這是必需的,因為當SIGHUP更改full_page_write時,
* 我們必須在它透過後臺程式實際影響WAL-logging之前對其進行WAL-log記錄。
* Checkpointer檢查點設定在啟動或SIGHUP之後。
*
* To read these fields, you must hold an insertion lock. To modify them,
* you must hold ALL the locks.
* 為了讀取這些域,必須持有insertion lock.
* 如需更新,則需要持有所有這些lock.
*/
//插入時的當前redo point
XLogRecPtr RedoRecPtr; /* current redo point for insertions */
//為PITR強制執行full-page寫?
bool forcePageWrites; /* forcing full-page writes for PITR? */
//是否全頁寫?
bool fullPageWrites;
/*
* exclusiveBackupState indicates the state of an exclusive backup (see
* comments of ExclusiveBackupState for more details). nonExclusiveBackups
* is a counter indicating the number of streaming base backups currently
* in progress. forcePageWrites is set to true when either of these is
* non-zero. lastBackupStart is the latest checkpoint redo location used
* as a starting point for an online backup.
* exclusive sivebackupstate表示排他備份的狀態
* (有關詳細資訊,請參閱exclusive sivebackupstate的註釋)。
* 非排他性備份是一個計數器,指示當前正在進行的流基礎備份的數量。
* forcePageWrites在這兩個值都不為零時被設定為true。
* lastBackupStart用作線上備份起點的最新檢查點的重做位置。
*/
ExclusiveBackupState exclusiveBackupState;
int nonExclusiveBackups;
XLogRecPtr lastBackupStart;
/*
* WAL insertion locks.
* WAL寫入鎖
*/
WALInsertLockPadded *WALInsertLocks;
} XLogCtlInsert;
XLogCtl
XLOG的所有共享記憶體狀態資訊
/*
* Total shared-memory state for XLOG.
* XLOG的所有共享記憶體狀態資訊
*/
typedef struct XLogCtlData
{
XLogCtlInsert Insert;//插入控制器
/* Protected by info_lck: */
//------ 透過info_lck鎖保護
XLogwrtRqst LogwrtRqst;
//Insert->RedoRecPtr最近的複製
XLogRecPtr RedoRecPtr; /* a recent copy of Insert->RedoRecPtr */
//最後的checkpoint的nextXID & epoch
uint32 ckptXidEpoch; /* nextXID & epoch of latest checkpoint */
TransactionId ckptXid;
//最新非同步提交/回滾的LSN
XLogRecPtr asyncXactLSN; /* LSN of newest async commit/abort */
//slot需要的最"老"的LSN
XLogRecPtr replicationSlotMinLSN; /* oldest LSN needed by any slot */
//最後移除/回收的XLOG段
XLogSegNo lastRemovedSegNo; /* latest removed/recycled XLOG segment */
/* Fake LSN counter, for unlogged relations. Protected by ulsn_lck. */
//---- "偽裝"的LSN計數器,用於不需要記錄日誌的關係.透過ulsn_lck鎖保護
XLogRecPtr unloggedLSN;
slock_t ulsn_lck;
/* Time and LSN of last xlog segment switch. Protected by WALWriteLock. */
//---- 切換後最新的xlog段的時間線和LSN,透過WALWriteLock鎖保護
pg_time_t lastSegSwitchTime;
XLogRecPtr lastSegSwitchLSN;
/*
* Protected by info_lck and WALWriteLock (you must hold either lock to
* read it, but both to update)
* 透過info_lck和WALWriteLock保護
* (必須持有其中之一才能讀取,必須全部持有才能更新)
*/
XLogwrtResult LogwrtResult;
/*
* Latest initialized page in the cache (last byte position + 1).
* 在快取中最後初始化的page(最後一個位元組位置 + 1)
*
* To change the identity of a buffer (and InitializedUpTo), you need to
* hold WALBufMappingLock. To change the identity of a buffer that's
* still dirty, the old page needs to be written out first, and for that
* you need WALWriteLock, and you need to ensure that there are no
* in-progress insertions to the page by calling
* WaitXLogInsertionsToFinish().
* 如需改變緩衝區的標識(以及InitializedUpTo),需要持有WALBufMappingLock鎖.
* 改變標記為dirty的緩衝區的識別符號,舊的page需要先行寫出,因此必須持有WALWriteLock鎖,
* 而且必須確保沒有正在透過呼叫WaitXLogInsertionsToFinish()進行執行中的插入page操作
*/
XLogRecPtr InitializedUpTo;
/*
* These values do not change after startup, although the pointed-to pages
* and xlblocks values certainly do. xlblock values are protected by
* WALBufMappingLock.
* 在啟動後這些值不會修改,雖然pointed-to pages和xlblocks值通常會更改.
* xlblock的值透過WALBufMappingLock鎖保護.
*/
//未寫入的XLOG pages的快取
char *pages; /* buffers for unwritten XLOG pages */
//ptr-s的第一個位元組 + XLOG_BLCKSZ
XLogRecPtr *xlblocks; /* 1st byte ptr-s + XLOG_BLCKSZ */
//已分配的xlog緩衝的索引最高值
int XLogCacheBlck; /* highest allocated xlog buffer index */
/*
* Shared copy of ThisTimeLineID. Does not change after end-of-recovery.
* If we created a new timeline when the system was started up,
* PrevTimeLineID is the old timeline's ID that we forked off from.
* Otherwise it's equal to ThisTimeLineID.
* ThisTimeLineID的共享複製.
* 在完成恢復後不要修改.
* 如果在系統啟動後建立了一個新的時間線,PrevTimeLineID是從舊時間線分叉的ID.
* 否則,PrevTimeLineID = ThisTimeLineID
*/
TimeLineID ThisTimeLineID;
TimeLineID PrevTimeLineID;
/*
* SharedRecoveryInProgress indicates if we're still in crash or archive
* recovery. Protected by info_lck.
* SharedRecoveryInProgress標記是否處於當機或者歸檔恢復中,透過info_lck鎖保護.
*/
bool SharedRecoveryInProgress;
/*
* SharedHotStandbyActive indicates if we're still in crash or archive
* recovery. Protected by info_lck.
* SharedHotStandbyActive標記是否處於當機或者歸檔恢復中,透過info_lck鎖保護.
*/
bool SharedHotStandbyActive;
/*
* WalWriterSleeping indicates whether the WAL writer is currently in
* low-power mode (and hence should be nudged if an async commit occurs).
* Protected by info_lck.
* WalWriterSleeping標記WAL writer程式是否處於"節能"模式
* (因此,如果發生非同步提交,應該對其進行微操作).
* 透過info_lck鎖保護.
*/
bool WalWriterSleeping;
/*
* recoveryWakeupLatch is used to wake up the startup process to continue
* WAL replay, if it is waiting for WAL to arrive or failover trigger file
* to appear.
* recoveryWakeupLatch等待WAL arrive或者failover觸發檔案出現,
* 如出現則喚醒啟動程式繼續執行WAL回放.
*
*/
Latch recoveryWakeupLatch;
/*
* During recovery, we keep a copy of the latest checkpoint record here.
* lastCheckPointRecPtr points to start of checkpoint record and
* lastCheckPointEndPtr points to end+1 of checkpoint record. Used by the
* checkpointer when it wants to create a restartpoint.
* 在恢復期間,我們儲存最後檢查點記錄的一個複製在這裡.
* lastCheckPointRecPtr指向檢查點的起始位置
* lastCheckPointEndPtr指向執行檢查點的結束點+1位置
* 在checkpointer程式希望建立一個重新啟動的點時使用.
*
* Protected by info_lck.
* 使用info_lck鎖保護.
*/
XLogRecPtr lastCheckPointRecPtr;
XLogRecPtr lastCheckPointEndPtr;
CheckPoint lastCheckPoint;
/*
* lastReplayedEndRecPtr points to end+1 of the last record successfully
* replayed. When we're currently replaying a record, ie. in a redo
* function, replayEndRecPtr points to the end+1 of the record being
* replayed, otherwise it's equal to lastReplayedEndRecPtr.
* lastReplayedEndRecPtr指向最後一個成功回放的記錄的結束點 + 1的位置.
* 如果正處於redo函式回放記錄期間,那麼replayEndRecPtr指向正在恢復的記錄的結束點 + 1的位置,
* 否則replayEndRecPtr = lastReplayedEndRecPtr
*/
XLogRecPtr lastReplayedEndRecPtr;
TimeLineID lastReplayedTLI;
XLogRecPtr replayEndRecPtr;
TimeLineID replayEndTLI;
/* timestamp of last COMMIT/ABORT record replayed (or being replayed) */
//最後的COMMIT/ABORT回放(或正在回放)記錄的時間戳
TimestampTz recoveryLastXTime;
/*
* timestamp of when we started replaying the current chunk of WAL data,
* only relevant for replication or archive recovery
* 我們開始回放當前的WAL chunk的時間戳(僅與複製或存檔恢復相關)
*/
TimestampTz currentChunkStartTime;
/* Are we requested to pause recovery? */
//是否請求暫停恢復
bool recoveryPause;
/*
* lastFpwDisableRecPtr points to the start of the last replayed
* XLOG_FPW_CHANGE record that instructs full_page_writes is disabled.
* lastFpwDisableRecPtr指向最後已回放的XLOG_FPW_CHANGE記錄(禁用對整個頁面的寫指令)的起始點.
*/
XLogRecPtr lastFpwDisableRecPtr;
//鎖結構
slock_t info_lck; /* locks shared variables shown above */
} XLogCtlData;
static XLogCtlData *XLogCtl = NULL;
二、跟蹤分析
跟蹤任意一個後臺程式,列印全域性變數XLogCtl.
(gdb) p XLogCtl
$6 = (XLogCtlData *) 0x7f391e00ea80
(gdb) p *XLogCtl
$7 = {Insert = {insertpos_lck = 0 '\000', CurrBytePos = 5494680728, PrevBytePos = 5494680616,
pad = '\000' <repeats 127 times>, RedoRecPtr = 5510830896, forcePageWrites = false, fullPageWrites = true,
exclusiveBackupState = EXCLUSIVE_BACKUP_NONE, nonExclusiveBackups = 0, lastBackupStart = 0,
WALInsertLocks = 0x7f391e013100}, LogwrtRqst = {Write = 5510831008, Flush = 5510831008}, RedoRecPtr = 5510830896,
ckptXidEpoch = 0, ckptXid = 2036, asyncXactLSN = 5510830896, replicationSlotMinLSN = 0, lastRemovedSegNo = 0,
unloggedLSN = 1, ulsn_lck = 0 '\000', lastSegSwitchTime = 1545962218, lastSegSwitchLSN = 5507670464, LogwrtResult = {
Write = 5510831008, Flush = 5510831008}, InitializedUpTo = 5527601152, pages = 0x7f391e014000 "\230\320\006",
xlblocks = 0x7f391e00f088, XLogCacheBlck = 2047, ThisTimeLineID = 1, PrevTimeLineID = 1,
archiveCleanupCommand = '\000' <repeats 1023 times>, SharedRecoveryInProgress = false, SharedHotStandbyActive = false,
WalWriterSleeping = true, recoveryWakeupLatch = {is_set = 0, is_shared = true, owner_pid = 0}, lastCheckPointRecPtr = 0,
lastCheckPointEndPtr = 0, lastCheckPoint = {redo = 0, ThisTimeLineID = 0, PrevTimeLineID = 0, fullPageWrites = false,
nextXidEpoch = 0, nextXid = 0, nextOid = 0, nextMulti = 0, nextMultiOffset = 0, oldestXid = 0, oldestXidDB = 0,
oldestMulti = 0, oldestMultiDB = 0, time = 0, oldestCommitTsXid = 0, newestCommitTsXid = 0, oldestActiveXid = 0},
lastReplayedEndRecPtr = 0, lastReplayedTLI = 0, replayEndRecPtr = 0, replayEndTLI = 0, recoveryLastXTime = 0,
currentChunkStartTime = 0, recoveryPause = false, lastFpwDisableRecPtr = 0, info_lck = 0 '\000'}
(gdb)
其中:
1.XLogCtl->Insert是XLogCtlInsert結構體變數.
2.RedoRecPtr為5510830896 -> 1/48789B30,該值與pg_control檔案中的REDO location相對應.
[xdb@localhost ~]$ pg_controldata|grep REDO
Latest checkpoint's REDO location: 1/48789B30
Latest checkpoint's REDO WAL file: 000000010000000100000048
3.ThisTimeLineID&PrevTimeLineID,時間線ID,值為1.
其他相關資訊可對照結構體定義閱讀.
三、參考資料
PostgreSQL 原始碼解讀(4)- 插入資料#3(heap_insert)
PG Source Code
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/6906/viewspace-2374770/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- PostgreSQL 原始碼解讀(205)- 查詢#118(資料結構RangeTblEntry)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(108)- 後臺程式#1(PGPROC資料結構)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(109)- WAL#5(相關資料結構)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(206)- 查詢#119(資料結構RangSubselect等)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(197)- 查詢#112(排序#5 - mergeruns)SQL原始碼排序
- PostgreSQL 原始碼解讀(204)- 查詢#117(資料結構SelectStmt&Value)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(207)- 查詢#120(資料結構FromExpr&JoinExpr)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(1)- 插入資料#1SQL原始碼
- PostgreSQL 原始碼解讀(178)- 查詢#95(聚合函式)#1相關資料結構SQL原始碼函式資料結構
- PostgreSQL 原始碼解讀(2)- 插入資料#2(RelationPutHeapTuple)SQL原始碼APT
- PostgreSQL 原始碼解讀(5)- 插入資料#4(ExecInsert)SQL原始碼
- PostgreSQL 原始碼解讀(6)- 插入資料#5(ExecModifyTable)SQL原始碼
- PostgreSQL 原始碼解讀(8)- 插入資料#7(ExecutePlan)SQL原始碼
- PostgreSQL 原始碼解讀(10)- 插入資料#9(ProcessQuery)SQL原始碼
- PostgreSQL 原始碼解讀(13)- 插入資料#12(PostgresMain)SQL原始碼AI
- PostgreSQL 原始碼解讀(44)- 查詢語句#29(等價類相關資料結構)SQL原始碼資料結構
- PostgreSQL 原始碼解讀(3)- 如何閱讀原始碼SQL原始碼
- PostgreSQL 原始碼解讀(4)- 插入資料#3(heap_insert)SQL原始碼
- PostgreSQL:原始碼目錄結構SQL原始碼
- PostgreSQL 原始碼解讀(12)- 插入資料#11(exec_simple_query)SQL原始碼
- PostgreSQL 原始碼解讀(7)- 插入資料#6(ExecProcNode和ExecPro...SQL原始碼
- PostgreSQL 原始碼解讀(9)- 插入資料#8(ExecutorRun和standard...SQL原始碼
- PostgreSQL 原始碼解讀(11)- 插入資料#10(PortalRunMulti和Por...SQL原始碼
- PostgreSQL 原始碼解讀(22)- 查詢語句#7(PlannedStmt結構詳解-日誌分析)SQL原始碼
- redis資料結構原始碼閱讀——字串編碼過程Redis資料結構原始碼字串編碼
- PostgreSQL 原始碼解讀(14)- Insert語句(如何構造PlannedStmt)SQL原始碼
- PostgreSQL 原始碼解讀(219)- Locks(Overview)SQL原始碼View
- PostgreSQL 原始碼解讀(241)- plpgsql(CreateFunction)SQL原始碼Function
- PostgreSQL 原始碼解讀(92)- 分割槽表#1(資料插入路由#1)SQL原始碼路由
- PostgreSQL 原始碼解讀(94)- 分割槽表#2(資料插入路由#2)SQL原始碼路由
- 比特幣原始碼研讀(3)資料結構-交易Transaction比特幣原始碼資料結構
- PostgreSQL 原始碼解讀(240)- HTAB簡介SQL原始碼
- PostgreSQL 原始碼解讀(220)- Locks(LOCK Struct)SQL原始碼Struct
- PostgreSQL 原始碼解讀(221)- Locks(PROCLOCK Struct)SQL原始碼Struct
- 簡單易懂的Vue資料繫結原始碼解讀Vue原始碼
- 比特幣原始碼研讀(2)資料結構-區塊Block比特幣原始碼資料結構BloC
- 比特幣原始碼研讀(4)資料結構-交易池TransactionPool比特幣原始碼資料結構
- CesiumJS 2022^ 原始碼解讀[0] - 文章目錄與原始碼工程結構JS原始碼