PostgreSQL9.5：pg_rewind 快速恢復備節點

francs發表於2016-11-08

瞭解 PG 的朋友應該知道 PG 的主備切換並不容易，步驟較嚴謹，在啟用備節點前需主動關閉主節點，否則再想以備節點角色拉起主節點會比較困難，之前部落格介紹過主備切換，PostgreSQL HOT-Standby 的主備切換，PG 9.5 版本已經將 pg_rewind 加入到原始碼，當主備發生切換時，可以將原來主庫通過同步模式恢復，避免重做備庫。這樣對於較大的庫來說，節省了大量重做備庫時間。

pg_rewind 會將目標庫的資料檔案，配置檔案複製到本地目錄，由於 pg_rewind 不會讀取所有未發生變化的資料塊，所以速度比重做備庫要快很多，

一環境準備

流複製環境
192.168.2.37/1931 主節點(主機名 db1)
192.168.2.38/1931 備節點(主機名 db2)
備註：流複製環境參考 PostgreSQL：使用 pg_basebackup 搭建流複製環境，本文略。

–pg_rewind 前提條件
1 full_page_writes
2 wal_log_hints 設定成 on 或者 PG 在初始化時開啟 checksums 功能

二主備切換

–備節點 recovery.conf 配置: db2 上操作

[pg95@db2 pg_root]$ grep ^[a-z] recovery.conf 
recovery_target_timeline = `latest`
standby_mode = on
primary_conninfo = `host=192.168.2.37 port=1931 user=repuser`           # e.g. `host=localhost port=5432`

–啟用備節點: db2 上操作

[pg95@db2 pg_root]$ pg_ctl promote -D $PGDATA
server promoting

[pg95@db2 pg_root]$ pg_controldata | grep cluster
Database cluster state:               in production

–備節點啟用後，建立一張測試表並插入資料

[pg95@db2 pg_root]$ psql
psql (9.5alpha1)
Type "help" for help.

postgres=# create table test_2(id int4);
CREATE TABLE
                   
postgres=# insert into test_2(id) select n from generate_series(1,10000) n;
INSERT 0 10000

–停原來主節點: db1 上操作

[pg95@db1 ~]$ pg_controldata | grep cluster
Database cluster state:               in production

[pg95@db1 ~]$ pg_ctl stop -m fast -D $PGDATA
waiting for server to shut down....... done
server stopped

備註：停完原主庫後，千萬不能立即以備節點形式拉起老庫，否則在執行 pg_rewind 時會報，”target server must be shut down cleanly” 錯誤。

–pg_rewind: db1 上操作

[pg95@db1 pg_root]$ pg_ctl stop -m fast -D $PGDATA
waiting for server to shut down......... done
server stopped

[pg95@db1 pg_root]$ pg_rewind --target-pgdata $PGDATA --source-server=`host=192.168.2.38 port=1931 user=postgres dbname=postgres` -P 
connected to server
target server needs to use either data checksums or "wal_log_hints = on"

備註：執行 pg_rewind 丟擲以上錯誤，錯誤內容很明顯。

–pg_rewind 程式碼分析

  364     /*
  365      * Target cluster need to use checksums or hint bit wal-logging, this to
  366      * prevent from data corruption that could occur because of hint bits.
  367      */
  368     if (ControlFile_target.data_checksum_version != PG_DATA_CHECKSUM_VERSION &&
  369         !ControlFile_target.wal_log_hints)
  370     {
  371         pg_fatal("target server needs to use either data checksums or "wal_log_hints = on"
");
  372     }
  373

備註：資料庫在 initdb 時需要開啟 checksums 或者設定 “wal_log_hints = on”，接著設定主，備節點的 wal_log_hints 引數並重啟資料庫。

–再次 pg_rewind, db1 上操作

[pg95@db1 pg_root]$ pg_rewind --target-pgdata $PGDATA --source-server=`host=192.168.2.38 port=1931 user=postgres dbname=postgres` -P
connected to server
The servers diverged at WAL position 0/1300CEB0 on timeline 5.
Rewinding from last common checkpoint at 0/1200008C on timeline 5
reading source file list
reading target file list
reading WAL in target
need to copy 59 MB (total source directory size is 76 MB)
61185/61185 kB (100%) copied
creating backup label and updating control file
Done!

備註：pg_rewind 成功。

–調整 recovery.conf 檔案: db1 操作
[pg95@db1 ~]$ cd $PGDATA
[pg95@db1 pg_root]$ mv recovery.done recovery.conf

備註：注意是否需要修改 primary_conninfo 配置。

[pg95@db1 pg_root]$ grep ^[a-z] recovery.conf 
recovery_target_timeline = `latest`
standby_mode = on
primary_conninfo = `host=192.168.2.38 port=1931 user=repuser`           # e.g. `host=localhost port=5432`

–啟動原主庫， db1 上操作

[pg95@db1 pg_root]$ pg_ctl start -D $PGDATA
server starting

[pg95@db1 pg_root]$ pg_controldata | grep cluster
Database cluster state:               in archive recovery

–資料驗證, db1 上操作

[pg95@db1 pg_root]$ psql
psql (9.5alpha1)
Type "help" for help.

postgres=# select count(*) from test_2;
 count 
-------
 10000
(1 row)

備註：pg_rewind 成功，原主庫現在是以備庫角色啟動，而且資料表 test_2 也同步過來了。

三 pg_rewind 原理

The basic idea is to copy everything from the new cluster to the old cluster, except for the blocks that we know to be the same.

    1)Scan the WAL log of the old cluster, starting from the last checkpoint before the point where the new cluster`s timeline history forked off from the old cluster. For each WAL record, make a note of the data blocks that were touched. This yields a list of all the data blocks that were changed in the old cluster, after the new cluster forked off.

    2)Copy all those changed blocks from the new cluster to the old cluster.

    3)Copy all other files like clog, conf files etc. from the new cluster to old cluster. Everything except the relation files.

    4) Apply the WAL from the new cluster, starting from the checkpoint created at failover. (Strictly speaking, pg_rewind doesn`t apply the WAL, it just creates a backup label file indicating that when PostgreSQL is started, it will start replay from that checkpoint and apply all the required WAL.)

四參考

PostgreSQL HOT-Standby 的主備切換
 PostgreSQL：使用 pg_basebackup 搭建流複製環境
 pg_rewind

oop主節點（NameNode）備份策略以及恢復方法
2014-04-10
OOP
hadoop主節點（NameNode）備份策略以及恢復方法
2017-08-04
Hadoop
RAC一個節點恢復另一個節點在帶庫上的備份
2009-07-27
MySQL8.4備份恢復快速命令
2024-05-10
MySql
RAC恢復到單例項節點上
2014-09-05
單例
oracle9iRAC恢復一個節點
2012-10-30
Oracle
Oracle備份恢復之熱備份恢復及異機恢復
2018-02-05
Oracle
vertica單節點故障恢復 Startup Failed, ASR Required
2019-07-09
AIUI
Oracle RAC恢復成單節點資料庫
2014-07-10
Oracle資料庫
【備份恢復】從備份恢復資料庫
2015-01-20
資料庫
【管理篇備份恢復】備份恢復基礎
2010-05-22
2 Day DBA-管理方案物件-執行備份和恢復-快速恢復區大小
2014-02-04
物件
RAC資料庫的RMAN備份異機恢復到單節點資料庫
2017-08-28
資料庫
oracle快速恢復區
2017-05-09
Oracle
【備份恢復】資料恢復指導
2016-10-20
資料恢復
MySQL備份與恢復——基於Xtrabackup物理備份恢復
2021-07-12
MySql
備份與恢復--利用備份的控制檔案恢復
2009-01-16
Mysql備份恢復
2019-04-18
MySql
oracle冷備恢復
2021-10-08
Oracle
Postgresql 備份恢復
2016-07-28
SQL
redis備份恢復
2016-04-27
Redis
oracle 備用恢復
2014-03-25
Oracle
mysql 備份恢復
2010-01-19
MySql
備份和恢復
2024-07-09
備份與恢復：Polardb資料庫資料基於時間點恢復
2020-11-12
資料庫
備份與恢復：polardb資料庫備份與恢復
2020-11-13
資料庫
【備份恢復】Oracle 資料備份與恢復微實踐
2015-03-27
Oracle
MySQL 非常規恢復與物理備份恢復
2019-04-30
MySql
oracle冷備份、恢復和異機恢復
2014-03-11
Oracle
備份與恢復（Parameter 檔案恢復篇）
2012-07-19
ROSE HA切換節點導致DG失敗、恢復
2014-12-16
ROS
【備份與恢復】控制檔案的恢復（不完全恢復）
2012-06-25
【物理熱備】（下）備份恢復系統表空間手工備份恢復
2016-10-16
Redis當機快速恢復
2021-08-02
Redis
2 Day DBA-管理方案物件-執行備份和恢復-監控快速恢復區的使用
2014-02-04
物件
postgreSQL 恢復至故障點精準恢復
2019-01-01
SQL
詳解叢集級備份恢復：物理細粒度備份恢復
2023-05-12
【備份恢復】noarchive模式下使用增量備份恢復資料庫
2016-10-17
Hive模式資料庫

PostgreSQL9.5：pg_rewind 快速恢復備節點

一 環境準備

二 主備切換

三 pg_rewind 原理

四 參考

相關文章

一環境準備

二主備切換

四參考