ORA-19599 When Backing up an Archivelog that is Corrupt

潇湘隐者發表於2024-04-19

原文網址 : https://www.cnblogs.com/kerrycode/p/18145275

前幾天遇到了一起備份失敗案例，RMAN備份過程中遇到了歸檔日誌損壞的情況，還是第一次遇到這種案例，這裡記錄一下這個案例的具體情況。

備份作業失敗，檢查RMAN備份的輸出日誌，發現一個歸檔日誌檔案損壞（corrupt）了，如下所示：

RMAN-08137: warning: archived log not deleted, needed for standby or upstream capture process
RMAN-08515: archived log file name=/eapdblog/eap_1_666_1155313416.arc thread=1 sequence=666
RMAN-08137: warning: archived log not deleted, needed for standby or upstream capture process
RMAN-08515: archived log file name=/eapdblog/eap_1_667_1155313416.arc thread=1 sequence=667
RMAN-03009: failure of backup command on dev_0 channel at 04/09/2024 09:44:50
ORA-27192: skgfcls: sbtclose2 returned error - failed to close file
ORA-19511: non RMAN, but media manager or vendor specific failure, error text:
   Vendor specific error: OB2_EndObjectBackup() failed ERR(-2)
ORA-19599: block number 316064 is corrupt in archived log /eapdblog/eap_1_660_1155313416.arc

檢查驗證歸檔日誌，發現歸檔日誌檔案eap_1_660_1155313416.arc確實損壞。如下所示：

RMAN> validate archivelog all;

Starting validate at 09-APR-24
using target database control file instead of recovery catalog
allocated channel: ORA_DISK_1
channel ORA_DISK_1: SID=261 device type=DISK
channel ORA_DISK_1: starting validation of archived log
channel ORA_DISK_1: specifying archived log(s) for validation
input archived log thread=1 sequence=660 RECID=645 STAMP=1165788069
input archived log thread=1 sequence=663 RECID=648 STAMP=1165824445
input archived log thread=1 sequence=664 RECID=649 STAMP=1165828881
input archived log thread=1 sequence=665 RECID=650 STAMP=1165829178
input archived log thread=1 sequence=666 RECID=651 STAMP=1165829976
input archived log thread=1 sequence=667 RECID=652 STAMP=1165830268
channel ORA_DISK_1: validation complete, elapsed time: 00:00:01
List of Archived Logs
=====================
Thrd Seq     Status Blocks Failing Blocks Examined Name
---- ------- ------ -------------- --------------- ---------------
1    660     FAILED 8              346599          /eapdblog/eap_1_660_1155313416.arc
1    663     OK     0              382900          /eapdblog/eap_1_663_1155313416.arc
1    664     OK     0              94593           /eapdblog/eap_1_664_1155313416.arc
1    665     OK     0              1748            /eapdblog/eap_1_665_1155313416.arc
1    666     OK     0              17557           /eapdblog/eap_1_666_1155313416.arc
1    667     OK     0              4226            /eapdblog/eap_1_667_1155313416.arc
validate found one or more corrupt blocks
See trace file /eapdb/diag/rdbms/eap/eap/trace/eap_ora_917867.trc for details
Finished validate at 09-APR-24

RMAN> exit

檢查告警日誌，也看到下面資訊。

2024-04-08T23:15:05.730996+08:00

***
Corrupt block seq: 660 blocknum=316064.
Bad header found during backing up archived log
Data in bad block - flag:1. format:34. bno:93696. seq:649
beg:16 cks:21324
calculated check value: 21324

Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
2024-04-08T23:15:21.671470+08:00

***
Corrupt block seq: 660 blocknum=316064.
Bad header found during backing up archived log
Data in bad block - flag:1. format:34. bno:93696. seq:649
beg:16 cks:21324
calculated check value: 21324

Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
Reread of seq=660, blocknum=316064, file=/eapdblog/eap_1_660_1155313416.arc, found same corrupt data
2024-04-08T23:15:36.695623+08:00

雖然知道歸檔日誌損壞了，但是不清楚什麼原因導致歸檔日誌損壞，之前也見過別人分享的案例ORA-1578 ORA-353 ORA-19599 Corrupt blocks with zeros when filesystemio_options=SETALL on ext4 file system using Linux (Doc ID 1487957.1)，但是當前環境如下所示，跟Doc ID 1487957.1中案例環境完全不一樣

作業系統  ：Red Hat Enterprise Linux release 8.8 (Ootpa)

資料庫版本： Oracle 19c Enterprise Edition 19.20.0.0.0

檔案系統為： xfs

開了Service Requests，然後提交各種日誌，以及損壞歸檔日誌的dump檔案，最後官方反饋跟未公開的兩個bug非常相似（下面截圖）。不過這種現象發生的頻率非常少。還是第一次遇到這種錯誤。官方技術支援建議，如果這種情況出現的頻率很少，建議觀察，如果出現頻率很高，建議修改filesystemio_options為directio來規避這個問題。

sqlplus / as sysdba
oradebug setmypid
oradebug tracefile_name 
alter system dump logfile '/eapdblog/eap_1_660_1155313416.arc' VALIDATE;

做了如下操作處理，然後重新做了RMAN完整備份，又觀察了好幾天，暫時一直未遇到這個錯誤。

手工刪除這個損壞的歸檔日誌
RMAN > crosscheck archivelog all;
RMAN> DELETE EXPIRED ARCHIVELOG sequence 660;

Oracle OCP(58)：ARCHIVELOG 管理
2019-06-06
OracleHive
【轉】恢復archivelog介紹
2019-05-31
Hive
PG 自動刪除archivelog
2021-10-14
Hive
打 patch 報錯：corrupt patch at line 36
2024-08-20
獎金up up up！單個漏洞最高獎勵2萬元！
2022-06-17
Level Up
2024-10-17
wake up
2024-10-04
rman 還原歸檔日誌(restore archivelog
2018-11-21
RESTHive
rman備份archivelog出現ORA-19625
2019-10-18
Hive
case when 語句
2022-08-02
day day up
2024-11-18
Flashback database必須要有之前的archivelog嗎？
2019-01-19
DatabaseHive
理解RMAN backup database plus archivelog delete all input命令
2018-06-27
DatabaseHivedelete
Git 錯誤：fatel: loose object ... is corrupt 解決辦法
2018-09-20
GitObject
Extract rows from a CORRUPT table creating ROWID from DBA_EXTENTS
2019-04-28
ext4 lvreduce報錯superblock or the partition table is likely to be corrupt
2021-04-09
VRBloC
E. Level Up
2024-08-01
mysql中case when的使用
2018-07-27
MySql
sql case when, Exist ,group by ,聚合
2024-03-15
SQL
Oracle case when改寫SQL
2020-05-23
OracleSQL
Laravel query when 的查詢
2019-06-18
Laravel
SpokenEnglish01_ When's it due?
2019-05-08
[Reactive] Run functions when data changes
2024-09-24
ReactFunction
SQL Server CASE WHEN ... THEN ... ELSE ... END
2024-07-25
SQLServer
What are general rules when deciding on index?
2022-02-07
Index
drools中的條件 when
2022-05-24
TLS Poison - When TLS Hack you
2021-04-23
TLS
clean_up_log.sh
2019-02-22
ALi CTF 2015 write up
2020-08-19
up主進軍之路
2020-10-17
SQLServer使用case when中的order by
2018-12-11
SQLServer
What is the Impact on the Database When Modifying the OS Date
2019-06-16
Database
翻譯|Where and When to Fetch Data With Redux
2019-04-28
Redux
在CSS中如何使用 when/else
2021-12-30
CSS
For example, when you want to get the ball to the ground
2021-11-04
【YashanDB知識庫】archivelog磁碟滿導致資料庫abnormal
2024-09-14
Hive資料庫ORM
Original error: Error: socket hang up
2024-05-23
Error
2016 ALICTF xxFileSystem write-up
2020-08-19

ORA-19599 When Backing up an Archivelog that is Corrupt

相關文章