私有網路介面丟失導致例項崩潰
客戶10.2.0.4
RAC資料庫出現網路異常,導致例項崩潰並伴隨大量ORA-27300錯誤。
詳細錯誤資訊為:
Wed Nov 21 16:37:36 2012
Errors in file /u01/oracle/app/admin/orcl/udump/orcl2_ora_29173.trc:
ORA-00603: ORACLE server session terminated by fatal error
ORA-27504: IPC error creating OSD context
ORA-27300: OS system dependent operation:if_not_found failed with status: 0
ORA-27301: OS failure message: Error 0
ORA-27302: failure occurred at: skgxpvaddr9
ORA-27303: additional information: requested interface 10.0.1.2 not found. Check
output from ifconfig command
Wed Nov 21 16:37:36 2012
Errors in file /u01/oracle/app/admin/orcl/udump/orcl2_ora_29198.trc:
ORA-00603: ORACLE server session terminated by fatal error
ORA-27504: IPC error creating OSD context
ORA-27300: OS system dependent operation:if_not_found failed with status: 0
ORA-27301: OS failure message: Error 0
ORA-27302: failure occurred at: skgxpvaddr9
ORA-27303: additional information: requested interface 10.0.1.2 not found.
Check output from ifconfig command
Wed Nov 21 16:37:56 2012
Trace dumping is performing id=[cdmp_20121121163746]
Wed Nov 21 16:38:00 2012
ospid 28424: network interface with IP address 10.0.1.2 no longer operational
requested interface 10.0.1.2 not found. Check output from ifconfig command
Wed Nov 21 16:38:07 2012
Error: KGXGN aborts the instance (6)
Wed Nov 21 16:38:07 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_lmon_28422.trc:
ORA-29702: error occurred in Cluster Group Service operation
LMON: terminating instance due to error 29702
Wed Nov 21 16:38:07 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_lms1_28430.trc:
ORA-29702: error occurred in Cluster Group Service operation
Wed Nov 21 16:38:07 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_lms3_28438.trc:
ORA-29702: error occurred in Cluster Group Service operation
.
.
.
Wed Nov 21 16:38:09 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_j000_28635.trc:
ORA-29702: error occurred in Cluster Group Service operation
ORA-29702: error occurred in Cluster Group Service operation
Wed Nov 21 16:38:09 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_mman_28450.trc:
ORA-29702: error occurred in Cluster Group Service operation
Wed Nov 21 16:38:09 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_asmb_28496.trc:
ORA-15064: communication failure with ASM instance
ORA-03113: end-of-file on communication channel
Wed Nov 21 16:38:10 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_pmon_28416.trc:
ORA-29702: error occurred in Cluster Group Service operation
Wed Nov 21 16:38:10 2012
Errors in file /u01/oracle/app/admin/orcl/bdump/orcl2_smon_28462.trc:
ORA-29702: error occurred in Cluster Group Service operation
Wed Nov 21 17:25:50 2012
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Interface type 1 eth1 10.0.1.0 configured from OCR for use as a cluster
interconnect
Interface type 1 eth0 172.18.19.0 configured from OCR for use as a public interface
顯然導致RAC節點當機的問題來自作業系統或硬體層。導致出現ORA-27504錯誤的原因是作業系統相關的ORA-27300、ORA-27301、ORA-27302和ORA-27303錯誤。而這些錯誤明確的之處私有網路介面的地址無法找到,而作業系統命令ifconfig命令輸出結果異常。
Oracle的網路心跳依賴於私有網路,而出現了這個問題,導致資料庫節點崩潰也是情理之中的了。
顯然這不應該算作Oracle的bug,Oracle給出的錯誤資訊已經清晰的指明瞭問題的原因。找到導致作業系統層面網路介面失效的原因才是解決問題的關鍵。
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/4227/viewspace-1060815/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- 儲存崩潰導致資料丟失如何處理
- iOS開發-stringByEvaluatingJavaScriptFromString導致崩潰iOSJavaScript
- 執行緒崩潰為什麼不會導致 JVM 崩潰執行緒JVM
- 【伺服器資料恢復】raid6崩潰導致分割槽丟失的資料恢復案例伺服器資料恢復AI
- MongoDB例項重啟失敗探究(大事務Redo導致)MongoDB
- 模態對話方塊可能導致程式崩潰
- A站大流量導致服務崩潰異常分析
- 誤升級GLIBC導致系統崩潰之後
- 【伺服器資料恢復】RAID5崩潰後強制上線導致資料丟失的資料恢復案例伺服器資料恢復AI
- 【伺服器資料恢復】RAID5崩潰後強制上線導致檔案丟失的資料恢復案例伺服器資料恢復AI
- Nginx轉發導致請求頭丟失Nginx
- 記錄一個LifeCycle 多執行緒使用導致的崩潰執行緒
- 伺服器磁碟離線導致RAIDZ崩潰資料恢復伺服器AI資料恢復
- 記一次 .NET某工控 宇宙射線 導致程式崩潰分析
- HttpClient引發的執行緒數過多導致應用崩潰HTTPclient執行緒應用崩潰
- 案例解析:執行緒池使用不當導致的系統崩潰執行緒
- alicdn邊緣節點不穩定導致頁面崩潰問題
- 微軟修復了導致 Outlook 啟動時崩潰的問題微軟
- KB4522355導致Win10部分開始選單崩潰,並且安裝容易失敗Win10
- Oracle歸檔檔案丟失導致OGG不用啟動Oracle
- WKWebView 網路請求Header 丟失WebViewHeader
- 蘋果iOS 11.3/11.4曝bug:“黑點錯誤”導致裝置崩潰蘋果iOS
- lol關於win10系統導致閃退崩潰修復方法Win10
- memcopy 導致的程式碼崩潰問題,memcpy的三大踩坑記memcpy
- 解決Qt中ui->tableView->setModel(model);導致程式崩潰 問題QTUIView
- 關於 iconv 轉碼導致資料丟失的問題
- 伺服器當機會導致Kafka訊息丟失嗎伺服器Kafka
- 容器網路防火牆狀態異常導致丟包排查記錄防火牆
- IP packet reassembles failed導致例項被驅逐AI
- 儲存互斥失敗導致資料丟失的資料恢復成功案例資料恢復
- UE4 記憶體寫壞導致異常崩潰問題記錄記憶體
- 多塊硬碟離線導致raid6崩潰的資料恢復案例硬碟AI資料恢復
- node啟動程式-清理由於崩潰導致的沒有關掉的程式
- 重灌系統導致分割槽丟失的資料恢復案例資料恢復
- WWDC 2018:理解崩潰以及崩潰日誌
- Redis CVE-2020-14147導致例項異常退出Redis
- EVA4400儲存斷電導致資料丟失如何恢復
- 伺服器重灌系統導致分割槽丟失的恢復方法伺服器
- Verdaccio publish 時包含 deprecated 導致歷史版本丟失問題原因分析