ORA-600(KSFD_DECAIOPC)和ORA-600(kfioReapIO00)錯誤

dbhelper發表於2015-01-15

由於共享磁碟問題導致的兩個ORA-600錯誤。

 

 

客戶的10.2.0.4 RAC for Linux X86-64,在告警日誌中出現了大量的錯誤資訊:

Tue Apr 24 16:15:04 2012
Errors in file /u01/admin/orcl/udump/orcl1_ora_10437.trc:
ORA-00600: internal error code, arguments: [KSFD_DECAIOPC], [0xFC213CBF0], [], [], [], [], [], []
ORA-07445: exception encountered: core dump [<0x9293a0>] [SIGSEGV] [Address not mapped to object] [0x0000007CA] [] []
ORA-07445: exception encountered: core dump [<0xb5814a>] [SIGSEGV] [Address not mapped to object] [0xFFFFFFFFFFFFFFF9] [] []
ORA-00333: redo log read error block 2 count 8192
ORA-00202: control file: '+ASM_DISK1/orcl/controlfile/current.256.757170241'
ORA-15081: failed to submit an I/O operation to a disk
Tue Apr 24 16:15:14 2012
WARNING: kfk failed to open a disk[/dev/oracleasm/disks/DISK4]
Tue Apr 24 16:15:14 2012
Errors in file /u01/admin/orcl/udump/orcl1_ora_10437.trc:
ORA-15025: could not open disk '/dev/oracleasm/disks/DISK4'
ORA-27041: unable to open file
Linux-x86_64 Error: 24: Too many open files
Additional information: 3
ORA-00600: internal error code, arguments: [KSFD_DECAIOPC], [0xFC213CBF0], [], [], [], [], [], []
ORA-07445: exception encountered: core dump [<0x9293a0>] [SIGSEGV] [Address not mapped to object] [0x0000007CA] [] []
ORA-07445: exception encountered: core dump [<0xb5814a>] [SIGSEGV] [Address not mapped to object] [0xFFFFFFFFFFFFFFF9] [] []
ORA-00333: redo log read error block 2 count 8192
ORA-00202: control file: '+ASM_DISK1/orcl/controlfile/current.256.757170241'
ORA-15081: failed to submit an I/O operation to a disk
WARNING: kfk failed to open a disk[/dev/oracleasm/disks/DISK2]

當前的版本10.2.0.4ORA-600錯誤資訊KSFD_DECAIOPC,都符合bug 8433026的描述,但是當前資料庫並未配置STREAM環境,雖然當前庫配置了DSG的複製應用,但是畢竟和流應用還是有所區別。

從錯誤資訊可以判斷,當前的問題要問題來自ASM磁碟組中部分磁碟存在異常,導致讀取時出現錯誤。

除了KSFD_DECAIOPC錯誤外,由於底層共享儲存的問題,還導致了另外的ORA-600錯誤:

Tue Jul 10 10:39:53 2012
Errors in file /u01/admin/orcl/udump/orcl1_ora_19546.trc:
ORA-00600: internal error code, arguments: [kfioReapIO00], [0], [52], [], [], [], [], []
ORA-00333: redo log read error block 2 count 8192
ORA-00600: internal error code, arguments: [KSFD_DECAIOPC], [0xFC7D6ADB8], [], [], [], [], [], []
ORA-07445: exception encountered: core dump [<0x9293a0>] [SIGSEGV] [Address not mapped to object] [0x0000007CA] [] []
ORA-07445: exception encountered: core dump [<0xb5814a>] [SIGSEGV] [Address not mapped to object] [0xFFFFFFFFFFFFFFF9] [] []
ORA-00333: redo log read error block 2 count 8192
ORA-00202: control file: '+ASM_DISK1/orcl/controlfile/current.256.757170241'
ORA-15081: failed to submit an I/O operation to a disk

顯然這兩個ORA-600都是由於底層磁碟錯誤所引起的,而當硬體人員解決了共享磁碟錯誤後,ASM例項沒有經過重啟就恢復了正常,此後也沒有類似錯誤的出現。

 

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/8494287/viewspace-1402978/,如需轉載,請註明出處,否則將追究法律責任。

相關文章