ASM REACTING TO PARTITION ERRORS [ID 1062954.1]
ASM REACTING TO PARTITION ERRORS [ID 1062954.1]
--------------------------------------------------------------------------------
修改時間 10-AUG-2011 型別 PROBLEM 狀態 PUBLISHED
修改時間 10-AUG-2011 型別 PROBLEM 狀態 PUBLISHED
In this Document
Symptoms
Cause
Solution
Symptoms
Cause
Solution
--------------------------------------------------------------------------------
Applies to:
Oracle Server - Enterprise Edition - Version: 10.1.0.4 to 11.2.0.1.0 - Release: 10.1 to 11.2
Linux x86
Haansoft Linux x86-64
Oracle Server - Enterprise Edition - Version: 10.1.0.4 to 11.2.0.1.0 - Release: 10.1 to 11.2
Linux x86
Haansoft Linux x86-64
Symptoms
Randomly disks that belonged to ASM disk groups show as PROVISIONED or at times as CANDIDATE in v$asm_disk.header_status. Upon dismount, disk groups with those disks will not mount.
Randomly disks that belonged to ASM disk groups show as PROVISIONED or at times as CANDIDATE in v$asm_disk.header_status. Upon dismount, disk groups with those disks will not mount.
From ASM alert log:
ERROR: diskgroup was not mounted
ORA-15032: not all alterations performed
ORA-15063: ASM discovered an insufficient number of disks for diskgroup ""
ORA-15032: not all alterations performed
ORA-15063: ASM discovered an insufficient number of disks for diskgroup ""
Or
ORA-15032: not all alterations performed
ORA-15040: diskgroup is incomplete
ORA-15042: ASM disk "disk number here>" is missing
ORA-15063: ASM discovered an insufficient number of disks for diskgroup ""
ORA-15040: diskgroup is incomplete
ORA-15042: ASM disk "disk number here>" is missing
ORA-15063: ASM discovered an insufficient number of disks for diskgroup ""
This seems to occur when new LUNs are either added or configured on the cluster, but this behavior. has occurred several times (10+), on more than one cluster and across separate data centers.
At times the disks' v$asm_header_status is of member but still the disk groups will not mount, upon attempt to re-mount the disk group.
While troubleshooting the issue, it has been noticed that the OS partition table for the devices employed by the ASM disks, is wiped out (does not exist). This issue is similarly reproduced when dd is used to wiped the devices although this does not explain why some times the disks will show with v$asm_disk.header_status=member, and still cannot be mountable.
Cause
It turns out that the inq.Linux command is incorrectly writing to /dev/sd device, which is wiping out the partition table. Depending on which mpath device /dev/sd is part of, where once notices the corruption.
It turns out that the inq.Linux command is incorrectly writing to /dev/sd
This is caused by EMC bug which has older version (prior to versions 6.3.0.0-771) of the Linux inq utility/command. The eNav utility calls inq.Linux.
Details from EMC bug:
1) older versions of this command scanned all devices in /dev, not just scsi disks, and so included /dev/kmsg
2) older versions of this command incorrectly matched /dev/kmsg and /dev/sd thinking it was multiple paths to the same device, when it is not.
3) older versions of this command allocated a 216 byte inquiry buffer. This was apparently sufficient for EMC devices, but was too small for certain other disks. The scsi layer would return an error if the buffer is undersized.
1) older versions of this command scanned all devices in /dev, not just scsi disks, and so included /dev/kmsg
2) older versions of this command incorrectly matched /dev/kmsg and /dev/sd
3) older versions of this command allocated a 216 byte inquiry buffer. This was apparently sufficient for EMC devices, but was too small for certain other disks. The scsi layer would return an error if the buffer is undersized.
The above 3 conditions basically caused the errors to erroneously get routed to /dev/sd instead of /dev/kmsg, which then wipes out the corresponding partition tables. All three conditions are fixed in versions of the INQ command after 6.3.0.0-771.
It is assumed that any installations with over 500 scsi disks attached (includes multiple paths to the same disk via multipathing, etc....so in that case, only 250 or so LUNs if the environment has two paths per LUN, minus the number of locally attached disks) which would cause /dev/sd to exist and that were running the eNav utility were at risk for similar corruption.
Note: Verified/confirmed by customer and sources outside Oracle (RedHat, EMC, Maryville) however no further details, like the EMC bug number or any other additional information, were furthermore provided.
Solution
An immediate work-around is to comment out the calling of the inq.Linux command to prevent it from happening across your environment. For this one has to contact either Maryville support (vendor of eNav) or EMC.
Another appropriate solution is to upgrade to a newer version of inq.Linux.
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/20674423/viewspace-721540/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- partition table and partition indexIndex
- PARTITION partition01
- PARTITION partition02
- PARTITION partition04
- doesn't contain a valid partition tableAI
- Pruning、Reference Partition、Exchange Partition
- partition timestamp(0) not use partition
- Identify If A Disk/Part Is Still Used By ASM,Used by ASM Or Used by ASM_603210.1IDEASM
- Android Transform + ASM 初探AndroidORMASM
- Metric Collection ErrorsError
- oracle之errorsOracleError
- oracle 之errorsOracleError
- RMAN Duplicate Database From RAC ASM To RAC ASM [ID 461479.1]DatabaseASM
- Duplicate database from non ASM to ASM to a different host [ID 382669.1]DatabaseASM
- Database Capture and Replay: Common Errors and Reasons (文件 ID 463263.1)DatabaseAPTError
- PARTITION SPILT
- hive partitionHive
- over (partition by)
- exchange partition
- oracle partitionOracle
- split partition
- Clique Partition
- Go Errors 詳解GoError
- slave-skip-errorsError
- HIbernate errors...Error
- Data Warehouse Guide閱讀筆記(七):partition tableGUIIDE筆記
- How to Create a BNE Log For Web Adi Issues and Errors? [ID 817023.1]WebError
- DIAGNOSING ORA-3113 ERRORS [ID 1020463.6]Error
- Partition Pruning和Partition-Wise Joins
- How to Convert a Single-Instance ASM to Cluster ASM [ID 452758.1]ASM
- How to Prepare Storage for ASM (Doc ID 452924.1)ASM
- ASM條帶揭密----_asm_stripesize、_asm_stripewidth引數的設定和影響ASM
- oracle partition by group by,詳解partition by和group by對比Oracle
- partition table update partition-key result in changing tablespace
- TiDB 原始碼閱讀系列文章(二十)Table PartitionTiDB原始碼
- ORA-31600: invalid input value INDEX PARTITION for parameterIndex
- ORA-04045: errors during recompilation/revalidation of LBACSYS.LBAC_EVENTSError
- com.android.tools.build:gradle:2.0.0-alpha3 build errorsAndroidUIGradleError