11gR2 RAC安裝時遇到的一些錯誤

murkey發表於2015-03-09

       第一次安裝11gR2遇到很多問題,排查錯誤也用了好些時間。現將有關問題記錄如下:

一、安裝grid執行root.sh的時候不成功,如下錯誤

 

Errors in file :

ORA-27091: unable to queue I/O

ORA-15081: failed to submit an I/O operation to a disk

ORA-06512: at line 4

PROT-1: Failed to initialize ocrconfig

Command return code of 255 (65280) from command: /oracle/grid/bin/ocrconfig -upgrade grid oinstall

Failed to create Oracle Cluster Registry configuration, rc 255

CRS-2500: Cannot stop resource 'ora.crsd' as it is not running

CRS-4000: Command Stop failed, or completed with errors.

Command return code of 1 (256) from command: /oracle/grid/bin/crsctl stop resource ora.crsd -init

Stop of resource "ora.crsd -init" failed

Failed to stop CRSD

CRS-2673: Attempting to stop 'ora.asm' on 'rac1'

CRS-2677: Stop of 'ora.asm' on 'rac1' succeeded

CRS-2673: Attempting to stop 'ora.ctssd' on 'rac1'

CRS-2677: Stop of 'ora.ctssd' on 'rac1' succeeded

CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'rac1'

CRS-2677: Stop of 'ora.cssdmonitor' on 'rac1' succeeded

CRS-2673: Attempting to stop 'ora.cssd' on 'rac1'

CRS-2677: Stop of 'ora.cssd' on 'rac1' succeeded

CRS-2673: Attempting to stop 'ora.gpnpd' on 'rac1'

CRS-2677: Stop of 'ora.gpnpd' on 'rac1' succeeded

CRS-2673: Attempting to stop 'ora.gipcd' on 'rac1'

CRS-2677: Stop of 'ora.gipcd' on 'rac1' succeeded

CRS-2673: Attempting to stop 'ora.mdnsd' on 'rac1'

CRS-2677: Stop of 'ora.mdnsd' on 'rac1' succeeded

Initial cluster configuration failed.  See /oracle/grid/cfgtoollogs/crsconfig/rootcrs_rac1.log for details

 

解決方法:按照Linux: root.sh Fails to Format Voting disks when Placing OCR/Voting Disks on ASM Using asmlib [ID 955550.1]  問題解決(引用ID 955550.1

In this Document
  
  

  

  


Applies to:

Oracle Server - Enterprise Edition - Version: 11.2.0.0 to 11.2.0.2 - Release: 11.2 to 11.2
Linux x86
Red Hat Enterprise Linux Advanced Server x86-64 (AMD Opteron Architecture)
x86 64 bit (for Enterprise Linux only)
Linux x86-64
x86 32 bit (for Enterprise Linux only)
Grid Infrastructure, clusterware, CRS, voting disk

Symptoms

Oracle Grid Infrastructure installation with Oracle ASM using ASMlib fails during root.sh execution.

At the completion of Grid Infrastructure installation, root.sh script. executed on the first node in the cluster fails to format the OCR and voting disks which are placed on ASM storage.

Example of the error when root.sh fails:

CRS-2676: Start of 'ora.ctssd' on 'auw2k3' succeeded
ASM created and started successfully.
DiskGroup DATA created successfully.

Errors in file :
ORA-27091: unable to queue I/O
ORA-15081: failed to submit an I/O operation to a disk
ORA-06512: at line 4
PROT-1: Failed to initialize ocrconfig
Command return code of 255 (65280) from command: /u01/app/11.2.0/grid/bin/ocrconfig -upgrade oragrid oinstall
Failed to create Oracle Cluster Registry configuration, rc 255
CRS-2500: Cannot stop resource 'ora.crsd' as it is not running
CRS-4000: Command Stop failed, or completed with errors.
Command return code of 1 (256) from command: /u01/app/11.2.0/grid/bin/crsctl stop resource ora.crsd -init
Stop of resource "ora.crsd -init" failed

 

Changes

New installation

Cause

Diskgroup is succesfully created, root.sh fails at generating the OCR keys when invoking ocrconfig because userid used for ASMlib driver differs from grid software owner.

Oracle Grid infrastructure installation generates a log file in $GRID_HOME/cfgtoollogs/crsconfig.
Per log file
'rootcrs_.log':

2009-10-21 12:57:26: Querying for existing CSS voting disks
2009-10-21 12:57:26: Performing initial configuration for cluster
2009-10-21 12:57:28: Start of resource "ora.ctssd -init" Succeeded
2009-10-21 12:57:28: Configuring ASM via ASMCA
2009-10-21 12:57:28: Executing as oragrid: /u01/app/11.2.0/grid/bin/asmca -silent -diskGroupName DATA -diskList ORCL:ASMD40,ORCL:ASMD41 -redundancy EXTERNAL -configureLocalASM
2009-10-21 12:57:28: Running as user oragrid: /u01/app/11.2.0/grid/bin/asmca -silent -diskGroupName DATA -diskList ORCL:ASMD40,ORCL:ASMD41 -redundancy EXTERNAL -configureLocalASM
2009-10-21 12:57:28:   Invoking "/u01/app/11.2.0/grid/bin/asmca -silent -diskGroupName DATA -diskList ORCL:ASMD40,ORCL:ASMD41 -redundancy EXTERNAL -configureLocalASM" as user "oragrid"
2009-10-21 12:58:02: Creating or upgrading OCR keys
2009-10-21 12:58:04: Command return code of 255 (65280) from command: /u01/app/11.2.0/grid/bin/ocrconfig -upgrade oragrid oinstall
2009-10-21 12:58:04: Failed to create Oracle Cluster Registry configuration, rc 255
2009-10-21 12:58:04: Exiting exclusive mode
2009-10-21 12:58:04: Command return code of 1 (256) from command: /u01/app/11.2.0/grid/bin/crsctl stop resource ora.crsd -init
2009-10-21 12:58:04: Stop of resource "ora.crsd -init" failed


In above example the grid software owner is
oragrid but the ASMlib device driver has been configured with owner oracle (different user) despite both being part of dba group

% /usr/sbin/oracleasm configure
ORACLEASM_ENABLED=true
ORACLEASM_UID=oracle
ORACLEASM_GID=dba
ORACLEASM_SCANBOOT=true
ORACLEASM_SCANORDER=""
ORACLEASM_SCANEXCLUDE=""

Solution

Please note all tasks below are done as the root user!

1. To Resolve this problem, first deconfigure Grid.
Run the following script. on any node that had root.sh executed on :

% $GRID_HOME/crs/install/rootcrs.pl -deconfig -force


2. Reconfigure ASMlib
Delete the ASMlib device/s used for OCR and voting ASM diskgroup, in example above the devices used are:
ORCL:ASMD40
ORCL:ASMD41

% /usr/sbin/oracleasm deletedisk ASMD40
% /usr/sbin/oracleasm deletedisk ASMD41


3.  Re-configure ASMlib driver to use the correct userid, in this case "oragrid" user is the software owner for Grid Infrastructure installation, ensure followings are done on all nodes:

% /usr/sbin/oracleasm configure -u oragrid
% /usr/sbin/oracleasm configure


4.  stop/start ASMlib on all nodes

% /usr/sbin/oracleasm exit
% /usr/sbin/oracleasm init
% /usr/sbin/oracleasm scandisks


5.  Create the ASMlib disks again used for Grid Infrastructure installation

% oracleasm createdisk ASMD40 /dev/sdh1
% oracleasm createdisk ASMD41 /dev/sdi1


Then run the scandisks again on all nodes ->

% /usr/sbin/oracleasm scandisks

 

Now ASMlib devices should be correctly configured, this can be confirmed by running 'oracleasm configure' or  'ls -la /dev/oracleasm/disks/ '.


6.  Re-run 'root.sh'

 

二、在配置asm磁碟組的時候,透過asmca配置介面找不到磁碟。

         解決方法:將init+ASM1.ORA中的asm_diskstring禁用掉,重新執行asmca進行配置完成

 

三、執行dbca建庫時識別不到ASM磁碟組

         解決方法:由於未執行usermod -G dba,asmdba,oper oracle 而導致不能識別出ASM磁碟組。

四、在安裝過程中,會提示一個錯誤“嘗試安裝Oracle RAC的系統不屬於有效群集”

 

 

這是11gR2的一個bug,原因是inventory.xml裡沒有CRS=TRUE等標識.

修改/home/oracle/oracle11g/oraInventory/ContentsXML/inventory.xml裡的值(兩節點)

1oracle1,oracle2進入$GRID_HOME/oui/bin

2./runInstaller -attachHome ORACLE_HOME="/oracle_crs/product/11.2/crs" ORACLE_HOME_NAME="ORADB_11_2"

3./runInstaller -updateNodeList ORACLE_HOME="/oracle_crs/product/11.2/crs" ORACLE_HOME_NAME="ORADB_11_2" "CLUSTER_NODES={oracle1,oracle2}" CRS=TRUE

 

修改完兩節點的inventory.xml的值後,重新執行db的安裝,一步一步的下去,便可完成資料庫軟體的安裝(在此過程中我們選擇只安裝資料庫軟體,建庫的工作用dbca來完成)

 

 

五、oracle basesoftware install目錄

 $ORACLE_BASE$ORACLE_HOME目錄不要存在上下級關係,否則在安裝時會報錯。

[INS-32026]The software Location specified should not be under Oracle base location

 

六、./root.sh

……(省略)

 /oracle/grid/bin/clscfg.bin: error while loading shared libraries: libcap.so.1: cannot open shared object file: No such file or directory

Failed to create keys in the OLR, rc = 127, 32512

解決方法:

從系統光碟上安裝libcap軟體包即可。在很多RAC安裝指導文件中都會說要將安裝系統時要將所有的開發包都要裝。

再次執行root.sh時,會提示你已經配置過一次了,要刪掉之前的配置

執行 $ORACLE_HOME/install/roothas.pl -deconfig -force

 

 

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/500314/viewspace-1453627/,如需轉載,請註明出處,否則將追究法律責任。

相關文章