Solaris10下安裝Oracle1106RAC環境(五)

yangtingkun發表於2008-07-28

安裝一套RAC11G的測試環境,作業系統為Solaris 10,共享磁碟管理使用Volumn Cluster ManagerOracle版本為1106

記錄一下本次RAC安裝過程中遇到的錯誤。

Solaris10下安裝Oracle1106RAC環境(一):http://yangtingkun.itpub.net/post/468/467067

Solaris10下安裝Oracle1106RAC環境(二):http://yangtingkun.itpub.net/post/468/467250

Solaris10下安裝Oracle1106RAC環境(三):http://yangtingkun.itpub.net/post/468/467251

Solaris10下安裝Oracle1106RAC環境(四):http://yangtingkun.itpub.net/post/468/467252

 

 

首先碰到是由於共享儲存沒有設定成功。

對於配置好的VOLUMN CLUSTER MANAGER,只需要在主節點上使用vxassist make來建立裸裝置,從節點上的對應的裸裝置會自動建立。

開始的時候設定不正常,主節點劃分裸裝置後,從節點上沒有自動建立。於是採用相同的命令在從節點上建立裸裝置。

OracleCLUSTER安裝過程中,並沒有報錯,但是在節點2上執行cluster的配置指令碼:/data/oracle/product/11.1/crs/root.sh的時候報錯,錯誤資訊如下:

root@newtrade2 # . /data/oracle/product/11.1/crs/root.sh
WARNING: directory '/data/oracle/product/11.1' is not owned by root
WARNING: directory '/data/oracle/product' is not owned by root
WARNING: directory '/data/oracle' is not owned by root
WARNING: directory '/data' is not owned by root
Checking to see if Oracle CRS stack is already configured

Setting the permissions on OCR backup directory
Setting up Network socket directories
Oracle Cluster Registry configuration upgraded successfully
The directory '/data/oracle/product/11.1' is not owned by root. Changing owner to root
The directory '/data/oracle/product' is not owned by root. Changing owner to root
The directory '/data/oracle' is not owned by root. Changing owner to root
The directory '/data' is not owned by root. Changing owner to root
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node :
node 1: newtrade1 newtrade1-priv newtrade1
node 2: newtrade2 newtrade2-priv newtrade2
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/vx/rdsk/datavg/vot1
Format of 1 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10

而這個錯誤導致的原因就是共享儲存沒有設定成功。正常配置的情況下,不會出現這個錯誤資訊。

第二個錯誤和第一個錯誤有關,由於發生了第一個錯誤,導致CLUSTER安裝失敗,當時只是透過runInstaller工具清除了CLUSTER安裝部分,而徹底清除環境,導致第二次安裝完成後,執行/data/oracle/product/11.1/crs/root.sh指令碼的時候報錯,錯誤資訊為:

root@newtrade1 # . /data/oracle/product/11.1/crs/root.sh
Checking to see if Oracle CRS stack is already configured
Oracle CRS stack is already configured and will be running under init(1M)

嘗試手工啟動crs

root@newtrade1 # /etc/init.d/init.crs start
Startup will be queued to init within 30 seconds.

等待很長時間,CLUSTER也沒有正常啟動,檢查/tmp目錄下對應的日誌檔案:

root@newtrade1 # more crsctl.2209
Oracle Cluster Registry initialization failed with invalid format: PROC-22: The OCR backend has an invalid format
root@newtrade1 # more crsctl.2211
Oracle Cluster Registry initialization failed with invalid format: PROC-22: The OCR backend has an invalid format
root@newtrade1 # more crsctl.2217
Oracle Cluster Registry initialization failed with invalid format: PROC-22: The OCR backend has an invalid format

需要進行的操作是將整個CLUSTER環境完整的清除。關於清除CLUSTER環境可以參考文章:

11g可以透過crs目錄下的install/rootdelete.sh來清除root環境:

root@newtrade1 # cd /data/oracle/product/11.1/database/install
root@newtrade1 # ./rootdelete.sh remote nosharedvar
Getting local node name
NODE = newtrade1
PRKO-2006 : Invalid node name: newtrade1
Oracle Cluster Registry initialization failed with invalid format: PROC-22: The OCR backend has an invalid format
Oracle CRS stack is not running.
Oracle CRS stack is down now.
Removing script. for Oracle Cluster Ready services
Updating ocr file for downgrade
Cleaning up SCR settings in '/var/opt/oracle/scls_scr'
Cleaning up Network socket directories

在節點1和節點2分別執行上面的操作,然後利用runInstaller來刪除安裝的環境,最後是透過作業系統命令刪除對應的路徑。

然後重新安裝CLUSTER即可。

第三個錯誤是安裝Oracle Database軟體過程中碰到的錯誤,Oracle報錯:

Unable to create directory /data/oracle. You may not have the proper permission to create the directory, or there is no space left in the volume.

Unable to create the Oracle Base you have specified.

這個錯誤是由於11gCluster安裝完成後,將目錄的屬主改為了root

root@newtrade1 # ls -l /data
total 4954390
drwxr-xr-x  10 oracle   oinstall     512 Nov 11  2007 clusterware
drwxr-xr-x   6 oracle   oinstall     512 Jul 11 14:52 database
drwx------   2 oracle   oinstall    8192 Jul  8 17:26 lost+found
drwxr-xr-x   4 root     oinstall     512 Jul 11 15:01 oracle
-rw-r--r--   1 oracle   oinstall 495223321 Jul  8 23:06 solaris.sparc64_11gR1_clusterware.zip
-rw-r--r--   1 oracle   oinstall 2040142241 Jul  8 23:52 solaris.sparc64_11gR1_database.zip

Oraclecluster安裝之後執行的指令碼會改變CLUSTER目錄為root屬主,這是正常的,不過11g不但改變了CLUSTER目錄的屬主,連其的父目錄也一起進行了修改,因此這裡需要手工將其他目錄改回oracle使用者屬主,在兩個節點分別執行:

root@newtrade1 # chown oracle:oinstall /data
root@newtrade1 # chown oracle:oinstall /data/oracle
root@newtrade1 # chown oracle:oinstall /data/oracle/product
root@newtrade1 # chown oracle:oinstall /data/oracle/product/11.1

然後就可以順利的進行安裝了。

第四個問題是由於RAC資料庫的特殊性,需要先配置監聽,然後在建立資料庫。如果這裡順序錯誤,那麼DBCA會出現下面的提示:

Failed to retrieve network listener resources required for the Real Application Clusters high availability extensions configurations on the following nodes:[newtrade1, newtrade2].

Do you want listeners on port 1521 with prefix LISTENER to be created on nodes [newtrade1, newtrade2] automatically? If you world like to configure the listener with differenct properties, run NetCA before continuing.

根據描述,這裡可以有兩種方法,如果執行預設的設定,那麼只要直接點選NEXT就可以了,如果需要對監聽進行自定義設定,那麼需要另啟動一個視窗,利用NETCA工具設定監聽後,在返回當前頁面進行安裝。

總的來說,有了前面幾次安裝10.2的經歷,這次11g RAC的安裝還是比較順利的,就是由於共享儲存的設定不當,引發了幾個問題,其他部分還是相當順利的。

 

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/4227/viewspace-413542/,如需轉載,請註明出處,否則將追究法律責任。

相關文章