10g VIP網路卡斷開導致漂移,網路正常後一般如何恢復回去

tolywang發表於2009-02-17
節點2上報錯 /var/log/messages,VIP網段有在早上斷開過,已經和網路工程師確認 。

Feb 16 06:54:01 ecsrac02 kernel: bnx2: eth0 NIC Link is Down
Feb 16 06:56:07 ecsrac02 kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex  





節點2上的 alert log :

Mon Feb 16 05:11:30 2009
Thread 2 advanced to log sequence 3583 (LGWR switch)
  Current log# 10 seq# 3583 mem# 0: /ocfs_data1/ecsdb/redo10a.log
  Current log# 10 seq# 3583 mem# 1: /ocfs_index1/ecsdb/redo10b.log
Mon Feb 16 05:15:30 2009
Thread 2 advanced to log sequence 3584 (LGWR switch)
  Current log# 11 seq# 3584 mem# 0: /ocfs_data1/ecsdb/redo11a.log
  Current log# 11 seq# 3584 mem# 1: /ocfs_index1/ecsdb/redo11b.log
Mon Feb 16 06:54:42 2009
ALTER SYSTEM SET service_names='' SCOPE=MEMORY SID='ecsdb2';
Mon Feb 16 06:54:42 2009
Immediate Kill Session#: 1036, Serial#: 55845
Immediate Kill Session: sess: 0x25cb1d5b8  OS pid: 30068
Immediate Kill Session#: 1038, Serial#: 13595
Immediate Kill Session: sess: 0x25cb1eb20  OS pid: 11821
Immediate Kill Session#: 1042, Serial#: 50814
Immediate Kill Session: sess: 0x25cb215f0  OS pid: 16328
Immediate Kill Session#: 1047, Serial#: 10969
Immediate Kill Session: sess: 0x25fb36f48  OS pid: 14038
Immediate Kill Session#: 1113, Serial#: 46642
Immediate Kill Session: sess: 0x25fb631b0  OS pid: 31750
Immediate Kill Session#: 1118, Serial#: 26560
Immediate Kill Session: sess: 0x25cb54360  OS pid: 7687
Mon Feb 16 06:54:43 2009
Process OS id : 7687 alive after kill
Errors in file
Immediate Kill Session#: 1131, Serial#: 57076
Immediate Kill Session: sess: 0x25fb6f258  OS pid: 7157
Immediate Kill Session#: 1179, Serial#: 25165
Immediate Kill Session: sess: 0x25fb8f418  OS pid: 28197
Immediate Kill Session#: 1180, Serial#: 24265
Immediate Kill Session: sess: 0x25cb7daf8  OS pid: 11359
Immediate Kill Session#: 1205, Serial#: 54612
Immediate Kill Session: sess: 0x25fba0a60  OS pid: 19502
Immediate Kill Session#: 1210, Serial#: 38467
Immediate Kill Session: sess: 0x25cb91c10  OS pid: 26109
Immediate Kill Session#: 1211, Serial#: 60805
Immediate Kill Session: sess: 0x25fba4a98  OS pid: 6343
Immediate Kill Session#: 1246, Serial#: 6259
Immediate Kill Session: sess: 0x25cba9d60  OS pid: 21726
                                                                                             






發現節點2的監聽關閉,其他正常

ecsrac02$
ecsrac02$crs_stat -t
Name           Type           Target    State     Host        
------------------------------------------------------------
ora.ecsdb.db   application    ONLINE    ONLINE    ecsrac02   
ora....b1.inst application    ONLINE    ONLINE    ecsrac01   
ora....b2.inst application    ONLINE    ONLINE    ecsrac02   
ora....01.lsnr application    ONLINE    ONLINE    ecsrac01   
ora....c01.gsd application    ONLINE    ONLINE    ecsrac01   
ora....c01.ons application    ONLINE    ONLINE    ecsrac01   
ora....c01.vip application    ONLINE    ONLINE    ecsrac01   
ora....02.lsnr application    ONLINE    OFFLINE               
ora....c02.gsd application    ONLINE    ONLINE    ecsrac02   
ora....c02.ons application    ONLINE    ONLINE    ecsrac02   
ora....c02.vip application    ONLINE    ONLINE    ecsrac01   


--------------------  
ora....c02.vip application    ONLINE    ONLINE    ecsrac01   可以看到 2節點上的vip漂移到節點1上


節點1上 ifconfig 也可以看到兩個虛擬IP在節點1上 。






問題:

1.  剛開始crs_stat -t 看到節點2的例項和監聽都是offline . 後來看到只有監聽是offline , 不知道ip漂移後oracle是如何做的 。

2.  節點2上 alert log 可以看到下面這些命令
ALTER SYSTEM SET service_names='' SCOPE=MEMORY SID='ecsdb2';
Mon Feb 16 06:54:42 2009
Immediate Kill Session#: 1036, Serial#: 55845
Immediate Kill Session: sess: 0x25cb1d5b8  OS pid: 30068
Immediate Kill Session#: 1038, Serial#: 13595
Immediate Kill Session: sess: 0x25cb1eb20  OS pid: 11821

將service_names 置為空,會將哪些節點2上的服務關閉,節點1會做些什麼 ?

3.  重新啟動節點2上的監聽,還是不能讓它online
ecsrac02$crs_stop  ora.ecsrac02.LISTENER_ECSRAC02.lsnr  
Target set to OFFLINE for `ora.ecsrac02.LISTENER_ECSRAC02.lsnr`
ecsrac02$
ecsrac02$crs_start   ora.ecsrac02.LISTENER_ECSRAC02.lsnr
ecsrac01 : CRS-1019: Resource ora.ecsrac02.LISTENER_ECSRAC02.lsnr (application) cannot run on ecsrac01
ecsrac02 : CRS-1018: Resource ora.ecsrac02.vip (application) is already running on ecsrac01

CRS-0223: Resource 'ora.ecsrac02.LISTENER_ECSRAC02.lsnr' has placement error.

ecsrac02$

需要重新啟動節點2的所有服務嗎 ?   [root@ecsrac02 ~]# /etc/init.d/init.crs  start

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/35489/viewspace-553504/,如需轉載,請註明出處,否則將追究法律責任。

相關文章