文件 ID 300388.1 私有網路卡MTU大小不同,則例項無法啟動

張玉龍發表於2016-10-24


Instances Unable To Start If MTU Size Is Different for Cluster_interconnect (文件 ID 300388.1)

如果Cluster_interconnect的MTU大小不同,則例項無法啟動


Oracle Server - Enterprise Edition - Version 9.0.1.0 to 11.2.0.3 [Release 9.0.1 to 11.2]
Information in this document applies to any platform.
***Checked for relevance on 07-Jan-2010***


如果用於互連的網路卡(私有網路卡)的MTU大小在叢集成員節點上不同,則RAC例項將不能啟動


網路配置


MTU大小在私有網路介面上設定,例如,兩個叢集成員的兩個介面:

node 1
eth0 Link encap:Ethernet HWaddr 00:0E:0C:08:4B:D5 
inet addr: xxx.x.x.x Bcast:xxx.x.x.x Mask:255.255.255.0
inet6 addr: fe80::20e:cff:fe08:4bd5/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:9000 Metric:1

node 2
eth0 Link encap:Ethernet HWaddr 00:0E:0C:08:03:59 
inet addr: xxx.x.x.x Bcast:xxx.x.x.x Mask:255.255.255.0
inet6 addr: fe80::20e:cff:fe08:359/64 Scope:Link
UP BROADCAST RUNNING MULTICAST *MTU:1500* Metric:1


如果您配置了不同的MTU大小,啟動將掛起,並在alert-log中出現以下錯誤:

Tue Mar  1 01:50:35 2005
lmon registered with NM - instance id 2 (internal mem no 1)
Tue Mar  1 01:50:36 2005
Reconfiguration started (old inc 0, new inc 2)
List of nodes:
0 1
Global Resource Directory frozen
Update rdomain variables
Communication channels reestablished
* domain 0 valid = 0 according to instance 0
Tue Mar  1 01:55:44 2005
IPC Send timeout to 0.0 inc 9 for msg type 53 from opid 5
Tue Mar  1 01:59:25 2005
Trace dumping is performing id=[cdmp_20050301095925]
Tue Mar  1 01:59:31 2005
Reconfiguration started (old inc 2, new inc 3)
List of nodes:
1
通常,會在alert-file和後臺程式(LMD和LEMON)的跟蹤中看到超時。


解決方案:


1. 使用oradebug ipc標識Oracle RAC正在使用的介面 - Metalink note 181489.1
2. 檢查網路配置,例如使用ifconfig,例如:/sbin/ifconfig eth0
3. 使用適合所有介面的資料包大小來ping網路卡的IP地址。 使用-M開關以避免包拆分,例如:
   ping <nodename> -s <largest-size - 適合> -M do
4. 配置叢集互連線口在所有叢集成員節點上具有相同的MTU大小


參考:
BUG:3328693 - ONE NODE IN RAC CLUSTER WILL NOT START UP LMON TIMEOUT DETECTED
NOTE:181489.1 - Tuning Inter-Instance Performance in RAC and OPS

相關文章