IDENTIFIER: 864D2CE3 NIM thread blocked

cccgw發表於2009-11-21
#errpt
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
864D2CE3 0920030307 P S topsvcs NIM thread blocked[@more@]

摘自http://zhsumin.itpub.net/post/8419/399295

IBM主機2節點HA執行Oracle 10.2,OS errpt報錯:NIM thread blocked

錯誤資訊:

#errpt
IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
864D2CE3 0920030307 P S topsvcs NIM thread blocked

#]errpt -aj 864D2CE3
---------------------------------------------------------------------------
LABEL: TS_NIM_ERROR_STUCK_
IDENTIFIER: 864D2CE3

Date/Time: Thu Sep 20 03:03:42 BEIST 2007
Sequence Number: 758
Machine Id: 00CEB22C4C00
Node Id: whdcp5
Class: S
Type: PERM
Resource Name: topsvcs

Description
NIM thread blocked

Probable Causes
A thread in a Topology Services Network Interface Module (NIM) process
was blocked
Topology Services NIM process cannot get timely access to CPU

User Causes
Excessive memory consumption is causing high memory contention
Excessive disk I/O is causing high memory contention

Recommended Actions
Examine I/O and memory activity on the system
Reduce load on the system
Tune virtual memory parameters
Call IBM Service if problem persists

Failure Causes
Excessive virtual memory activity prevents NIM from making progress
Excessive disk I/O traffic is interfering with paging I/O

Recommended Actions
Examine I/O and memory activity on the system
Reduce load on the system
Tune virtual memory parameters
Call IBM Service if problem persists

Detail Data
DETECTING MODULE
rsct,nim_control.C,1.39.1.10,5677
ERROR ID
6XnGH40C8Lw4/DM21L.V2.1...................
REFERENCE CODE

Thread which was blocked
command receive thread
Interval in seconds during which process was blocked
16
Interface name
tty0

故障分析:

在上述例子中,我們看到了系統說明了這是一個NIM 執行緒被鎖死的錯誤,以及可能導致的

原因及處理方法。 一般這種錯誤可能是由於系統資源耗盡或者異常大量io 造成的。

Thread which was blocked 中指定了被鎖死的程式。在Interval in seconds during which process was blocked 中註明了執行緒被鎖定的時間。在interface
中說明的受影響的adapter(介面卡)
故障解決:
如果沒有產生adapter down event , 那麼這個資訊可以忽略不計。因為她不是使cluster 會產生警示的錯誤報告 。 
但是下面有兩種方法可以阻止或減少該種錯誤報告的產生。 
1.    升級 bos.rte.libpthreads 的包到最新的級別。
2.    降低NIM failure detact rate.
   smitty hacmp
  cluster config
    cluster topology
      configure Network Modules
       Change a Network Module using Predefined Values
   rs232 Ethernet 的值都調慢。 

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/668365/viewspace-1028927/,如需轉載,請註明出處,否則將追究法律責任。

相關文章