Oracle RAC 11204 CHM(cluster healthy monitor) 檔案無限膨脹.

babyyellow發表於2017-05-25



CHM 儲存資料的檔案, 已經沒有節制的膨脹了. 

大bug . 

解決方案: 

參考: http://blog.itpub.net/28697282/viewspace-1579187/ 


Affects:

Product (Component) Oracle Server (PCW)
Range of versions believed to be affected Versions BELOW 12.2
Versions confirmed as being affected
Platforms affected Generic (all / most platforms affected)

Fixed:

The fix for 20186278 is first included in

Description

BDB retension time suddenly change to 1094795585 or some other huge value.
This may cause crfclust.bdb file to grow in size, as crflogd tries to keep 
data for longer retention. 
Rediscovery Notes 
When hitting this problem, crflogd.log may show the retention size being 
suddenly updated to a large size. 
 
 ------------------------------------------------------------------
 2014-02-03 15:36:29.799: [ CRFLOGD][2363541248]updateDBRetention: new retention is 1094795585 seconds and 4702111234474983745 bytes
 ------------------------------------------------------------------
 
Current retention value may be checked with following command. 
If the retention value (unit: seconds) shows a large value like 
1094795585, you are likely to have hit this problem.
 
 $ oclumon manage -get repsize
 CHM Repository Size = 1094795585
 
 
Workaround 
Modify BDBSIZE parameter in config file crf.ora to 
adequately lower value. To do so, follow these steps:
 
1. Issue "$GI_HOME/bin/crsctl stop res ora.crf -init " on all the nodes of cluster.
2. Locate the config file $GRID_HOME/crf/admin/crf.ora
3. Manually edit the crf.ora file on every node of the cluster and
   change BDBSIZE tag entry and remove the value (set it to blank) or 
   set it to a desired value, eg. 61511. Do not delete the BDBSIZE tag itself.
4. Restart ora.crf daemon on every node.
解決方法

        該bug在12.2版本修復,變通的解決方法
        1.  在叢集中的所有節點執行 “GI_HOME/bin/crsctl stop res ora.crf -init "  
        2.  跳轉到 $GRID_HOME/crs/admin/crf.ora所在目錄下
        3.  在叢集中的所有節點手工編輯crf.ora檔案,修改BDBSIZE所在行,移除其所設定的值(設定它為空)
            或者 將它的值設定為期望值,例如61511。不要刪除BDBSIZE。
        4. 在叢集中的每個節點重啟ora.crf.

########################################33
兩個節點都執行. 

crsctl stat res ora.crf -init -t
crsctl stop res ora.crf -init
cd /data/grid/11gr2/crf/db/rac-product1
rm -f *.bdb
crsctl start res ora.crf -init

######################################

關於oracle CHM ( CLUSTER  HEALTHY MONITOR ) 參考: 

請參考Oracle官方文件:
 
  Oracle? Clusterware Administration and Deployment Guide
  11g Release 2 (11.2)
  Part Number E16794-17

  或者 My Oracle Support文件:
  Cluster Health Monitor (CHM) FAQ (Doc ID 1328466.1)

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/133735/viewspace-2139836/,如需轉載,請註明出處,否則將追究法律責任。

相關文章