Failure 1 contacting Cluster Synchronization Services daemon_1466098.1
Failure 1 contacting Cluster Synchronization Services daemon (Doc ID 1466098.1)
In this Document
Applies to:Oracle Database - Enterprise Edition - Version 10.1.0.2 to 11.1.0.7 [Release 10.1 to 11.1]Information in this document applies to any platform. PurposeThis note provides steps to troubleshoot CSSD check failure from "crsctl check crs" or "crsctl check cssd" command. Depending on the version, the error message can be slightly different: 10gR2
Failure 1 contacting CSS daemon
Cannot communicate with CRS Cannot communicate with EVM
Failure 1 contacting Cluster Synchronization Services daemon
Cannot communicate with Cluster Ready Services Cannot communicate with Event Manager
CRS-4638: Oracle High Availability Services is online
CRS-4535: Cannot communicate with Cluster Ready Services CRS-4530: Communications failure contacting Cluster Synchronization Services daemon CRS-4534: Cannot communicate with Event Manager
Note: 1. The note discusses CSSD check error only, CRSD and EVMD is out of the scope of this note. 2. The note applies only when CRS is running in cluster mode, it does not apply to local cssd (localconfig) that's required to run single instance ASM. 3. For 11gR2, refer to Document 1050908.1 section "Case 3: OCSSD.BIN does not start"
Troubleshooting Steps
1. First step is to check whether CRS is up or not, a good sign of CRS is up is that RAC database is running fine, and the following daemons are running:
# ps -ef| egrep 'crsd.bin|ocssd.bin|evmd.bin' | grep -v grep
crsuser 5031 1 0 Jun26 ? 00:10:37 /ocw/crs/bin/ocssd.bin crsuser 5156 1 0 Jun26 ? 00:00:43 /ocw/crs/bin/evmd.bin crsuser 5479 1 0 Jun26 ? 00:02:32 /ocw/crs/bin/crsd.bin reboot
If CRS is up and only check fails, check Document 370605.1, if it does not apply, check cssdOUT.log, ocssd.log and ocssd.trc in If CRS is down, start with "crsctl start crs" command, if it does not start, go to Step 2. If the problem is CRS is not auto starting upon node reboot, check whether CRS is enabled or not (By default CRS is enabled for auto start upon node reboot) To verify whether its currently enabled or not:
# cat $SCRBASE/$HOSTNAME/root/crsstart
enable
SCRBASE is /etc/oracle/scls_scr on Linux and AIX, /var/opt/oracle/scls_scr on hp-ux and Solaris To enable:
# $GRID_HOME/bin/crsctl enable crs
2. Execute the following as root user and check the exit code:
# su -
# echo $?
# su - crsuser -c "/ocw/crs/bin/crsctl check boot" If there's an error, the error message will be printed on the screen and the error code will be non-zero
3.1. OS is not at appropriate run level:
grep init.cssd /etc/inittab h2:35:respawn:/etc/init.d/init.cssd fatal >/dev/null 2>&1
Above shows CRS will run at run level 3 and 5. Note depend on platform, CRS comes up at different run level.
who -r
3.2. "init.cssd fatal" is not up
ps -ef|grep "init.cssd fatal"|grep -v grep
root 4519 1 0 22:29 ? 00:00:00 /bin/sh /etc/init.d/init.cssd fatal
[ CSSD]2012-07-11 10:21:22.211 [1108330816] >TRACE: clsclisten: Permission denied for (ADDRESS=(PROTOCOL=ipc)(KEY=racnode1DBG_CSSD))
[ clsdmt]Fail to listen to (ADDRESS=(PROTOCOL=ipc)(KEY=racnode1DBG_CSSD)) OR [ CSSD]2012-07-11 10:21:24.418 [1150290240] >TRACE: clsclisten: Permission denied for (ADDRESS=(PROTOCOL=ipc)(KEY=Oracle_CSS_LclLstnr_crsclu_1)) [ CSSD]2012-07-11 10:21:24.418 [1150290240] >ERROR: clssgmclientlsnr: listening failed for (ADDRESS=(PROTOCOL=ipc)(KEY=Oracle_CSS_LclLstnr_crsclu_1)) (3)
The solution is to disable CRS from auto starting, reboot the node, remove all network socket files, and enable CRS from auto starting. CRS provide full clusterware functionality and doesn't need Vendor clusterware to be installed; but if you happened to have CRS on top of Vendor clusterware in your environment, then Vendor clusterware need to come up fully before CRS can be started, to verify, as crs user:
$ $CRS_HOME/bin/lsnodes -n
racnode1 1 racnode1 0 If vendor clusterware is not fully up, likely ocssd.log will have similar messages like following:
2010-08-30 18:28:13.207: [ CSSD][36]clssnm_skgxninit: skgxncin failed, will retry
2010-08-30 18:28:14.207: [ CSSD][36]clssnm_skgxnmon: skgxn init failed 2010-08-30 18:28:14.208: [ CSSD][36]################################### 2010-08-30 18:28:14.208: [ CSSD][36]clssscExit: CSSD signal 11 in thread skgxnmon
Jan 25 14:46:43 racnode1 logger: Cluster Ready Services completed waiting on dependencies.
Jan 25 14:46:43 racnode1 last message repeated 2 times Jan 25 14:46:43 racnode1 logger: Oracle CSS Family monitor starting. Jan 25 14:46:43 racnode1 logger: Running CRSD with TZ = Jan 25 14:46:44 racnode1 logger: Oracle CSS restart. 0, 1 ##>> CRS is coming up, good example OR Jul 19 16:45:33 racnode1 logger: Cluster Ready Services waiting on dependencies. Diagnostics in /tmp/crsctl.2372.
##>> refer to the /tmp/crsctl.
ReferencesNOTE:1050908.1 - Troubleshoot Grid Infrastructure Startup Issues |
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/17252115/viewspace-1156736/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- oracle 11GR2新特性 Cluster Time Synchronization Service 配置Oracle
- CRS-4535: Cannot communicate with Cluster Ready Services
- Oracle10g新特色:CRS(Cluster Ready Services)Oracle
- Oracle10g New Feature:CRS(Cluster Ready Services) (zt)Oracle
- 安裝Oracle cluster報錯 Failure at final check of Oracle CRS Stack 10OracleAI
- 2.3.6.2 Synchronization of Multiple ApplicationsAPP
- MySQL Cluster研究(1)MySql
- Failure at scls_scr_create with code 1AI
- 1. MySQL Galera Cluster全解析 Part 1 Galera Cluster 簡介MySql
- 練習使用list failure,advise failure; repair failure;AI
- ORA-27302: failure occurred at: sskgxprcv1AI
- Oracle cluster table(1)_概念介紹Oracle
- Failure is not fatal, but failure to change might be.AI
- Oracle RAC(Cluster)的重構(整理)(1)Oracle
- 解決ntp的錯誤 no server suitable for synchronization foundServerUI
- CLUSTER內部結構的總結(1)
- design for failureAI
- Web services框架Web框架
- Assertion failure 錯誤AI
- Ext success failure 呼叫AI
- RMAN-03009: failure of backup command on ORA_DISK_1 channelAI
- 例項重啟 - ORA-27302: failure occurred at: sskgxpsnd1AI
- Salesforce Integration 概覽(四) Batch Data Synchronization(批量資料的同步)SalesforceBAT
- 2.4.1.2 Nonedefault Services in a CDBNone
- 2.4 Overview of Services in a CDBView
- DCOM services errorError
- Web services 介紹Web
- DRIVER_POWER_STATE_FAILUREAI
- Vagrant:NS_ERROR_FAILUREErrorAI
- Assertion failure in -[UICollectionViewData validateLayoutInRect:]AIUIView
- redis.cluster/memcached.cluster/wmware esxiRedis
- XML安全之Web ServicesXMLWeb
- Amazon Web Services (目錄)Web
- 漫談Oracle RAC servicesOracle
- SQLNET.AUTHENTICATION_SERVICESSQL
- Overview of Oracle Net Services (280)ViewOracle
- Web Services體系 (轉)Web
- Web Services 平臺 (轉)Web