RAC一個節點OHASD啟動不了waiting for init.ohasd to be started
Cluster failed to start due to problem with socket pipe npohasd (文件 ID 1612325.1) | 轉到底部 |
APPLIES TO:
Oracle Database - Enterprise Edition - Version 11.2.0.1 and laterInformation in this document applies to any platform.
SYMPTOMS
CRS stack not coming up on one node.
Sockets permission issue with Grid Infrastructure and CRS stack fails to come up with crsctl start crs after the server reboot.
test-133(root)/>ps -ef|grep init root 28717 28382 2 19:03:20 pts/9 0:00 grep d.bin root 28756 1 0 10:01 ? 00:00:00 /bin/sh /etc/init.d/init.ohasd run root 28676 27170 0 19:02:57 pts/5 0:00
rm -rf /tmp/.oracle/* /usr/tmp/.oracle/* /var/tmp/.oracle/*
test-133(root)/>/oracle/app/11.2.0/grid/bin/crsctl start crs
test-133(root)/>ps -ef|grep d.bin root 28717 28382 2 19:03:20 pts/9 0:00 grep d.bin root 28680 0 19:02:57 0:00 /oracle/app/11.2.0/grid/bin/ohasd.bin reboot
ls -lrt /tmp/.oracle
prw-r--r-- 1 root root 0 Jan 6 09:50 npohasd
tusc output -> seems stuck in sleeping.
test-133:(root)/>/hpk/tusc -faep -T %H:%M:%S -p 28680
( Attached to process 28680 ("/oracle/app/11.2.0/grid/bin/ohasd.bin reboot") [64 -bit] ) 19:03:42 [28680] open(0x40000000007789b0, O_WRONLY|0x800, 023240) [sleeping] tusc: ttrace(TT_PROC_STOP, 0, 0, 0, 0, 0): Permission denied
CHANGES
Problem Started after patching failed and the server rebooted.
CAUSE
Permission issue
Relinked the binaries and restarted the server again so that init.ohasd came up fine, but ohasd and other daemons wouldn't start and no sockets get created
OS start S96ohasd, it will wait for init.ohasd to write the pipe.
What happened here is init.ohasd was started, then all socket files got removed by the manual removal, then when you start ohasd again, it will wait there since those socket files was removed manually
SOLUTION
WORKAROUND: ----------- Clear all sockets under /var/tmp/.oracle or /tmp/.oracle if any and then open two terminals of the same node, where stack is not coming up. 1) On Terminal 1 , issue as Root user :-
2) Simultaneously , on node2 , issue below command as Root user , once npohasd socket has been created.
3) Now if you check on terminal 1 , the CRS stack would start coming up.
4) Once entire CRS stack is up, you can press CTRL+C and come out of the dd command running on 2nd terminal.
Check and validate all resources are online using
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/22039464/viewspace-2123312/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- Oracle RAC 一個節點不能自動啟動 怪問題Oracle
- RAC資料庫只能啟動一個節點的故障資料庫
- 【Oracle】RAC 11.2.0.4.0 OHASD服務無法啟動Oracle
- Oracle 19c rac安裝,只能啟動一個節點的ASMOracleASM
- RAC一個節點恢復另一個節點在帶庫上的備份
- RAC節點啟動失敗--ASM無法連線ASM
- asm例項自動dismount導致rac一個節點當機ASM
- HBase啟動不了的一個原因處理
- 一節點啟動不起來(ocssd)CSS
- Oracle RAC命中ORA-7445只能開啟一個節點故障案例分析Oracle
- RAC第一個節點被剔除叢集故障分析
- ORACLE 10G增加一個節點rac3Oracle 10g
- oracle兩節點RAC,由於gipc導致某節點crs無法啟動問題分析Oracle
- Oracle 9.2.0.4 RAC + Linux AS3.0 如何動態加入一個新的節點OracleLinuxS3
- ORACLE RAC環境下節點自動重啟問題總結Oracle
- 記一次oracle 19c RAC叢集重啟單節點DB啟動異常(二)Oracle
- 一個節點rac+單節點dg網路配置(listener.ora與tnsnames.ora)
- oracle 11gR2 rac 兩節點有一個節點down掉問題處理Oracle
- RAC 雙節點 轉單節點流程
- rac節點無法啟動ORA-29702的問題及分析
- 處理rac資料庫一個節點監聽異常資料庫
- Oracle9204 RAC 節點2當機後5小時重新啟動找不到節點1上例項Oracle
- RAC中job會在哪個節點執行
- Oracle RAC新增節點Oracle
- Oracle RAC 新增節點Oracle
- 私有IP丟失造成Oracle 12C RAC叢集節點不能啟動Oracle
- oracle 12c RAC安裝,例項不能多節點同時啟動Oracle
- AIX RAC9i 節點2啟動時遭遇到7445錯誤AI
- RAC刪除節點失敗重啟大法解決
- 11g的ohasd無法隨機啟動隨機
- Oracle10g RAC 單獨關閉一個節點(維護)Oracle
- RAC一個節點記憶體故障當機,無法訪問記憶體
- oracle 10g rac vip 服務啟動不了的問題Oracle 10g
- rac維護工具集系列(一)--節點層
- RAC修改叢集兩個節點public ip地址
- RAC節點之間通訊問題 兩節點 11g RAC
- 2節點RAC安裝
- Oracle Rac 刪除節點Oracle