當心ORACLE 12.2 RAC新特性引入的BUG導致ORA-4031
前幾日朋友有套資料庫升級到oracle 12C r2後出現了ORA-4031, 是一套on Linux x86 64-bit 2節點的RAC資料庫,Hard parse並不高,並且沒有使用AMM, ASMM。SGA分配80G, shared pool固定在13Gb.
— nodel 1 db alert log file
2018-05-29T13:44:51.107475+08:00 Thread 1 advanced to log sequence 6680 (LGWR switch) Current log# 14 seq# 6680 mem# 0: +SSDDG1/anbob/ONLINELOG/group_14.260.944852559 2018-05-29T13:57:28.332831+08:00 Errors in file /oracle/app/diag/rdbms/anbob/anbob1/trace/anbob1_m001_38524.trc (incident=1804178): ORA-04031: unable to allocate 640 bytes of shared memory ("shared pool"," SELECT source, (case...","kkqctdrvTD: co","ckydef:kkqcscpcky") 2018-05-29T13:57:28.332861+08:00 Errors in file /oracle/app/diag/rdbms/anbob/anbob1/trace/anbob1_ora_38534.trc (incident=1804506): ORA-04031: unable to allocate 40 bytes of shared memory ("shared pool","select con#,obj#,rcon#,enabl...","SQLA","tmp") Incident details in: /oracle/app/diag/rdbms/anbob/anbob1/incident/incdir_1804506/anbob1_ora_38534_i1804506.trc Incident details in: /oracle/app/diag/rdbms/anbob/anbob1/incident/incdir_1804178/anbob1_m001_38524_i1804178.trc Use ADRCI or Support Workbench to package the incident. See Note 411.1 at My Oracle Support for error and packaging details. Use ADRCI or Support Workbench to package the incident. See Note 411.1 at My Oracle Support for error and packaging details. Errors in file /oracle/app/diag/rdbms/anbob/anbob1/trace/anbob1_m001_38524.trc (incident=1804179): ORA-04031: unable to allocate 4152 bytes of shared memory ("shared pool"," SELECT source, (case...","9003.kgght","9003.kgght") ORA-04031: unable to allocate 640 bytes of shared memory ("shared pool"," SELECT source, (case...","kkqctdrvTD: co","ckydef:kkqcscpcky") Incident details in: /oracle/app/diag/rdbms/anbob/anbob1/incident/incdir_1804179/anbob1_m001_38524_i1804179.trc Use ADRCI or Support Workbench to package the incident. See Note 411.1 at My Oracle Support for error and packaging details. 2018-05-29T13:57:28.588932+08:00 ORA-04031 heap dump being written to trace file /oracle/app/diag/rdbms/anbob/anbob1/incident/incdir_1804506/anbob1_ora_38534_i1804506.trc 2018-05-29T13:57:28.971700+08:00 Errors in file /oracle/app/diag/rdbms/anbob/anbob1/trace/anbob1_ora_36389.trc (incident=1804658): ORA-04031: unable to allocate 424 bytes of shared memory ("shared pool","EXEC_OPR_POS","KGLS^62f5cb39","KGLS MEM BLOCK") Incident details in: /oracle/app/diag/rdbms/anbob/anbob1/incident/incdir_1804658/anbob1_ora_36389_i1804658.trc Use ADRCI or Support Workbench to package the incident. See Note 411.1 at My Oracle Support for error and packaging details. 2018-05-29T13:57:29.086756+08:00 Errors in file /oracle/app/diag/rdbms/anbob/anbob1/trace/anbob1_ora_75557.trc (incident=1800986): ORA-04031: unable to allocate 40 bytes of shared memory ("shared pool","select * from (select id,msi...","SQLA","tmp") Incident details in: /oracle/app/diag/rdbms/anbob/anbob1/incident/incdir_1800986/anbob1_ora_75557_i1800986.trc Use ADRCI or Support Workbench to package the incident. See Note 411.1 at My Oracle Support for error and packaging details.
M001程式在申請”shared pool”遇到ora-4031, 當然問題時間段的AWR沒有生成, 當時建議做heapdump如下
SQL> connect / as sysdba SQL> alter session set events 'immediate trace name heapdump level 536870914';
OR
sqlplus "/ as sysdba" oradebug setmypid oradebug unlimit oradebug dump heapdump 536870914 oradebug tracefile_name exit
讓現場的收集了部分trace file, 並沒有什麼異常資訊,這時可以看問題時間點前最近的AWR,找”SGA breakdown difference”部份,初步判斷異常變化。
Note:
“gc index split transaction” 記憶體區使用超過了5Gb, 這在以前的版本中是很少見的. 並且繼續確認了幾份AWR,該heap 記憶體區時一直增長的, 如果並且如果手動從X$KSMSS 確認當前的”gc index split transaction”也是在持續增長。
gc index split transaction
gc index split transaction 是用於Oracle 12cr2 引入的新特性”fast index split wait”, 該特性是在KCL layer實現,用於RAC的Global Cache,使用它表示一個程式index splits等待遠端或本地程式, 基本上每個session在連線時都會分配一組buffer. 所以這個現象只存在於RAC環境。並且配置_gc_fast_index_split_wait=0沒有什麼作用。
似乎是因為使用了kclxidinit() used kghalp (permanent memory) 分配的是perm trunk,而不是kghalf (freeable memory)
解決方案
Mos中查詢了一下,比較符合bug 27163928.
重啟可以臨時解決問題,但是會逐漸再次因為fast index split wait增長導致ora-4031.
or
安裝one-off patch 27163928
or
升級到oracle 18.1(if released!)
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/23368118/viewspace-2155593/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- 【ASK_ORACLE】Oracle 12.2 Bug導致網路卡出現故障後RAC庫未向TCP註冊本地VIP監聽OracleTCP
- Oracle 12.2 BUG :分割槽維護導致的 ORA-600 KKPOFPCD3Oracle
- Oracle 12.2 新特性: Online PDB relocate (PDB hot move)Oracle
- ORACLE 11.2.0.4 rac for linux 鏈路宕導致的單節點異常當機OracleLinux
- Oracle 12.2 RAC on Linux Best Practice DocumentationOracleLinux
- Oracle RAC啟動因CTSS導致的異常Oracle
- Oracle RAC日常運維-NetworkManager導致叢集故障Oracle運維
- Oracle 12.2新特性: PDB級閃回資料庫(Flashback PDB)Oracle資料庫
- Oracle 12.2 新特性 | PDB不同字符集變更深入解析Oracle
- RAC節點hang住, oracle bug導致了cpu過高,無法啟動叢集隔離Oracle
- Oracle 12.2 新特性:線上PDB資料庫克隆(Pluggable Hot Clone)Oracle資料庫
- DRM特性引起的RAC節點當機
- 淺複製導致的bug
- 【ASK_ORACLE】因process用盡導致的rac重啟的解決方法Oracle
- C++各版本引入的新特性C++
- 【ASK_ORACLE】安裝Oracle RAC 12.2的GI軟體時報錯CLSRSC-614Oracle
- Oracle 12.2 RAC修改public ip address或public ip(subnet (netmask) or interface)Oracle
- 分而治之:Oracle 18c及12.2分割槽新特性的N種優化實踐Oracle優化
- ORACLE RAC 兩節點db_32k_cache_size設定不當導致表truncate失敗之ORA-00379Oracle
- Oracle ORA-4031解決思路Oracle
- Oracle:Redhat 7 + Oracle RAC 11g 安裝 bug 總結OracleRedhat
- Go1.18 新特性:引入新的 netip 網路庫Go
- 【12.2】Oracle 12C R2新特性-外部表支援分割槽了(Partitioning External Tables)Oracle
- Containerd 的 Bug 導致容器被重建!如何避免?AI
- 12.2新特性之ADG多節點啟動MRP程式
- Oracle 12c因bug導致ORA-04031問題處理過程Oracle
- oracle rac+adg調整redo日誌組導致adg備庫ogg抽取程式abendOracle
- ORACLE RAC 11.2.0.4 ASM加盤導致叢集重啟之ASM sga設定過小OracleASM
- GreatSQL的sp中新增新的sp_instr引入的bug解析SQL
- 又見想當然導致的誤譯
- 在https中引入http資源所導致的問題HTTP
- Oracle 12.2 OJVM安裝OracleJVM
- 記php-fpm重啟導致的一個bugPHP
- 開心檔之Java 9 新特性Java
- 記一次儲存問題導致的rac故障案例
- ORACLE 12.2中的更改與新功能Oracle
- oracle RACOracle
- C# 9.0中引入的新特性init和record的使用思考C#