Company

Advanced Search
Home | General Database | Platform | Articles | Scripts | Online Documentation
Back

Articles

Date 2010-01-29 10:22:50
Component CRS
Title ERROR: clssgmGetGrock: Group ID of 702053 exceeds max value
Version 11.1.0.7
Problem

When you are using Oracle Clusterware 11g Release 1 cluster and you are facing an cluster outage due to node evictions it is very good possible that you are running into bug 860459.

To analyse if you are running into this issue you must see the error below in the ocssd.log files in the Clustereware home($ORACLE_HOME/log/<hostname>/cssd/ocssd*)
Perform a search on "exceeds max value".  If you find these lines you are running into the issue.
[ CSSD]2010-01-25 [1115699552] >ERROR: ASSERT clssgm.c 1594
[ CSSD]2010-01-25 [1115699552] >ERROR: clssgmGetGrock: Group ID of 702053 exceeds max value for global groups
[ CSSD]2010-01-25 [1115699552] >TRACE: clssgmDiscOmonReady: omon was posted for member 3
[ CSSD]2010-01-25 [1115699552] >ERROR: ###################################
[ CSSD]2010-01-25 [1115699552] >ERROR: clssscExit: CSSD aborting from thread GM Peer Lsnr
[ CSSD]2010-01-25 [1115699552] >ERROR: ###################################
 
You will see also that the master node must be evicted to run into this bug.
You can find the master node if you search on CLSS-3001.
 
[ CSSD]CLSS-3000: reconfiguration successful, incarnation 103410002 with 1 nodes
[ CSSD]CLSS-3001: local node number 1, master node number 1
Solution

The bug is fixed in Patch Setup Update 2.  For more information about this pathset check My Oracle Support ID 810663.1

Bugs Fixed by 11.1.0.7 CRS PSU #2:
 
All bugs fixed by 11.1.0.7 CRS Bundle Patch #1 and below bugs...
6140790 CRSD ISSUES ALERT LOG MESSAGE FOR CRSD-1205 WITHOUT RESOURCE NAME OR LOG FILE
6355663 UMASK IN S0CLSRDMAI.C AND SCLSRUTL.C SHOULD BE REMOVED
6486556 LX64-070801.27 - CSS KILLS NODES TO AVOID SPLITBRAIN
6986682 POSTROOTPATCH.SH SHOULD ALLOW OPTION TO NOT START THE STACK
7357394 AIX11107: CPU STARVATION ON ONE NODE, OTHER NODES CRASHED DURING CSS RECONFIG
7364519 APPSST GSI 10G:REMOTE NODES INVENTORY GET CORRUPTED AFTER APPLYING ROLLING PATCH
7631837 TB: CRSD CORE DUMP IN PROCR_CACHING_RETRY AT PROCR.C:7357
8214307 NODE REBOOT SLOW DUE TO EXCESS INIT.CRS STOP
8328259 TAF/RAC: QUERIES DO NOT FAIL OVER (SESSIONS DO) IN CERTAIN CONDITIONS - SOLARIS
8373758 TB-CMP: 11107 SERVICE CAN'T BE BROUGHT UP BY 11107 SRVCTL WHEN WITH 11.2 CRS
8374326 CANNOT DISABLE AUTOSTART IN CRS SCRIPTS SUSE 10 ORACLE 11I AND 10G RAC
8429716 DATABASE HANGS AT 1900 USER LOAD
8476516 CLONE.PL COPIES THE WRONG INIT.CSSD
8531031 CRS BUNDLE 8287931 INSTALL IS INCOMPLETE, CORRUPTION
8557163 LOCAL JOIN IN CSSD IS TAKING AROUND 10 MINS
8586117 "CRSD.BIN" IS NOT RESPAWNED CORRECTLY AFTER KILL IF HOSTNAME HAS CAPITAL LETTER
8595233 PROBLEMS ON PRIVATE NETWORK CAUSES ALL NODES TO BE EVICTED
8604549 TB_WB: ALL NODES REBOOT WHEN GLOBAL GROUP ID REACHES TO 524288
8619821 CORE DUMP FROM CSSD : CLSSGMINITIALRECV()
8733944 NODE REBOOT DURING CRS 11.2 BETA UPGRADE
8737425 INTENSIVE BURSTS OF CLOSE(2) SYSTEM CALLS WHEN CRSD.BIN FORKS CHILD PROCESSES
8781354 POSTROOTPATCH.SH FAILS W/"ARGUMENT LIST TOO LONG" DUE TO CORES
9189026 UNABLE TO BRING THE CLUSTER SERVICES UP ON THE THE 4NODE CLUSTER AFTER APPLYING
9189171 ROLLING UPGRADE OF 11.1.0.7 BUNDLE 2 TO 11.2 IS BROKEN

© RACHelp 2010 | About me | Disclaimers | Contact