Please see my other blog for Oracle EBusiness Suite Posts - EBMentors

Search This Blog

Note: All the posts are based on practical approach avoiding lengthy theory. All have been tested on some development servers. Please don’t test any post on production servers until you are sure.

Sunday, June 02, 2013

INS-20802 Grid Infrastructure Configuration Failed - 11gR2 on Windows

Yesterday while installing Oracle RAC 11gR2 (11.2.0.3) on one of the client side got the below error.

INS-20802 Grid Infrastructure Configuration Failed
Although all the cluvfy verification passed successful already, above error occurred after the 84% of installation on the step of Grid Infrastructure Configuration. Before this error remote operations went smooth and all GI folders copied on the remote host successfuly. I did not cancel the installation and decided to investigate first to know the cause.


1- I checked the installations logs (C:\Program Files\Oracle\Inventory\logs) which did not provide the enough information.
installActions2013-06-01_03-41-08PM
INFO: Starting 'Grid Infrastructure Configuration'
INFO: Starting 'Grid Infrastructure Configuration'
INFO: 2013-06-01 13:30:04: Checking for super user privileges
INFO: 2013-06-01 13:30:04: superUser=Administrator groupName=Administrators
INFO: 2013-06-01 13:30:04: domain=WORKGROUP user=ADMINISTRATOR
INFO: 2013-06-01 13:30:04: C:\app\11.2.0\grid\bin\crssetup.exe getsystem
INFO: 2013-06-01 13:30:04: Executing cmd: C:\app\11.2.0\grid\bin\crssetup.exe getsystem
INFO: 2013-06-01 13:30:05: Command output:
INFO: > SYSTEM
INFO: >End Command output
INFO: 2013-06-01 13:30:05: User has Administrator privileges
INFO: Using configuration parameter file: C:\app\11.2.0\grid\crs\install\crsconfig_params
INFO: Creating trace directory
INFO:
WARNING:
INFO: Completed Plugin named: Grid Infrastructure Configuration
INFO: Grid Infrastructure Configuration failed.
INFO: Grid Infrastructure Configuration failed.
INFO: ConfigClient.executeToolsInAggregate action performed
INFO: Exiting ConfigClient.executeToolsInAggregate method
INFO: Calling event ConfigToolsExecuted
INFO:
2- So continued with other logs (C:\app\11.2.0\grid\cfgtoollogs\crsconfig) and found that problem was for the remote node(s)

rootcrs_gispsddb1
2013-06-01 13:55:17: Succeeded in writing the checkpoint:'ROOTCRS_REMOTENODES' with status:FAIL
2013-06-01 13:55:17: CkptFile: C:\app\Administrator\Clusterware\ckptGridHA_gispsddb1.xml
2013-06-01 13:55:17: Sync the checkpoint file 'C:\app\Administrator\Clusterware\ckptGridHA_gispsddb1.xml'
2013-06-01 13:55:17: Configure Oracle Grid Infrastructure on remote nodes... failed
2013-06-01 13:59:10: The configuration parameter file C:\app\11.2.0\grid\crs\install\crsconfig_params is valid
2013-06-01 13:59:10: Checking for super user privileges
2013-06-01 13:59:10: superUser=Administrator groupName=Administrators
2013-06-01 13:59:10: domain=WORKGROUP user=ADMINISTRATOR
2013-06-01 13:59:10: C:\app\11.2.0\grid\bin\crssetup.exe getsystem
2013-06-01 13:59:10: Executing cmd: C:\app\11.2.0\grid\bin\crssetup.exe getsystem
2013-06-01 13:59:10: Command output:

3-  After getting this info that issue is for the remote node (client had only 2 nodes), I checked the logs on remote noed
rootcrs_gispsddb2

2013-06-01 13:54:37: Running C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:54:37: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:54:38: Checking the status of cluster
2013-06-01 13:54:43: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:54:43: Checking the status of cluster
2013-06-01 13:54:48: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:54:48: Checking the status of cluster
2013-06-01 13:54:53: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:54:53: Checking the status of cluster
2013-06-01 13:54:58: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:54:59: Checking the status of cluster
2013-06-01 13:55:04: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:55:04: Checking the status of cluster
2013-06-01 13:55:09: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:55:09: Oracle CRS stack installed and running
2013-06-01 13:55:09: Running C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:55:09: Executing cmd: C:\app\11.2.0\grid\bin\crsctl check cluster -n gispsddb2
2013-06-01 13:55:09: ckpt: C:\app\11.2.0\grid\bin\cluutil -ckpt -oraclebase C:\app\Administrator -chkckpt -name ROOTCRS_NODECONFIG
2013-06-01 13:55:09: Running as user Administrator: C:\app\11.2.0\grid\bin\cluutil -ckpt -oraclebase C:\app\Administrator -chkckpt -name ROOTCRS_NODECONFIG
2013-06-01 13:55:09: s_run_as_user2: Running C:\app\11.2.0\grid\bin\cluutil -ckpt -oraclebase C:\app\Administrator -chkckpt -name ROOTCRS_NODECONFIG
2013-06-01 13:55:10: C:\app\11.2.0\grid\bin\cluutil successfully executed
2013-06-01 13:55:10: checkpoint ROOTCRS_NODECONFIG does not exist
2013-06-01 13:55:10: Running as user Administrator: C:\app\11.2.0\grid\bin\cluutil -ckpt -oraclebase C:\app\Administrator -writeckpt -name ROOTCRS_NODECONFIG -state START
2013-06-01 13:55:10: s_run_as_user2: Running C:\app\11.2.0\grid\bin\cluutil -ckpt -oraclebase C:\app\Administrator -writeckpt -name ROOTCRS_NODECONFIG -state START
2013-06-01 13:55:10: C:\app\11.2.0\grid\bin\cluutil successfully executed
2013-06-01 13:55:10: Succeeded in writing the checkpoint:'ROOTCRS_NODECONFIG' with status:START
2013-06-01 13:55:10: CkptFile: C:\app\Administrator\Clusterware\ckptGridHA_gispsddb2.xml
2013-06-01 13:55:10: Sync the checkpoint file 'C:\app\Administrator\Clusterware\ckptGridHA_gispsddb2.xml'
2013-06-01 13:55:10: Configuring node
2013-06-01 13:55:11: adding nodeapps...
2013-06-01 13:55:11: upgrade_opt=
2013-06-01 13:55:11: nodevip=GISPSDDB2-vip/255.255.255.0/public
2013-06-01 13:55:11: DHCP_flag=0
2013-06-01 13:55:11: nodes_to_add=GISPSDDB2
2013-06-01 13:55:11: add nodeapps for static IP
2013-06-01 13:55:11: Running srvctl config nodeapps to detect if VIP exists
2013-06-01 13:55:15: vip exists
2013-06-01 13:55:15: Invoking "C:\app\11.2.0\grid\bin\srvctl add vip -n gispsddb2 -k 1 -A "GISPSDDB2-vip/255.255.255.0/public" "
2013-06-01 13:55:15: trace file=C:\app\11.2.0\grid\cfgtoollogs\crsconfig\srvmcfg0.log
2013-06-01 13:55:15: Executing C:\app\11.2.0\grid\bin\srvctl add vip -n gispsddb2 -k 1 -A "GISPSDDB2-vip/255.255.255.0/public"
2013-06-01 13:55:15: Executing cmd: C:\app\11.2.0\grid\bin\srvctl add vip -n gispsddb2 -k 1 -A "GISPSDDB2-vip/255.255.255.0/public"
2013-06-01 13:55:16: Command output:
> PRKO-2200 : Invalid interface specified on command line: public
>End Command output
2013-06-01 13:55:16: "C:\app\11.2.0\grid\bin\srvctl add vip -n gispsddb2 -k 1 -A "GISPSDDB2-vip/255.255.255.0/public" " failed with status 1.
So problem was while adding VIP for the node2 and it was related with the public interface. I checked from ncpa.cpl and found no issue with public interface. Names were identical on windows GUI. I verified the interface as below on both nodes.
node1
C:\app\11.2.0\grid\BIN>oifcfg iflist
public 172.16.139.0
Private 70.70.70.0
on node 2
Because cluster configuration was not successful on node2 so I had to check using the installation media of grid.
E:\gridMedia\stage\ext\BIN>oifcfg iflist
Public 172.16.139.0
Private 70.70.70.0
Using oifcfg , found that public interface name ("Public") was different than node1's ("public"). I requested client to change interface name as "public" and it was done. After this I just used the retry button on the same installation screen and installation went smooth :)


NOTE: The INS-20802 error is a 'catch-all' error that only indicates that the Grid Infrastructure Configuration has failed. There can be myriad causes for this error to occur. eg: Operating System function failure, failure to install driver etc

No comments: