DBA survival BLOG

DBA stuff and Oracle Data Guard

Oracle Grid Infrastructure 18c patching part 1: Some history

Posted on November 16, 2018 by Ludovico

Down the memory lane

Although sometimes I think I have been working with Oracle Grid Infrastructure since it exists, sometimes my memory does not work well. I still like to go through the Oracle RAC family history from time to time:

8i -> no Oracle cluster did exist. RAC was leveraging 3rd party clusters (like Tru Cluster, AIX HACMP, Sun Cluster)…
9i -> if I remember well, Oracle hired some developers of Tru Cluster after the acquisition of Compaq by HP. Oracle CRS was born and was quite similar to Tru Cluster. (The commands were almost the same: crs_stat instead of caa_stat, etc)
10g -> Oracle re-branded CRS to Clusterware
11g -> With the addition of ASM (and other components), Oracle created the concept of “Grid Infrastructure”, composed by Clusterware and additional products. All the new versions still use the name Grid Infrastructure and new products have been added through the years (ACFS, RHP, QoS …)

But I have missing souvenirs. For example, I cannot remember having ever upgraded an Oracle Cluster from 9i to 10g or from 10g to 11g. At that time I was working for several customers, and every new release was installed on new Hardware.

My first, real upgrade (as far as I can remember) was from 11gR2 to 12c, where the upgrade process was a nice, OUI-driven, out-of-place install.

The process was (still is 🙂 ) nice and smooth:

The installer copies, prepares and links the binaries on all the nodes in a new Oracle Home
The upgrade process is rolling: the first node puts the cluster in upgrade mode
The last node does the final steps and exists the cluster from the upgrade mode.

This is about Upgrading to a new release. But what about patching?

In-place patching

Patching of Grid Infrastructure has always been in-place and, I will not hide it, quite painful.

If you wanted to patch a Grid Infrastructure before release 12cR2, you had to:

read the documentation carefully and check for possible conflicts
backup the Grid Home
copy the patch on the host
evacuate all the services and databases from the cluster node that you want to patch
patch the binaries (depending on the versions and patches, this might be easy with opatchauto or quite painful with manual unlocking/locking and manual opatch steps)
restart/relocate the services back on the node
repeat the tasks for every node

The disadvantages of in-place patching are many:

Need to stage the patch on every node
Need to repeat the patching process for every node
No easy rollback (some bad problems might lead to deconfiguring the cluster from one node and then adding it back to the cluster)

Out-of-place patching

Out-of-place patching is proven to be much a better solution. I am doing it regularly since a while for Oracle Database homes and I am very satisfied with it. I am implementing it at CERN as well, and it will unlock new levels of server consolidation 🙂

I have written a blog series here, and presented about it a few times.

But out-of-place patching for Grid Infrastructure is VERY recent.

12cR2: opatchauto

Oracle 12cR2 introduced out-of-place patching as a new feature of opatchauto.

This MOS document explains it quite in detail:

Grid Infrastructure Out of Place ( OOP ) Patching using opatchauto (Doc ID 2419319.1)

The process is the following:

a preparation process clones the active Oracle Home on the current node and patches it
a switch process switches the active Oracle Home from the old one to the prepared clone
those two phases are repeated for each node

The good thing is that the preparation can be done in advance on all the nodes and the switch can be triggered only if all the clones are patched successfully.

However, the staging of the patch, the cloning and patching must still happen on every node, making the concept of golden images quite useless for patching.

It is worth to mention, at this point, that Grid Infrastructure Golden Images ARE A THING, and that they have been introduced by Rapid Home Provisioning release 12cR2, where cluster automatic provisioning has been included as a new feature.

This Grid Infrastructure golden images have already been mentioned here and here.

I have discussed about Rapid Home provisioning itself here, but I will ad a couple of thoughts in the next paragraph.

18c and the brand new Independent local-mode Automaton

I have been early tester of the Rapid Home Provisioning product, when it has been released with Oracle 12.1.0.2. I have presented about it at UKOUG and as a RAC SIG webinar.
https://www.youtube.com/watch?v=vaB4RWjYPq0
https://www.ludovicocaldara.net/dba/rhp-presentation/

I liked the product A LOT, despite a few bugs due to the initial release. The concept of out-of-placing patching that RHP uses is the best one, in my opinion, to cope with frequent patches and upgrades.

Now, with Oracle 18c, the Rapid Home Provisioning Independent Local-mode Automaton comes to play. There is not that much documentation about it, even in the Oracle documentation, but a few things are clear:

The Independent local-mode automaton comes without additional licenses as it is not part of the RHP Server/Client infrastructure
It is 100% local to the cluster where it is used
Its main “job” is to allow moving Grid Infrastructure Homes from a non-patched version to an out-of-place patched one.

$ rhpctl move gihome –sourcehome Oracle_home_path -destinationhome Oracle_home_path

1	$ rhpctl move gihome –sourcehome Oracle_home_path -destinationhome Oracle_home_path

I will not disclore more here, as the rest of this blog series is focused on this new product 🙂

Stay tuned for details, examples and feedback from its usage at CERN 😉

—

Ludo

Port conflict with “Oracle Remote Method Invocation (ORMI)” during Grid Infrastructure install

Posted on November 13, 2018 by Ludovico

After years of installing Grid Infrastructures, today I have got for the first time an error on something new:

$ /u01/app/grid/crs1840/gridSetup.sh -silent -responseFile /u01/app/grid/crs1840/inventory/response/CERNDB_Grid_Config.rsp ORACLE_HOME_NAME=crs1840
Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-13013] Target environment does not meet some mandatory requirements.
   CAUSE: Some of the mandatory prerequisites are not met. See logs for details. /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log
   ACTION: Identify the list of failed prerequisite checks from the log: /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log. Then either from the log file or from installation manual find the appropriate configuration to meet the prerequisites and fix it manually.

$ /u01/app/grid/crs1840/gridSetup.sh -silent -responseFile /u01/app/grid/crs1840/inventory/response/CERNDB_Grid_Config.rsp ORACLE_HOME_NAME=crs1840

Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-13013] Target environment does not meet some mandatory requirements.

CAUSE: Some of the mandatory prerequisites are not met. See logs for details. /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log

ACTION: Identify the list of failed prerequisite checks from the log: /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log. Then either from the log file or from installation manual find the appropriate configuration to meet the prerequisites and fix it manually.

Looking at the logs (which I do not have now as I removed them as part of the failed install cleanup 🙁 ), the error is generated by the cluster verification utility (CVU) on this check:

Verifying Port Availability for component "Oracle Remote Method Invocation (ORMI)"

1	Verifying Port Availability for component "Oracle Remote Method Invocation (ORMI)"

The components verified by the CVU can be found inside $ORACLE_HOME/cv/cvdata/. In my case, precisely:

$ grep -i ORMI $ORACLE_HOME/cv/cvdata/18/crsinst_prereq.xml
         <PORT NAME="Oracle Remote Method Invocation (ORMI)" VALUE="23791" PROTOCOL="TCP" NETWORK_TYPE="PUBLIC"/>
         <PORT NAME="Oracle Remote Method Invocation (ORMI)" VALUE="23792" PROTOCOL="TCP" NETWORK_TYPE="PUBLIC"/>

$ grep -i ORMI $ORACLE_HOME/cv/cvdata/18/crsinst_prereq.xml

This check is critical, so the install fails.

In my case the port was used by mcollectived.

[root@server1 work]# netstat -anp | grep 23791

[root@server1 work]# netstat -anp | grep 23792
tcp 0 0 x.x.x.x:23792 x.x.x.x:61613 ESTABLISHED 2298/ruby

[root@server1 work]# ps -eaf | grep 2298
root 2298 1 0 11:16 ? 00:00:02 /opt/puppetlabs/puppet/bin/ruby /opt/puppetlabs/puppet/bin/mcollectived --config=/etc/puppetlabs/mcollective/server.cfg --pidfile=/var/run/puppetlabs/mcollective.pid --daemonize
root 47116 4114 0 12:50 pts/0 00:00:00 grep --color=auto 2298

[root@server1 work]# netstat -anp | grep 23791

[root@server1 work]# netstat -anp | grep 23792

tcp 0 0 x.x.x.x:23792 x.x.x.x:61613 ESTABLISHED 2298/ruby

[root@server1 work]# ps -eaf | grep 2298

root 2298 1 0 11:16 ? 00:00:02 /opt/puppetlabs/puppet/bin/ruby /opt/puppetlabs/puppet/bin/mcollectived --config=/etc/puppetlabs/mcollective/server.cfg --pidfile=/var/run/puppetlabs/mcollective.pid --daemonize

root 47116 4114 0 12:50 pts/0 00:00:00 grep --color=auto 2298

The port has been taken dynamically, and previous runs of CVU did not encounter the problem.

A rare port conflict that might happen when configuring GI 🙂

—

Ludo

Grid Infrastructure 18c: changes in gridSetup.sh -applyRU and -createGoldImage

Posted on November 6, 2018 by Ludovico

Starting with release 12cR2, Grid Infrastructure binaries are no more shipped as an installer, but as a zip file that is uncompressed directly in the Oracle Home path.
This opened a few new possibilities including patching the software before the Grid Infrastructure configuration.
My former colleague Markus Flechtner wrote an excellent blog post about it, here: https://www.markusdba.net/?p=294

Now, with 18c, there are a couple of things that changed comparing to Markus blog.

The -applyRU switch replaces the -applyPSU

While it is possible to apply several sub-patches of a PSU one by one:

./gridSetup.sh -silent -applyOneOffs <path to sub-patch>
e.g.

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28547619
./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655784
./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655916
...

./gridSetup.sh -silent -applyOneOffs <path to sub-patch>

e.g.

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28547619

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655784

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655916

...

it was possible to do all at once with:

./gridSetup.sh -silent -applyPSU <path to PSU>

1	./gridSetup.sh -silent -applyPSU <path to PSU>

Now the switch is called, for consistency with the patch naming, -applyRU.

E.g.:

# [ oracle@server:/u01/app/grid/crs1840 [16:38:40] [18.4.0.0.0 [GRID] SID=GRID] 255 ] #
$ ./gridSetup.sh -silent -applyRU /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165
Preparing the home to patch...
Applying the patch  /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165...
Successfully applied the patch.
The log can be found at: /u01/app/oraInventory/logs/GridSetupActions2018-11-02_04-39-54PM/installerPatchActions_2018-11-02_04-39-54PM.log
Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-40426] Grid installation option has not been specified.
   ACTION: Specify the valid installation option.

# [ oracle@server:/u01/app/grid/crs1840 [16:38:40] [18.4.0.0.0 [GRID] SID=GRID] 255 ] #

$ ./gridSetup.sh -silent -applyRU /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165

Preparing the home to patch...

Applying the patch /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165...

Successfully applied the patch.

The log can be found at: /u01/app/oraInventory/logs/GridSetupActions2018-11-02_04-39-54PM/installerPatchActions_2018-11-02_04-39-54PM.log

Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-40426] Grid installation option has not been specified.

ACTION: Specify the valid installation option.

Still there are no options to avoid the run of the Setup Wizard, but it is safe to ignore the error as the patch has been applied successfully.

The -createGoldImage does not work anymore if the Home is not attached

I have tried to create the golden image as per Markus post, but I get this error:

# [ oracle@server:/u01/app/grid/crs1840 [09:43:39] [18.4.0.0.0 [GRID] SID=GRID] 0 ] #
$ ./gridSetup.sh -createGoldImage -destinationlocation  /u01/app/oracle/stage/golden_images/crs1840 -silent
Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-32715] The source home (/u01/app/grid/crs1840) is not registered in the central inventory.
   ACTION: Ensure that the source home is registered in the central inventory.

# [ oracle@server:/u01/app/grid/crs1840 [09:43:39] [18.4.0.0.0 [GRID] SID=GRID] 0 ] #

$ ./gridSetup.sh -createGoldImage -destinationlocation /u01/app/oracle/stage/golden_images/crs1840 -silent

Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-32715] The source home (/u01/app/grid/crs1840) is not registered in the central inventory.

ACTION: Ensure that the source home is registered in the central inventory.

To workaround the issue, there are two ways:

Create a zip file manually, as all the content needed to install the patched version is right there. No need to touch anything as the software is not configured yet.

Configure the software with CRS_SWONLY before creating the gold image:

$ cat grid1840_swonly.rsp
oracle.install.responseFileVersion=/oracle/install/rspfmt_crsinstall_response_schema_v18.0.0
INVENTORY_LOCATION=/u01/app/oraInventory
oracle.install.option=CRS_SWONLY
ORACLE_BASE=/u01/app/oracle
oracle.install.asm.OSDBA=dba
oracle.install.asm.OSASM=asmdba
oracle.install.crs.config.scanType=LOCAL_SCAN
oracle.install.crs.config.gpnp.configureGNS=false
oracle.install.crs.config.autoConfigureClusterNodeVIP=false
oracle.install.crs.config.gpnp.gnsOption=CREATE_NEW_GNS
oracle.install.crs.config.clusterNodes=server1,server2
oracle.install.asm.configureGIMRDataDG=false
oracle.install.crs.config.useIPMI=false
oracle.install.asm.storageOption=ASM
oracle.install.asmOnNAS.configureGIMRDataDG=false
oracle.install.asm.diskGroup.name=OCRVOT
oracle.install.asm.diskGroup.AUSize=1
oracle.install.asm.gimrDG.AUSize=1
oracle.install.asm.configureAFD=false
oracle.install.crs.configureRHPS=false
oracle.install.crs.config.ignoreDownNodes=false
oracle.install.config.managementOption=NONE
oracle.install.config.omsPort=0
oracle.install.crs.rootconfig.executeRootScript=false

$ ./gridSetup.sh -silent -responseFile grid1840_swonly.rsp ORACLE_HOME_NAME=crs1840
Launching Oracle Grid Infrastructure Setup Wizard...

The response file for this session can be found at:
 /u01/app/grid/crs1840/install/response/grid_2018-11-05_01-18-28PM.rsp

You can find the log of this install session at:
 /u01/app/oraInventory/logs/GridSetupActions2018-11-05_01-18-28PM/gridSetupActions2018-11-05_01-18-28PM.log

As a root user, execute the following script(s):
        1. /u01/app/grid/crs1840/root.sh

Execute /u01/app/grid/crs1840/root.sh on the following nodes:
[server1, server2]

[root@server1 dbs01]# /u01/app/grid/crs1840/root.sh
Check /u01/app/grid/crs1840/install/root_server1.cern.ch_2018-11-05_14-13-58-835084539.log for the output of root script

[root@server2 dbs01]# /u01/app/grid/crs1840/root.sh
Check /u01/app/grid/crs1840/install/root_server2.cern.ch_2018-11-05_14-15-18-835087641.log for the output of root script

$ ./gridSetup.sh -createGoldImage -destinationlocation  /u01/app/oracle/stage/golden_images/crs1840 -silent
Launching Oracle Grid Infrastructure Setup Wizard...

Successfully Setup Software.
Gold Image location: /u01/app/oracle/stage/golden_images/crs1840/grid_home_2018-11-05_02-25-52PM.zip

$ cat grid1840_swonly.rsp

oracle.install.responseFileVersion=/oracle/install/rspfmt_crsinstall_response_schema_v18.0.0

INVENTORY_LOCATION=/u01/app/oraInventory

oracle.install.option=CRS_SWONLY

ORACLE_BASE=/u01/app/oracle

oracle.install.asm.OSDBA=dba

oracle.install.asm.OSASM=asmdba

oracle.install.crs.config.scanType=LOCAL_SCAN

oracle.install.crs.config.gpnp.configureGNS=false

oracle.install.crs.config.autoConfigureClusterNodeVIP=false

oracle.install.crs.config.gpnp.gnsOption=CREATE_NEW_GNS

oracle.install.crs.config.clusterNodes=server1,server2

oracle.install.asm.configureGIMRDataDG=false

oracle.install.crs.config.useIPMI=false

oracle.install.asm.storageOption=ASM

oracle.install.asmOnNAS.configureGIMRDataDG=false

oracle.install.asm.diskGroup.name=OCRVOT

oracle.install.asm.diskGroup.AUSize=1

oracle.install.asm.gimrDG.AUSize=1

oracle.install.asm.configureAFD=false

oracle.install.crs.configureRHPS=false

oracle.install.crs.config.ignoreDownNodes=false

oracle.install.config.managementOption=NONE

oracle.install.config.omsPort=0

oracle.install.crs.rootconfig.executeRootScript=false

$ ./gridSetup.sh -silent -responseFile grid1840_swonly.rsp ORACLE_HOME_NAME=crs1840

Launching Oracle Grid Infrastructure Setup Wizard...

The response file for this session can be found at:

/u01/app/grid/crs1840/install/response/grid_2018-11-05_01-18-28PM.rsp

You can find the log of this install session at:

/u01/app/oraInventory/logs/GridSetupActions2018-11-05_01-18-28PM/gridSetupActions2018-11-05_01-18-28PM.log

As a root user, execute the following script(s):

1. /u01/app/grid/crs1840/root.sh

Execute /u01/app/grid/crs1840/root.sh on the following nodes:

[server1, server2]

[root@server1 dbs01]# /u01/app/grid/crs1840/root.sh

Check /u01/app/grid/crs1840/install/root_server1.cern.ch_2018-11-05_14-13-58-835084539.log for the output of root script

[root@server2 dbs01]# /u01/app/grid/crs1840/root.sh

Check /u01/app/grid/crs1840/install/root_server2.cern.ch_2018-11-05_14-15-18-835087641.log for the output of root script

$ ./gridSetup.sh -createGoldImage -destinationlocation /u01/app/oracle/stage/golden_images/crs1840 -silent

Launching Oracle Grid Infrastructure Setup Wizard...

Successfully Setup Software.

Gold Image location: /u01/app/oracle/stage/golden_images/crs1840/grid_home_2018-11-05_02-25-52PM.zip

HTH

—

Ludo