DBA survival BLOG

DBA stuff and Oracle Data Guard

Data Guard, Easy Connect and the Observer for multiple configurations

Posted on August 14, 2020 by Ludovico

EZConnect

One of the challenges of automation in bin Oracle Environments is dealing with tnsnames.ora files.
These files might grow big and are sometimes hard to distribute/maintain properly.
The worst is when manual modifications are needed: manual operations, if not made carefully, can screw up the connection to the databases.
The best solution is always using LDAP naming resolution. I have seen customers using OID, OUD, Active Directory, openldapd, all with a great level of control and automation. However, some customer don’t have/want this possibility and keep relying on TNS naming resolution.
When Data Guard (and eventually RAC) are in place, the tnsnames.ora gets filled by entries for the DGConnectIdentifiers and StaticConnectIdentifier. If I add the observer, an additional entry is required to access the dbname_CFG service created by the Fast Start Failover.

Actually, all these entries are not required if I use Easy Connect.

My friend Franck Pachot wrote a couple of nice blog posts about Easy Connect while working with me at CERN:
https://medium.com/@FranckPachot/19c-easy-connect-e0c3b77968d7

https://medium.com/@FranckPachot/19c-ezconnect-and-wallet-easy-connect-and-external-password-file-8e326bb8c9f5

Basic Data Guard configuration

The basic configuration with Data Guard is quite simple to achieve with Easy Connect. In this examples I have:
– The primary database TOOLCDB1_SITE1
– The duplicated database for standby TOOLCDB1_SITE2

After setting up the static registration (no Grid Infrastructure in my lab):

SID_LIST_LISTENER=
  (SID_LIST=
    (SID_DESC=
      (GLOBAL_DBNAME=TOOLCDB1_SITE1_DGMGRL)
      (SID_NAME=TOOLCDB1)
      (ORACLE_HOME=/u01/app/oracle/product/db_19_8_0)
    )
  )

SID_LIST_LISTENER=

(SID_LIST=

(SID_DESC=

(GLOBAL_DBNAME=TOOLCDB1_SITE1_DGMGRL)

(SID_NAME=TOOLCDB1)

(ORACLE_HOME=/u01/app/oracle/product/db_19_8_0)

)

and copying the passwordfile, the configuration can be created with:

DGMGRL> create configuration TOOLCDB1 as primary database is TOOLCDB1_SITE1 connect identifier is 'newbox01:1521/TOOLCDB1_SITE1';
Configuration "toolcdb1" created with primary database "toolcdb1_site1"

DGMGRL>  edit database TOOLCDB1_SITE1 set property 'StaticConnectIdentifier'='newbox01:1521/TOOLCDB1_SITE1_DGMGRL';
Property "StaticConnectIdentifier" updated

DGMGRL>  add database TOOLCDB1_SITE2 as connect identifier is 'newbox02:1521/TOOLCDB1_SITE2';
Database "toolcdb1_site2" added

DGMGRL>  edit database TOOLCDB1_SITE2 set property 'StaticConnectIdentifier'='newbox02:1521/TOOLCDB1_SITE2_DGMGRL';
Property "StaticConnectIdentifier" updated

DGMGRL>  enable configuration;
Enabled.

DGMGRL> create configuration TOOLCDB1 as primary database is TOOLCDB1_SITE1 connect identifier is 'newbox01:1521/TOOLCDB1_SITE1';

Configuration "toolcdb1" created with primary database "toolcdb1_site1"

DGMGRL> edit database TOOLCDB1_SITE1 set property 'StaticConnectIdentifier'='newbox01:1521/TOOLCDB1_SITE1_DGMGRL';

Property "StaticConnectIdentifier" updated

DGMGRL> add database TOOLCDB1_SITE2 as connect identifier is 'newbox02:1521/TOOLCDB1_SITE2';

Database "toolcdb1_site2" added

DGMGRL> edit database TOOLCDB1_SITE2 set property 'StaticConnectIdentifier'='newbox02:1521/TOOLCDB1_SITE2_DGMGRL';

Property "StaticConnectIdentifier" updated

DGMGRL> enable configuration;

Enabled.

That’s it.

Now, if I want to have the configuration observed, I need to activate the Fast Start Failover:

DGMGRL> edit database toolcdb1_site1 set property LogXptMode='SYNC';
Property "logxptmode" updated

DGMGRL> edit database toolcdb1_site2 set property LogXptMode='SYNC';
Property "logxptmode" updated

DGMGRL> edit database toolcdb1_site1 set property FastStartFailoverTarget='toolcdb1_site2';
Property "faststartfailovertarget" updated

DGMGRL> edit database toolcdb1_site2 set property FastStartFailoverTarget='toolcdb1_site1';
Property "faststartfailovertarget" updated

DGMGRL> edit configuration set protection mode as maxavailability;
Succeeded.

DGMGRL> enable fast_start failover;
Enabled in Zero Data Loss Mode.

DGMGRL> edit database toolcdb1_site1 set property LogXptMode='SYNC';

Property "logxptmode" updated

DGMGRL> edit database toolcdb1_site2 set property LogXptMode='SYNC';

Property "logxptmode" updated

DGMGRL> edit database toolcdb1_site1 set property FastStartFailoverTarget='toolcdb1_site2';

Property "faststartfailovertarget" updated

DGMGRL> edit database toolcdb1_site2 set property FastStartFailoverTarget='toolcdb1_site1';

Property "faststartfailovertarget" updated

DGMGRL> edit configuration set protection mode as maxavailability;

Succeeded.

DGMGRL> enable fast_start failover;

Enabled in Zero Data Loss Mode.

With just two databases, FastStartFailoverTarget is not explicitly needed, but I usually do it as other databases might be added to the configuration in the future.
After that, the broker complains that FSFO is enabled but there is no observer yet:

DGMGRL> show fast_start failover;

Fast-Start Failover: Enabled in Zero Data Loss Mode

  Protection Mode:    MaxAvailability
  Lag Limit:          0 seconds

  Threshold:          180 seconds
  Active Target:      toolcdb1_site2
  Potential Targets:  "toolcdb1_site2"
    toolcdb1_site2 valid
  Observer:           (none)
  Shutdown Primary:   TRUE
  Auto-reinstate:     TRUE
  Observer Reconnect: 180 seconds
  Observer Override:  FALSE

Configurable Failover Conditions
  Health Conditions:
    Corrupted Controlfile          YES
    Corrupted Dictionary           YES
    Inaccessible Logfile            NO
    Stuck Archiver                  NO
    Datafile Write Errors          YES

  Oracle Error Conditions:
    (none)


DGMGRL> show configuration;

Configuration - toolcdb1

  Protection Mode: MaxAvailability
  Members:
  toolcdb1_site1 - Primary database
    Warning: ORA-16819: fast-start failover observer not started

    toolcdb1_site2 - (*) Physical standby database

Fast-Start Failover: Enabled in Zero Data Loss Mode

Configuration Status:
WARNING   (status updated 39 seconds ago)

DGMGRL> show fast_start failover;

Fast-Start Failover: Enabled in Zero Data Loss Mode

Protection Mode: MaxAvailability

Lag Limit: 0 seconds

Threshold: 180 seconds

Active Target: toolcdb1_site2

Potential Targets: "toolcdb1_site2"

toolcdb1_site2 valid

Observer: (none)

Shutdown Primary: TRUE

Auto-reinstate: TRUE

Observer Reconnect: 180 seconds

Observer Override: FALSE

Configurable Failover Conditions

Health Conditions:

Corrupted Controlfile YES

Corrupted Dictionary YES

Inaccessible Logfile NO

Stuck Archiver NO

Datafile Write Errors YES

Oracle Error Conditions:

(none)

DGMGRL> show configuration;

Configuration - toolcdb1

Protection Mode: MaxAvailability

Members:

toolcdb1_site1 - Primary database

Warning: ORA-16819: fast-start failover observer not started

toolcdb1_site2 - (*) Physical standby database

Fast-Start Failover: Enabled in Zero Data Loss Mode

Configuration Status:

WARNING (status updated 39 seconds ago)

Observer for multiple configurations

This feature has been introduced in 12.2 but it is still not widely used.
Before 12.2, the Observer was a foreground process: the DBAs had to start it in a wrapper script executed with nohup in order to keep it live.
Since 12.2, the observer can run as a background process as far as there is a valid wallet for the connection to the databases.
Also, 12.2 introduced the capability of starting multiple configurations with a single dgmgrl command: “START OBSERVING”.

For more information about it, you can check the documentation here:
https://docs.oracle.com/en/database/oracle/oracle-database/19/dgbkr/using-data-guard-broker-to-manage-switchovers-failovers.html#GUID-BC513CDB-1E06-4EB3-9FE1-E1331E15E492

How to set it up with Easy Connect?

First, I need a wallet. And here comes the first compromise:
Having a single dgmgrl session to start all my configurations means that I have a single wallet for all the databases that I want to observe.
Fair enough, all the DBs (CDBs?) are managed by the same team in this case.
If I have only observers on my host I can easily point to the wallet from my central sqlnet.ora:

WALLET_LOCATION =
   (SOURCE =
      (METHOD = FILE)
      (METHOD_DATA = (DIRECTORY = /u01/app/oracle/admin/observers/wallet))
  )
SQLNET.WALLET_OVERRIDE = TRUE

WALLET_LOCATION =

(SOURCE =

(METHOD = FILE)

(METHOD_DATA = (DIRECTORY = /u01/app/oracle/admin/observers/wallet))

)

SQLNET.WALLET_OVERRIDE = TRUE

Otherwise I need to create a separate TNS_ADMIN for my observer management environment.
Then, I create the wallet:

$ WALLET_DIR=$ORACLE_BASE/admin/observers/wallet
$ mkdir -p $WALLET_DIR
$ orapki wallet create -wallet $WALLET_DIR -auto_login_local -pwd Password2020
Oracle PKI Tool Release 21.0.0.0.0 - Production
Version 21.0.0.0.0
Copyright (c) 2004, 2020, Oracle and/or its affiliates. All rights reserved.

Operation is successfully completed.

$ WALLET_DIR=$ORACLE_BASE/admin/observers/wallet

$ mkdir -p $WALLET_DIR

$ orapki wallet create -wallet $WALLET_DIR -auto_login_local -pwd Password2020

Oracle PKI Tool Release 21.0.0.0.0 - Production

Version 21.0.0.0.0

Operation is successfully completed.

Now I need to add the connection descriptors.

Which connection descriptors do I need?
The Observer uses the DGConnectIdentifier to keep observing the databases, but needs a connection to both of them using the TOOLCDB1_CFG service (unless I specify something different with the broker configuration property ConfigurationWideServiceName) to connect to the configuration and get the DGConnectIdentifier information. Again, you can check it in the doc. or the note Oracle 12.2 – Simplified OBSERVER Management for Multiple Fast-Start Failover Configurations (Doc ID 2285891.1)

So I need to specify three secrets for three connection descriptors:

$ mkstore -wrl "$TNS_ADMIN" -createCredential newbox01,newbox02:1521/TOOLCDB1_CFG sysdg
Oracle Secret Store Tool Release 21.0.0.0.0 - Production
Version 21.0.0.0.0
Copyright (c) 2004, 2020, Oracle and/or its affiliates. All rights reserved.

Your secret/Password is missing in the command line
Enter your secret/Password:
Re-enter your secret/Password:
Enter wallet password:

$ mkstore -wrl "$TNS_ADMIN" -createCredential newbox01:1521/TOOLCDB1_SITE1 sysdg
Oracle Secret Store Tool Release 21.0.0.0.0 - Production
Version 21.0.0.0.0
Copyright (c) 2004, 2020, Oracle and/or its affiliates. All rights reserved.

Your secret/Password is missing in the command line
Enter your secret/Password:
Re-enter your secret/Password:
Enter wallet password:


$ mkstore -wrl "$TNS_ADMIN" -createCredential newbox02:1521/TOOLCDB1_SITE2 sysdg
Oracle Secret Store Tool Release 21.0.0.0.0 - Production
Version 21.0.0.0.0
Copyright (c) 2004, 2020, Oracle and/or its affiliates. All rights reserved.

Your secret/Password is missing in the command line
Enter your secret/Password:
Re-enter your secret/Password:
Enter wallet password:

$ mkstore -wrl "$TNS_ADMIN" -createCredential newbox01,newbox02:1521/TOOLCDB1_CFG sysdg

Oracle Secret Store Tool Release 21.0.0.0.0 - Production

Version 21.0.0.0.0

Your secret/Password is missing in the command line

Enter your secret/Password:

Re-enter your secret/Password:

Enter wallet password:

$ mkstore -wrl "$TNS_ADMIN" -createCredential newbox01:1521/TOOLCDB1_SITE1 sysdg

Oracle Secret Store Tool Release 21.0.0.0.0 - Production

Version 21.0.0.0.0

Your secret/Password is missing in the command line

Enter your secret/Password:

Re-enter your secret/Password:

Enter wallet password:

$ mkstore -wrl "$TNS_ADMIN" -createCredential newbox02:1521/TOOLCDB1_SITE2 sysdg

Oracle Secret Store Tool Release 21.0.0.0.0 - Production

Version 21.0.0.0.0

Your secret/Password is missing in the command line

Enter your secret/Password:

Re-enter your secret/Password:

Enter wallet password:

The first one will be used for the initial connection. The other two to observe the Primary and Standby.
I need to be careful that the first EZConnect descriptor matches EXACTLY what I put in observer.ora (see next step) and the last two match my DGConnectIdentifier (unless I specify something different with ObserverConnectIdentifier), otherwise I will get some errors and the observer will not observe correctly (or will not start at all).

The dgmgrl needs then a file named observer.ora.
$ORACLE_BASE/admin/observers or the central TNS_ADMIN would be good locations, but what if I have observers that must be started from multiple Oracle Homes?
In that case, having a observer.ora in $ORACLE_HOME/network/admin (or $ORACLE_BASE/homes/{OHNAME}/network/admin/ if Read-Only Oracle Home is enabled) would be a better solution: in this case I would need to start one session per Oracle Home

The content of my observer.ora must be something like:

BROKER_CONFIGS=
   (
     (CONFIG=
       (NAME=TOOLCDB1)
       (CONNECT_ID=newbox01,newbox02:1521/TOOLCDB1_CFG)
       (CONFIG_HOME=/export/soft/oracle/admin/TOOLCDB1/observer)
     )
   )

BROKER_CONFIGS=

(

(CONFIG=

(NAME=TOOLCDB1)

(CONNECT_ID=newbox01,newbox02:1521/TOOLCDB1_CFG)

(CONFIG_HOME=/export/soft/oracle/admin/TOOLCDB1/observer)

)

This is the example for my configuration, but I can put as many (CONFIG=…) as I want in order to observe multiple configurations.
Then, if everything is configured properly, I can start all the observers with a single command:

DGMGRL> SET OBSERVERCONFIGFILE=/u01/app/oracle/admin/observers/observer.ora
DGMGRL> START OBSERVING
ObserverConfigFile=observer.ora
observer configuration file parsing succeeded
Submitted command "START OBSERVER" using connect identifier "newbox01,newbox02:1521/TOOLCDB1_CFG"

Check superobserver.log, individual observer logs and Data Guard Broker logs for execution details.

DGMGRL> show observers
ObserverConfigFile=/u01/app/oracle/admin/observers/observer.ora
observer configuration file parsing succeeded
Submitted command "SHOW OBSERVER" using connect identifier "newbox01,newbox02:1521/TOOLCDB1_CFG"
Connected to "TOOLCDB1_SITE2"

Configuration - toolcdb1

  Primary:            toolcdb1_site1
  Active Target:      toolcdb1_site2

Observer "newbox03.trivadistraining.com1" - Master

  Host Name:                    newbox03.trivadistraining.com
  Last Ping to Primary:         1 second ago
  Last Ping to Target:          2 seconds ago

DGMGRL> SET OBSERVERCONFIGFILE=/u01/app/oracle/admin/observers/observer.ora

DGMGRL> START OBSERVING

ObserverConfigFile=observer.ora

observer configuration file parsing succeeded

Submitted command "START OBSERVER" using connect identifier "newbox01,newbox02:1521/TOOLCDB1_CFG"

Check superobserver.log, individual observer logs and Data Guard Broker logs for execution details.

DGMGRL> show observers

ObserverConfigFile=/u01/app/oracle/admin/observers/observer.ora

observer configuration file parsing succeeded

Submitted command "SHOW OBSERVER" using connect identifier "newbox01,newbox02:1521/TOOLCDB1_CFG"

Connected to "TOOLCDB1_SITE2"

Configuration - toolcdb1

Primary: toolcdb1_site1

Active Target: toolcdb1_site2

Observer "newbox03.trivadistraining.com1" - Master

Host Name: newbox03.trivadistraining.com

Last Ping to Primary: 1 second ago

Last Ping to Target: 2 seconds ago

Troubleshooting

If the observer does not work, sometimes it is not easy to understand the cause.

Has SYSDG been granted to SYSDG user? Is SYSDG account unlocked?
Does sqlnet.ora contain the correct wallet location?
Is the wallet accessible in autologin?
Are the entries in the wallet correct? (check with “sqlplus /@connstring as sysdg”)

Missing pieces

Here, a few features that I think would be a nice addition in the future:

Awareness for the ORACLE_HOME to be used for each observer
Possibility to specify a different TNS_ADMIN per observer (different wallets)
Integration with Grid Infrastructure (srvctl add observer…) and support for multiple observers

—

Ludovico

FPP local-mode: Steps to remove/add node from a cluster if RHP fails to move gihome

Posted on July 9, 2019 by Ludovico

I am getting more and more experience with patching clusters with the local-mode automaton. The whole process would be very complex, but the local-mode automaton makes it really easy.

I have had nevertheless a couple of clusters where the process did not work:

#1: The very first cluster that I installed in 18c

This cluster has “kind of failed” patching the first node. Actually, the rhpctl command exited with an error:

$ rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server1
server1.cern.ch: Audit ID: 2
server1.cern.ch: verifying versions of Oracle homes ...
server1.cern.ch: verifying owners of Oracle homes ...
server1.cern.ch: verifying groups of Oracle homes ...
server1.cern.ch: starting to move the Oracle Grid Infrastructure home from "/u01/crs/crs1830" to "/u01/crs/crs1860" on server cluster "AISTEST-RAC16"
[...]
2019/07/08 09:45:06 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'
PRCG-1239 : failed to close a proxy connection
Connection refused to host: server1.cern.ch; nested exception is:
        java.net.ConnectException: Connection refused (Connection refused)
PRCG-1079 : Internal error: ClientFactoryImpl-submitAction-error1
PROC-32: Cluster Ready Services on the local node is not running Messaging error [gipcretConnectionRefused] [29]

$ rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server1

server1.cern.ch: Audit ID: 2

server1.cern.ch: verifying versions of Oracle homes ...

server1.cern.ch: verifying owners of Oracle homes ...

server1.cern.ch: verifying groups of Oracle homes ...

server1.cern.ch: starting to move the Oracle Grid Infrastructure home from "/u01/crs/crs1830" to "/u01/crs/crs1860" on server cluster "AISTEST-RAC16"

[...]

2019/07/08 09:45:06 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'

PRCG-1239 : failed to close a proxy connection

Connection refused to host: server1.cern.ch; nested exception is:

java.net.ConnectException: Connection refused (Connection refused)

PRCG-1079 : Internal error: ClientFactoryImpl-submitAction-error1

PROC-32: Cluster Ready Services on the local node is not running Messaging error [gipcretConnectionRefused] [29]

But actually, the helper kept running and configured everything properly:

$ tail -f /ORA/dbs01/oracle/crsdata/server1/crsconfig/crs_postpatch_server1_2019-07-08_09-41-36AM.log
2019-07-08 09:55:25:
2019-07-08 09:55:25: Succeeded in writing the checkpoint:'ROOTCRS_POSTPATCH' with status:SUCCESS
2019-07-08 09:55:25: Executing cmd: /u01/crs/crs1860/bin/clsecho -p has -f clsrsc -m 672
2019-07-08 09:55:25: Executing cmd: /u01/crs/crs1860/bin/clsecho -p has -f clsrsc -m 672
2019-07-08 09:55:25: Command output:
>  CLSRSC-672: Post-patch steps for patching GI home successfully completed.
>End Command output
2019-07-08 09:55:25: CLSRSC-672: Post-patch steps for patching GI home successfully completed.

$ tail -f /ORA/dbs01/oracle/crsdata/server1/crsconfig/crs_postpatch_server1_2019-07-08_09-41-36AM.log

2019-07-08 09:55:25:

2019-07-08 09:55:25: Succeeded in writing the checkpoint:'ROOTCRS_POSTPATCH' with status:SUCCESS

2019-07-08 09:55:25: Executing cmd: /u01/crs/crs1860/bin/clsecho -p has -f clsrsc -m 672

2019-07-08 09:55:25: Command output:

> CLSRSC-672: Post-patch steps for patching GI home successfully completed.

>End Command output

2019-07-08 09:55:25: CLSRSC-672: Post-patch steps for patching GI home successfully completed.

The cluster was OK on the first node, with the correct patch level. The second node, however, was failing with:

$  rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server2
server1.cern.ch: retrieving status of databases ...
server1.cern.ch: retrieving status of services of databases ...
PRCT-1011 : Failed to run "rhphelper". Detailed error: <HLP_EMSG>,RHPHELP_procCmdLine-05,</HLP_EMSG>,<HLP_VRES>3</HLP_VRES>,<HLP_IEEMSG>,PRCG-1079 : Internal error: RHPHELP122_main-01,</HLP_IEEMSG>,<HLP_ERES>1</HLP_ERES>

$ rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server2

server1.cern.ch: retrieving status of databases ...

server1.cern.ch: retrieving status of services of databases ...

PRCT-1011 : Failed to run "rhphelper". Detailed error: <HLP_EMSG>,RHPHELP_procCmdLine-05,</HLP_EMSG>,<HLP_VRES>3</HLP_VRES>,<HLP_IEEMSG>,PRCG-1079 : Internal error: RHPHELP122_main-01,</HLP_IEEMSG>,<HLP_ERES>1</HLP_ERES>

I am not sure about the cause, but let’s assume it is irrelevant for the moment.

#2: A cluster with new GI home not properly linked with RAC

This was another funny case, where the first node patched successfully, but the second one failed upgrading in the middle of the process with a java NullPointer exception. We did a few bad tries of prePatch and postPatch to solve, but after that the second node of the cluster was in an inconsistent state: in ROLLING_UPGRADE mode and not possible to patch anymore.

Common solution: removing the node from the cluster and adding it back

In both cases we were in the following situation:

one node was successfully patched to 18.6
one node was not patched and was not possible to patch it anymore (at least without heavy interventions)

So, for me, the easiest solution has been removing the failing node and adding it back with the new patched version.

Steps to remove the node

Although the steps are described here: https://docs.oracle.com/en/database/oracle/oracle-database/18/cwadd/adding-and-deleting-cluster-nodes.html#GUID-8ADA9667-EC27-4EF9-9F34-C8F65A757F2A, there are a few differences that I will highlight:

Stop of the cluster:

(root)# crsctl stop crs

1	(root)# crsctl stop crs

The actual procedure to remove a node asks to deconfigure the databases and managed homes from the active cluster version. But as we manage our homes with golden images, we do not need this; we rather want to keep all the entries in the OCR so that when we add it back, everything is in place.

Once stopped the CRS, we have deinstalled the CRS home on the failing node:

(oracle)$ $OH/deinstall/deinstall -local

1	(oracle)$ $OH/deinstall/deinstall -local

This complained about the CRS that was down, but it continued and ask for this script to be executed:

/u01/crs/crs1830/crs/install/rootcrs.sh -force  -deconfig -paramfile "/tmp/deinstall2019-07-08_11-37-20AM/response/deinstall_1830.rsp"

1	/u01/crs/crs1830/crs/install/rootcrs.sh -force -deconfig -paramfile "/tmp/deinstall2019-07-08_11-37-20AM/response/deinstall_1830.rsp"

We’ve got errors also for this script, but the remove process was OK afterall.

Then, from the surviving node:

root # crsctl delete node -n server2
oracle $ srvctl stop vip -vip server2
root $ srvctl remove vip -vip server2

root # crsctl delete node -n server2

oracle $ srvctl stop vip -vip server2

root $ srvctl remove vip -vip server2

Adding the node back

From the surviving node, we ran gridSetup.sh and followed the steps to ad the node.

Wait before running root.sh.

In our case, we have originally installed the cluster starting with a SW_ONLY install. This type of installation keeps some leftovers in the configuration files that prevent the root.sh from configuring the cluster…we have had to modify rootconfig.sh:

check/modify /u01/crs/crs1860/crs/config/rootconfig.sh and change this:
# before:
# SW_ONLY=true
# after:
SW_ONLY=false

check/modify /u01/crs/crs1860/crs/config/rootconfig.sh and change this:

# before:

# SW_ONLY=true

# after:

SW_ONLY=false

then, after running root.sh and the config tools, everything was back as before removing the node form the cluster.

For one of the clusters , both nodes were at the same patch level, but the cluster was still in ROLLING_PATCH mode. So we have had to do a

(root) # crsctl stop rollingpatch

1	(root) # crsctl stop rollingpatch

—

Ludo

(unsupported) DST_UPGRADE_STATE is DATAPUMP(1) but no data pump jobs are running. How to fix?

Posted on April 30, 2019 by Ludovico

This blog post contains unsupported commands. Do not try to use them without the supervision of Oracle Support!

I have just run across an Oracle Database who had a broken configuration in database_properties.

The database was in process of being upgraded to 18c, but the DST upgrade step was not working because of wrong entries in the view DATABASE_PROPERTIES:

SQL> SELECT version FROM v$timezone_file;

       VERSION
--------------
            14


SQL> SELECT PROPERTY_NAME, SUBSTR(property_value, 1, 30) value
2 FROM DATABASE_PROPERTIES
3 WHERE PROPERTY_NAME LIKE 'DST_%'
4 ORDER BY PROPERTY_NAME;

PROPERTY_NAME               VALUE
--------------------------- ------------------------------
DST_PRIMARY_TT_VERSION      14
DST_SECONDARY_TT_VERSION    4
DST_UPGRADE_STATE           DATAPUMP(1)

SQL> SELECT version FROM v$timezone_file;

VERSION

--------------

SQL> SELECT PROPERTY_NAME, SUBSTR(property_value, 1, 30) value

2 FROM DATABASE_PROPERTIES

3 WHERE PROPERTY_NAME LIKE 'DST_%'

4 ORDER BY PROPERTY_NAME;

PROPERTY_NAME VALUE

--------------------------- ------------------------------

DST_PRIMARY_TT_VERSION 14

DST_SECONDARY_TT_VERSION 4

DST_UPGRADE_STATE DATAPUMP(1)

The MOS note Updating the RDBMS DST version in 12c Release 1 (12.1.0.1 and up) using DBMS_DST (Doc ID 1509653.1) states that I had to check note How To Cleanup Orphaned DataPump Jobs In DBA_DATAPUMP_JOBS ? (Doc ID 336014.1) to solve the problem.

In fact, there should have been an orphan data pump job trying to import the timezone file. But in my case, no jobs at all, do data pump job tables.

Second, the secondary time zone being lower than the primary one was, to me, sign of an old upgrade went wrong.

Trying to begin a new prepare phase was failing with:

SQL> exec DBMS_DST.BEGIN_PREPARE(31);
BEGIN DBMS_DST.BEGIN_PREPARE(31); END;

*
ERROR at line 1:
ORA-56920: a prepare or upgrade window or an on-demand or datapump-job loading of a secondary time zone data file is in an active state
ORA-06512: at "SYS.DBMS_SYS_ERROR", line 79
ORA-06512: at "SYS.DBMS_DST", line 1390
ORA-06512: at line 1

SQL> exec DBMS_DST.BEGIN_PREPARE(31);

BEGIN DBMS_DST.BEGIN_PREPARE(31); END;

ERROR at line 1:

ORA-56920: a prepare or upgrade window or an on-demand or datapump-job loading of a secondary time zone data file is in an active state

ORA-06512: at "SYS.DBMS_SYS_ERROR", line 79

ORA-06512: at "SYS.DBMS_DST", line 1390

ORA-06512: at line 1

Trying to end the old one was failing as well:

SQL> EXEC DBMS_DST.END_PREPARE;
BEGIN DBMS_DST.END_PREPARE; END;

*
ERROR at line 1:
ORA-56924: prepare window does not exist
ORA-06512: at "SYS.DBMS_SYS_ERROR", line 79
ORA-06512: at "SYS.DBMS_DST", line 1470
ORA-06512: at line 1

SQL> EXEC DBMS_DST.END_PREPARE;

BEGIN DBMS_DST.END_PREPARE; END;

ERROR at line 1:

ORA-56924: prepare window does not exist

ORA-06512: at "SYS.DBMS_SYS_ERROR", line 79

ORA-06512: at "SYS.DBMS_DST", line 1470

ORA-06512: at line 1

Trying to unload the secondary was failing as well:

SQL> exec dbms_dst.UNLOAD_SECONDARY
BEGIN dbms_dst.UNLOAD_SECONDARY; END;

*
ERROR at line 1:
ORA-56938: no secondary time zone data file being loaded by on-demand or a datapump job
ORA-06512: at "SYS.DBMS_DST", line 1975
ORA-06512: at "SYS.DBMS_SYS_ERROR", line 79
ORA-06512: at "SYS.DBMS_DST", line 1950
ORA-06512: at line 1

SQL> exec dbms_dst.UNLOAD_SECONDARY

BEGIN dbms_dst.UNLOAD_SECONDARY; END;

ERROR at line 1:

ORA-56938: no secondary time zone data file being loaded by on-demand or a datapump job

ORA-06512: at "SYS.DBMS_DST", line 1975

ORA-06512: at "SYS.DBMS_SYS_ERROR", line 79

ORA-06512: at "SYS.DBMS_DST", line 1950

ORA-06512: at line 1

I double-checked ALL the notes to clean-up the situation and made sure that there was nothing actually running regarding a DST upgrade.

I am pretty evil trying unsupported stuff. So I have decided to check the underlying table:

sys@ACCINT:SQL> select text from dba_views where view_name='DATABASE_PROPERTIES';

TEXT
--------------------------------------------------------------------------------
select name, value$, comment$
  from x$props

sys@ACCINT:SQL> select text from dba_views where view_name='DATABASE_PROPERTIES';

TEXT

--------------------------------------------------------------------------------

select name, value$, comment$

from x$props

Fixed tables are not writable, but sys.props$ is, and it was containing the same bad data:

NAME                                          
------------------------------------------
VALUE$                                    
------------------------------------------
COMMENT$                                  
------------------------------------------
DST_UPGRADE_STATE                         
DATAPUMP(1)                               
State of Day Light Saving Time Upgrade    
                                          
DST_PRIMARY_TT_VERSION                    
14                                        
Version of primary timezone data file     
                                          
DST_SECONDARY_TT_VERSION                  
4                                         
Version of secondary timezone data file

NAME

------------------------------------------

VALUE$

------------------------------------------

COMMENT$

------------------------------------------

DST_UPGRADE_STATE

DATAPUMP(1)

State of Day Light Saving Time Upgrade

DST_PRIMARY_TT_VERSION

Version of primary timezone data file

DST_SECONDARY_TT_VERSION

Version of secondary timezone data file

So I did what I knew was wrong, after taking a guaranteed restore point. Do not try this at home without the supervision of Oracle Support!

SQL> update props$ set value$=0 where name='DST_SECONDARY_TT_VERSION';

1 row updated.

SQL> update props$ set value$='NONE' where name='DST_UPGRADE_STATE';

1 row updated.

SQL> select * from props$ where name like 'DST%';

NAME                                     
-----------------------------------------
VALUE$                                   
-----------------------------------------
COMMENT$                                 
-----------------------------------------
DST_UPGRADE_STATE                        
NONE                                     
State of Day Light Saving Time Upgrade   
                                         
DST_PRIMARY_TT_VERSION                   
14                                       
Version of primary timezone data file    
                                         
DST_SECONDARY_TT_VERSION                 
0                                        
Version of secondary timezone data file  


3 rows selected.

SQL> commit;

Commit complete.

SQL> update props$ set value$=0 where name='DST_SECONDARY_TT_VERSION';

1 row updated.

SQL> update props$ set value$='NONE' where name='DST_UPGRADE_STATE';

1 row updated.

SQL> select * from props$ where name like 'DST%';

NAME

-----------------------------------------

VALUE$

-----------------------------------------

COMMENT$

-----------------------------------------

DST_UPGRADE_STATE

NONE

State of Day Light Saving Time Upgrade

DST_PRIMARY_TT_VERSION

Version of primary timezone data file

DST_SECONDARY_TT_VERSION

Version of secondary timezone data file

3 rows selected.

SQL> commit;

Commit complete.

Trying again:

SQL> exec DBMS_DST.BEGIN_PREPARE(31);
A prepare window has been successfully started.

PL/SQL procedure successfully completed.

SQL> exec DBMS_DST.BEGIN_PREPARE(31);

A prepare window has been successfully started.

PL/SQL procedure successfully completed.

The rest of the upgrade procedure went smoothly.

—

Ludovico

First draft of a Common Oracle Environment… for the Cloud Database (and not only)

Posted on April 16, 2019 by Ludovico

I have just published on GitHub a draft of a common Oracle environment scripts that make the shell environment a little bit smarter than what it is by default. It uses some function and aliases that I have published during the past years.

You can start playing with:

# Connect as oracle
sudo su - oracle

# Clone this repository
git clone https://github.com/ludovicocaldara/COE.git

# Enable the profile scripts
echo ". ~/COE/profile.sh" >> $HOME/.bash_profile

# Load the new profile
. ~/.bash_profile

# Connect as oracle

sudo su - oracle

# Clone this repository

git clone https://github.com/ludovicocaldara/COE.git

# Enable the profile scripts

echo ". ~/COE/profile.sh" >> $HOME/.bash_profile

# Load the new profile

. ~/.bash_profile

Ideal for the Oracle Cloud Infrastructure

If you are new to the Oracle Cloud, probably you do not have environment scripts that makes it easy to interact with the database.

The environment scripts that I have published work out-of the box in the cloud (just make sure that you have rlwrap installed so that you can have a better CLI experience).

Actually, they work great as well on-premises, but I assume that you already have something automatic there.

Some examples

My famous Smart Prompt 😉 (including version, edition, exit code, etc)

# [ oracle@ludodb01:/home/oracle [22:18:59] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
#

1 2	# [ oracle@ludodb01:/home/oracle [22:18:59] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] # #

u : gets the status of the databases

# [ oracle@ludodb01:/home/oracle [22:18:59] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# u
DB_Unique_Name           DB_Name  ludodb01       Oracle_Home
------------------------ -------- -------------- --------------------------------------------------
CDB_fra1cw               CDB      CDB            /u01/app/oracle/product/18.0.0.0/dbhome_1

# [ oracle@ludodb01:/home/oracle [22:18:59] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# u

DB_Unique_Name DB_Name ludodb01 Oracle_Home

------------------------ -------- -------------- --------------------------------------------------

CDB_fra1cw CDB CDB /u01/app/oracle/product/18.0.0.0/dbhome_1

pmon: just displays the running pmon processes

# [ oracle@ludodb01:/home/oracle [22:27:17] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# pmon
grid      8093     1  0 Mar25 ?        00:01:39 asm_pmon_+ASM1
grid     10293     1  0 Mar25 ?        00:01:43 apx_pmon_+APX1
oracle   11077     1  0 Mar25 ?        00:01:47 ora_pmon_CDB

# [ oracle@ludodb01:/home/oracle [22:27:17] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# pmon

grid 8093 1 0 Mar25 ? 00:01:39 asm_pmon_+ASM1

grid 10293 1 0 Mar25 ? 00:01:43 apx_pmon_+APX1

oracle 11077 1 0 Mar25 ? 00:01:47 ora_pmon_CDB

db : sets the environment for a specific DB_NAME, DB_UNIQUE_NAME or SID

# [ oracle@ludodb01:/u01/app/oracle/diag/rdbms/cdb_fra1cw/CDB/trace [22:33:53] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# db CDB
DB_UNIQUE_NAME  = CDB_fra1cw
ORACLE_SID      = CDB
ROLE            = PRIMARY
VERSION         = 18.4.0.0.0
ORACLE_HOME     = /u01/app/oracle/product/18.0.0.0/dbhome_1
NLS_LANG        = AMERICAN_AMERICA.AL32UTF8

# [ oracle@ludodb01:/u01/app/oracle/diag/rdbms/cdb_fra1cw/CDB/trace [22:33:53] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# db CDB

DB_UNIQUE_NAME = CDB_fra1cw

ORACLE_SID = CDB

ROLE = PRIMARY

VERSION = 18.4.0.0.0

ORACLE_HOME = /u01/app/oracle/product/18.0.0.0/dbhome_1

NLS_LANG = AMERICAN_AMERICA.AL32UTF8

svcstat : shows the running services (and the corresponding pdb, host, etc) as I described in my previous post

# [ oracle@ludodb01:/home/oracle [22:28:03] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# svcstat
DB_Unique_Name           Service_Name                   PDB                            ludodb01
------------------------ ------------------------------ ------------------------------ --------------
cdb_fra1cw               pdb_service_test               PDB1                           ONLINE

# [ oracle@ludodb01:/home/oracle [22:28:03] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# svcstat

DB_Unique_Name Service_Name PDB ludodb01

------------------------ ------------------------------ ------------------------------ --------------

cdb_fra1cw pdb_service_test PDB1 ONLINE

s_ : smart alias for sqlplus: connects as sysdba/sysasm by default, or with any arguments that you pass:

# [ oracle@ludodb01:/home/oracle [22:29:14] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# s_

SQL*Plus: Release 18.0.0.0.0 - Production on Mon Apr 15 22:30:22 2019
Version 18.4.0.0.0

Copyright (c) 1982, 2018, Oracle.  All rights reserved.


Connected to:
Oracle Database 18c Enterprise Edition Release 18.0.0.0.0 - Production
Version 18.4.0.0.0

SQL> show user
USER is "SYS"
SQL> Disconnected from Oracle Database 18c Enterprise Edition Release 18.0.0.0.0 - Production
Version 18.4.0.0.0

# [ oracle@ludodb01:/home/oracle [22:30:30] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# s_ pippo/pippo

SQL*Plus: Release 18.0.0.0.0 - Production on Mon Apr 15 22:30:34 2019
Version 18.4.0.0.0

Copyright (c) 1982, 2018, Oracle.  All rights reserved.

ERROR:
ORA-01017: invalid username/password; logon denied


Enter user-name:

# [ oracle@ludodb01:/home/oracle [22:29:14] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# s_

SQL*Plus: Release 18.0.0.0.0 - Production on Mon Apr 15 22:30:22 2019

Version 18.4.0.0.0

Connected to:

Oracle Database 18c Enterprise Edition Release 18.0.0.0.0 - Production

Version 18.4.0.0.0

SQL> show user

USER is "SYS"

SQL> Disconnected from Oracle Database 18c Enterprise Edition Release 18.0.0.0.0 - Production

Version 18.4.0.0.0

# [ oracle@ludodb01:/home/oracle [22:30:30] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# s_ pippo/pippo

SQL*Plus: Release 18.0.0.0.0 - Production on Mon Apr 15 22:30:34 2019

Version 18.4.0.0.0

ERROR:

ORA-01017: invalid username/password; logon denied

Enter user-name:

adr_, dg_ rman_, cm_, lsn_ : aliases for common oracle binaries
genpasswd : generates random passwords (default length 30)

# [ oracle@ludodb01:/home/oracle [22:32:35] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# genpasswd
+gagDCqVSgqHqsU+-IdeA0nx_-HVZ1

# [ oracle@ludodb01:/home/oracle [22:33:00] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# genpasswd 12
DiU9nHiwPB9y

# [ oracle@ludodb01:/home/oracle [22:32:35] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# genpasswd

+gagDCqVSgqHqsU+-IdeA0nx_-HVZ1

# [ oracle@ludodb01:/home/oracle [22:33:00] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# genpasswd 12

DiU9nHiwPB9y

lsoh: lists the Oracle Homes attached to the inventory

# [ oracle@ludodb01:/u01/app/oracle/diag/rdbms/cdb_fra1cw/CDB/trace [22:33:53] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# lsoh

HOME                        LOCATION                                                VERSION      EDITION
--------------------------- ------------------------------------------------------- ------------ ---------
OraGrid180                  /u01/app/18.0.0.0/grid                                  18.4.0.0.0   GRID
OraDB18000_home1            /u01/app/oracle/product/18.0.0.0/dbhome_1               18.4.0.0.0   DBMS EE

# [ oracle@ludodb01:/u01/app/oracle/diag/rdbms/cdb_fra1cw/CDB/trace [22:33:53] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# lsoh

HOME LOCATION VERSION EDITION

--------------------------- ------------------------------------------------------- ------------ ---------

OraGrid180 /u01/app/18.0.0.0/grid 18.4.0.0.0 GRID

OraDB18000_home1 /u01/app/oracle/product/18.0.0.0/dbhome_1 18.4.0.0.0 DBMS EE

setoh: sets the Oracle Home given its name in the inventory

# [ oracle@ludodb01:/u01/app/oracle/diag/rdbms/cdb_fra1cw/CDB/trace [22:35:38] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #
# setoh OraGrid180
VERSION         = 18.4.0.0.0
ORACLE_HOME     = /u01/app/18.0.0.0/grid

# [ oracle@ludodb01:/u01/app/oracle/diag/rdbms/cdb_fra1cw/CDB/trace [22:35:38] [18.4.0.0.0 [DBMS EE] SID=CDB] 0 ] #

# setoh OraGrid180

VERSION = 18.4.0.0.0

ORACLE_HOME = /u01/app/18.0.0.0/grid

You might want to install the same environment for oracle, grid (if you have role separation, it should be the case for Cloud DB Systems) and (eventually) root.

I am curious to know if it works well for your environment.

Cheers

—

Ludo

Oracle Clusterware Services Status at a glance, fast!

Posted on March 20, 2019 by Ludovico

If you use Oracle Clusterware or you deploy your databases to the Oracle Cloud, you probably have some application services defined with srvctl for your database.

If you have many databases, services and nodes, it might be annoying, when doing maintenance or service relocation, to have a quick overview about how services are distributed across the nodes and what’s their status.

With srvctl (the official tool for that), it is a per-database operation:

$ srvctl status service
PRKO-2082 : Missing mandatory option -db

1 2	$ srvctl status service PRKO-2082 : Missing mandatory option -db

If you have many databases, you have to run db by db.

It is also slow! For example, this database has 20 services. Getting the status takes 27 seconds:

# [ oracle@server1:/home/oracle/ [15:52:00] [11.2.0.4.0 [DBMS EE] SID=HRDEV1] 1 ] #
$ time srvctl status service -d hrdev_site1
Service SERVICE_NUMBER_01 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_02 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_03 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_04 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_05 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_06 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_07 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_08 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_09 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_10 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_11 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_12 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_13 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_14 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_15 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_16 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_17 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_18 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_19 is running on instance(s) HRDEV4
Service SERVICE_NUMBER_20 is running on instance(s) HRDEV4

real    0m27.858s
user    0m1.365s
sys     0m1.143s

# [ oracle@server1:/home/oracle/ [15:52:00] [11.2.0.4.0 [DBMS EE] SID=HRDEV1] 1 ] #

$ time srvctl status service -d hrdev_site1

Service SERVICE_NUMBER_01 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_02 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_03 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_04 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_05 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_06 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_07 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_08 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_09 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_10 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_11 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_12 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_13 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_14 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_15 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_16 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_17 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_18 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_19 is running on instance(s) HRDEV4

Service SERVICE_NUMBER_20 is running on instance(s) HRDEV4

real 0m27.858s

user 0m1.365s

sys 0m1.143s

Instead of operating row-by-row (get the status for each service), why not relying on the cluster resources with crsctl and get the big picture once?

$ time crsctl stat res -f -w "(TYPE = ora.service.type)"
...
...

real    0m0.655s
user    0m0.169s
sys     0m0.098s

$ time crsctl stat res -f -w "(TYPE = ora.service.type)"

...

real 0m0.655s

user 0m0.169s

sys 0m0.098s

crsctl stat res -f returns a list of ATTRIBUTE_NAME=value for each service, eventually more than one if the service is not singleton/single instance but uniform/multi instance.

By parsing them with some awk code can provide nice results!

STATE, INTERNAL_STATE and TARGET are useful in this case and might be used to display colours as well.

Green: Status ONLINE, Target ONLINE, STABLE
Black: Status OFFLINE, Target OFFLNE, STABLE
Red: Status ONLINE, Target OFFLINE, STABLE
Yellow: all other cases

Here’s the code:

if [ -f /etc/oracle/olr.loc ] ; then
        export ORA_CLU_HOME=`cat /etc/oracle/olr.loc 2>/dev/null | grep crs_home | awk -F= '{print $2}'`
        export CRS_EXISTS=1
        export CRSCTL=$ORA_CLU_HOME/bin/crsctl
else
        export CRS_EXISTS=0
fi

svcstat ()
{
    if [ $CRS_EXISTS -eq 1 ]; then
        ${CRSCTL} stat res -f -w "(TYPE = ora.service.type)" | awk -F= '
function print_row() {
        dbbcol="";
        dbecol="";
        instbcol="";
        instecol="";
        instances=res["INSTANCE_COUNT 1"];
        for(i=1;i<=instances;i++) {
                # if at least one of the services is online, the service is online (then I paint it green)
                if (res["STATE " i] == "ONLINE" ) {
                        dbbcol="\033[0;32m";
                        dbecol="\033[0m";
                }
        }
        # db unique name is always the second part of the resource name
        # because it does not change, I can get it once from the resource name
        res["DB_UNIQUE_NAME"]=substr(substr(res["NAME"],5),1,index(substr(res["NAME"],5),".")-1);

        # same for service name
        res["SERVICE_NAME"]=substr(res["NAME"],index(substr(res["NAME"],5),".")+5,length(substr(res["NAME"],index(substr(res["NAME"],5),".")+5))-4);

        #starting printing the first part of the information
        printf ("%s%-24s %-30s%s",dbbcol, res["DB_UNIQUE_NAME"], res["SERVICE_NAME"], dbecol);

        # here, instance need to map to the correct server.
        # the mapping is node by attribute TARGET_SERVER (not last server)
        for ( n in node ) {
                node_name=node[n];
                status[node_name]="";
                for (i=1; i<=instances; i++) {
                        # we are on the instance that matches the server
                        if (node_name == res["TARGET_SERVER " i]) {
                                res["SERVER_NAME " i]=node_name;
                                if (status[node_name] !~ "ONLINE") {
                                        # when a service relocates both instances get the survival target_server
                                        # but just one is ONLINE... so we need to get always the ONLINE one.
                                        #printf("was::%s:", status[node_name]);
                                        status[node_name]=res["STATE " i];
                                }

                                # colors modes
                                if ( res["STATE " i] == "ONLINE" && res["INTERNAL_STATE " i] == "STABLE" ) {
                                        # online and stable: GREEN
                                        status[node_name]=sprintf("\033[0;32m%-14s\033[0m", status[node_name]);
                                }
                                else if ( res["STATE " i] != "ONLINE" && res["INTERNAL_STATE " i] == "STABLE" ) {
                                        # offline and stable
                                        if ( res["TARGET " i] == "OFFLINE" ) {
                                                # offline, stable, target offline: BLACK
                                                status[node_name]=sprintf("%-14s", status[node_name]);
                                        }
                                        else {
                                                # offline, stable, target online: RED
                                                status[node_name]=sprintf("\033[0;31m%-14s\033[0m", status[node_name]);
                                        }
                                }
                                else {
                                        # all other cases: offline and starting, online and stopping, clearning, etc.: YELLOW
                                        status[node_name]=sprintf("\033[0;33m%-14s\033[0m", status[node_name]);
                                }
                                #printf("%s %s %s %s\n", status[node_name], node[n], res["STATE " i], res["INTERNAL_STATE " i]);
                        }
                }
               printf(" %-14s", status[node_name]);
        }
        printf("\n");
}
function pad (string, len, char) {
        ret = string;
        for ( i = length(string); i<len ; i++) {
                ret = sprintf("%s%s",ret,char);
        }
        return ret;
}
BEGIN {
        debug = 0;
        first = 1;
        afterempty=1;
        # this loop should set:
        # node[1]=server1; node[2]=server2; nodes=2;
        nodes=0;
        while ("olsnodes" | getline a) {
                nodes++;
                node[nodes] = a;
        }
        fmt="%-24s %-30s";
        printf (fmt, "DB_Unique_Name", "Service_Name");
        for ( n in node ) {
                printf (" %-14s", node[n]);
        }
        printf ("\n");
        printf (fmt, pad("",24,"-"), pad("",30,"-"));
        for ( n in node ) {
                printf (" %s", pad("",14,"-"));
        }
        printf ("\n");

}
# MAIN awk svcstat
{
        if ( $1 == "NAME" ) {
                if ( first != 1 && res["NAME"] == $2 ) {
                        if ( debug == 1 ) print "Secondary instance";
                        instance++;
                }
                else {
                        if ( first != 1 ) {
                                print_row();
                        }
                        first = 0;
                        instance=1;
                        delete res;
                        res["NAME"] = $2;
                }
        }
        else  {
                res[$1 " " instance] = $2 ;

        }
}
END {
        #if ( debug == 1 ) for (key in res) { print key ": " res[key] }
        print_row();
}
';
    else
        echo "svcstat not available on non-clustered environments";
        false;
    fi
}

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

if [ -f /etc/oracle/olr.loc ] ; then

export ORA_CLU_HOME=`cat /etc/oracle/olr.loc 2>/dev/null | grep crs_home | awk -F= '{print $2}'`

export CRS_EXISTS=1

export CRSCTL=$ORA_CLU_HOME/bin/crsctl

else

export CRS_EXISTS=0

svcstat ()

{

if [ $CRS_EXISTS -eq 1 ]; then

${CRSCTL} stat res -f -w "(TYPE = ora.service.type)" | awk -F= '

function print_row() {

dbbcol="";

dbecol="";

instbcol="";

instecol="";

instances=res["INSTANCE_COUNT 1"];

for(i=1;i<=instances;i++) {

# if at least one of the services is online, the service is online (then I paint it green)

if (res["STATE " i] == "ONLINE" ) {

dbbcol="\033[0;32m";

dbecol="\033[0m";

}

# db unique name is always the second part of the resource name

# because it does not change, I can get it once from the resource name

res["DB_UNIQUE_NAME"]=substr(substr(res["NAME"],5),1,index(substr(res["NAME"],5),".")-1);

# same for service name

res["SERVICE_NAME"]=substr(res["NAME"],index(substr(res["NAME"],5),".")+5,length(substr(res["NAME"],index(substr(res["NAME"],5),".")+5))-4);

#starting printing the first part of the information

printf ("%s%-24s %-30s%s",dbbcol, res["DB_UNIQUE_NAME"], res["SERVICE_NAME"], dbecol);

# here, instance need to map to the correct server.

# the mapping is node by attribute TARGET_SERVER (not last server)

for ( n in node ) {

node_name=node[n];

status[node_name]="";

for (i=1; i<=instances; i++) {

# we are on the instance that matches the server

if (node_name == res["TARGET_SERVER " i]) {

res["SERVER_NAME " i]=node_name;

if (status[node_name] !~ "ONLINE") {

# when a service relocates both instances get the survival target_server

# but just one is ONLINE... so we need to get always the ONLINE one.

#printf("was::%s:", status[node_name]);

status[node_name]=res["STATE " i];

}

# colors modes

if ( res["STATE " i] == "ONLINE" && res["INTERNAL_STATE " i] == "STABLE" ) {

# online and stable: GREEN

status[node_name]=sprintf("\033[0;32m%-14s\033[0m", status[node_name]);

}

else if ( res["STATE " i] != "ONLINE" && res["INTERNAL_STATE " i] == "STABLE" ) {

# offline and stable

if ( res["TARGET " i] == "OFFLINE" ) {

# offline, stable, target offline: BLACK

status[node_name]=sprintf("%-14s", status[node_name]);

}

else {

# offline, stable, target online: RED

status[node_name]=sprintf("\033[0;31m%-14s\033[0m", status[node_name]);

}

else {

# all other cases: offline and starting, online and stopping, clearning, etc.: YELLOW

status[node_name]=sprintf("\033[0;33m%-14s\033[0m", status[node_name]);

}

#printf("%s %s %s %s\n", status[node_name], node[n], res["STATE " i], res["INTERNAL_STATE " i]);

}

printf(" %-14s", status[node_name]);

}

printf("\n");

}

function pad (string, len, char) {

ret = string;

for ( i = length(string); i<len ; i++) {

ret = sprintf("%s%s",ret,char);

}

return ret;

}

BEGIN {

debug = 0;

first = 1;

afterempty=1;

# this loop should set:

# node[1]=server1; node[2]=server2; nodes=2;

nodes=0;

while ("olsnodes" | getline a) {

nodes++;

node[nodes] = a;

}

fmt="%-24s %-30s";

printf (fmt, "DB_Unique_Name", "Service_Name");

for ( n in node ) {

printf (" %-14s", node[n]);

}

printf ("\n");

printf (fmt, pad("",24,"-"), pad("",30,"-"));

for ( n in node ) {

printf (" %s", pad("",14,"-"));

}

printf ("\n");

}

# MAIN awk svcstat

{

if ( $1 == "NAME" ) {

if ( first != 1 && res["NAME"] == $2 ) {

if ( debug == 1 ) print "Secondary instance";

instance++;

}

else {

if ( first != 1 ) {

print_row();

}

first = 0;

instance=1;

delete res;

res["NAME"] = $2;

}

else {

res[$1 " " instance] = $2 ;

}

END {

#if ( debug == 1 ) for (key in res) { print key ": " res[key] }

print_row();

}

else

echo "svcstat not available on non-clustered environments";

false;

}

Here’s what you can expect, for 92 services distributed on 4 nodes and a dozen of databases (the output is snipped and the names are masked):

$ time svcstat
DB_Unique_Name     Service_Name       server1  server2  server3  server4
------------------ ------------------ -------- -------- -------- --------
hrdev_site1        SERVICE_NUMBER_01                             ONLINE
hrdev_site1        SERVICE_NUMBER_02                             ONLINE
...
hrdev_site1        SERVICE_NUMBER_20                             ONLINE
hrstg_site1        SERVICE_NUMBER_21                    ONLINE  
hrstg_site1        SERVICE_NUMBER_22                    ONLINE  
...
hrstg_site1        SERVICE_NUMBER_41                    ONLINE  
hrtest_site1       SERVICE_NUMBER_42           ONLINE           
hrtest_site1       SERVICE_NUMBER_43           ONLINE           
...
hrtest_site1       SERVICE_NUMBER_62           ONLINE           
hrtest_site1       SERVICE_NUMBER_63           ONLINE           
hrtest_site1       SERVICE_NUMBER_64           ONLINE           
hrtest_site1       SERVICE_NUMBER_65           ONLINE           
hrtest_site1       SERVICE_NUMBER_66           ONLINE           
erpdev_site1       SERVICE_NUMBER_67  ONLINE                    
erptest_site1      SERVICE_NUMBER_68  ONLINE                    
cmsstg_site1       SERVICE_NUMBER_69  ONLINE                    
cmsstg_site1       SERVICE_NUMBER_70  ONLINE                    
...
cmsstg_site1       SERVICE_NUMBER_74  ONLINE                    
cmsstg_site1       SERVICE_NUMBER_75  ONLINE                    
cmstest_site1      SERVICE_NUMBER_76  ONLINE                    
...
cmstest_site1      SERVICE_NUMBER_81  ONLINE                    
kbtest_site1       SERVICE_NUMBER_82                    ONLINE           
...
kbtest_site1       SERVICE_NUMBER_84                    ONLINE           
reporting_site1    SERVICE_NUMBER_85  ONLINE                    
paydev_site1       SERVICE_NUMBER_86           ONLINE           
payrep_site1       SERVICE_NUMBER_87           ONLINE           
...
paytest_site1      SERVICE_NUMBER_90           ONLINE           
paytest_site1      SERVICE_NUMBER_91           ONLINE           
crm_site1          SERVICE_NUMBER_92                             ONLINE

real    0m0.358s
user    0m0.232s
sys     0m0.134s

$ time svcstat

DB_Unique_Name Service_Name server1 server2 server3 server4

------------------ ------------------ -------- -------- -------- --------

hrdev_site1 SERVICE_NUMBER_01 ONLINE

hrdev_site1 SERVICE_NUMBER_02 ONLINE

...

hrdev_site1 SERVICE_NUMBER_20 ONLINE

hrstg_site1 SERVICE_NUMBER_21 ONLINE

hrstg_site1 SERVICE_NUMBER_22 ONLINE

...

hrstg_site1 SERVICE_NUMBER_41 ONLINE

hrtest_site1 SERVICE_NUMBER_42 ONLINE

hrtest_site1 SERVICE_NUMBER_43 ONLINE

...

hrtest_site1 SERVICE_NUMBER_62 ONLINE

hrtest_site1 SERVICE_NUMBER_63 ONLINE

hrtest_site1 SERVICE_NUMBER_64 ONLINE

hrtest_site1 SERVICE_NUMBER_65 ONLINE

hrtest_site1 SERVICE_NUMBER_66 ONLINE

erpdev_site1 SERVICE_NUMBER_67 ONLINE

erptest_site1 SERVICE_NUMBER_68 ONLINE

cmsstg_site1 SERVICE_NUMBER_69 ONLINE

cmsstg_site1 SERVICE_NUMBER_70 ONLINE

...

cmsstg_site1 SERVICE_NUMBER_74 ONLINE

cmsstg_site1 SERVICE_NUMBER_75 ONLINE

cmstest_site1 SERVICE_NUMBER_76 ONLINE

...

cmstest_site1 SERVICE_NUMBER_81 ONLINE

kbtest_site1 SERVICE_NUMBER_82 ONLINE

...

kbtest_site1 SERVICE_NUMBER_84 ONLINE

reporting_site1 SERVICE_NUMBER_85 ONLINE

paydev_site1 SERVICE_NUMBER_86 ONLINE

payrep_site1 SERVICE_NUMBER_87 ONLINE

...

paytest_site1 SERVICE_NUMBER_90 ONLINE

paytest_site1 SERVICE_NUMBER_91 ONLINE

crm_site1 SERVICE_NUMBER_92 ONLINE

real 0m0.358s

user 0m0.232s

sys 0m0.134s

I’d be curious to know if it works well for your environment, please comment here. 🙂

Thanks

—

Ludo

Oracle Grid Infrastructure 18c patching part 3: Executing out-of-place patching with the local-mode automaton

Posted on January 13, 2019 by Ludovico

I wish I had more time to blog in the recent weeks. Sorry for the delay in this blog series 🙂

If you have not read the two previous blog posts, please do it now. I suppose here that you have the Independent Local-Mode Automaton already enabled.

What does the Independent Local-mode Automaton?

The automaton automates the process of moving the active Grid Infrastructure Oracle Home from the current one to a new one. The new one can be either at a higher patch level or at a lower one. Of course, you will probably want to patch your grid infrastructure, going then to a higher level of patching.

Preparing the new Grid Infrastructure Oracle Home

The GI home, starting from 12.2, is just a zip that is extracted directly in the new Oracle Home. In this blog post I suppose that you want to patch your Grid Infrastructure from an existing 18.3 to a brand new 18.4 (18.5 will be released very soon).

So, if your current OH is /u01/app/grid/crs1830, you might want to prepare the new home in /u01/app/grid/crs1840 by unzipping the software and then patching using the steps described here.

If you already have a golden image with the correct version, you can unzip it directly.

Beware of four important things:

You have to register the new Oracle home in the Central Inventory using the SW_ONLY install, as described here.
You must do it for all the nodes in the cluster prior to upgrading
The response file must contain the same groups (DBA, OPER, etc) as the current active Home, otherwise errors will appear.
You must relink by hand your Oracle binaries with the RAC option:
$ cd /u01/app/grid/1crs1840/rdbms/lib
$ make -f ins_rdbms.mk rac_on ioracle

In fact, after every attach to the central inventory the binaries are relinked without RAC option, so it is important to activate RAC again to avoid bad problems when upgrading the ASM with the new Automaton.

Executing the move gihome

If everything is correct, you should have now the current and new Oracle Homes, correctly registered in the Central Inventory, with the RAC option activated.

You can now do a first eval to check if everything looks good:

# [ oracle@server1:/u01/app/oracle/home [12:01:52] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ rhpctl move gihome -sourcehome /u01/app/grid/crs1830 -desthome /u01/app/grid/crs1840 -eval
server2.cern.ch: Audit ID: 4
server2.cern.ch: Evaluation in progress for "move gihome" ...
server2.cern.ch: verifying versions of Oracle homes ...
server2.cern.ch: verifying owners of Oracle homes ...
server2.cern.ch: verifying groups of Oracle homes ...
server2.cern.ch: Evaluation finished successfully for "move gihome".

# [ oracle@server1:/u01/app/oracle/home [12:01:52] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ rhpctl move gihome -sourcehome /u01/app/grid/crs1830 -desthome /u01/app/grid/crs1840 -eval

server2.cern.ch: Audit ID: 4

server2.cern.ch: Evaluation in progress for "move gihome" ...

server2.cern.ch: verifying versions of Oracle homes ...

server2.cern.ch: verifying owners of Oracle homes ...

server2.cern.ch: verifying groups of Oracle homes ...

server2.cern.ch: Evaluation finished successfully for "move gihome".

My personal suggestion at least at your first experiences with the automaton, is to move the Oracle Home on one node at a time. This way, YOU control the relocation of the services and resources before doing the actual move operation.

Here is the execution for the first node:

# [ oracle@server1:/u01/app/oracle/home [15:17:26] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ rhpctl move gihome -sourcehome /u01/app/grid/crs1830 -desthome /u01/app/grid/crs1840 -node server1
server2.cern.ch: Audit ID: 4
server2.cern.ch: verifying versions of Oracle homes ...
server2.cern.ch: verifying owners of Oracle homes ...
server2.cern.ch: verifying groups of Oracle homes ...
server2.cern.ch: starting to move the Oracle Grid Infrastructure home from "/u01/app/grid/crs1830" to "/u01/app/grid/crs1840" on server cluster "CRSTEST-RAC16"
server2.cern.ch: Executing prepatch and postpatch on nodes: "server1".
server2.cern.ch: Executing root script on nodes [server1].
server2.cern.ch: Successfully executed root script on nodes [server1].
server2.cern.ch: Executing root script on nodes [server1].
Using configuration parameter file: /u01/app/grid/crs1840/crs/install/crsconfig_params
The log of current session can be found at:
  /u01/app/oracle/crsdata/server1/crsconfig/crs_postpatch_server1_2018-11-14_03-27-43PM.log
Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [NORMAL]. The cluster active patch level is [70732493].
CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'server1'
CRS-2673: Attempting to stop 'ora.crsd' on 'server1'
CRS-2790: Starting shutdown of Cluster Ready Services-managed resources on server 'server1'
CRS-2673: Attempting to stop 'ora.LISTENER_SCAN2.lsnr' on 'server1'
CRS-2673: Attempting to stop 'ora.mgmt.ghchkpt.acfs' on 'server1'
CRS-2673: Attempting to stop 'ora.helper336.hlp' on 'server1'
CRS-2673: Attempting to stop 'ora.chad' on 'server1'
CRS-2673: Attempting to stop 'ora.chad' on 'server2'
CRS-2673: Attempting to stop 'ora.LISTENER.lsnr' on 'server1'
CRS-2673: Attempting to stop 'ora.OCRVOT.dg' on 'server1'
CRS-2673: Attempting to stop 'ora.MGMT.dg' on 'server1'
CRS-2673: Attempting to stop 'ora.helper' on 'server1'
CRS-2673: Attempting to stop 'ora.cvu' on 'server1'
CRS-2673: Attempting to stop 'ora.qosmserver' on 'server1'
CRS-2677: Stop of 'ora.helper336.hlp' on 'server1' succeeded
CRS-2677: Stop of 'ora.OCRVOT.dg' on 'server1' succeeded
CRS-2677: Stop of 'ora.MGMT.dg' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.asm' on 'server1'
CRS-2677: Stop of 'ora.LISTENER_SCAN2.lsnr' on 'server1' succeeded
CRS-2677: Stop of 'ora.LISTENER.lsnr' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.scan2.vip' on 'server1'
CRS-2677: Stop of 'ora.helper' on 'server1' succeeded
CRS-2677: Stop of 'ora.cvu' on 'server1' succeeded
CRS-2677: Stop of 'ora.scan2.vip' on 'server1' succeeded
CRS-2677: Stop of 'ora.asm' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.ASMNET1LSNR_ASM.lsnr' on 'server1'
CRS-2677: Stop of 'ora.mgmt.ghchkpt.acfs' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.MGMT.GHCHKPT.advm' on 'server1'
CRS-2677: Stop of 'ora.MGMT.GHCHKPT.advm' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.proxy_advm' on 'server1'
CRS-2677: Stop of 'ora.chad' on 'server2' succeeded
CRS-2677: Stop of 'ora.chad' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.mgmtdb' on 'server1'
CRS-2677: Stop of 'ora.qosmserver' on 'server1' succeeded
CRS-2677: Stop of 'ora.ASMNET1LSNR_ASM.lsnr' on 'server1' succeeded
CRS-2677: Stop of 'ora.proxy_advm' on 'server1' succeeded
CRS-2677: Stop of 'ora.mgmtdb' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.MGMTLSNR' on 'server1'
CRS-2677: Stop of 'ora.MGMTLSNR' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.server1.vip' on 'server1'
CRS-2677: Stop of 'ora.server1.vip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.MGMTLSNR' on 'server2'
CRS-2672: Attempting to start 'ora.qosmserver' on 'server2'
CRS-2672: Attempting to start 'ora.scan2.vip' on 'server2'
CRS-2672: Attempting to start 'ora.cvu' on 'server2'
CRS-2672: Attempting to start 'ora.server1.vip' on 'server2'
CRS-2676: Start of 'ora.cvu' on 'server2' succeeded
CRS-2676: Start of 'ora.server1.vip' on 'server2' succeeded
CRS-2676: Start of 'ora.MGMTLSNR' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.mgmtdb' on 'server2'
CRS-2676: Start of 'ora.scan2.vip' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.LISTENER_SCAN2.lsnr' on 'server2'
CRS-2676: Start of 'ora.LISTENER_SCAN2.lsnr' on 'server2' succeeded
CRS-2676: Start of 'ora.qosmserver' on 'server2' succeeded
CRS-2676: Start of 'ora.mgmtdb' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.chad' on 'server2'
CRS-2676: Start of 'ora.chad' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.ons' on 'server1'
CRS-2677: Stop of 'ora.ons' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.net1.network' on 'server1'
CRS-2677: Stop of 'ora.net1.network' on 'server1' succeeded
CRS-2792: Shutdown of Cluster Ready Services-managed resources on 'server1' has completed
CRS-2677: Stop of 'ora.crsd' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.asm' on 'server1'
CRS-2673: Attempting to stop 'ora.crf' on 'server1'
CRS-2673: Attempting to stop 'ora.drivers.acfs' on 'server1'
CRS-2673: Attempting to stop 'ora.mdnsd' on 'server1'
CRS-2677: Stop of 'ora.drivers.acfs' on 'server1' succeeded
CRS-2677: Stop of 'ora.crf' on 'server1' succeeded
CRS-2677: Stop of 'ora.mdnsd' on 'server1' succeeded
CRS-2677: Stop of 'ora.asm' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'server1'
CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.ctssd' on 'server1'
CRS-2673: Attempting to stop 'ora.evmd' on 'server1'
CRS-2677: Stop of 'ora.ctssd' on 'server1' succeeded
CRS-2677: Stop of 'ora.evmd' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'server1'
CRS-2677: Stop of 'ora.cssd' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'server1'
CRS-2673: Attempting to stop 'ora.gpnpd' on 'server1'
CRS-2677: Stop of 'ora.gipcd' on 'server1' succeeded
CRS-2677: Stop of 'ora.gpnpd' on 'server1' succeeded
CRS-2793: Shutdown of Oracle High Availability Services-managed resources on 'server1' has completed
CRS-4133: Oracle High Availability Services has been stopped.
2018/11/14 15:30:10 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'
CRS-4123: Starting Oracle High Availability Services-managed resources
CRS-2672: Attempting to start 'ora.mdnsd' on 'server1'
CRS-2672: Attempting to start 'ora.evmd' on 'server1'
CRS-2676: Start of 'ora.mdnsd' on 'server1' succeeded
CRS-2676: Start of 'ora.evmd' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'server1'
CRS-2676: Start of 'ora.gpnpd' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'server1'
CRS-2676: Start of 'ora.gipcd' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'server1'
CRS-2672: Attempting to start 'ora.crf' on 'server1'
CRS-2676: Start of 'ora.cssdmonitor' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'server1'
CRS-2672: Attempting to start 'ora.diskmon' on 'server1'
CRS-2676: Start of 'ora.diskmon' on 'server1' succeeded
CRS-2676: Start of 'ora.crf' on 'server1' succeeded
CRS-2676: Start of 'ora.cssd' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.cluster_interconnect.haip' on 'server1'
CRS-2672: Attempting to start 'ora.ctssd' on 'server1'
CRS-2676: Start of 'ora.ctssd' on 'server1' succeeded
CRS-2676: Start of 'ora.cluster_interconnect.haip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'server1'
CRS-2676: Start of 'ora.asm' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.storage' on 'server1'
CRS-2676: Start of 'ora.storage' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.crsd' on 'server1'
CRS-2676: Start of 'ora.crsd' on 'server1' succeeded
CRS-6017: Processing resource auto-start for servers: server1
CRS-2673: Attempting to stop 'ora.server1.vip' on 'server2'
CRS-2673: Attempting to stop 'ora.LISTENER_SCAN1.lsnr' on 'server2'
CRS-2672: Attempting to start 'ora.ons' on 'server1'
CRS-2672: Attempting to start 'ora.chad' on 'server1'
CRS-2677: Stop of 'ora.server1.vip' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.server1.vip' on 'server1'
CRS-2677: Stop of 'ora.LISTENER_SCAN1.lsnr' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.scan1.vip' on 'server2'
CRS-2677: Stop of 'ora.scan1.vip' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.scan1.vip' on 'server1'
CRS-2676: Start of 'ora.chad' on 'server1' succeeded
CRS-2676: Start of 'ora.server1.vip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.LISTENER.lsnr' on 'server1'
CRS-2676: Start of 'ora.scan1.vip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.LISTENER_SCAN1.lsnr' on 'server1'
CRS-2676: Start of 'ora.LISTENER.lsnr' on 'server1' succeeded
CRS-2679: Attempting to clean 'ora.asm' on 'server1'
CRS-2676: Start of 'ora.LISTENER_SCAN1.lsnr' on 'server1' succeeded
CRS-2681: Clean of 'ora.asm' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'server1'
CRS-2676: Start of 'ora.ons' on 'server1' succeeded
ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)
CRS-2674: Start of 'ora.asm' on 'server1' failed
CRS-2672: Attempting to start 'ora.asm' on 'server1'
ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)
CRS-2674: Start of 'ora.asm' on 'server1' failed
CRS-2679: Attempting to clean 'ora.proxy_advm' on 'server1'
CRS-2681: Clean of 'ora.proxy_advm' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.proxy_advm' on 'server1'
CRS-2676: Start of 'ora.proxy_advm' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'server1'
ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)
CRS-2674: Start of 'ora.asm' on 'server1' failed
CRS-2672: Attempting to start 'ora.MGMT.GHCHKPT.advm' on 'server1'
CRS-2676: Start of 'ora.MGMT.GHCHKPT.advm' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.mgmt.ghchkpt.acfs' on 'server1'
CRS-2676: Start of 'ora.mgmt.ghchkpt.acfs' on 'server1' succeeded
===== Summary of resource auto-start failures follows =====
CRS-2807: Resource 'ora.asm' failed to start automatically.
CRS-6016: Resource auto-start has completed for server server1
CRS-6024: Completed start of Oracle Cluster Ready Services-managed resources
CRS-4123: Oracle High Availability Services has been started.
Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [ROLLING PATCH]. The cluster active patch level is [70732493].
2018/11/14 15:35:23 CLSRSC-4015: Performing install or upgrade action for Oracle Trace File Analyzer (TFA) Collector.
2018/11/14 15:37:11 CLSRSC-4003: Successfully patched Oracle Trace File Analyzer (TFA) Collector.
2018/11/14 15:37:13 CLSRSC-672: Post-patch steps for patching GI home successfully completed.
server2.cern.ch: Successfully executed root script on nodes [server1].
server2.cern.ch: Updating inventory on nodes: server1.
========================================
server2.cern.ch:
Starting Oracle Universal Installer...

The inventory pointer is located at /etc/oraInst.loc
'UpdateNodeList' was successful.
server2.cern.ch: Updated inventory on nodes: server1.
server2.cern.ch: Updating inventory on nodes: server1.
========================================
server2.cern.ch:
Starting Oracle Universal Installer...

The inventory pointer is located at /etc/oraInst.loc
'UpdateNodeList' was successful.
server2.cern.ch: Updated inventory on nodes: server1.
server2.cern.ch: Continue by running 'rhpctl move gihome -destwc <workingcopy_name> -continue [-root | -sudouser <sudo_username> -sudopath <path_to_sudo_binary>]'.
server2.cern.ch: completed the move of Oracle Grid Infrastructure home on server cluster "CRSTEST-RAC16"

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

# [ oracle@server1:/u01/app/oracle/home [15:17:26] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ rhpctl move gihome -sourcehome /u01/app/grid/crs1830 -desthome /u01/app/grid/crs1840 -node server1

server2.cern.ch: Audit ID: 4

server2.cern.ch: verifying versions of Oracle homes ...

server2.cern.ch: verifying owners of Oracle homes ...

server2.cern.ch: verifying groups of Oracle homes ...

server2.cern.ch: starting to move the Oracle Grid Infrastructure home from "/u01/app/grid/crs1830" to "/u01/app/grid/crs1840" on server cluster "CRSTEST-RAC16"

server2.cern.ch: Executing prepatch and postpatch on nodes: "server1".

server2.cern.ch: Executing root script on nodes [server1].

server2.cern.ch: Successfully executed root script on nodes [server1].

server2.cern.ch: Executing root script on nodes [server1].

Using configuration parameter file: /u01/app/grid/crs1840/crs/install/crsconfig_params

The log of current session can be found at:

/u01/app/oracle/crsdata/server1/crsconfig/crs_postpatch_server1_2018-11-14_03-27-43PM.log

Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [NORMAL]. The cluster active patch level is [70732493].

CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'server1'

CRS-2673: Attempting to stop 'ora.crsd' on 'server1'

CRS-2790: Starting shutdown of Cluster Ready Services-managed resources on server 'server1'

CRS-2673: Attempting to stop 'ora.LISTENER_SCAN2.lsnr' on 'server1'

CRS-2673: Attempting to stop 'ora.mgmt.ghchkpt.acfs' on 'server1'

CRS-2673: Attempting to stop 'ora.helper336.hlp' on 'server1'

CRS-2673: Attempting to stop 'ora.chad' on 'server1'

CRS-2673: Attempting to stop 'ora.chad' on 'server2'

CRS-2673: Attempting to stop 'ora.LISTENER.lsnr' on 'server1'

CRS-2673: Attempting to stop 'ora.OCRVOT.dg' on 'server1'

CRS-2673: Attempting to stop 'ora.MGMT.dg' on 'server1'

CRS-2673: Attempting to stop 'ora.helper' on 'server1'

CRS-2673: Attempting to stop 'ora.cvu' on 'server1'

CRS-2673: Attempting to stop 'ora.qosmserver' on 'server1'

CRS-2677: Stop of 'ora.helper336.hlp' on 'server1' succeeded

CRS-2677: Stop of 'ora.OCRVOT.dg' on 'server1' succeeded

CRS-2677: Stop of 'ora.MGMT.dg' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.asm' on 'server1'

CRS-2677: Stop of 'ora.LISTENER_SCAN2.lsnr' on 'server1' succeeded

CRS-2677: Stop of 'ora.LISTENER.lsnr' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.scan2.vip' on 'server1'

CRS-2677: Stop of 'ora.helper' on 'server1' succeeded

CRS-2677: Stop of 'ora.cvu' on 'server1' succeeded

CRS-2677: Stop of 'ora.scan2.vip' on 'server1' succeeded

CRS-2677: Stop of 'ora.asm' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.ASMNET1LSNR_ASM.lsnr' on 'server1'

CRS-2677: Stop of 'ora.mgmt.ghchkpt.acfs' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.MGMT.GHCHKPT.advm' on 'server1'

CRS-2677: Stop of 'ora.MGMT.GHCHKPT.advm' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.proxy_advm' on 'server1'

CRS-2677: Stop of 'ora.chad' on 'server2' succeeded

CRS-2677: Stop of 'ora.chad' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.mgmtdb' on 'server1'

CRS-2677: Stop of 'ora.qosmserver' on 'server1' succeeded

CRS-2677: Stop of 'ora.ASMNET1LSNR_ASM.lsnr' on 'server1' succeeded

CRS-2677: Stop of 'ora.proxy_advm' on 'server1' succeeded

CRS-2677: Stop of 'ora.mgmtdb' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.MGMTLSNR' on 'server1'

CRS-2677: Stop of 'ora.MGMTLSNR' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.server1.vip' on 'server1'

CRS-2677: Stop of 'ora.server1.vip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.MGMTLSNR' on 'server2'

CRS-2672: Attempting to start 'ora.qosmserver' on 'server2'

CRS-2672: Attempting to start 'ora.scan2.vip' on 'server2'

CRS-2672: Attempting to start 'ora.cvu' on 'server2'

CRS-2672: Attempting to start 'ora.server1.vip' on 'server2'

CRS-2676: Start of 'ora.cvu' on 'server2' succeeded

CRS-2676: Start of 'ora.server1.vip' on 'server2' succeeded

CRS-2676: Start of 'ora.MGMTLSNR' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.mgmtdb' on 'server2'

CRS-2676: Start of 'ora.scan2.vip' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.LISTENER_SCAN2.lsnr' on 'server2'

CRS-2676: Start of 'ora.LISTENER_SCAN2.lsnr' on 'server2' succeeded

CRS-2676: Start of 'ora.qosmserver' on 'server2' succeeded

CRS-2676: Start of 'ora.mgmtdb' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.chad' on 'server2'

CRS-2676: Start of 'ora.chad' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.ons' on 'server1'

CRS-2677: Stop of 'ora.ons' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.net1.network' on 'server1'

CRS-2677: Stop of 'ora.net1.network' on 'server1' succeeded

CRS-2792: Shutdown of Cluster Ready Services-managed resources on 'server1' has completed

CRS-2677: Stop of 'ora.crsd' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.asm' on 'server1'

CRS-2673: Attempting to stop 'ora.crf' on 'server1'

CRS-2673: Attempting to stop 'ora.drivers.acfs' on 'server1'

CRS-2673: Attempting to stop 'ora.mdnsd' on 'server1'

CRS-2677: Stop of 'ora.drivers.acfs' on 'server1' succeeded

CRS-2677: Stop of 'ora.crf' on 'server1' succeeded

CRS-2677: Stop of 'ora.mdnsd' on 'server1' succeeded

CRS-2677: Stop of 'ora.asm' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'server1'

CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.ctssd' on 'server1'

CRS-2673: Attempting to stop 'ora.evmd' on 'server1'

CRS-2677: Stop of 'ora.ctssd' on 'server1' succeeded

CRS-2677: Stop of 'ora.evmd' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.cssd' on 'server1'

CRS-2677: Stop of 'ora.cssd' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.gipcd' on 'server1'

CRS-2673: Attempting to stop 'ora.gpnpd' on 'server1'

CRS-2677: Stop of 'ora.gipcd' on 'server1' succeeded

CRS-2677: Stop of 'ora.gpnpd' on 'server1' succeeded

CRS-2793: Shutdown of Oracle High Availability Services-managed resources on 'server1' has completed

CRS-4133: Oracle High Availability Services has been stopped.

2018/11/14 15:30:10 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'

CRS-4123: Starting Oracle High Availability Services-managed resources

CRS-2672: Attempting to start 'ora.mdnsd' on 'server1'

CRS-2672: Attempting to start 'ora.evmd' on 'server1'

CRS-2676: Start of 'ora.mdnsd' on 'server1' succeeded

CRS-2676: Start of 'ora.evmd' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.gpnpd' on 'server1'

CRS-2676: Start of 'ora.gpnpd' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.gipcd' on 'server1'

CRS-2676: Start of 'ora.gipcd' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.cssdmonitor' on 'server1'

CRS-2672: Attempting to start 'ora.crf' on 'server1'

CRS-2676: Start of 'ora.cssdmonitor' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.cssd' on 'server1'

CRS-2672: Attempting to start 'ora.diskmon' on 'server1'

CRS-2676: Start of 'ora.diskmon' on 'server1' succeeded

CRS-2676: Start of 'ora.crf' on 'server1' succeeded

CRS-2676: Start of 'ora.cssd' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.cluster_interconnect.haip' on 'server1'

CRS-2672: Attempting to start 'ora.ctssd' on 'server1'

CRS-2676: Start of 'ora.ctssd' on 'server1' succeeded

CRS-2676: Start of 'ora.cluster_interconnect.haip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.asm' on 'server1'

CRS-2676: Start of 'ora.asm' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.storage' on 'server1'

CRS-2676: Start of 'ora.storage' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.crsd' on 'server1'

CRS-2676: Start of 'ora.crsd' on 'server1' succeeded

CRS-6017: Processing resource auto-start for servers: server1

CRS-2673: Attempting to stop 'ora.server1.vip' on 'server2'

CRS-2673: Attempting to stop 'ora.LISTENER_SCAN1.lsnr' on 'server2'

CRS-2672: Attempting to start 'ora.ons' on 'server1'

CRS-2672: Attempting to start 'ora.chad' on 'server1'

CRS-2677: Stop of 'ora.server1.vip' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.server1.vip' on 'server1'

CRS-2677: Stop of 'ora.LISTENER_SCAN1.lsnr' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.scan1.vip' on 'server2'

CRS-2677: Stop of 'ora.scan1.vip' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.scan1.vip' on 'server1'

CRS-2676: Start of 'ora.chad' on 'server1' succeeded

CRS-2676: Start of 'ora.server1.vip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.LISTENER.lsnr' on 'server1'

CRS-2676: Start of 'ora.scan1.vip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.LISTENER_SCAN1.lsnr' on 'server1'

CRS-2676: Start of 'ora.LISTENER.lsnr' on 'server1' succeeded

CRS-2679: Attempting to clean 'ora.asm' on 'server1'

CRS-2676: Start of 'ora.LISTENER_SCAN1.lsnr' on 'server1' succeeded

CRS-2681: Clean of 'ora.asm' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.asm' on 'server1'

CRS-2676: Start of 'ora.ons' on 'server1' succeeded

ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)

CRS-2674: Start of 'ora.asm' on 'server1' failed

CRS-2672: Attempting to start 'ora.asm' on 'server1'

ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)

CRS-2674: Start of 'ora.asm' on 'server1' failed

CRS-2679: Attempting to clean 'ora.proxy_advm' on 'server1'

CRS-2681: Clean of 'ora.proxy_advm' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.proxy_advm' on 'server1'

CRS-2676: Start of 'ora.proxy_advm' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.asm' on 'server1'

ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)

CRS-2674: Start of 'ora.asm' on 'server1' failed

CRS-2672: Attempting to start 'ora.MGMT.GHCHKPT.advm' on 'server1'

CRS-2676: Start of 'ora.MGMT.GHCHKPT.advm' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.mgmt.ghchkpt.acfs' on 'server1'

CRS-2676: Start of 'ora.mgmt.ghchkpt.acfs' on 'server1' succeeded

===== Summary of resource auto-start failures follows =====

CRS-2807: Resource 'ora.asm' failed to start automatically.

CRS-6016: Resource auto-start has completed for server server1

CRS-6024: Completed start of Oracle Cluster Ready Services-managed resources

CRS-4123: Oracle High Availability Services has been started.

Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [ROLLING PATCH]. The cluster active patch level is [70732493].

2018/11/14 15:35:23 CLSRSC-4015: Performing install or upgrade action for Oracle Trace File Analyzer (TFA) Collector.

2018/11/14 15:37:11 CLSRSC-4003: Successfully patched Oracle Trace File Analyzer (TFA) Collector.

2018/11/14 15:37:13 CLSRSC-672: Post-patch steps for patching GI home successfully completed.

server2.cern.ch: Successfully executed root script on nodes [server1].

server2.cern.ch: Updating inventory on nodes: server1.

========================================

server2.cern.ch:

Starting Oracle Universal Installer...

The inventory pointer is located at /etc/oraInst.loc

'UpdateNodeList' was successful.

server2.cern.ch: Updated inventory on nodes: server1.

server2.cern.ch: Updating inventory on nodes: server1.

========================================

server2.cern.ch:

Starting Oracle Universal Installer...

The inventory pointer is located at /etc/oraInst.loc

'UpdateNodeList' was successful.

server2.cern.ch: Updated inventory on nodes: server1.

server2.cern.ch: Continue by running 'rhpctl move gihome -destwc <workingcopy_name> -continue [-root | -sudouser <sudo_username> -sudopath <path_to_sudo_binary>]'.

server2.cern.ch: completed the move of Oracle Grid Infrastructure home on server cluster "CRSTEST-RAC16"

From this output you can see at line 15 that the cluster status is NORMAL, then the cluster is stopped on node 1 (lines 16 to 100), then the active version is modified in the oracle-ohasd.service file (line 101), then started back with the new version (lines 102 to 171). The cluster status now is ROLLING PATCH (line 172). The TFA and the node list are updated.

Before continuing with the other(s) node(s), make sure that all the resources are up & running:

# [ oracle@server1:/u01/app/oracle/home [15:37:26] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ crss
HA Resource                                   Targets                          States
-----------                                   -----------------------------    ----------------------------------------
ora.ASMNET1LSNR_ASM.lsnr                      ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.LISTENER.lsnr                             ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.LISTENER_SCAN1.lsnr                       ONLINE                           ONLINE on server1
ora.LISTENER_SCAN2.lsnr                       ONLINE                           ONLINE on server2
ora.MGMT.GHCHKPT.advm                         ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.MGMT.dg                                   ONLINE,ONLINE                    OFFLINE,ONLINE on server2
ora.MGMTLSNR                                  ONLINE                           ONLINE on server2
ora.OCRVOT.dg                                 OFFLINE,ONLINE                   OFFLINE,ONLINE on server2
ora.asm                                       ONLINE,ONLINE,OFFLINE            OFFLINE,ONLINE on server2,OFFLINE
ora.chad                                      ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.cvu                                       ONLINE                           ONLINE on server2
ora.helper                                    ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.helper336.hlp                             ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.server1.vip                             ONLINE                           ONLINE on server1
ora.server2.vip                             ONLINE                           ONLINE on server2
ora.mgmt.ghchkpt.acfs                         ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.mgmtdb                                    ONLINE                           ONLINE on server2
ora.net1.network                              ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.ons                                       ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.proxy_advm                                ONLINE,ONLINE                    ONLINE on server1,ONLINE on server2
ora.qosmserver                                ONLINE                           ONLINE on server2
ora.rhpserver                                 ONLINE                           ONLINE on server2
ora.scan1.vip                                 ONLINE                           ONLINE on server1
ora.LISTENER_LEAF.lsnr
ora.scan2.vip                                 ONLINE                           ONLINE on server2



# [ oracle@server1:/u01/app/oracle/home [15:52:10] [18.4.0.0.0 [GRID] SID=GRID] 1 ] #
$ crsctl query crs releasepatch
Oracle Clusterware release patch level is [59717688] and the complete list of patches [27908644 27923415 28090523 28090553 28090557 28256701 28547619 28655784 28655916 28655963 28656071 ] have been applied on the local node. The release patch string is [18.4.0.0.0].

# [ oracle@server1:/u01/app/oracle/home [15:37:26] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ crss

HA Resource Targets States

----------- ----------------------------- ----------------------------------------

ora.ASMNET1LSNR_ASM.lsnr ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.LISTENER.lsnr ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.LISTENER_SCAN1.lsnr ONLINE ONLINE on server1

ora.LISTENER_SCAN2.lsnr ONLINE ONLINE on server2

ora.MGMT.GHCHKPT.advm ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.MGMT.dg ONLINE,ONLINE OFFLINE,ONLINE on server2

ora.MGMTLSNR ONLINE ONLINE on server2

ora.OCRVOT.dg OFFLINE,ONLINE OFFLINE,ONLINE on server2

ora.asm ONLINE,ONLINE,OFFLINE OFFLINE,ONLINE on server2,OFFLINE

ora.chad ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.cvu ONLINE ONLINE on server2

ora.helper ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.helper336.hlp ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.server1.vip ONLINE ONLINE on server1

ora.server2.vip ONLINE ONLINE on server2

ora.mgmt.ghchkpt.acfs ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.mgmtdb ONLINE ONLINE on server2

ora.net1.network ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.ons ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.proxy_advm ONLINE,ONLINE ONLINE on server1,ONLINE on server2

ora.qosmserver ONLINE ONLINE on server2

ora.rhpserver ONLINE ONLINE on server2

ora.scan1.vip ONLINE ONLINE on server1

ora.LISTENER_LEAF.lsnr

ora.scan2.vip ONLINE ONLINE on server2

# [ oracle@server1:/u01/app/oracle/home [15:52:10] [18.4.0.0.0 [GRID] SID=GRID] 1 ] #

$ crsctl query crs releasepatch

Oracle Clusterware release patch level is [59717688] and the complete list of patches [27908644 27923415 28090523 28090553 28090557 28256701 28547619 28655784 28655916 28655963 28656071 ] have been applied on the local node. The release patch string is [18.4.0.0.0].

You might want as well to relocate manually your resources back to node 1 prior to continuing on node 2.

After that, node 2 can follow the very same procedure:

# [ oracle@server1:/u01/app/oracle/home [15:54:30] [18.4.0.0.0 [GRID] SID=GRID] 130 ] #
$ rhpctl move gihome -sourcehome /u01/app/grid/crs1830 -desthome /u01/app/grid/crs1840 -node server2
server2.cern.ch: Audit ID: 51
server2.cern.ch: Executing prepatch and postpatch on nodes: "server2".
server2.cern.ch: Executing root script on nodes [server2].
server2.cern.ch: Successfully executed root script on nodes [server2].
server2.cern.ch: Executing root script on nodes [server2].
Using configuration parameter file: /u01/app/grid/crs1840/crs/install/crsconfig_params
The log of current session can be found at:
  /u01/app/oracle/crsdata/server2/crsconfig/crs_postpatch_server2_2018-11-14_03-58-21PM.log
Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [ROLLING PATCH]. The cluster active patch level is [70732493].
CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'server2'
CRS-2673: Attempting to stop 'ora.crsd' on 'server2'
CRS-2790: Starting shutdown of Cluster Ready Services-managed resources on server 'server2'
CRS-2673: Attempting to stop 'ora.LISTENER_SCAN2.lsnr' on 'server2'
CRS-2673: Attempting to stop 'ora.cvu' on 'server2'
CRS-2673: Attempting to stop 'ora.rhpserver' on 'server2'
CRS-2673: Attempting to stop 'ora.OCRVOT.dg' on 'server2'
CRS-2673: Attempting to stop 'ora.MGMT.dg' on 'server2'
CRS-2673: Attempting to stop 'ora.qosmserver' on 'server2'
CRS-2673: Attempting to stop 'ora.LISTENER.lsnr' on 'server2'
CRS-2673: Attempting to stop 'ora.chad' on 'server1'
CRS-2673: Attempting to stop 'ora.chad' on 'server2'
CRS-2673: Attempting to stop 'ora.helper336.hlp' on 'server2'
CRS-2673: Attempting to stop 'ora.helper' on 'server2'
CRS-2677: Stop of 'ora.LISTENER_SCAN2.lsnr' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.scan2.vip' on 'server2'
CRS-2677: Stop of 'ora.LISTENER.lsnr' on 'server2' succeeded
CRS-2677: Stop of 'ora.chad' on 'server1' succeeded
CRS-2677: Stop of 'ora.chad' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.mgmtdb' on 'server2'
CRS-2677: Stop of 'ora.OCRVOT.dg' on 'server2' succeeded
CRS-2677: Stop of 'ora.MGMT.dg' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.asm' on 'server2'
CRS-2677: Stop of 'ora.helper336.hlp' on 'server2' succeeded
CRS-2677: Stop of 'ora.helper' on 'server2' succeeded
CRS-2677: Stop of 'ora.scan2.vip' on 'server2' succeeded
CRS-2677: Stop of 'ora.asm' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.ASMNET1LSNR_ASM.lsnr' on 'server2'
CRS-2677: Stop of 'ora.cvu' on 'server2' succeeded
CRS-2677: Stop of 'ora.qosmserver' on 'server2' succeeded
CRS-2677: Stop of 'ora.ASMNET1LSNR_ASM.lsnr' on 'server2' succeeded
CRS-2677: Stop of 'ora.mgmtdb' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.MGMTLSNR' on 'server2'
CRS-2677: Stop of 'ora.MGMTLSNR' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.server2.vip' on 'server2'
CRS-2672: Attempting to start 'ora.MGMTLSNR' on 'server1'
CRS-2677: Stop of 'ora.server2.vip' on 'server2' succeeded
CRS-2676: Start of 'ora.MGMTLSNR' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.mgmtdb' on 'server1'
CRS-2676: Start of 'ora.mgmtdb' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.chad' on 'server1'
CRS-2676: Start of 'ora.chad' on 'server1' succeeded
Stop JWC
CRS-5014: Agent "ORAROOTAGENT" timed out starting process "/u01/app/grid/crs1830/bin/ghappctl" for action "stop": details at "(:CLSN00009:)" in "/u01/app/oracle/diag/crs/server2/crs/trace/crsd_orarootagent_root.trc"
CRS-2675: Stop of 'ora.rhpserver' on 'server2' failed
CRS-2679: Attempting to clean 'ora.rhpserver' on 'server2'
CRS-2681: Clean of 'ora.rhpserver' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.mgmt.ghchkpt.acfs' on 'server2'
CRS-2677: Stop of 'ora.mgmt.ghchkpt.acfs' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.MGMT.GHCHKPT.advm' on 'server2'
CRS-2677: Stop of 'ora.MGMT.GHCHKPT.advm' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.proxy_advm' on 'server2'
CRS-2677: Stop of 'ora.proxy_advm' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.qosmserver' on 'server1'
CRS-2672: Attempting to start 'ora.scan2.vip' on 'server1'
CRS-2672: Attempting to start 'ora.cvu' on 'server1'
CRS-2672: Attempting to start 'ora.server2.vip' on 'server1'
CRS-2676: Start of 'ora.cvu' on 'server1' succeeded
CRS-2676: Start of 'ora.server2.vip' on 'server1' succeeded
CRS-2676: Start of 'ora.scan2.vip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.LISTENER_SCAN2.lsnr' on 'server1'
CRS-2676: Start of 'ora.LISTENER_SCAN2.lsnr' on 'server1' succeeded
CRS-2676: Start of 'ora.qosmserver' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.ons' on 'server2'
CRS-2677: Stop of 'ora.ons' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.net1.network' on 'server2'
CRS-2677: Stop of 'ora.net1.network' on 'server2' succeeded
CRS-2792: Shutdown of Cluster Ready Services-managed resources on 'server2' has completed
CRS-2677: Stop of 'ora.crsd' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.asm' on 'server2'
CRS-2673: Attempting to stop 'ora.crf' on 'server2'
CRS-2673: Attempting to stop 'ora.drivers.acfs' on 'server2'
CRS-2673: Attempting to stop 'ora.mdnsd' on 'server2'
CRS-2677: Stop of 'ora.drivers.acfs' on 'server2' succeeded
CRS-2677: Stop of 'ora.crf' on 'server2' succeeded
CRS-2677: Stop of 'ora.mdnsd' on 'server2' succeeded
CRS-2677: Stop of 'ora.asm' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'server2'
CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.ctssd' on 'server2'
CRS-2673: Attempting to stop 'ora.evmd' on 'server2'
CRS-2677: Stop of 'ora.ctssd' on 'server2' succeeded
CRS-2677: Stop of 'ora.evmd' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'server2'
CRS-2677: Stop of 'ora.cssd' on 'server2' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'server2'
CRS-2673: Attempting to stop 'ora.gpnpd' on 'server2'
CRS-2677: Stop of 'ora.gpnpd' on 'server2' succeeded
CRS-2677: Stop of 'ora.gipcd' on 'server2' succeeded
CRS-2793: Shutdown of Oracle High Availability Services-managed resources on 'server2' has completed
CRS-4133: Oracle High Availability Services has been stopped.
2018/11/14 16:01:42 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'
CRS-4123: Starting Oracle High Availability Services-managed resources
CRS-2672: Attempting to start 'ora.mdnsd' on 'server2'
CRS-2672: Attempting to start 'ora.evmd' on 'server2'
CRS-2676: Start of 'ora.mdnsd' on 'server2' succeeded
CRS-2676: Start of 'ora.evmd' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'server2'
CRS-2676: Start of 'ora.gpnpd' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'server2'
CRS-2676: Start of 'ora.gipcd' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.crf' on 'server2'
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'server2'
CRS-2676: Start of 'ora.cssdmonitor' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'server2'
CRS-2672: Attempting to start 'ora.diskmon' on 'server2'
CRS-2676: Start of 'ora.diskmon' on 'server2' succeeded
CRS-2676: Start of 'ora.crf' on 'server2' succeeded
CRS-2676: Start of 'ora.cssd' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.cluster_interconnect.haip' on 'server2'
CRS-2672: Attempting to start 'ora.ctssd' on 'server2'
CRS-2676: Start of 'ora.ctssd' on 'server2' succeeded
CRS-2676: Start of 'ora.cluster_interconnect.haip' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'server2'
CRS-2676: Start of 'ora.asm' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.storage' on 'server2'
CRS-2676: Start of 'ora.storage' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.crsd' on 'server2'
CRS-2676: Start of 'ora.crsd' on 'server2' succeeded
CRS-6017: Processing resource auto-start for servers: server2
CRS-2673: Attempting to stop 'ora.server2.vip' on 'server1'
CRS-2673: Attempting to stop 'ora.LISTENER_SCAN1.lsnr' on 'server1'
CRS-2672: Attempting to start 'ora.ons' on 'server2'
CRS-2672: Attempting to start 'ora.chad' on 'server2'
CRS-2677: Stop of 'ora.server2.vip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.server2.vip' on 'server2'
CRS-2677: Stop of 'ora.LISTENER_SCAN1.lsnr' on 'server1' succeeded
CRS-2673: Attempting to stop 'ora.scan1.vip' on 'server1'
CRS-2677: Stop of 'ora.scan1.vip' on 'server1' succeeded
CRS-2672: Attempting to start 'ora.scan1.vip' on 'server2'
CRS-2676: Start of 'ora.server2.vip' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.LISTENER.lsnr' on 'server2'
CRS-2676: Start of 'ora.chad' on 'server2' succeeded
CRS-2676: Start of 'ora.scan1.vip' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.LISTENER_SCAN1.lsnr' on 'server2'
CRS-2676: Start of 'ora.LISTENER.lsnr' on 'server2' succeeded
CRS-2679: Attempting to clean 'ora.asm' on 'server2'
CRS-2676: Start of 'ora.LISTENER_SCAN1.lsnr' on 'server2' succeeded
CRS-2681: Clean of 'ora.asm' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'server2'
CRS-2676: Start of 'ora.ons' on 'server2' succeeded
ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)
CRS-2674: Start of 'ora.asm' on 'server2' failed
CRS-2672: Attempting to start 'ora.asm' on 'server2'
ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)
CRS-2674: Start of 'ora.asm' on 'server2' failed
CRS-2679: Attempting to clean 'ora.proxy_advm' on 'server2'
CRS-2681: Clean of 'ora.proxy_advm' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.proxy_advm' on 'server2'
CRS-2676: Start of 'ora.proxy_advm' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'server2'
ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)
CRS-2674: Start of 'ora.asm' on 'server2' failed
CRS-2672: Attempting to start 'ora.MGMT.GHCHKPT.advm' on 'server2'
CRS-2676: Start of 'ora.MGMT.GHCHKPT.advm' on 'server2' succeeded
CRS-2672: Attempting to start 'ora.mgmt.ghchkpt.acfs' on 'server2'
CRS-2676: Start of 'ora.mgmt.ghchkpt.acfs' on 'server2' succeeded
===== Summary of resource auto-start failures follows =====
CRS-2807: Resource 'ora.asm' failed to start automatically.
CRS-6016: Resource auto-start has completed for server server2
CRS-6024: Completed start of Oracle Cluster Ready Services-managed resources
CRS-4123: Oracle High Availability Services has been started.
Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [NORMAL]. The cluster active patch level is [59717688].

SQL Patching tool version 18.0.0.0.0 Production on Wed Nov 14 16:09:01 2018
Copyright (c) 2012, 2018, Oracle.  All rights reserved.

Log file for this invocation: /u01/app/oracle/cfgtoollogs/sqlpatch/sqlpatch_181222_2018_11_14_16_09_01/sqlpatch_invocation.log

Connecting to database...OK
Gathering database info...done

Note:  Datapatch will only apply or rollback SQL fixes for PDBs
       that are in an open state, no patches will be applied to closed PDBs.
       Please refer to Note: Datapatch: Database 12c Post Patch SQL Automation
       (Doc ID 1585822.1)

Bootstrapping registry and package to current versions...done
Determining current state...done

Current state of interim SQL patches:
Interim patch 27923415 (OJVM RELEASE UPDATE: 18.3.0.0.180717 (27923415)):
  Binary registry: Installed
  PDB CDB$ROOT: Applied successfully on 13-NOV-18 04.35.06.794463 PM
  PDB GIMR_DSCREP_10: Applied successfully on 13-NOV-18 04.43.16.948526 PM
  PDB PDB$SEED: Applied successfully on 13-NOV-18 04.43.16.948526 PM

Current state of release update SQL patches:
  Binary registry:
    18.4.0.0.0 Release_Update 1809251743: Installed
  PDB CDB$ROOT:
    Applied 18.3.0.0.0 Release_Update 1806280943 successfully on 13-NOV-18 04.35.06.791214 PM
  PDB GIMR_DSCREP_10:
    Applied 18.3.0.0.0 Release_Update 1806280943 successfully on 13-NOV-18 04.43.16.940471 PM
  PDB PDB$SEED:
    Applied 18.3.0.0.0 Release_Update 1806280943 successfully on 13-NOV-18 04.43.16.940471 PM

Adding patches to installation queue and performing prereq checks...done
Installation queue:
  For the following PDBs: CDB$ROOT PDB$SEED GIMR_DSCREP_10
    No interim patches need to be rolled back
    Patch 28655784 (Database Release Update : 18.4.0.0.181016 (28655784)):
      Apply from 18.3.0.0.0 Release_Update 1806280943 to 18.4.0.0.0 Release_Update 1809251743
    No interim patches need to be applied

Installing patches...
Patch installation complete.  Total patches installed: 3

Validating logfiles...done
Patch 28655784 apply (pdb CDB$ROOT): SUCCESS
  logfile: /u01/app/oracle/cfgtoollogs/sqlpatch/28655784/22509982/28655784_apply__MGMTDB_CDBROOT_2018Nov14_16_11_00.log (no errors)
Patch 28655784 apply (pdb PDB$SEED): SUCCESS
  logfile: /u01/app/oracle/cfgtoollogs/sqlpatch/28655784/22509982/28655784_apply__MGMTDB_PDBSEED_2018Nov14_16_11_51.log (no errors)
Patch 28655784 apply (pdb GIMR_DSCREP_10): SUCCESS
  logfile: /u01/app/oracle/cfgtoollogs/sqlpatch/28655784/22509982/28655784_apply__MGMTDB_GIMR_DSCREP_10_2018Nov14_16_11_50.log (no errors)
SQL Patching tool complete on Wed Nov 14 16:12:50 2018
2018/11/14 16:13:40 CLSRSC-4015: Performing install or upgrade action for Oracle Trace File Analyzer (TFA) Collector.
2018/11/14 16:15:28 CLSRSC-4003: Successfully patched Oracle Trace File Analyzer (TFA) Collector.
2018/11/14 16:17:48 CLSRSC-672: Post-patch steps for patching GI home successfully completed.
server2.cern.ch: Updating inventory on nodes: server2.
========================================
server2.cern.ch:
Starting Oracle Universal Installer...

Checking swap space: must be greater than 500 MB.   Actual 16367 MB    Passed
The inventory pointer is located at /etc/oraInst.loc
'UpdateNodeList' was successful.
server2.cern.ch: Updated inventory on nodes: server2.
server2.cern.ch: Updating inventory on nodes: server2.
========================================
server2.cern.ch:
Starting Oracle Universal Installer...

Checking swap space: must be greater than 500 MB.   Actual 16367 MB    Passed
The inventory pointer is located at /etc/oraInst.loc
'UpdateNodeList' was successful.
server2.cern.ch: Updated inventory on nodes: server2.
server2.cern.ch: Completed the 'move gihome' operation on server cluster.

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

# [ oracle@server1:/u01/app/oracle/home [15:54:30] [18.4.0.0.0 [GRID] SID=GRID] 130 ] #

$ rhpctl move gihome -sourcehome /u01/app/grid/crs1830 -desthome /u01/app/grid/crs1840 -node server2

server2.cern.ch: Audit ID: 51

server2.cern.ch: Executing prepatch and postpatch on nodes: "server2".

server2.cern.ch: Executing root script on nodes [server2].

server2.cern.ch: Successfully executed root script on nodes [server2].

server2.cern.ch: Executing root script on nodes [server2].

Using configuration parameter file: /u01/app/grid/crs1840/crs/install/crsconfig_params

The log of current session can be found at:

/u01/app/oracle/crsdata/server2/crsconfig/crs_postpatch_server2_2018-11-14_03-58-21PM.log

Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [ROLLING PATCH]. The cluster active patch level is [70732493].

CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'server2'

CRS-2673: Attempting to stop 'ora.crsd' on 'server2'

CRS-2790: Starting shutdown of Cluster Ready Services-managed resources on server 'server2'

CRS-2673: Attempting to stop 'ora.LISTENER_SCAN2.lsnr' on 'server2'

CRS-2673: Attempting to stop 'ora.cvu' on 'server2'

CRS-2673: Attempting to stop 'ora.rhpserver' on 'server2'

CRS-2673: Attempting to stop 'ora.OCRVOT.dg' on 'server2'

CRS-2673: Attempting to stop 'ora.MGMT.dg' on 'server2'

CRS-2673: Attempting to stop 'ora.qosmserver' on 'server2'

CRS-2673: Attempting to stop 'ora.LISTENER.lsnr' on 'server2'

CRS-2673: Attempting to stop 'ora.chad' on 'server1'

CRS-2673: Attempting to stop 'ora.chad' on 'server2'

CRS-2673: Attempting to stop 'ora.helper336.hlp' on 'server2'

CRS-2673: Attempting to stop 'ora.helper' on 'server2'

CRS-2677: Stop of 'ora.LISTENER_SCAN2.lsnr' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.scan2.vip' on 'server2'

CRS-2677: Stop of 'ora.LISTENER.lsnr' on 'server2' succeeded

CRS-2677: Stop of 'ora.chad' on 'server1' succeeded

CRS-2677: Stop of 'ora.chad' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.mgmtdb' on 'server2'

CRS-2677: Stop of 'ora.OCRVOT.dg' on 'server2' succeeded

CRS-2677: Stop of 'ora.MGMT.dg' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.asm' on 'server2'

CRS-2677: Stop of 'ora.helper336.hlp' on 'server2' succeeded

CRS-2677: Stop of 'ora.helper' on 'server2' succeeded

CRS-2677: Stop of 'ora.scan2.vip' on 'server2' succeeded

CRS-2677: Stop of 'ora.asm' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.ASMNET1LSNR_ASM.lsnr' on 'server2'

CRS-2677: Stop of 'ora.cvu' on 'server2' succeeded

CRS-2677: Stop of 'ora.qosmserver' on 'server2' succeeded

CRS-2677: Stop of 'ora.ASMNET1LSNR_ASM.lsnr' on 'server2' succeeded

CRS-2677: Stop of 'ora.mgmtdb' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.MGMTLSNR' on 'server2'

CRS-2677: Stop of 'ora.MGMTLSNR' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.server2.vip' on 'server2'

CRS-2672: Attempting to start 'ora.MGMTLSNR' on 'server1'

CRS-2677: Stop of 'ora.server2.vip' on 'server2' succeeded

CRS-2676: Start of 'ora.MGMTLSNR' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.mgmtdb' on 'server1'

CRS-2676: Start of 'ora.mgmtdb' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.chad' on 'server1'

CRS-2676: Start of 'ora.chad' on 'server1' succeeded

Stop JWC

CRS-5014: Agent "ORAROOTAGENT" timed out starting process "/u01/app/grid/crs1830/bin/ghappctl" for action "stop": details at "(:CLSN00009:)" in "/u01/app/oracle/diag/crs/server2/crs/trace/crsd_orarootagent_root.trc"

CRS-2675: Stop of 'ora.rhpserver' on 'server2' failed

CRS-2679: Attempting to clean 'ora.rhpserver' on 'server2'

CRS-2681: Clean of 'ora.rhpserver' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.mgmt.ghchkpt.acfs' on 'server2'

CRS-2677: Stop of 'ora.mgmt.ghchkpt.acfs' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.MGMT.GHCHKPT.advm' on 'server2'

CRS-2677: Stop of 'ora.MGMT.GHCHKPT.advm' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.proxy_advm' on 'server2'

CRS-2677: Stop of 'ora.proxy_advm' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.qosmserver' on 'server1'

CRS-2672: Attempting to start 'ora.scan2.vip' on 'server1'

CRS-2672: Attempting to start 'ora.cvu' on 'server1'

CRS-2672: Attempting to start 'ora.server2.vip' on 'server1'

CRS-2676: Start of 'ora.cvu' on 'server1' succeeded

CRS-2676: Start of 'ora.server2.vip' on 'server1' succeeded

CRS-2676: Start of 'ora.scan2.vip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.LISTENER_SCAN2.lsnr' on 'server1'

CRS-2676: Start of 'ora.LISTENER_SCAN2.lsnr' on 'server1' succeeded

CRS-2676: Start of 'ora.qosmserver' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.ons' on 'server2'

CRS-2677: Stop of 'ora.ons' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.net1.network' on 'server2'

CRS-2677: Stop of 'ora.net1.network' on 'server2' succeeded

CRS-2792: Shutdown of Cluster Ready Services-managed resources on 'server2' has completed

CRS-2677: Stop of 'ora.crsd' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.asm' on 'server2'

CRS-2673: Attempting to stop 'ora.crf' on 'server2'

CRS-2673: Attempting to stop 'ora.drivers.acfs' on 'server2'

CRS-2673: Attempting to stop 'ora.mdnsd' on 'server2'

CRS-2677: Stop of 'ora.drivers.acfs' on 'server2' succeeded

CRS-2677: Stop of 'ora.crf' on 'server2' succeeded

CRS-2677: Stop of 'ora.mdnsd' on 'server2' succeeded

CRS-2677: Stop of 'ora.asm' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'server2'

CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.ctssd' on 'server2'

CRS-2673: Attempting to stop 'ora.evmd' on 'server2'

CRS-2677: Stop of 'ora.ctssd' on 'server2' succeeded

CRS-2677: Stop of 'ora.evmd' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.cssd' on 'server2'

CRS-2677: Stop of 'ora.cssd' on 'server2' succeeded

CRS-2673: Attempting to stop 'ora.gipcd' on 'server2'

CRS-2673: Attempting to stop 'ora.gpnpd' on 'server2'

CRS-2677: Stop of 'ora.gpnpd' on 'server2' succeeded

CRS-2677: Stop of 'ora.gipcd' on 'server2' succeeded

CRS-2793: Shutdown of Oracle High Availability Services-managed resources on 'server2' has completed

CRS-4133: Oracle High Availability Services has been stopped.

2018/11/14 16:01:42 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'

CRS-4123: Starting Oracle High Availability Services-managed resources

CRS-2672: Attempting to start 'ora.mdnsd' on 'server2'

CRS-2672: Attempting to start 'ora.evmd' on 'server2'

CRS-2676: Start of 'ora.mdnsd' on 'server2' succeeded

CRS-2676: Start of 'ora.evmd' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.gpnpd' on 'server2'

CRS-2676: Start of 'ora.gpnpd' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.gipcd' on 'server2'

CRS-2676: Start of 'ora.gipcd' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.crf' on 'server2'

CRS-2672: Attempting to start 'ora.cssdmonitor' on 'server2'

CRS-2676: Start of 'ora.cssdmonitor' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.cssd' on 'server2'

CRS-2672: Attempting to start 'ora.diskmon' on 'server2'

CRS-2676: Start of 'ora.diskmon' on 'server2' succeeded

CRS-2676: Start of 'ora.crf' on 'server2' succeeded

CRS-2676: Start of 'ora.cssd' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.cluster_interconnect.haip' on 'server2'

CRS-2672: Attempting to start 'ora.ctssd' on 'server2'

CRS-2676: Start of 'ora.ctssd' on 'server2' succeeded

CRS-2676: Start of 'ora.cluster_interconnect.haip' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.asm' on 'server2'

CRS-2676: Start of 'ora.asm' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.storage' on 'server2'

CRS-2676: Start of 'ora.storage' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.crsd' on 'server2'

CRS-2676: Start of 'ora.crsd' on 'server2' succeeded

CRS-6017: Processing resource auto-start for servers: server2

CRS-2673: Attempting to stop 'ora.server2.vip' on 'server1'

CRS-2673: Attempting to stop 'ora.LISTENER_SCAN1.lsnr' on 'server1'

CRS-2672: Attempting to start 'ora.ons' on 'server2'

CRS-2672: Attempting to start 'ora.chad' on 'server2'

CRS-2677: Stop of 'ora.server2.vip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.server2.vip' on 'server2'

CRS-2677: Stop of 'ora.LISTENER_SCAN1.lsnr' on 'server1' succeeded

CRS-2673: Attempting to stop 'ora.scan1.vip' on 'server1'

CRS-2677: Stop of 'ora.scan1.vip' on 'server1' succeeded

CRS-2672: Attempting to start 'ora.scan1.vip' on 'server2'

CRS-2676: Start of 'ora.server2.vip' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.LISTENER.lsnr' on 'server2'

CRS-2676: Start of 'ora.chad' on 'server2' succeeded

CRS-2676: Start of 'ora.scan1.vip' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.LISTENER_SCAN1.lsnr' on 'server2'

CRS-2676: Start of 'ora.LISTENER.lsnr' on 'server2' succeeded

CRS-2679: Attempting to clean 'ora.asm' on 'server2'

CRS-2676: Start of 'ora.LISTENER_SCAN1.lsnr' on 'server2' succeeded

CRS-2681: Clean of 'ora.asm' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.asm' on 'server2'

CRS-2676: Start of 'ora.ons' on 'server2' succeeded

ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)

CRS-2674: Start of 'ora.asm' on 'server2' failed

CRS-2672: Attempting to start 'ora.asm' on 'server2'

ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)

CRS-2674: Start of 'ora.asm' on 'server2' failed

CRS-2679: Attempting to clean 'ora.proxy_advm' on 'server2'

CRS-2681: Clean of 'ora.proxy_advm' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.proxy_advm' on 'server2'

CRS-2676: Start of 'ora.proxy_advm' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.asm' on 'server2'

ORA-15150: instance lock mode 'EXCLUSIVE' conflicts with other ASM instance(s)

CRS-2674: Start of 'ora.asm' on 'server2' failed

CRS-2672: Attempting to start 'ora.MGMT.GHCHKPT.advm' on 'server2'

CRS-2676: Start of 'ora.MGMT.GHCHKPT.advm' on 'server2' succeeded

CRS-2672: Attempting to start 'ora.mgmt.ghchkpt.acfs' on 'server2'

CRS-2676: Start of 'ora.mgmt.ghchkpt.acfs' on 'server2' succeeded

===== Summary of resource auto-start failures follows =====

CRS-2807: Resource 'ora.asm' failed to start automatically.

CRS-6016: Resource auto-start has completed for server server2

CRS-6024: Completed start of Oracle Cluster Ready Services-managed resources

CRS-4123: Oracle High Availability Services has been started.

Oracle Clusterware active version on the cluster is [18.0.0.0.0]. The cluster upgrade state is [NORMAL]. The cluster active patch level is [59717688].

SQL Patching tool version 18.0.0.0.0 Production on Wed Nov 14 16:09:01 2018

Log file for this invocation: /u01/app/oracle/cfgtoollogs/sqlpatch/sqlpatch_181222_2018_11_14_16_09_01/sqlpatch_invocation.log

Connecting to database...OK

Gathering database info...done

Note: Datapatch will only apply or rollback SQL fixes for PDBs

that are in an open state, no patches will be applied to closed PDBs.

Please refer to Note: Datapatch: Database 12c Post Patch SQL Automation

(Doc ID 1585822.1)

Bootstrapping registry and package to current versions...done

Determining current state...done

Current state of interim SQL patches:

Interim patch 27923415 (OJVM RELEASE UPDATE: 18.3.0.0.180717 (27923415)):

Binary registry: Installed

PDB CDB$ROOT: Applied successfully on 13-NOV-18 04.35.06.794463 PM

PDB GIMR_DSCREP_10: Applied successfully on 13-NOV-18 04.43.16.948526 PM

PDB PDB$SEED: Applied successfully on 13-NOV-18 04.43.16.948526 PM

Current state of release update SQL patches:

Binary registry:

18.4.0.0.0 Release_Update 1809251743: Installed

PDB CDB$ROOT:

Applied 18.3.0.0.0 Release_Update 1806280943 successfully on 13-NOV-18 04.35.06.791214 PM

PDB GIMR_DSCREP_10:

Applied 18.3.0.0.0 Release_Update 1806280943 successfully on 13-NOV-18 04.43.16.940471 PM

PDB PDB$SEED:

Applied 18.3.0.0.0 Release_Update 1806280943 successfully on 13-NOV-18 04.43.16.940471 PM

Adding patches to installation queue and performing prereq checks...done

Installation queue:

For the following PDBs: CDB$ROOT PDB$SEED GIMR_DSCREP_10

No interim patches need to be rolled back

Patch 28655784 (Database Release Update : 18.4.0.0.181016 (28655784)):

Apply from 18.3.0.0.0 Release_Update 1806280943 to 18.4.0.0.0 Release_Update 1809251743

No interim patches need to be applied

Installing patches...

Patch installation complete. Total patches installed: 3

Validating logfiles...done

Patch 28655784 apply (pdb CDB$ROOT): SUCCESS

logfile: /u01/app/oracle/cfgtoollogs/sqlpatch/28655784/22509982/28655784_apply__MGMTDB_CDBROOT_2018Nov14_16_11_00.log (no errors)

Patch 28655784 apply (pdb PDB$SEED): SUCCESS

logfile: /u01/app/oracle/cfgtoollogs/sqlpatch/28655784/22509982/28655784_apply__MGMTDB_PDBSEED_2018Nov14_16_11_51.log (no errors)

Patch 28655784 apply (pdb GIMR_DSCREP_10): SUCCESS

logfile: /u01/app/oracle/cfgtoollogs/sqlpatch/28655784/22509982/28655784_apply__MGMTDB_GIMR_DSCREP_10_2018Nov14_16_11_50.log (no errors)

SQL Patching tool complete on Wed Nov 14 16:12:50 2018

2018/11/14 16:13:40 CLSRSC-4015: Performing install or upgrade action for Oracle Trace File Analyzer (TFA) Collector.

2018/11/14 16:15:28 CLSRSC-4003: Successfully patched Oracle Trace File Analyzer (TFA) Collector.

2018/11/14 16:17:48 CLSRSC-672: Post-patch steps for patching GI home successfully completed.

server2.cern.ch: Updating inventory on nodes: server2.

========================================

server2.cern.ch:

Starting Oracle Universal Installer...

Checking swap space: must be greater than 500 MB. Actual 16367 MB Passed

The inventory pointer is located at /etc/oraInst.loc

'UpdateNodeList' was successful.

server2.cern.ch: Updated inventory on nodes: server2.

server2.cern.ch: Updating inventory on nodes: server2.

========================================

server2.cern.ch:

Starting Oracle Universal Installer...

Checking swap space: must be greater than 500 MB. Actual 16367 MB Passed

The inventory pointer is located at /etc/oraInst.loc

'UpdateNodeList' was successful.

server2.cern.ch: Updated inventory on nodes: server2.

server2.cern.ch: Completed the 'move gihome' operation on server cluster.

As you can see, there are two differencse here: the second node was in this case the last one, so the cluster status gets back to NORMAL, and the GIMR is patched with datapatch (lines 176-227).

At this point, the cluster has been patched. After some testing, you can safely remove the inactive version of Grid Infrastructure using the deinstall binary ($OLD_OH/deinstall/deinstall).

Quite easy, huh?

If you combine the Independent Local-mode Automaton with a home-developed solution for the creation and the provisioning of Grid Infrastructure Golden Images, you can easily achieve automated Grid Infrastructure patching of a big, multi-cluster environment.

Of course, Fleet Patching and Provisioning remains the Rolls-Royce: if you can afford it, GI patching and much more is completely automated and developed by Oracle, so you will have no headaches when new versions are released. But the local-mode automaton might be enough for your needs.

—

Ludo

Oracle Grid Infrastructure 18c patching part 2: Independent Local-mode Automaton architecture and activation

Posted on December 16, 2018 by Ludovico

The first important step before starting using the new Independent Local-mode Automaton is understanding which are its components inside a cluster.

Resources

Here’s the list of service that you will find when you install a Grid Infrastructure 18c:

# [ oracle@server1:/u01/app/oracle/home [15:14:41] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ crsctl stat res -t
--------------------------------------------------------------------------------
Name           Target  State        Server                   State details
--------------------------------------------------------------------------------
Local Resources
--------------------------------------------------------------------------------
ora.ASMNET1LSNR_ASM.lsnr
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.LISTENER.lsnr
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.MGMT.GHCHKPT.advm
               OFFLINE OFFLINE      server1                STABLE
               OFFLINE OFFLINE      server2                STABLE
ora.MGMT.dg
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.OCRVOT.dg
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.chad
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.helper
               OFFLINE OFFLINE      server1                IDLE,STABLE
               OFFLINE OFFLINE      server2                STABLE
ora.mgmt.ghchkpt.acfs
               OFFLINE OFFLINE      server1                STABLE
               OFFLINE OFFLINE      server2                STABLE
ora.net1.network
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.ons
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.proxy_advm
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
--------------------------------------------------------------------------------
Cluster Resources
--------------------------------------------------------------------------------
ora.LISTENER_SCAN1.lsnr
      1        ONLINE  ONLINE       server2                STABLE
ora.LISTENER_SCAN2.lsnr
      1        ONLINE  ONLINE       server1                STABLE
ora.MGMTLSNR
      1        ONLINE  ONLINE       server1                169.254.17.12 10.30.
                                                             200.73,STABLE
ora.asm
      1        ONLINE  ONLINE       server1                Started,STABLE
      2        ONLINE  ONLINE       server2                Started,STABLE
      3        OFFLINE OFFLINE                               STABLE
ora.cvu
      1        ONLINE  ONLINE       server1                STABLE
ora.server1.vip
      1        ONLINE  ONLINE       server1                STABLE
ora.server2.vip
      1        ONLINE  ONLINE       server2                STABLE
ora.mgmtdb
      1        ONLINE  ONLINE       server1                Open,STABLE
ora.qosmserver
      1        ONLINE  ONLINE       server1                STABLE
ora.rhpserver
      1        OFFLINE OFFLINE                               STABLE
ora.scan1.vip
      1        ONLINE  ONLINE       server2                STABLE
ora.scan2.vip
      1        ONLINE  ONLINE       server1                STABLE
--------------------------------------------------------------------------------

# [ oracle@server1:/u01/app/oracle/home [15:14:41] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ crsctl stat res -t

--------------------------------------------------------------------------------

Name Target State Server State details

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.ASMNET1LSNR_ASM.lsnr

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.LISTENER.lsnr

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.MGMT.GHCHKPT.advm

OFFLINE OFFLINE server1 STABLE

OFFLINE OFFLINE server2 STABLE

ora.MGMT.dg

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.OCRVOT.dg

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.chad

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.helper

OFFLINE OFFLINE server1 IDLE,STABLE

OFFLINE OFFLINE server2 STABLE

ora.mgmt.ghchkpt.acfs

OFFLINE OFFLINE server1 STABLE

OFFLINE OFFLINE server2 STABLE

ora.net1.network

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.ons

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.proxy_advm

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.LISTENER_SCAN1.lsnr

1 ONLINE ONLINE server2 STABLE

ora.LISTENER_SCAN2.lsnr

1 ONLINE ONLINE server1 STABLE

ora.MGMTLSNR

1 ONLINE ONLINE server1 169.254.17.12 10.30.

200.73,STABLE

ora.asm

1 ONLINE ONLINE server1 Started,STABLE

2 ONLINE ONLINE server2 Started,STABLE

3 OFFLINE OFFLINE STABLE

ora.cvu

1 ONLINE ONLINE server1 STABLE

ora.server1.vip

1 ONLINE ONLINE server1 STABLE

ora.server2.vip

1 ONLINE ONLINE server2 STABLE

ora.mgmtdb

1 ONLINE ONLINE server1 Open,STABLE

ora.qosmserver

1 ONLINE ONLINE server1 STABLE

ora.rhpserver

1 OFFLINE OFFLINE STABLE

ora.scan1.vip

1 ONLINE ONLINE server2 STABLE

ora.scan2.vip

1 ONLINE ONLINE server1 STABLE

--------------------------------------------------------------------------------

As you can see, there are 4 components that are OFFLINE by default:

Three local resources (that are present on each node):

ora.MGMT.GHCHKPT.advm
ora.mgmt.ghchkpt.acfs
ora.helper

One cluster resource (active on only one server at a time, it can relocate):

ora.rhpserver

If you have ever worked with 12c Rapid Home Provisioning, those name should sound familiar.

The GHCHKPT filesystem (ant its relative volume), is used to store some data regarding the ongoing operations across the cluster during the GI home move.

The ora.helper is the process that actually does the operations. It is local because each node needs it to execute some actions at some point.

The rhpserver is the server process that coordinates the operations and delegates them to the helpers.

All those services compose the independent local-mode automaton, that is the default deployment. The full RHP framework (RHP Server and RHP Client) might be configured instead with some additional work.

Important note: Just a few weeks ago Oracle changed the name of Rapid Home Provisioning (RHP) to Fleet Patching and Provisioning (FPP). The name is definitely more appealing now, but it generates again some confusion about product names and acronyms, so beware that in this series sometimes I refer to RHP, sometimes to FPP, but actually it is the same thing.

Tomcat?

You might have noticed that tomcat is deployed now in the GI home, as there are patches specific to it (here I paste the 18.4 version):

$ opatch lspatches
28655963;DBWLM RELEASE UPDATE 18.4.0.0.0 (28655963)
28655784;Database Release Update : 18.4.0.0.181016 (28655784)
28655916;ACFS RELEASE UPDATE 18.4.0.0.0 (28655916)
28656071;OCW RELEASE UPDATE 18.4.0.0.0 (28656071)
28547619;TOMCAT RELEASE UPDATE 18.0.0.0.0 (28547619)
27908644;UPDATE 18.3 DATABASE CLIENT JDK IN ORACLE HOME TO JDK8U171
27923415;OJVM RELEASE UPDATE: 18.3.0.0.180717 (27923415)

$ opatch lspatches

28655963;DBWLM RELEASE UPDATE 18.4.0.0.0 (28655963)

28655784;Database Release Update : 18.4.0.0.181016 (28655784)

28655916;ACFS RELEASE UPDATE 18.4.0.0.0 (28655916)

28656071;OCW RELEASE UPDATE 18.4.0.0.0 (28656071)

28547619;TOMCAT RELEASE UPDATE 18.0.0.0.0 (28547619)

27908644;UPDATE 18.3 DATABASE CLIENT JDK IN ORACLE HOME TO JDK8U171

27923415;OJVM RELEASE UPDATE: 18.3.0.0.180717 (27923415)

Indeed Tomcat is registered in the inventory and patched just like any other product inside the OH:

<COMP NAME="oracle.tomcat.crs" VER="18.0.0.0.0" BUILD_NUMBER="0" BUILD_TIME="20180207.193003" REP_VER="0.0.0.0.0" RELEASE="Production" INV_LOC="Components/oracle.tomcat.crs/18.0.0.0.0/1/" LANGS="ALL_LANGS" XML_INV_LOC="Components21/oracle.tomcat.crs/18.0.0.0.0/" ACT_INST_VER="12.2.0.4.0" DEINST_VER="11.2.0.0.0" INSTALL_TIME="2018.Nov.05 13:27:32 CET" INST_LOC="/u01/app/grid/crs1840/tomcat">
   <EXT_NAME>Tomcat Container</EXT_NAME>
   <DESC>Packages files from the Tomcat Container.</DESC>
   <DESCID>COMPONENT_DESC</DESCID>
   <STG_INFO OSP_VER="10.2.0.0.0"/>
   <CMP_JAR_INFO>
      <INFO NAME="filemapObj" VAL="Components/oracle/tomcat/crs/v18_0_0_0_0/filemap.xml"/>
      <INFO NAME="helpDir" VAL="Components/oracle/tomcat/crs/v18_0_0_0_0/help/"/>
      <INFO NAME="actionsClass" VAL="Components.oracle.tomcat.crs.v18_0_0_0_0.CompActions"/>
      <INFO NAME="resourceClass" VAL="Components.oracle.tomcat.crs.v18_0_0_0_0.resources.CompRes"/>
      <INFO NAME="identifiersXML" VAL="Components/oracle/tomcat/crs/v18_0_0_0_0/identifiers.xml"/>
      <INFO NAME="contextClass" VAL="Components.oracle.tomcat.crs.v18_0_0_0_0.CompContext"/>
      <INFO NAME="fastCopyLogXML" VAL="Components/oracle/tomcat/crs/v18_0_0_0_0/fastCopyLog.xml"/>
   </CMP_JAR_INFO>
   <LOC_INFO INST_DFN_LOC="../Scripts" JAR_NAME="install1.jar"/>
   <BOOK NAME="oracle.tomcat.crs.hs"/>
   <PRE_REQ DEF="F"/>
   <PROD_HOME DEF="F"/>
   <LANG_IDX_MAP>
      <LANG LIST="en fr ar bn pt_BR bg fr_CA ca hr cs da nl ar_EG en_GB et fi de el iw hu is in it ja ko es lv lt ms es_MX no pl pt ro ru zh_CN sk sl es_ES sv th zh_TW tr uk vi"/>
      <LANGSET IDX="1" BITSET="{0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44}"/>
   </LANG_IDX_MAP>
   <PLAT_IDX_MAP>
      <PLAT LIST="46"/>
      <PLATSET IDX="1" BITSET="{0}"/>
   </PLAT_IDX_MAP>
   <DST_IDX_MAP>
      <DST LIST="%ORACLE_HOME% %INVENTORY_LOCATION%"/>
   </DST_IDX_MAP>
   <DEP_GRP_LIST/>
   <DEP_LIST/>
   <REF_LIST>
      <REF NAME="oracle.crs" VER="18.0.0.0.0" HOME_IDX="3"/>
   </REF_LIST>
   <INST_TYPE_LIST>
      <INST_TYPE NAME="Complete" NAME_ID="Maximum" DESC_ID=""/>
   </INST_TYPE_LIST>
   <FILESIZEINFO>
      <DEST VOLUME="%ORACLE_HOME%" SPACE_REQ="3375301"/>
      <DEST VOLUME="%INVENTORY_LOCATION%" SPACE_REQ="2000"/>
   </FILESIZEINFO>
</COMP>

<EXT_NAME>Tomcat Container</EXT_NAME>

<DESC>Packages files from the Tomcat Container.</DESC>

<DESCID>COMPONENT_DESC</DESCID>

<STG_INFO OSP_VER="10.2.0.0.0"/>

<CMP_JAR_INFO>

</CMP_JAR_INFO>

<LOC_INFO INST_DFN_LOC="../Scripts" JAR_NAME="install1.jar"/>

<PRE_REQ DEF="F"/>

<PROD_HOME DEF="F"/>

<LANG_IDX_MAP>

</LANG_IDX_MAP>

<PLAT_IDX_MAP>

</PLAT_IDX_MAP>

<DST_IDX_MAP>

</DST_IDX_MAP>

<DEP_GRP_LIST/>

<DEP_LIST/>

<REF_LIST>

</REF_LIST>

<INST_TYPE_LIST>

<INST_TYPE NAME="Complete" NAME_ID="Maximum" DESC_ID=""/>

</INST_TYPE_LIST>

</FILESIZEINFO>

</COMP>

# [ oracle@server2:/u01/app/grid/crs1830/inventory/Components21/oracle.tomcat.crs/18.0.0.0.0 [08:56:06] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ vi context.xml

<?xml version="1.0" standalone="yes" ?>
<!-- Copyright (c) 1999, 2018, Oracle and/or its affiliates.
All rights reserved. -->
<!-- Do not modify the contents of this file by hand. -->
<COMP_CONTEXT>
   <VAR_LIST SIZE="0">
      <VAR NAME="PROD_HOME" TYPE="String" DESC_RES_ID="" SECURE="F" VAL="/u01/app/grid/crs1830/tomcat" ADV="F" CLONABLE="T" USER_INPUT="DEFAULT"/>
   </VAR_LIST>
   <CONST_LIST SIZE="2">
      <CONST NAME="COMPONENT_DESC" PLAT_SP="F" TYPE="String" TRANS="T" VAL="COMPONENT_DESC_ALL"/>
      <CONST NAME="COMPONENT_NAME" PLAT_SP="F" TYPE="String" TRANS="F" VAL="Tomcat Container"/>
   </CONST_LIST>
</COMP_CONTEXT>

# [ oracle@server2:/u01/app/grid/crs1830/inventory/Components21/oracle.tomcat.crs/18.0.0.0.0 [08:56:06] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ vi context.xml

<?xml version="1.0" standalone="yes" ?>

<COMP_CONTEXT>

<VAR_LIST SIZE="0">

</VAR_LIST>

<CONST_LIST SIZE="2">

</CONST_LIST>

</COMP_CONTEXT>

Out of the box, Tomcat is used for the Quality of Services Management (ora.qosmserver resource):

$ ps -eaf | grep tomcat
oracle    58746 142151  0 13:10 pts/1    00:00:00 grep --color=auto tomcat
oracle   108610      1  0 Dec04 ?        00:25:33 /CRS/dbs01/crs1830/jdk/bin/java -server -Xms128M -Xmx384M -Djava.awt.headless=true -Ddisable.checkForUpdate=true -Djava.util.logging.config.file=/ORA/dbs01/oracle/crsdata/itrac1602/qos/conf/logging.properties -Djava.util.logging.manager=org.apache.juli.ClassLoaderLogManager -DTRACING.ENABLED=false -Djava.rmi.server.hostname=itrac1602.cern.ch -Doracle.http.port=8888 -Doracle.jmx.port=23792 -Doracle.tls.enabled=false -Doracle.jwc.tls.http.enabled=false -Djava.security.manager -Djava.security.policy=/ORA/dbs01/oracle/crsdata/itrac1602/qos/conf/catalina.policy -Djava.security.egd=file:/dev/urandom -Dcatalina.home=/CRS/dbs01/crs1840/tomcat -Dcatalina.base=/ORA/dbs01/oracle/crsdata/itrac1602/qos -Djava.io.tmpdir=/ORA/dbs01/oracle/crsdata/itrac1602/qos/temp -Doracle.home=/CRS/dbs01/crs1840 -classpath /CRS/dbs01/crs1840/tomcat/lib/tomcat-juli.jar:/CRS/dbs01/crs1840/tomcat/lib/bootstrap.jar:/CRS/dbs01/crs1840/jlib/jwc-logging.jar org.apache.catalina.startup.Bootstrap start

$ ps -eaf | grep tomcat

oracle 58746 142151 0 13:10 pts/1 00:00:00 grep --color=auto tomcat

oracle 108610 1 0 Dec04 ? 00:25:33 /CRS/dbs01/crs1830/jdk/bin/java -server -Xms128M -Xmx384M -Djava.awt.headless=true -Ddisable.checkForUpdate=true -Djava.util.logging.config.file=/ORA/dbs01/oracle/crsdata/itrac1602/qos/conf/logging.properties -Djava.util.logging.manager=org.apache.juli.ClassLoaderLogManager -DTRACING.ENABLED=false -Djava.rmi.server.hostname=itrac1602.cern.ch -Doracle.http.port=8888 -Doracle.jmx.port=23792 -Doracle.tls.enabled=false -Doracle.jwc.tls.http.enabled=false -Djava.security.manager -Djava.security.policy=/ORA/dbs01/oracle/crsdata/itrac1602/qos/conf/catalina.policy -Djava.security.egd=file:/dev/urandom -Dcatalina.home=/CRS/dbs01/crs1840/tomcat -Dcatalina.base=/ORA/dbs01/oracle/crsdata/itrac1602/qos -Djava.io.tmpdir=/ORA/dbs01/oracle/crsdata/itrac1602/qos/temp -Doracle.home=/CRS/dbs01/crs1840 -classpath /CRS/dbs01/crs1840/tomcat/lib/tomcat-juli.jar:/CRS/dbs01/crs1840/tomcat/lib/bootstrap.jar:/CRS/dbs01/crs1840/jlib/jwc-logging.jar org.apache.catalina.startup.Bootstrap start

But it is used for the Independent Local Mode Automaton as well, when it is started.

Enabling and starting the independent local-mode automaton

The resources are started using the following commands (as root, the order is quite important):

# /u01/app/grid/crs1830/bin/srvctl enable volume -volume GHCHKPT  -diskgroup mgmt
# /u01/app/grid/crs1830/bin/srvctl enable filesystem -volume GHCHKPT -diskgroup mgmt
# /u01/app/grid/crs1830/bin/srvctl start filesystem -volume GHCHKPT -diskgroup mgmt

# /u01/app/grid/crs1830/bin/srvctl enable volume -volume GHCHKPT -diskgroup mgmt

# /u01/app/grid/crs1830/bin/srvctl enable filesystem -volume GHCHKPT -diskgroup mgmt

# /u01/app/grid/crs1830/bin/srvctl start filesystem -volume GHCHKPT -diskgroup mgmt

Before continuing with the rhpserver resource, you might want to check if the filesystem is mounted:

$ crsctl stat res -t
--------------------------------------------------------------------------------
Name           Target  State        Server                   State details
--------------------------------------------------------------------------------
Local Resources
--------------------------------------------------------------------------------
ora.ASMNET1LSNR_ASM.lsnr
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.LISTENER.lsnr
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.MGMT.GHCHKPT.advm
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.MGMT.dg
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.OCRVOT.dg
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.chad
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.helper
               OFFLINE OFFLINE      server1                IDLE,STABLE
               OFFLINE OFFLINE      server2                STABLE
ora.mgmt.ghchkpt.acfs
               ONLINE  ONLINE       server1                mounted on /opt/orac
                                                             le/rhp_images/chkbas
                                                             e,STABLE
               ONLINE  ONLINE       server2                mounted on /opt/orac
                                                             le/rhp_images/chkbas
                                                             e,STABLE
ora.net1.network
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.ons
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.proxy_advm
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
--------------------------------------------------------------------------------
Cluster Resources
--------------------------------------------------------------------------------
ora.LISTENER_SCAN1.lsnr
      1        ONLINE  ONLINE       server2                STABLE
ora.LISTENER_SCAN2.lsnr
      1        ONLINE  ONLINE       server1                STABLE
ora.MGMTLSNR
      1        ONLINE  ONLINE       server1                169.254.17.12 10.30.
                                                             200.73,STABLE
ora.asm
      1        ONLINE  ONLINE       server1                Started,STABLE
      2        ONLINE  ONLINE       server2                Started,STABLE
      3        OFFLINE OFFLINE                               STABLE
ora.cvu
      1        ONLINE  ONLINE       server1                STABLE
ora.server1.vip
      1        ONLINE  ONLINE       server1                STABLE
ora.server2.vip
      1        ONLINE  ONLINE       server2                STABLE
ora.mgmtdb
      1        ONLINE  ONLINE       server1                Open,STABLE
ora.qosmserver
      1        ONLINE  ONLINE       server1                STABLE
ora.rhpserver
      1        OFFLINE OFFLINE                               STABLE
ora.scan1.vip
      1        ONLINE  ONLINE       server2                STABLE
ora.scan2.vip
      1        ONLINE  ONLINE       server1                STABLE
--------------------------------------------------------------------------------


[root@server2 dbs01]# df -k | grep ghchkpt
/dev/asm/ghchkpt-213              1572864   499572   1073292  32% /opt/oracle/rhp_images/chkbase

$ crsctl stat res -t

--------------------------------------------------------------------------------

Name Target State Server State details

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.ASMNET1LSNR_ASM.lsnr

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.LISTENER.lsnr

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.MGMT.GHCHKPT.advm

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.MGMT.dg

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.OCRVOT.dg

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.chad

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.helper

OFFLINE OFFLINE server1 IDLE,STABLE

OFFLINE OFFLINE server2 STABLE

ora.mgmt.ghchkpt.acfs

ONLINE ONLINE server1 mounted on /opt/orac

le/rhp_images/chkbas

e,STABLE

ONLINE ONLINE server2 mounted on /opt/orac

le/rhp_images/chkbas

e,STABLE

ora.net1.network

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.ons

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.proxy_advm

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.LISTENER_SCAN1.lsnr

1 ONLINE ONLINE server2 STABLE

ora.LISTENER_SCAN2.lsnr

1 ONLINE ONLINE server1 STABLE

ora.MGMTLSNR

1 ONLINE ONLINE server1 169.254.17.12 10.30.

200.73,STABLE

ora.asm

1 ONLINE ONLINE server1 Started,STABLE

2 ONLINE ONLINE server2 Started,STABLE

3 OFFLINE OFFLINE STABLE

ora.cvu

1 ONLINE ONLINE server1 STABLE

ora.server1.vip

1 ONLINE ONLINE server1 STABLE

ora.server2.vip

1 ONLINE ONLINE server2 STABLE

ora.mgmtdb

1 ONLINE ONLINE server1 Open,STABLE

ora.qosmserver

1 ONLINE ONLINE server1 STABLE

ora.rhpserver

1 OFFLINE OFFLINE STABLE

ora.scan1.vip

1 ONLINE ONLINE server2 STABLE

ora.scan2.vip

1 ONLINE ONLINE server1 STABLE

--------------------------------------------------------------------------------

[root@server2 dbs01]# df -k | grep ghchkpt

/dev/asm/ghchkpt-213 1572864 499572 1073292 32% /opt/oracle/rhp_images/chkbase

Now the rhpserver should start without problems, as oracle:

# [ oracle@server1:/u01/app/oracle/home [17:00:49] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ srvctl start rhpserver

1 2	# [ oracle@server1:/u01/app/oracle/home [17:00:49] [18.3.0.0.0 [GRID] SID=GRID] 0 ] # $ srvctl start rhpserver

Please note that if you omit to activate the filesystem first, the rhpserver will fail to start.

As you can see, now both rhpserver and the helper are online:

# [ oracle@server1:/u01/app/oracle/home [17:00:49] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ srvctl start rhpserver

# [ oracle@server1:/u01/app/oracle/home [17:02:39] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ crsctl stat res -t
--------------------------------------------------------------------------------
Name           Target  State        Server                   State details
--------------------------------------------------------------------------------
Local Resources
--------------------------------------------------------------------------------
ora.ASMNET1LSNR_ASM.lsnr
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.LISTENER.lsnr
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.MGMT.GHCHKPT.advm
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.MGMT.dg
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.OCRVOT.dg
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.chad
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.helper
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.mgmt.ghchkpt.acfs
               ONLINE  ONLINE       server1                mounted on /opt/orac
                                                             le/rhp_images/chkbas
                                                             e,STABLE
               ONLINE  ONLINE       server2                mounted on /opt/orac
                                                             le/rhp_images/chkbas
                                                             e,STABLE
ora.net1.network
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.ons
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
ora.proxy_advm
               ONLINE  ONLINE       server1                STABLE
               ONLINE  ONLINE       server2                STABLE
--------------------------------------------------------------------------------
Cluster Resources
--------------------------------------------------------------------------------
ora.LISTENER_SCAN1.lsnr
      1        ONLINE  ONLINE       server2                STABLE
ora.LISTENER_SCAN2.lsnr
      1        ONLINE  ONLINE       server1                STABLE
ora.MGMTLSNR
      1        ONLINE  ONLINE       server1                169.254.17.12 10.30.
                                                             200.73,STABLE
ora.asm
      1        ONLINE  ONLINE       server1                Started,STABLE
      2        ONLINE  ONLINE       server2                Started,STABLE
      3        OFFLINE OFFLINE                               STABLE
ora.cvu
      1        ONLINE  ONLINE       server1                STABLE
ora.server1.vip
      1        ONLINE  ONLINE       server1                STABLE
ora.server2.vip
      1        ONLINE  ONLINE       server2                STABLE
ora.mgmtdb
      1        ONLINE  ONLINE       server1                Open,STABLE
ora.qosmserver
      1        ONLINE  ONLINE       server1                STABLE
ora.rhpserver
      1        ONLINE  ONLINE       server2                STABLE
ora.scan1.vip
      1        ONLINE  ONLINE       server2                STABLE
ora.scan2.vip
      1        ONLINE  ONLINE       server1                STABLE
--------------------------------------------------------------------------------

# [ oracle@server1:/u01/app/oracle/home [17:00:49] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ srvctl start rhpserver

# [ oracle@server1:/u01/app/oracle/home [17:02:39] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ crsctl stat res -t

--------------------------------------------------------------------------------

Name Target State Server State details

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.ASMNET1LSNR_ASM.lsnr

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.LISTENER.lsnr

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.MGMT.GHCHKPT.advm

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.MGMT.dg

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.OCRVOT.dg

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.chad

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.helper

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.mgmt.ghchkpt.acfs

ONLINE ONLINE server1 mounted on /opt/orac

le/rhp_images/chkbas

e,STABLE

ONLINE ONLINE server2 mounted on /opt/orac

le/rhp_images/chkbas

e,STABLE

ora.net1.network

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.ons

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

ora.proxy_advm

ONLINE ONLINE server1 STABLE

ONLINE ONLINE server2 STABLE

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.LISTENER_SCAN1.lsnr

1 ONLINE ONLINE server2 STABLE

ora.LISTENER_SCAN2.lsnr

1 ONLINE ONLINE server1 STABLE

ora.MGMTLSNR

1 ONLINE ONLINE server1 169.254.17.12 10.30.

200.73,STABLE

ora.asm

1 ONLINE ONLINE server1 Started,STABLE

2 ONLINE ONLINE server2 Started,STABLE

3 OFFLINE OFFLINE STABLE

ora.cvu

1 ONLINE ONLINE server1 STABLE

ora.server1.vip

1 ONLINE ONLINE server1 STABLE

ora.server2.vip

1 ONLINE ONLINE server2 STABLE

ora.mgmtdb

1 ONLINE ONLINE server1 Open,STABLE

ora.qosmserver

1 ONLINE ONLINE server1 STABLE

ora.rhpserver

1 ONLINE ONLINE server2 STABLE

ora.scan1.vip

1 ONLINE ONLINE server2 STABLE

ora.scan2.vip

1 ONLINE ONLINE server1 STABLE

--------------------------------------------------------------------------------

# [ oracle@server2:/u01/app/grid/crs1830 [08:59:43] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #
$ ps -eaf | grep tomca
oracle   132330      1 15 08:48 ?        00:01:39 /u01/app/grid/crs1830/jdk/bin/java -server -Xms128M -Xmx384M -Djava.awt.headless=true -Ddisable.checkForUpdate=true -Djava.util.logging.config.file=/u01/app/oracle/crsdata/server2/rhp/conf/logging.properties -Djava.util.logging.manager=org.apache.juli.ClassLoaderLogManager -DTRACING.ENABLED=false -Djava.rmi.server.hostname=server2.cern.ch -Doracle.http.port=8894 -Doracle.jmx.port=23795 -Doracle.tls.enabled=false -Doracle.jwc.tls.http.enabled=true -Doracle.rhp.storagebase=/opt/oracle/rhp_images -Djava.security.egd=file:/dev/urandom -Dcatalina.home=/u01/app/grid/crs1830/tomcat -Dcatalina.base=/u01/app/oracle/crsdata/server2/rhp -Djava.io.tmpdir=/u01/app/oracle/crsdata/server2/rhp/temp -Doracle.home=/u01/app/grid/crs1830 -classpath /u01/app/grid/crs1830/tomcat/lib/tomcat-juli.jar:/u01/app/grid/crs1830/tomcat/lib/bootstrap.jar:/u01/app/grid/crs1830/jlib/jwc-logging.jar org.apache.catalina.startup.Bootstrap start

# [ oracle@server2:/u01/app/grid/crs1830 [08:59:43] [18.3.0.0.0 [GRID] SID=GRID] 0 ] #

$ ps -eaf | grep tomca

oracle 132330 1 15 08:48 ? 00:01:39 /u01/app/grid/crs1830/jdk/bin/java -server -Xms128M -Xmx384M -Djava.awt.headless=true -Ddisable.checkForUpdate=true -Djava.util.logging.config.file=/u01/app/oracle/crsdata/server2/rhp/conf/logging.properties -Djava.util.logging.manager=org.apache.juli.ClassLoaderLogManager -DTRACING.ENABLED=false -Djava.rmi.server.hostname=server2.cern.ch -Doracle.http.port=8894 -Doracle.jmx.port=23795 -Doracle.tls.enabled=false -Doracle.jwc.tls.http.enabled=true -Doracle.rhp.storagebase=/opt/oracle/rhp_images -Djava.security.egd=file:/dev/urandom -Dcatalina.home=/u01/app/grid/crs1830/tomcat -Dcatalina.base=/u01/app/oracle/crsdata/server2/rhp -Djava.io.tmpdir=/u01/app/oracle/crsdata/server2/rhp/temp -Doracle.home=/u01/app/grid/crs1830 -classpath /u01/app/grid/crs1830/tomcat/lib/tomcat-juli.jar:/u01/app/grid/crs1830/tomcat/lib/bootstrap.jar:/u01/app/grid/crs1830/jlib/jwc-logging.jar org.apache.catalina.startup.Bootstrap start

Now all is set to start using it!

We’ll see how to use it in the next posts.

—

Ludo

Oracle Grid Infrastructure 18c patching part 1: Some history

Posted on November 16, 2018 by Ludovico

Down the memory lane

Although sometimes I think I have been working with Oracle Grid Infrastructure since it exists, sometimes my memory does not work well. I still like to go through the Oracle RAC family history from time to time:

8i -> no Oracle cluster did exist. RAC was leveraging 3rd party clusters (like Tru Cluster, AIX HACMP, Sun Cluster)…
9i -> if I remember well, Oracle hired some developers of Tru Cluster after the acquisition of Compaq by HP. Oracle CRS was born and was quite similar to Tru Cluster. (The commands were almost the same: crs_stat instead of caa_stat, etc)
10g -> Oracle re-branded CRS to Clusterware
11g -> With the addition of ASM (and other components), Oracle created the concept of “Grid Infrastructure”, composed by Clusterware and additional products. All the new versions still use the name Grid Infrastructure and new products have been added through the years (ACFS, RHP, QoS …)

But I have missing souvenirs. For example, I cannot remember having ever upgraded an Oracle Cluster from 9i to 10g or from 10g to 11g. At that time I was working for several customers, and every new release was installed on new Hardware.

My first, real upgrade (as far as I can remember) was from 11gR2 to 12c, where the upgrade process was a nice, OUI-driven, out-of-place install.

The process was (still is 🙂 ) nice and smooth:

The installer copies, prepares and links the binaries on all the nodes in a new Oracle Home
The upgrade process is rolling: the first node puts the cluster in upgrade mode
The last node does the final steps and exists the cluster from the upgrade mode.

This is about Upgrading to a new release. But what about patching?

In-place patching

Patching of Grid Infrastructure has always been in-place and, I will not hide it, quite painful.

If you wanted to patch a Grid Infrastructure before release 12cR2, you had to:

read the documentation carefully and check for possible conflicts
backup the Grid Home
copy the patch on the host
evacuate all the services and databases from the cluster node that you want to patch
patch the binaries (depending on the versions and patches, this might be easy with opatchauto or quite painful with manual unlocking/locking and manual opatch steps)
restart/relocate the services back on the node
repeat the tasks for every node

The disadvantages of in-place patching are many:

Need to stage the patch on every node
Need to repeat the patching process for every node
No easy rollback (some bad problems might lead to deconfiguring the cluster from one node and then adding it back to the cluster)

Out-of-place patching

Out-of-place patching is proven to be much a better solution. I am doing it regularly since a while for Oracle Database homes and I am very satisfied with it. I am implementing it at CERN as well, and it will unlock new levels of server consolidation 🙂

I have written a blog series here, and presented about it a few times.

But out-of-place patching for Grid Infrastructure is VERY recent.

12cR2: opatchauto

Oracle 12cR2 introduced out-of-place patching as a new feature of opatchauto.

This MOS document explains it quite in detail:

Grid Infrastructure Out of Place ( OOP ) Patching using opatchauto (Doc ID 2419319.1)

The process is the following:

a preparation process clones the active Oracle Home on the current node and patches it
a switch process switches the active Oracle Home from the old one to the prepared clone
those two phases are repeated for each node

The good thing is that the preparation can be done in advance on all the nodes and the switch can be triggered only if all the clones are patched successfully.

However, the staging of the patch, the cloning and patching must still happen on every node, making the concept of golden images quite useless for patching.

It is worth to mention, at this point, that Grid Infrastructure Golden Images ARE A THING, and that they have been introduced by Rapid Home Provisioning release 12cR2, where cluster automatic provisioning has been included as a new feature.

This Grid Infrastructure golden images have already been mentioned here and here.

I have discussed about Rapid Home provisioning itself here, but I will ad a couple of thoughts in the next paragraph.

18c and the brand new Independent local-mode Automaton

I have been early tester of the Rapid Home Provisioning product, when it has been released with Oracle 12.1.0.2. I have presented about it at UKOUG and as a RAC SIG webinar.
https://www.youtube.com/watch?v=vaB4RWjYPq0
https://www.ludovicocaldara.net/dba/rhp-presentation/

I liked the product A LOT, despite a few bugs due to the initial release. The concept of out-of-placing patching that RHP uses is the best one, in my opinion, to cope with frequent patches and upgrades.

Now, with Oracle 18c, the Rapid Home Provisioning Independent Local-mode Automaton comes to play. There is not that much documentation about it, even in the Oracle documentation, but a few things are clear:

The Independent local-mode automaton comes without additional licenses as it is not part of the RHP Server/Client infrastructure
It is 100% local to the cluster where it is used
Its main “job” is to allow moving Grid Infrastructure Homes from a non-patched version to an out-of-place patched one.

$ rhpctl move gihome –sourcehome Oracle_home_path -destinationhome Oracle_home_path

1	$ rhpctl move gihome –sourcehome Oracle_home_path -destinationhome Oracle_home_path

I will not disclore more here, as the rest of this blog series is focused on this new product 🙂

Stay tuned for details, examples and feedback from its usage at CERN 😉

—

Ludo

Port conflict with “Oracle Remote Method Invocation (ORMI)” during Grid Infrastructure install

Posted on November 13, 2018 by Ludovico

After years of installing Grid Infrastructures, today I have got for the first time an error on something new:

$ /u01/app/grid/crs1840/gridSetup.sh -silent -responseFile /u01/app/grid/crs1840/inventory/response/CERNDB_Grid_Config.rsp ORACLE_HOME_NAME=crs1840
Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-13013] Target environment does not meet some mandatory requirements.
   CAUSE: Some of the mandatory prerequisites are not met. See logs for details. /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log
   ACTION: Identify the list of failed prerequisite checks from the log: /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log. Then either from the log file or from installation manual find the appropriate configuration to meet the prerequisites and fix it manually.

$ /u01/app/grid/crs1840/gridSetup.sh -silent -responseFile /u01/app/grid/crs1840/inventory/response/CERNDB_Grid_Config.rsp ORACLE_HOME_NAME=crs1840

Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-13013] Target environment does not meet some mandatory requirements.

CAUSE: Some of the mandatory prerequisites are not met. See logs for details. /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log

ACTION: Identify the list of failed prerequisite checks from the log: /tmp/GridSetupActions2018-11-13_12-40-03PM/gridSetupActions2018-11-13_12-40-03PM.log. Then either from the log file or from installation manual find the appropriate configuration to meet the prerequisites and fix it manually.

Looking at the logs (which I do not have now as I removed them as part of the failed install cleanup 🙁 ), the error is generated by the cluster verification utility (CVU) on this check:

Verifying Port Availability for component "Oracle Remote Method Invocation (ORMI)"

1	Verifying Port Availability for component "Oracle Remote Method Invocation (ORMI)"

The components verified by the CVU can be found inside $ORACLE_HOME/cv/cvdata/. In my case, precisely:

$ grep -i ORMI $ORACLE_HOME/cv/cvdata/18/crsinst_prereq.xml
         <PORT NAME="Oracle Remote Method Invocation (ORMI)" VALUE="23791" PROTOCOL="TCP" NETWORK_TYPE="PUBLIC"/>
         <PORT NAME="Oracle Remote Method Invocation (ORMI)" VALUE="23792" PROTOCOL="TCP" NETWORK_TYPE="PUBLIC"/>

$ grep -i ORMI $ORACLE_HOME/cv/cvdata/18/crsinst_prereq.xml

This check is critical, so the install fails.

In my case the port was used by mcollectived.

[root@server1 work]# netstat -anp | grep 23791

[root@server1 work]# netstat -anp | grep 23792
tcp 0 0 x.x.x.x:23792 x.x.x.x:61613 ESTABLISHED 2298/ruby

[root@server1 work]# ps -eaf | grep 2298
root 2298 1 0 11:16 ? 00:00:02 /opt/puppetlabs/puppet/bin/ruby /opt/puppetlabs/puppet/bin/mcollectived --config=/etc/puppetlabs/mcollective/server.cfg --pidfile=/var/run/puppetlabs/mcollective.pid --daemonize
root 47116 4114 0 12:50 pts/0 00:00:00 grep --color=auto 2298

[root@server1 work]# netstat -anp | grep 23791

[root@server1 work]# netstat -anp | grep 23792

tcp 0 0 x.x.x.x:23792 x.x.x.x:61613 ESTABLISHED 2298/ruby

[root@server1 work]# ps -eaf | grep 2298

root 2298 1 0 11:16 ? 00:00:02 /opt/puppetlabs/puppet/bin/ruby /opt/puppetlabs/puppet/bin/mcollectived --config=/etc/puppetlabs/mcollective/server.cfg --pidfile=/var/run/puppetlabs/mcollective.pid --daemonize

root 47116 4114 0 12:50 pts/0 00:00:00 grep --color=auto 2298

The port has been taken dynamically, and previous runs of CVU did not encounter the problem.

A rare port conflict that might happen when configuring GI 🙂

—

Ludo

Grid Infrastructure 18c: changes in gridSetup.sh -applyRU and -createGoldImage

Posted on November 6, 2018 by Ludovico

Starting with release 12cR2, Grid Infrastructure binaries are no more shipped as an installer, but as a zip file that is uncompressed directly in the Oracle Home path.
This opened a few new possibilities including patching the software before the Grid Infrastructure configuration.
My former colleague Markus Flechtner wrote an excellent blog post about it, here: https://www.markusdba.net/?p=294

Now, with 18c, there are a couple of things that changed comparing to Markus blog.

The -applyRU switch replaces the -applyPSU

While it is possible to apply several sub-patches of a PSU one by one:

./gridSetup.sh -silent -applyOneOffs <path to sub-patch>
e.g.

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28547619
./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655784
./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655916
...

./gridSetup.sh -silent -applyOneOffs <path to sub-patch>

e.g.

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28547619

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655784

./gridSetup.sh -silent -applyOneOffs /work/p28659165_180000_Linux-x86-64/28659165/28655916

...

it was possible to do all at once with:

./gridSetup.sh -silent -applyPSU <path to PSU>

1	./gridSetup.sh -silent -applyPSU <path to PSU>

Now the switch is called, for consistency with the patch naming, -applyRU.

E.g.:

# [ oracle@server:/u01/app/grid/crs1840 [16:38:40] [18.4.0.0.0 [GRID] SID=GRID] 255 ] #
$ ./gridSetup.sh -silent -applyRU /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165
Preparing the home to patch...
Applying the patch  /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165...
Successfully applied the patch.
The log can be found at: /u01/app/oraInventory/logs/GridSetupActions2018-11-02_04-39-54PM/installerPatchActions_2018-11-02_04-39-54PM.log
Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-40426] Grid installation option has not been specified.
   ACTION: Specify the valid installation option.

# [ oracle@server:/u01/app/grid/crs1840 [16:38:40] [18.4.0.0.0 [GRID] SID=GRID] 255 ] #

$ ./gridSetup.sh -silent -applyRU /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165

Preparing the home to patch...

Applying the patch /u01/app/oracle/stage/p28659165_180000_Linux-x86-64/28659165...

Successfully applied the patch.

The log can be found at: /u01/app/oraInventory/logs/GridSetupActions2018-11-02_04-39-54PM/installerPatchActions_2018-11-02_04-39-54PM.log

Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-40426] Grid installation option has not been specified.

ACTION: Specify the valid installation option.

Still there are no options to avoid the run of the Setup Wizard, but it is safe to ignore the error as the patch has been applied successfully.

The -createGoldImage does not work anymore if the Home is not attached

I have tried to create the golden image as per Markus post, but I get this error:

# [ oracle@server:/u01/app/grid/crs1840 [09:43:39] [18.4.0.0.0 [GRID] SID=GRID] 0 ] #
$ ./gridSetup.sh -createGoldImage -destinationlocation  /u01/app/oracle/stage/golden_images/crs1840 -silent
Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-32715] The source home (/u01/app/grid/crs1840) is not registered in the central inventory.
   ACTION: Ensure that the source home is registered in the central inventory.

# [ oracle@server:/u01/app/grid/crs1840 [09:43:39] [18.4.0.0.0 [GRID] SID=GRID] 0 ] #

$ ./gridSetup.sh -createGoldImage -destinationlocation /u01/app/oracle/stage/golden_images/crs1840 -silent

Launching Oracle Grid Infrastructure Setup Wizard...

[FATAL] [INS-32715] The source home (/u01/app/grid/crs1840) is not registered in the central inventory.

ACTION: Ensure that the source home is registered in the central inventory.

To workaround the issue, there are two ways:

Create a zip file manually, as all the content needed to install the patched version is right there. No need to touch anything as the software is not configured yet.

Configure the software with CRS_SWONLY before creating the gold image:

$ cat grid1840_swonly.rsp
oracle.install.responseFileVersion=/oracle/install/rspfmt_crsinstall_response_schema_v18.0.0
INVENTORY_LOCATION=/u01/app/oraInventory
oracle.install.option=CRS_SWONLY
ORACLE_BASE=/u01/app/oracle
oracle.install.asm.OSDBA=dba
oracle.install.asm.OSASM=asmdba
oracle.install.crs.config.scanType=LOCAL_SCAN
oracle.install.crs.config.gpnp.configureGNS=false
oracle.install.crs.config.autoConfigureClusterNodeVIP=false
oracle.install.crs.config.gpnp.gnsOption=CREATE_NEW_GNS
oracle.install.crs.config.clusterNodes=server1,server2
oracle.install.asm.configureGIMRDataDG=false
oracle.install.crs.config.useIPMI=false
oracle.install.asm.storageOption=ASM
oracle.install.asmOnNAS.configureGIMRDataDG=false
oracle.install.asm.diskGroup.name=OCRVOT
oracle.install.asm.diskGroup.AUSize=1
oracle.install.asm.gimrDG.AUSize=1
oracle.install.asm.configureAFD=false
oracle.install.crs.configureRHPS=false
oracle.install.crs.config.ignoreDownNodes=false
oracle.install.config.managementOption=NONE
oracle.install.config.omsPort=0
oracle.install.crs.rootconfig.executeRootScript=false

$ ./gridSetup.sh -silent -responseFile grid1840_swonly.rsp ORACLE_HOME_NAME=crs1840
Launching Oracle Grid Infrastructure Setup Wizard...

The response file for this session can be found at:
 /u01/app/grid/crs1840/install/response/grid_2018-11-05_01-18-28PM.rsp

You can find the log of this install session at:
 /u01/app/oraInventory/logs/GridSetupActions2018-11-05_01-18-28PM/gridSetupActions2018-11-05_01-18-28PM.log

As a root user, execute the following script(s):
        1. /u01/app/grid/crs1840/root.sh

Execute /u01/app/grid/crs1840/root.sh on the following nodes:
[server1, server2]

[root@server1 dbs01]# /u01/app/grid/crs1840/root.sh
Check /u01/app/grid/crs1840/install/root_server1.cern.ch_2018-11-05_14-13-58-835084539.log for the output of root script

[root@server2 dbs01]# /u01/app/grid/crs1840/root.sh
Check /u01/app/grid/crs1840/install/root_server2.cern.ch_2018-11-05_14-15-18-835087641.log for the output of root script

$ ./gridSetup.sh -createGoldImage -destinationlocation  /u01/app/oracle/stage/golden_images/crs1840 -silent
Launching Oracle Grid Infrastructure Setup Wizard...

Successfully Setup Software.
Gold Image location: /u01/app/oracle/stage/golden_images/crs1840/grid_home_2018-11-05_02-25-52PM.zip

$ cat grid1840_swonly.rsp

oracle.install.responseFileVersion=/oracle/install/rspfmt_crsinstall_response_schema_v18.0.0

INVENTORY_LOCATION=/u01/app/oraInventory

oracle.install.option=CRS_SWONLY

ORACLE_BASE=/u01/app/oracle

oracle.install.asm.OSDBA=dba

oracle.install.asm.OSASM=asmdba

oracle.install.crs.config.scanType=LOCAL_SCAN

oracle.install.crs.config.gpnp.configureGNS=false

oracle.install.crs.config.autoConfigureClusterNodeVIP=false

oracle.install.crs.config.gpnp.gnsOption=CREATE_NEW_GNS

oracle.install.crs.config.clusterNodes=server1,server2

oracle.install.asm.configureGIMRDataDG=false

oracle.install.crs.config.useIPMI=false

oracle.install.asm.storageOption=ASM

oracle.install.asmOnNAS.configureGIMRDataDG=false

oracle.install.asm.diskGroup.name=OCRVOT

oracle.install.asm.diskGroup.AUSize=1

oracle.install.asm.gimrDG.AUSize=1

oracle.install.asm.configureAFD=false

oracle.install.crs.configureRHPS=false

oracle.install.crs.config.ignoreDownNodes=false

oracle.install.config.managementOption=NONE

oracle.install.config.omsPort=0

oracle.install.crs.rootconfig.executeRootScript=false

$ ./gridSetup.sh -silent -responseFile grid1840_swonly.rsp ORACLE_HOME_NAME=crs1840

Launching Oracle Grid Infrastructure Setup Wizard...

The response file for this session can be found at:

/u01/app/grid/crs1840/install/response/grid_2018-11-05_01-18-28PM.rsp

You can find the log of this install session at:

/u01/app/oraInventory/logs/GridSetupActions2018-11-05_01-18-28PM/gridSetupActions2018-11-05_01-18-28PM.log

As a root user, execute the following script(s):

1. /u01/app/grid/crs1840/root.sh

Execute /u01/app/grid/crs1840/root.sh on the following nodes:

[server1, server2]

[root@server1 dbs01]# /u01/app/grid/crs1840/root.sh

Check /u01/app/grid/crs1840/install/root_server1.cern.ch_2018-11-05_14-13-58-835084539.log for the output of root script

[root@server2 dbs01]# /u01/app/grid/crs1840/root.sh

Check /u01/app/grid/crs1840/install/root_server2.cern.ch_2018-11-05_14-15-18-835087641.log for the output of root script

$ ./gridSetup.sh -createGoldImage -destinationlocation /u01/app/oracle/stage/golden_images/crs1840 -silent

Launching Oracle Grid Infrastructure Setup Wizard...

Successfully Setup Software.

Gold Image location: /u01/app/oracle/stage/golden_images/crs1840/grid_home_2018-11-05_02-25-52PM.zip

HTH

—

Ludo