DBA survival BLOG

DBA stuff and Oracle Data Guard

Install and configure CMAN 19c in the Oracle Cloud, step by step

Posted on July 12, 2019 by Ludovico

Installing and configuring CMAN is a trivial activity, but having the steps in one place is better than reinventing the wheel.

Prepare for the install

Download the Oracle Client 19.3.0.0 in the Oracle Database 19c download page.

Choose this one: LINUX.X64_193000_client.zip (64-bit) (1,134,912,540 bytes) , not the one named “LINUX.X64_193000_client_home.zip” because it is a preinstalled home that does not contain the CMAN tools.

Access the OCI Console and create a new Compute instance. The default configuration is OK, just make sure that it is Oracle Linux 7 🙂

Do not forget to add your SSH Public Key to access the VM via SSH!

Access the VM using

ssh opc@{public_ip}

1	ssh opc@{public_ip}

Copy the Oracle Client zip in /tmp using your favorite scp program.

Install CMAN

Follow these steps to install CMAN:

# become root
sudo su - root

# install some prereqs (packages, oracle user, kernel params, etc.):
yum install oracle-database-preinstall-19c.x86_64

# prepare the base directory:
mkdir /u01
chown oracle:oinstall /u01

# become oracle
su - oracle

# prepare the Oracle Home dir
mkdir -p /u01/app/oracle/product/cman1930

# unzip the Client install binaries
mkdir -p $HOME/stage
cd $HOME/stage
unzip /tmp/LINUX.X64_193000_client.zip

# prepare the response file:
cat <<EOF > $HOME/cman.rsp
oracle.install.responseFileVersion=/oracle/install/rspfmt_clientinstall_response_schema_v19.0.0
ORACLE_HOSTNAME=$(hostname)
UNIX_GROUP_NAME=oinstall
INVENTORY_LOCATION=/u01/app/oraInventory
SELECTED_LANGUAGES=en
ORACLE_HOME=/u01/app/oracle/product/cman1930
ORACLE_BASE=/u01/app/oracle
oracle.install.client.installType=Custom
oracle.install.client.customComponents="oracle.sqlplus:19.0.0.0.0","oracle.network.client:19.0.0.0.0","oracle.network.cman:19.0.0.0.0","oracle.network.listener:19.0.0.0.0"
EOF

# install!
$HOME/stage/client/runInstaller -silent -responseFile $HOME/cman.rsp  ORACLE_HOME_NAME=cman1930

# back as root:
exit

# finish the install
/u01/app/oraInventory/orainstRoot.sh
/u01/app/oracle/product/cman1930/root.sh

# become root

sudo su - root

# install some prereqs (packages, oracle user, kernel params, etc.):

yum install oracle-database-preinstall-19c.x86_64

# prepare the base directory:

mkdir /u01

chown oracle:oinstall /u01

# become oracle

su - oracle

# prepare the Oracle Home dir

mkdir -p /u01/app/oracle/product/cman1930

# unzip the Client install binaries

mkdir -p $HOME/stage

cd $HOME/stage

unzip /tmp/LINUX.X64_193000_client.zip

# prepare the response file:

cat <<EOF > $HOME/cman.rsp

oracle.install.responseFileVersion=/oracle/install/rspfmt_clientinstall_response_schema_v19.0.0

ORACLE_HOSTNAME=$(hostname)

UNIX_GROUP_NAME=oinstall

INVENTORY_LOCATION=/u01/app/oraInventory

SELECTED_LANGUAGES=en

ORACLE_HOME=/u01/app/oracle/product/cman1930

ORACLE_BASE=/u01/app/oracle

oracle.install.client.installType=Custom

oracle.install.client.customComponents="oracle.sqlplus:19.0.0.0.0","oracle.network.client:19.0.0.0.0","oracle.network.cman:19.0.0.0.0","oracle.network.listener:19.0.0.0.0"

EOF

# install!

$HOME/stage/client/runInstaller -silent -responseFile $HOME/cman.rsp ORACLE_HOME_NAME=cman1930

# back as root:

exit

# finish the install

/u01/app/oraInventory/orainstRoot.sh

/u01/app/oracle/product/cman1930/root.sh

Basic configuration

# as oracle:

mkdir -p /u01/app/oracle/network/admin
export TNS_ADMIN=/u01/app/oracle/network/admin

cat <<EOF > $TNS_ADMIN/cman-test.ora
cman-test = (configuration=
  (address=(protocol=tcp)(host=$(hostname))(port=1521))
  (parameter_list =
    (log_level=ADMIN)
    (max_connections=1024)
    (idle_timeout=0)
    (registration_invited_nodes = *)
    (inbound_connect_timeout=0)
    (session_timeout=0)
    (outbound_connect_timeout=0)
    (max_gateway_processes=16)
    (min_gateway_processes=2)
    (remote_admin=on)
    (trace_level=off)
    (max_cmctl_sessions=4)
    (event_group=init_and_term,memory_ops)
  )
  (rule_list=
    (rule=
       (src=*)(dst=*)(srv=*)(act=accept)
       (action_list=(aut=off)(moct=0)(mct=0)(mit=0)(conn_stats=on))
  ) )
)
EOF

echo "IFILE=${TNS_ADMIN}/cman-test.ora" >> $TNS_ADMIN/cman.ora

# as oracle:

mkdir -p /u01/app/oracle/network/admin

export TNS_ADMIN=/u01/app/oracle/network/admin

cat <<EOF > $TNS_ADMIN/cman-test.ora

cman-test = (configuration=

(address=(protocol=tcp)(host=$(hostname))(port=1521))

(parameter_list =

(log_level=ADMIN)

(max_connections=1024)

(idle_timeout=0)

(registration_invited_nodes = *)

(inbound_connect_timeout=0)

(session_timeout=0)

(outbound_connect_timeout=0)

(max_gateway_processes=16)

(min_gateway_processes=2)

(remote_admin=on)

(trace_level=off)

(max_cmctl_sessions=4)

(event_group=init_and_term,memory_ops)

)

(rule_list=

(rule=

(src=*)(dst=*)(srv=*)(act=accept)

(action_list=(aut=off)(moct=0)(mct=0)(mit=0)(conn_stats=on))

) )

)

EOF

echo "IFILE=${TNS_ADMIN}/cman-test.ora" >> $TNS_ADMIN/cman.ora

This will create a CMAN configuration named cman-test. Beware that it is very basic and insecure. Please read the CMAN documentation if you want something more secure or sophisticated.

The advantage of having the TNS_ADMIN outside the Oracle Home is that if you need to patch CMAN, you can do it out-of-place without the need to copy the configuration files somewhere else.

The advantage of using IFILE inside cman.ora, is that you can manage easily different CMAN configurations in the same host without editing directly cman.ora, with the risk of messing it up.

Preparing the start/stop script

Create a file /u01/app/oracle/scripts/cman_service.sh with this content:

#!/bin/bash -l

LOCAL_PARSE_OPTIONS="a:c:o:"

Usage () {
        cat <<EOF

        Purpose   : Start/stop a CMAN configuration

        Usage: `basename $0` -a {start|stop|reload|restart|status} -c <config_name> -o <oracle_home>

        Options:
                -a action           One in start|stop|reload|restart|status
                -c config_name      Name of the cman instance (e.g. ais-prod, gen-prod, etc.)
                -o oracle_home      The ORACLE_HOME path that must be used for the operation (e.g. cman1930)
EOF
}


CENTRAL_CONFIG_DIR=/ORA/dbs01/oracle/network/admin

while getopts ":${LOCAL_PARSE_OPTIONS}" opt ; do
        case $opt in
                a)
                        L_Action=$OPTARG
                        ;;
                c)
                        L_Config=$OPTARG
                        ;;
                o)
                        L_OH=$OPTARG
                        ;;
                \?)
                        eerror "Invalid option: -$OPTARG"
                        exit 1
                        ;;
                :)
                        eerror "Option -$OPTARG requires an argument."
                        exit 1
                        ;;
        esac
done

if [ ! $L_Config ] ; then
    Usage
        eerror "Please specify a configuration name with -c. Possible values are: "
        ls -1 $CENTRAL_CONFIG_DIR | sed -e "s/\.ora//" | grep -v cman
        exit 1
fi


## if the install step was OK, we should have a valid OH installed with this name:
export ORACLE_HOME=$L_OH
if [ ! -f $ORACLE_HOME/bin/cmctl ] ; then
        Usage
        echo "Please set a valid ORACLE_HOME name with -o."
        exit 1
fi


export TNS_ADMIN=$CENTRAL_CONFIG_DIR
case $L_Action in
        start)
                $OH/bin/cmctl startup -c $L_Config
                ;;
        stop)
                $OH/bin/cmctl shutdown -c $L_Config
                ;;
        reload)
                $OH/bin/cmctl reload -c $L_Config
                ;;
        restart)
                $OH/bin/cmctl shutdown -c $L_Config
                sleep 1
                $OH/bin/cmctl startup -c $L_Config
                ;;
        status)
                $OH/bin/cmctl show status -c $L_Config
                # do it again for the exit code
                $OH/bin/cmctl show status -c $L_Config | grep "The command completed successfully." >/dev/null
                ;;
        *)
                echo "Invalid action"
                exit 1
                ;;
esac

#!/bin/bash -l

LOCAL_PARSE_OPTIONS="a:c:o:"

Usage () {

cat <<EOF

Purpose : Start/stop a CMAN configuration

Usage: `basename $0` -a {start|stop|reload|restart|status} -c <config_name> -o <oracle_home>

Options:

-a action One in start|stop|reload|restart|status

-c config_name Name of the cman instance (e.g. ais-prod, gen-prod, etc.)

-o oracle_home The ORACLE_HOME path that must be used for the operation (e.g. cman1930)

EOF

}

CENTRAL_CONFIG_DIR=/ORA/dbs01/oracle/network/admin

while getopts ":${LOCAL_PARSE_OPTIONS}" opt ; do

case $opt in

L_Action=$OPTARG

;;

L_Config=$OPTARG

;;

L_OH=$OPTARG

;;

\?)

eerror "Invalid option: -$OPTARG"

exit 1

;;

eerror "Option -$OPTARG requires an argument."

exit 1

;;

esac

done

if [ ! $L_Config ] ; then

Usage

eerror "Please specify a configuration name with -c. Possible values are: "

ls -1 $CENTRAL_CONFIG_DIR | sed -e "s/\.ora//" | grep -v cman

exit 1

## if the install step was OK, we should have a valid OH installed with this name:

export ORACLE_HOME=$L_OH

if [ ! -f $ORACLE_HOME/bin/cmctl ] ; then

Usage

echo "Please set a valid ORACLE_HOME name with -o."

exit 1

export TNS_ADMIN=$CENTRAL_CONFIG_DIR

case $L_Action in

start)

$OH/bin/cmctl startup -c $L_Config

;;

stop)

$OH/bin/cmctl shutdown -c $L_Config

;;

reload)

$OH/bin/cmctl reload -c $L_Config

;;

restart)

$OH/bin/cmctl shutdown -c $L_Config

sleep 1

$OH/bin/cmctl startup -c $L_Config

;;

status)

$OH/bin/cmctl show status -c $L_Config

# do it again for the exit code

$OH/bin/cmctl show status -c $L_Config | grep "The command completed successfully." >/dev/null

;;

echo "Invalid action"

exit 1

;;

esac

This is at the same time ORACLE_HOME agnostic and configuration agnostic.

Make it executable:

chmod +x /u01/app/oracle/scripts/cman_service.sh

1	chmod +x /u01/app/oracle/scripts/cman_service.sh

and try to start CMAN:

$ /u01/app/oracle/scripts/cman_service.sh -o /u01/app/oracle/product/cman1930 -c cman-test -a start
VERSION         = 19.3.0.0.0
ORACLE_HOME     = /u01/app/oracle/product/cman1930
VERSION         = 19.3.0.0.0
ORACLE_HOME     = /u01/app/oracle/product/cman1930

CMCTL for Linux: Version 19.0.0.0.0 - Production on 12-JUL-2019 09:23:50

Copyright (c) 1996, 2019, Oracle.  All rights reserved.

Current instance cman-test is not yet started
Connecting to (DESCRIPTION=(address=(protocol=tcp)(host=ocf-cman-1)(port=1521)))
Starting Oracle Connection Manager instance cman-test. Please wait...
CMAN for Linux: Version 19.0.0.0.0 - Production
Status of the Instance
----------------------
Instance name             cman-test
Version                   CMAN for Linux: Version 19.0.0.0.0 - Production
Start date                12-JUL-2019 09:23:50
Uptime                    0 days 0 hr. 0 min. 9 sec
Num of gateways started   2
Average Load level        0
Log Level                 ADMIN
Trace Level               OFF
Instance Config file      /u01/app/oracle/product/cman1930/network/admin/cman.ora
Instance Log directory    /u01/app/oracle/diag/netcman/ocf-cman-1/cman-test/alert
Instance Trace directory  /u01/app/oracle/diag/netcman/ocf-cman-1/cman-test/trace
The command completed successfully.

$ /u01/app/oracle/scripts/cman_service.sh -o /u01/app/oracle/product/cman1930 -c cman-test -a start

VERSION = 19.3.0.0.0

ORACLE_HOME = /u01/app/oracle/product/cman1930

VERSION = 19.3.0.0.0

ORACLE_HOME = /u01/app/oracle/product/cman1930

CMCTL for Linux: Version 19.0.0.0.0 - Production on 12-JUL-2019 09:23:50

Current instance cman-test is not yet started

Connecting to (DESCRIPTION=(address=(protocol=tcp)(host=ocf-cman-1)(port=1521)))

Starting Oracle Connection Manager instance cman-test. Please wait...

CMAN for Linux: Version 19.0.0.0.0 - Production

Status of the Instance

----------------------

Instance name cman-test

Version CMAN for Linux: Version 19.0.0.0.0 - Production

Start date 12-JUL-2019 09:23:50

Uptime 0 days 0 hr. 0 min. 9 sec

Num of gateways started 2

Average Load level 0

Log Level ADMIN

Trace Level OFF

Instance Config file /u01/app/oracle/product/cman1930/network/admin/cman.ora

Instance Log directory /u01/app/oracle/diag/netcman/ocf-cman-1/cman-test/alert

Instance Trace directory /u01/app/oracle/diag/netcman/ocf-cman-1/cman-test/trace

The command completed successfully.

Stop should work as well:

$ /u01/app/oracle/scripts/cman_service.sh -o /u01/app/oracle/product/cman1930 -c cman-test -a stop
VERSION         = 19.3.0.0.0
ORACLE_HOME     = /u01/app/oracle/product/cman1930
VERSION         = 19.3.0.0.0
ORACLE_HOME     = /u01/app/oracle/product/cman1930

CMCTL for Linux: Version 19.0.0.0.0 - Production on 12-JUL-2019 09:28:34

Copyright (c) 1996, 2019, Oracle.  All rights reserved.

Current instance cman-test is already started
Connecting to (DESCRIPTION=(address=(protocol=tcp)(host=ocf-cman-1)(port=1521)))
The command completed successfully.

$ /u01/app/oracle/scripts/cman_service.sh -o /u01/app/oracle/product/cman1930 -c cman-test -a stop

VERSION = 19.3.0.0.0

ORACLE_HOME = /u01/app/oracle/product/cman1930

VERSION = 19.3.0.0.0

ORACLE_HOME = /u01/app/oracle/product/cman1930

CMCTL for Linux: Version 19.0.0.0.0 - Production on 12-JUL-2019 09:28:34

Current instance cman-test is already started

Connecting to (DESCRIPTION=(address=(protocol=tcp)(host=ocf-cman-1)(port=1521)))

The command completed successfully.

Add the service in systemctl

# as root:

cat <<EOF > /etc/systemd/system/cman-test.service
[Unit]
Description=CMAN Service for cman-test
Documentation=http://www.ludovicocaldara.net/dba/cman-oci-install
After=network-online.target

[Service]
User=oracle
Group=oinstall
LimitNOFILE=10240
MemoryLimit=8G
RestartSec=30s
StartLimitInterval=1800s
StartLimitBurst=20
ExecStart=/u01/app/oracle/scripts/cman_service.sh -c cman-test -a start -o /u01/app/oracle/product/cman1930
ExecReload=/u01/app/oracle/scripts/cman_service.sh -c cman-test -a reload -o /u01/app/oracle/product/cman1930
ExecStop=/u01/app/oracle/scripts/cman_service.sh -c cman-test -a stop -o /u01/app/oracle/product/cman1930
KillMode=control-group
Restart=on-failure
Type=forking

[Install]
WantedBy=multi-user.target
Alias=service-cman-test.service
EOF

/usr/bin/systemctl enable cman-test.service

# start
/usr/bin/systemctl start cman-test
# stop
/usr/bin/systemctl stop cman-test

# as root:

cat <<EOF > /etc/systemd/system/cman-test.service

[Unit]

Description=CMAN Service for cman-test

Documentation=http://www.ludovicocaldara.net/dba/cman-oci-install

After=network-online.target

[Service]

User=oracle

Group=oinstall

LimitNOFILE=10240

MemoryLimit=8G

RestartSec=30s

StartLimitInterval=1800s

StartLimitBurst=20

ExecStart=/u01/app/oracle/scripts/cman_service.sh -c cman-test -a start -o /u01/app/oracle/product/cman1930

ExecReload=/u01/app/oracle/scripts/cman_service.sh -c cman-test -a reload -o /u01/app/oracle/product/cman1930

ExecStop=/u01/app/oracle/scripts/cman_service.sh -c cman-test -a stop -o /u01/app/oracle/product/cman1930

KillMode=control-group

Restart=on-failure

Type=forking

[Install]

WantedBy=multi-user.target

Alias=service-cman-test.service

EOF

/usr/bin/systemctl enable cman-test.service

# start

/usr/bin/systemctl start cman-test

# stop

/usr/bin/systemctl stop cman-test

Open firewall ports

By default, new OL7 images use firewalld. Just open the port 1521 from the public zone:

# as root:
firewall-cmd --permanent --add-port=1521/tcp
firewall-cmd --reload

# as root:

firewall-cmd --permanent --add-port=1521/tcp

firewall-cmd --reload

Bonus: have a smart environment!

# as root:
yum install -y git rlwrap

# Connect as oracle
sudo su - oracle

# Clone this repository
git clone https://github.com/ludovicocaldara/COE.git

# Enable the profile scripts
echo ". ~/COE/profile.sh" >> $HOME/.bash_profile

# set the cman1930 home by default:
echo "setoh cman1930" >> $HOME/.bash_profile
echo "export TNS_ADMIN=/u01/app/oracle/network/admin" >> $HOME/.bash_profile

# Load the new profile
. ~/.bash_profile

# as root:

yum install -y git rlwrap

# Connect as oracle

sudo su - oracle

# Clone this repository

git clone https://github.com/ludovicocaldara/COE.git

# Enable the profile scripts

echo ". ~/COE/profile.sh" >> $HOME/.bash_profile

# set the cman1930 home by default:

echo "setoh cman1930" >> $HOME/.bash_profile

echo "export TNS_ADMIN=/u01/app/oracle/network/admin" >> $HOME/.bash_profile

# Load the new profile

. ~/.bash_profile

[root@ocf-cman-1 tmp]# su - oracle
Last login: Fri Jul 12 09:49:09 GMT 2019 on pts/0
VERSION         = 19.3.0.0.0
ORACLE_HOME     = /u01/app/oracle/product/cman1930

# [ oracle@ocf-cman-1:/home/oracle [09:49:54] [19.3.0.0.0 [CLIENT] SID="not set"] 0 ] #
# # ahhh, that;s satisfying

[root@ocf-cman-1 tmp]# su - oracle

Last login: Fri Jul 12 09:49:09 GMT 2019 on pts/0

VERSION = 19.3.0.0.0

ORACLE_HOME = /u01/app/oracle/product/cman1930

# [ oracle@ocf-cman-1:/home/oracle [09:49:54] [19.3.0.0.0 [CLIENT] SID="not set"] 0 ] #

# # ahhh, that;s satisfying

—

Ludo

FPP local-mode: Steps to remove/add node from a cluster if RHP fails to move gihome

Posted on July 9, 2019 by Ludovico

I am getting more and more experience with patching clusters with the local-mode automaton. The whole process would be very complex, but the local-mode automaton makes it really easy.

I have had nevertheless a couple of clusters where the process did not work:

#1: The very first cluster that I installed in 18c

This cluster has “kind of failed” patching the first node. Actually, the rhpctl command exited with an error:

$ rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server1
server1.cern.ch: Audit ID: 2
server1.cern.ch: verifying versions of Oracle homes ...
server1.cern.ch: verifying owners of Oracle homes ...
server1.cern.ch: verifying groups of Oracle homes ...
server1.cern.ch: starting to move the Oracle Grid Infrastructure home from "/u01/crs/crs1830" to "/u01/crs/crs1860" on server cluster "AISTEST-RAC16"
[...]
2019/07/08 09:45:06 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'
PRCG-1239 : failed to close a proxy connection
Connection refused to host: server1.cern.ch; nested exception is:
        java.net.ConnectException: Connection refused (Connection refused)
PRCG-1079 : Internal error: ClientFactoryImpl-submitAction-error1
PROC-32: Cluster Ready Services on the local node is not running Messaging error [gipcretConnectionRefused] [29]

$ rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server1

server1.cern.ch: Audit ID: 2

server1.cern.ch: verifying versions of Oracle homes ...

server1.cern.ch: verifying owners of Oracle homes ...

server1.cern.ch: verifying groups of Oracle homes ...

server1.cern.ch: starting to move the Oracle Grid Infrastructure home from "/u01/crs/crs1830" to "/u01/crs/crs1860" on server cluster "AISTEST-RAC16"

[...]

2019/07/08 09:45:06 CLSRSC-329: Replacing Clusterware entries in file 'oracle-ohasd.service'

PRCG-1239 : failed to close a proxy connection

Connection refused to host: server1.cern.ch; nested exception is:

java.net.ConnectException: Connection refused (Connection refused)

PRCG-1079 : Internal error: ClientFactoryImpl-submitAction-error1

PROC-32: Cluster Ready Services on the local node is not running Messaging error [gipcretConnectionRefused] [29]

But actually, the helper kept running and configured everything properly:

$ tail -f /ORA/dbs01/oracle/crsdata/server1/crsconfig/crs_postpatch_server1_2019-07-08_09-41-36AM.log
2019-07-08 09:55:25:
2019-07-08 09:55:25: Succeeded in writing the checkpoint:'ROOTCRS_POSTPATCH' with status:SUCCESS
2019-07-08 09:55:25: Executing cmd: /u01/crs/crs1860/bin/clsecho -p has -f clsrsc -m 672
2019-07-08 09:55:25: Executing cmd: /u01/crs/crs1860/bin/clsecho -p has -f clsrsc -m 672
2019-07-08 09:55:25: Command output:
>  CLSRSC-672: Post-patch steps for patching GI home successfully completed.
>End Command output
2019-07-08 09:55:25: CLSRSC-672: Post-patch steps for patching GI home successfully completed.

$ tail -f /ORA/dbs01/oracle/crsdata/server1/crsconfig/crs_postpatch_server1_2019-07-08_09-41-36AM.log

2019-07-08 09:55:25:

2019-07-08 09:55:25: Succeeded in writing the checkpoint:'ROOTCRS_POSTPATCH' with status:SUCCESS

2019-07-08 09:55:25: Executing cmd: /u01/crs/crs1860/bin/clsecho -p has -f clsrsc -m 672

2019-07-08 09:55:25: Command output:

> CLSRSC-672: Post-patch steps for patching GI home successfully completed.

>End Command output

2019-07-08 09:55:25: CLSRSC-672: Post-patch steps for patching GI home successfully completed.

The cluster was OK on the first node, with the correct patch level. The second node, however, was failing with:

$  rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server2
server1.cern.ch: retrieving status of databases ...
server1.cern.ch: retrieving status of services of databases ...
PRCT-1011 : Failed to run "rhphelper". Detailed error: <HLP_EMSG>,RHPHELP_procCmdLine-05,</HLP_EMSG>,<HLP_VRES>3</HLP_VRES>,<HLP_IEEMSG>,PRCG-1079 : Internal error: RHPHELP122_main-01,</HLP_IEEMSG>,<HLP_ERES>1</HLP_ERES>

$ rhpctl move gihome -sourcehome /u01/crs/crs1830 -desthome /u01/crs/crs1860 -node server2

server1.cern.ch: retrieving status of databases ...

server1.cern.ch: retrieving status of services of databases ...

PRCT-1011 : Failed to run "rhphelper". Detailed error: <HLP_EMSG>,RHPHELP_procCmdLine-05,</HLP_EMSG>,<HLP_VRES>3</HLP_VRES>,<HLP_IEEMSG>,PRCG-1079 : Internal error: RHPHELP122_main-01,</HLP_IEEMSG>,<HLP_ERES>1</HLP_ERES>

I am not sure about the cause, but let’s assume it is irrelevant for the moment.

#2: A cluster with new GI home not properly linked with RAC

This was another funny case, where the first node patched successfully, but the second one failed upgrading in the middle of the process with a java NullPointer exception. We did a few bad tries of prePatch and postPatch to solve, but after that the second node of the cluster was in an inconsistent state: in ROLLING_UPGRADE mode and not possible to patch anymore.

Common solution: removing the node from the cluster and adding it back

In both cases we were in the following situation:

one node was successfully patched to 18.6
one node was not patched and was not possible to patch it anymore (at least without heavy interventions)

So, for me, the easiest solution has been removing the failing node and adding it back with the new patched version.

Steps to remove the node

Although the steps are described here: https://docs.oracle.com/en/database/oracle/oracle-database/18/cwadd/adding-and-deleting-cluster-nodes.html#GUID-8ADA9667-EC27-4EF9-9F34-C8F65A757F2A, there are a few differences that I will highlight:

Stop of the cluster:

(root)# crsctl stop crs

1	(root)# crsctl stop crs

The actual procedure to remove a node asks to deconfigure the databases and managed homes from the active cluster version. But as we manage our homes with golden images, we do not need this; we rather want to keep all the entries in the OCR so that when we add it back, everything is in place.

Once stopped the CRS, we have deinstalled the CRS home on the failing node:

(oracle)$ $OH/deinstall/deinstall -local

1	(oracle)$ $OH/deinstall/deinstall -local

This complained about the CRS that was down, but it continued and ask for this script to be executed:

/u01/crs/crs1830/crs/install/rootcrs.sh -force  -deconfig -paramfile "/tmp/deinstall2019-07-08_11-37-20AM/response/deinstall_1830.rsp"

1	/u01/crs/crs1830/crs/install/rootcrs.sh -force -deconfig -paramfile "/tmp/deinstall2019-07-08_11-37-20AM/response/deinstall_1830.rsp"

We’ve got errors also for this script, but the remove process was OK afterall.

Then, from the surviving node:

root # crsctl delete node -n server2
oracle $ srvctl stop vip -vip server2
root $ srvctl remove vip -vip server2

root # crsctl delete node -n server2

oracle $ srvctl stop vip -vip server2

root $ srvctl remove vip -vip server2

Adding the node back

From the surviving node, we ran gridSetup.sh and followed the steps to ad the node.

Wait before running root.sh.

In our case, we have originally installed the cluster starting with a SW_ONLY install. This type of installation keeps some leftovers in the configuration files that prevent the root.sh from configuring the cluster…we have had to modify rootconfig.sh:

check/modify /u01/crs/crs1860/crs/config/rootconfig.sh and change this:
# before:
# SW_ONLY=true
# after:
SW_ONLY=false

check/modify /u01/crs/crs1860/crs/config/rootconfig.sh and change this:

# before:

# SW_ONLY=true

# after:

SW_ONLY=false

then, after running root.sh and the config tools, everything was back as before removing the node form the cluster.

For one of the clusters , both nodes were at the same patch level, but the cluster was still in ROLLING_PATCH mode. So we have had to do a

(root) # crsctl stop rollingpatch

1	(root) # crsctl stop rollingpatch

—

Ludo

How to install and access Oracle Weblogic 12.2 in the Oracle Cloud Infrastructure

Posted on July 5, 2019 by Ludovico

I put here the steps required to install and access Weblogic in the OCI (mostly for me in case I need to do it again 😉 ). The assumptions are:

you already have an account for the Oracle Cloud Infrastructure and you can access the OCI console
you already have a Compartment with a VCN and a subnet configured (for test purposes, a VCN created with the default values will be just fine)
you already have a keypair for your SSH client (id_rsa, id_rsa.pub)
you have an X server on your laptop (if you have Windows, I recommend MobaXTerm, but Xming or other servers are just fine)

Create the compute instance

Menu -> Core Infrastructure -> Compute -> Instances -> Create Instance
Choose a name for the Instance, all the other fields defaults are fine for test (Oracle Linux 7.6, VM.Standard2.1, etc.)
Paste your SSH public key
Optionally, under advanced/network, specify a different name for the VM
Click on Create to complete the creation

At some point you will have an instance “Green” ready to access:

Click on it and get the public address:

Using your SSH keypair, you can now access the instance with:

$ ssh opc@{public_ip}

1	$ ssh opc@{public_ip}

Setup sshd for SSH tunneling and X11 forwarding

Edit as root the sshd_config:

$ sudo vi /etc/ssh/sshd_config

1	$ sudo vi /etc/ssh/sshd_config

Modify it so that the following lines are present with these values:

AllowTcpForwarding yes
PermitOpen any
X11Forwarding yes
X11DisplayOffset 10
X11UseLocalhost no

AllowTcpForwarding yes

PermitOpen any

X11Forwarding yes

X11DisplayOffset 10

X11UseLocalhost no

Those values are required for X11 forwarding (required for the graphical installation) and for SSH tunneling (required to access the Weblogic ports without exposing them over internet).

Then restart sshd:

$ sudo systemctl restart sshd

1	$ sudo systemctl restart sshd

Install the packages for X11

# sudo yum install xorg-x11-xauth.x86_64
# sudo yum install libXtst
# # optional, to test if X11 forwarding works
# sudo yum install xterm

# sudo yum install xorg-x11-xauth.x86_64

# sudo yum install libXtst

# # optional, to test if X11 forwarding works

# sudo yum install xterm

At this point, it should be possible to forward X11. You can test by reconnecting with:

$ ssh -XC opc@{public_ip}

1	$ ssh -XC opc@{public_ip}

and then:

$ xterm

$ xterm

Create the oracle user

$ sudo su - root
# groupadd -g 54321 oinstall
# useradd -u 54321 -g oinstall oracle
# passwd oracle

$ sudo su - root

# groupadd -g 54321 oinstall

# useradd -u 54321 -g oinstall oracle

# passwd oracle

At this point, you can reconnect using oracle directly, so X11 forward will work for the oracle user without any additional setup:

$ ssh -XC oracle@{public_ip}

1	$ ssh -XC oracle@{public_ip}

Follow the canonical steps to install weblogic

If you do not know how to do that, follow this good tutorial by Tim Hall (oracle-base):

Oracle WebLogic Server (WLS) 12cR2 (12.2.1) Installation on Oracle Linux 6 and 7

Access the Weblogic console from outside Oracle Cloud

If you configured correctly sshd, once the Oracle Weblogic instance is configured and started, you can tunnel to the port (it should be 7001):

$ ssh -L 7001:{vm_name}:7001 oracle@{public_ip}

1	$ ssh -L 7001:{vm_name}:7001 oracle@{public_ip}

And be able to browse from your laptop using localhost:7001:

HTH

—

Ludovico

Oracle SW_ONLY install leads to relink with rac_off at every attachHome

Posted on July 4, 2019 by Ludovico

OK, I really do not know what other title I should use for this post.

I have developed and presented a few times my personal approach to Oracle Home provisioning and patching. You can read more in this series.

With this approach:

I install the software (either GI or RDBMS) with the option SW_ONLY once
I patch it to the last version
I create a golden image that I evolve for the rest of the release lifecycle

When I need to install it, I just unzip the golden image and attach it to the Central Inventory.

I have discovered quite longtime ago that, every time I was attaching the home to the inventory, the binaries were relinked with rac_off, disregarding the fact that the home that I zipped actually had RAC enabled. This is quite annoying at my work at CERN, as all our databases are RAC.

So my solution to the problem is to detect if the server is on a cluster, and relink on the fly:

### EARLIER, IN THE ENVIRONMENT SCRIPTS
if [ -f /etc/oracle/olr.loc ] ; then
        export CRS_EXISTS=1
else
        export CRS_EXISTS=0
fi

### LATER, AFTER ATTACHING THE ORACLE_HOME:
pushd $ORACLE_HOME/rdbms/lib
if [ $CRS_EXISTS -eq 1 ] ; then
	make -f ins_rdbms.mk rac_on
else
	make -f ins_rdbms.mk rac_off
fi
make -f ins_rdbms.mk ioracle

### EARLIER, IN THE ENVIRONMENT SCRIPTS

if [ -f /etc/oracle/olr.loc ] ; then

export CRS_EXISTS=1

else

export CRS_EXISTS=0

### LATER, AFTER ATTACHING THE ORACLE_HOME:

pushd $ORACLE_HOME/rdbms/lib

if [ $CRS_EXISTS -eq 1 ] ; then

make -f ins_rdbms.mk rac_on

else

make -f ins_rdbms.mk rac_off

make -f ins_rdbms.mk ioracle

This is a simplified snippet of my actual code, but it gives the idea.

What causes the relink with rac_off?

I have discovered recently that the steps used by the runInstaller process to attach the Oracle Home are described in this file:

$ORACLE_HOME/inventory/make/makeorder.xml

1	$ORACLE_HOME/inventory/make/makeorder.xml

and in my case, for all my golden images, it contains:

<ohmd:MAKE
MAKEPATH="/usr/bin/make" FILENAME="rdbms/lib/ins_rdbms.mk" >
<ohmd:TARGET ACTIONTYPE="INSTALL" TARGETNAME="rac_off" >
<ohmd:INPUT_LIST>
<ohmd:INPUT VAL="ORACLE_HOME=%ORACLE_HOME%"/>
</ohmd:INPUT_LIST>
<ohmd:COMP_LIST>
<ohmd:COMP NAME="oracle.rdbms" VERSION="18.0.0.0.0"/>
</ohmd:COMP_LIST>
</ohmd:TARGET>
</ohmd:MAKE>

<ohmd:MAKE

MAKEPATH="/usr/bin/make" FILENAME="rdbms/lib/ins_rdbms.mk" >

<ohmd:TARGET ACTIONTYPE="INSTALL" TARGETNAME="rac_off" >

<ohmd:INPUT_LIST>

<ohmd:INPUT VAL="ORACLE_HOME=%ORACLE_HOME%"/>

</ohmd:INPUT_LIST>

<ohmd:COMP_LIST>

<ohmd:COMP NAME="oracle.rdbms" VERSION="18.0.0.0.0"/>

</ohmd:COMP_LIST>

</ohmd:TARGET>

</ohmd:MAKE>

So, it does not matter how I prepare my images: unless I change this file and put rac_on, the runInstaller keeps relinking with rac_off.

I have thought about changing the file, but then realized that I prefer to check and recompile at runtime, so I can reuse my images also for standalone servers (in case we need them).

Just to avoid surprises, it is convenient to check if a ORACLE_HOME is linked with RAC with this small function:

$ type isRACoh
isRACoh is a function
isRACoh ()
{
    OH2CHECK=${1:-$ORACLE_HOME};
    ar -t $OH2CHECK/rdbms/lib/libknlopt.a | grep --color=auto kcsm.o > /dev/null;
    if [ $? -eq 0 ]; then
        echo "Enabled";
    else
        echo "Disabled";
        false;
    fi
}

$ type isRACoh

isRACoh is a function

isRACoh ()

{

OH2CHECK=${1:-$ORACLE_HOME};

ar -t $OH2CHECK/rdbms/lib/libknlopt.a | grep --color=auto kcsm.o > /dev/null;

if [ $? -eq 0 ]; then

echo "Enabled";

else

echo "Disabled";

false;

}

This is true especially for Grid Infrastructure golden images, as they have the very same behavior of RDBMS homes, with the exception that they might break out-of-place patching if RAC is not enabled: the second ASM instance will not mount because the first will be exclusively mounted without the RAC option.

HTH.

—

Ludovico