DBA survival BLOG

DBA stuff and Oracle Data Guard

Moving Clusterware Interconnect from single NIC/Bond to HAIP

Posted on March 23, 2015 by Ludovico

Very recently I had to configure a customer’s RAC private interconnect from bonding to HAIP to get benefit of both NICs.

So I would like to recap here what the hypothetic steps would be if you need to do the same.

In this example I’ll switch from a single-NIC interconnect (eth1) rather than from a bond configuration, so if you are familiar with the RAC Attack! environment you can try to put everything in place on your own.

First, you need to plan the new network configuration in advance, keeping in mind that there are a couple of important restrictions:

Your interconnect interface naming must be uniform on all nodes in the cluster. The interconnect uses the interface name in its configuration and it doesn’t support different names on different hosts
You must bind the different private interconnect interfaces in different subnets (see Note: 1481481.1 – 11gR2 CSS Terminates/Node Eviction After Unplugging one Network Cable in Redundant Interconnect Environment if you need an explanation)

Implementation

The RAC Attack book uses one interface per node for the interconnect (eth1, using network 172.16.100.0)

To make things a little more complex, we’ll not use the eth1 in the new HAIP configuration, so we’ll test also the deletion of the old interface.

What you need to do is add two new interfaces (host only in your virtualbox) and configure them as eth2 and eth3, e.g. in networks 172.16.101.0 and 172.16.102.0)

eth2      Link encap:Ethernet  HWaddr 08:00:27:32:76:DD
          inet addr:172.16.101.51  Bcast:172.16.101.255  Mask:255.255.255.0
          inet6 addr: fe80::a00:27ff:fe32:76dd/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:29 errors:0 dropped:0 overruns:0 frame:0
          TX packets:25 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:2044 (1.9 KiB)  TX bytes:1714 (1.6 KiB)

eth3      Link encap:Ethernet  HWaddr 08:00:27:2E:05:4B
          inet addr:172.16.102.61  Bcast:172.16.102.255  Mask:255.255.255.0
          inet6 addr: fe80::a00:27ff:fe2e:54b/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:19 errors:0 dropped:0 overruns:0 frame:0
          TX packets:12 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:1140 (1.1 KiB)  TX bytes:720 (720.0 b)

eth2 Link encap:Ethernet HWaddr 08:00:27:32:76:DD

inet addr:172.16.101.51 Bcast:172.16.101.255 Mask:255.255.255.0

inet6 addr: fe80::a00:27ff:fe32:76dd/64 Scope:Link

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

RX packets:29 errors:0 dropped:0 overruns:0 frame:0

TX packets:25 errors:0 dropped:0 overruns:0 carrier:0

collisions:0 txqueuelen:1000

RX bytes:2044 (1.9 KiB) TX bytes:1714 (1.6 KiB)

eth3 Link encap:Ethernet HWaddr 08:00:27:2E:05:4B

inet addr:172.16.102.61 Bcast:172.16.102.255 Mask:255.255.255.0

inet6 addr: fe80::a00:27ff:fe2e:54b/64 Scope:Link

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

RX packets:19 errors:0 dropped:0 overruns:0 frame:0

TX packets:12 errors:0 dropped:0 overruns:0 carrier:0

collisions:0 txqueuelen:1000

RX bytes:1140 (1.1 KiB) TX bytes:720 (720.0 b)

modify /var/named/racattack in order to use the new addresses (RAC doesn’t care about logical names, it’s just for our convenience):

collabn1 A 192.168.78.51
collabn1-vip A 192.168.78.61
collabn1-priv A 172.16.100.51
collabn1-priv1 A 172.16.101.51
collabn1-priv2 A 172.16.102.61
collabn2 A 192.168.78.52
collabn2-vip A 192.168.78.62
collabn2-priv A 172.16.100.52
collabn2-priv1 A 172.16.101.52
collabn2-priv2 A 172.16.102.62

collabn1 A 192.168.78.51

collabn1-vip A 192.168.78.61

collabn1-priv A 172.16.100.51

collabn1-priv1 A 172.16.101.51

collabn1-priv2 A 172.16.102.61

collabn2 A 192.168.78.52

collabn2-vip A 192.168.78.62

collabn2-priv A 172.16.100.52

collabn2-priv1 A 172.16.101.52

collabn2-priv2 A 172.16.102.62

add also the reverse lookup in in-addr.arpa:

51.101.16.172 PTR collabn1-priv1.racattack.
52.102.16.172 PTR collabn1-priv2.racattack.
61.101.16.172 PTR collabn2-priv1.racattack.
62.102.16.172 PTR collabn2-priv2.racattack.

51.101.16.172 PTR collabn1-priv1.racattack.

52.102.16.172 PTR collabn1-priv2.racattack.

61.101.16.172 PTR collabn2-priv1.racattack.

62.102.16.172 PTR collabn2-priv2.racattack.

restart named on the first node and check that both nodes can ping all the names correctly:

[root@collabn1 named]# ping collabn2-priv1
PING collabn2-priv1.racattack (172.16.101.52) 56(84) bytes of data.
64 bytes from 172.16.101.52: icmp_seq=1 ttl=64 time=1.27 ms
64 bytes from 172.16.101.52: icmp_seq=2 ttl=64 time=0.396 ms
^C
--- collabn2-priv1.racattack ping statistics ---
2 packets transmitted, 2 received, 0% packet loss, time 1293ms
rtt min/avg/max/mdev = 0.396/0.835/1.275/0.440 ms
[root@collabn1 named]# ping collabn2-priv2
PING collabn2-priv2.racattack (172.16.102.62) 56(84) bytes of data.
64 bytes from 172.16.102.62: icmp_seq=1 ttl=64 time=0.924 ms
64 bytes from 172.16.102.62: icmp_seq=2 ttl=64 time=0.251 ms
^C
--- collabn2-priv2.racattack ping statistics ---
2 packets transmitted, 2 received, 0% packet loss, time 1480ms
rtt min/avg/max/mdev = 0.251/0.587/0.924/0.337 ms
[root@collabn1 named]# ping collabn1-priv2
PING collabn1-priv2.racattack (172.16.102.61) 56(84) bytes of data.
64 bytes from 172.16.102.61: icmp_seq=1 ttl=64 time=0.019 ms
64 bytes from 172.16.102.61: icmp_seq=2 ttl=64 time=0.032 ms
^C
--- collabn1-priv2.racattack ping statistics ---
2 packets transmitted, 2 received, 0% packet loss, time 1240ms
rtt min/avg/max/mdev = 0.019/0.025/0.032/0.008 ms
[root@collabn1 named]# ping collabn1-priv1
PING collabn1-priv1.racattack (172.16.101.51) 56(84) bytes of data.
64 bytes from 172.16.101.51: icmp_seq=1 ttl=64 time=0.017 ms
64 bytes from 172.16.101.51: icmp_seq=2 ttl=64 time=0.060 ms
^C
--- collabn1-priv1.racattack ping statistics ---
2 packets transmitted, 2 received, 0% packet loss, time 1224ms
rtt min/avg/max/mdev = 0.017/0.038/0.060/0.022 ms

[root@collabn1 named]# ping collabn2-priv1

PING collabn2-priv1.racattack (172.16.101.52) 56(84) bytes of data.

64 bytes from 172.16.101.52: icmp_seq=1 ttl=64 time=1.27 ms

64 bytes from 172.16.101.52: icmp_seq=2 ttl=64 time=0.396 ms

--- collabn2-priv1.racattack ping statistics ---

2 packets transmitted, 2 received, 0% packet loss, time 1293ms

rtt min/avg/max/mdev = 0.396/0.835/1.275/0.440 ms

[root@collabn1 named]# ping collabn2-priv2

PING collabn2-priv2.racattack (172.16.102.62) 56(84) bytes of data.

64 bytes from 172.16.102.62: icmp_seq=1 ttl=64 time=0.924 ms

64 bytes from 172.16.102.62: icmp_seq=2 ttl=64 time=0.251 ms

--- collabn2-priv2.racattack ping statistics ---

2 packets transmitted, 2 received, 0% packet loss, time 1480ms

rtt min/avg/max/mdev = 0.251/0.587/0.924/0.337 ms

[root@collabn1 named]# ping collabn1-priv2

PING collabn1-priv2.racattack (172.16.102.61) 56(84) bytes of data.

64 bytes from 172.16.102.61: icmp_seq=1 ttl=64 time=0.019 ms

64 bytes from 172.16.102.61: icmp_seq=2 ttl=64 time=0.032 ms

--- collabn1-priv2.racattack ping statistics ---

2 packets transmitted, 2 received, 0% packet loss, time 1240ms

rtt min/avg/max/mdev = 0.019/0.025/0.032/0.008 ms

[root@collabn1 named]# ping collabn1-priv1

PING collabn1-priv1.racattack (172.16.101.51) 56(84) bytes of data.

64 bytes from 172.16.101.51: icmp_seq=1 ttl=64 time=0.017 ms

64 bytes from 172.16.101.51: icmp_seq=2 ttl=64 time=0.060 ms

--- collabn1-priv1.racattack ping statistics ---

2 packets transmitted, 2 received, 0% packet loss, time 1224ms

rtt min/avg/max/mdev = 0.017/0.038/0.060/0.022 ms

check the nodes that compose the cluster:

[root@collabn1 network-scripts]# olsnodes -s
collabn1 Active
collabn2 Active

[root@collabn1 network-scripts]# olsnodes -s

collabn1 Active

collabn2 Active

on all nodes, make a copy of the gpnp profile.xml (just in case, the oifcfg tool does the copy automatically)

$ cd $GRID_HOME/gpnp/`hostname`/profiles/peer/
$ cp -p profile.xml profile.xml.bk

1 2	$ cd $GRID_HOME/gpnp/`hostname`/profiles/peer/ $ cp -p profile.xml profile.xml.bk

List the available networks:

[root@collabn1 bin]# ./oifcfg iflist -p -n
eth0 192.168.78.0 PRIVATE 255.255.255.0
eth1 172.16.100.0 PRIVATE 255.255.255.0
eth1 169.254.0.0 UNKNOWN 255.255.0.0
eth2 172.16.101.0 PRIVATE 255.255.255.0
eth3 172.16.102.0 PRIVATE 255.255.255.0

[root@collabn1 bin]# ./oifcfg iflist -p -n

eth0 192.168.78.0 PRIVATE 255.255.255.0

eth1 172.16.100.0 PRIVATE 255.255.255.0

eth1 169.254.0.0 UNKNOWN 255.255.0.0

eth2 172.16.101.0 PRIVATE 255.255.255.0

eth3 172.16.102.0 PRIVATE 255.255.255.0

Get the current ip configuration for the interconnect:

[root@collabn1 bin]# ./oifcfg getif
eth0 192.168.78.0 global public
eth1 172.16.100.0 global cluster_interconnect

[root@collabn1 bin]# ./oifcfg getif

eth0 192.168.78.0 global public

eth1 172.16.100.0 global cluster_interconnect

one one node only, set the new interconnect interfaces:

[root@collabn1 network-scripts]# oifcfg setif -global eth2/172.16.101.0:cluster_interconnect
[root@collabn1 network-scripts]# oifcfg setif -global eth3/172.16.102.0:cluster_interconnect
[root@collabn1 network-scripts]# oifcfg getif
eth0 192.168.78.0 global public
eth1 172.16.100.0 global cluster_interconnect
eth2 172.16.101.0 global cluster_interconnect
eth3 172.16.102.0 global cluster_interconnect

[root@collabn1 network-scripts]# oifcfg setif -global eth2/172.16.101.0:cluster_interconnect

[root@collabn1 network-scripts]# oifcfg setif -global eth3/172.16.102.0:cluster_interconnect

[root@collabn1 network-scripts]# oifcfg getif

eth0 192.168.78.0 global public

eth1 172.16.100.0 global cluster_interconnect

eth2 172.16.101.0 global cluster_interconnect

eth3 172.16.102.0 global cluster_interconnect

check that the other nodes has received the new configuration:

[root@collabn2 bin]# ./oifcfg getif
eth0 192.168.78.0 global public
eth1 172.16.100.0 global cluster_interconnect
eth2 172.16.101.0 global cluster_interconnect
eth3 172.16.102.0 global cluster_interconnect

[root@collabn2 bin]# ./oifcfg getif

eth0 192.168.78.0 global public

eth1 172.16.100.0 global cluster_interconnect

eth2 172.16.101.0 global cluster_interconnect

eth3 172.16.102.0 global cluster_interconnect

Before deleting the old interface, it would be sensible to stop your cluster resources (in some cases, one of the nodes may be evicted), in any case the cluster must be restarted completely in order to get the new interfaces working.

Note: having three interfaces in a HAIP interconnect is perfectly working, HAIP works from 2 to 4 interfaces. I’m showing how to delete eth1 just for information!! 🙂

[root@collabn1 network-scripts]# oifcfg delif -global eth1/172.16.100.0
[root@collabn1 network-scripts]# oifcfg getif
eth0 192.168.78.0 global public
eth2 172.16.101.0 global cluster_interconnect
eth3 172.16.102.0 global cluster_interconnect

[root@collabn1 network-scripts]# oifcfg delif -global eth1/172.16.100.0

[root@collabn1 network-scripts]# oifcfg getif

eth0 192.168.78.0 global public

eth2 172.16.101.0 global cluster_interconnect

eth3 172.16.102.0 global cluster_interconnect

on all nodes, shutdown the CRS:

[root@collabn1 network-scripts]# crsctl stop crs
CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'collabn1'
...

[root@collabn1 network-scripts]# crsctl stop crs

CRS-2791: Starting shutdown of Oracle High Availability Services-managed resources on 'collabn1'

...

Now you can disable the old interface:

[root@collabn1 network-scripts]# ifdown eth1

1	[root@collabn1 network-scripts]# ifdown eth1

and modify the parameter ONBOOT=no inside the configuration script of eth1 interface.

Start the cluster again:

[root@collabn1 network-scripts]# crsctl start crs

1	[root@collabn1 network-scripts]# crsctl start crs

And check that the resources are up & running:

# crscst stat res -t
--------------------------------------------------------------------------------
NAME TARGET STATE SERVER STATE_DETAILS
--------------------------------------------------------------------------------
Local Resources
--------------------------------------------------------------------------------
ora.DATA.dg
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
ora.LISTENER.lsnr
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
ora.asm
ONLINE ONLINE collabn1 Started
ONLINE ONLINE collabn2 Started
ora.gsd
OFFLINE OFFLINE collabn1
OFFLINE OFFLINE collabn2
ora.net1.network
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
ora.ons
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
--------------------------------------------------------------------------------
Cluster Resources
--------------------------------------------------------------------------------
ora.LISTENER_SCAN1.lsnr
1 ONLINE ONLINE collabn2
ora.LISTENER_SCAN2.lsnr
1 ONLINE ONLINE collabn1
ora.LISTENER_SCAN3.lsnr
1 ONLINE ONLINE collabn1
ora.collabn1.vip
1 ONLINE ONLINE collabn1
ora.collabn2.vip
1 ONLINE ONLINE collabn2
ora.cvu
1 ONLINE ONLINE collabn1
ora.oc4j
1 ONLINE ONLINE collabn1
ora.orcl.db
1 ONLINE ONLINE collabn1 Open
2 ONLINE ONLINE collabn2 Open
ora.scan1.vip
1 ONLINE ONLINE collabn2
ora.scan2.vip
1 ONLINE ONLINE collabn1
ora.scan3.vip
1 ONLINE ONLINE collabn1

# crscst stat res -t

--------------------------------------------------------------------------------

NAME TARGET STATE SERVER STATE_DETAILS

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.DATA.dg

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

ora.LISTENER.lsnr

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

ora.asm

ONLINE ONLINE collabn1 Started

ONLINE ONLINE collabn2 Started

ora.gsd

OFFLINE OFFLINE collabn1

OFFLINE OFFLINE collabn2

ora.net1.network

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

ora.ons

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.LISTENER_SCAN1.lsnr

1 ONLINE ONLINE collabn2

ora.LISTENER_SCAN2.lsnr

1 ONLINE ONLINE collabn1

ora.LISTENER_SCAN3.lsnr

1 ONLINE ONLINE collabn1

ora.collabn1.vip

1 ONLINE ONLINE collabn1

ora.collabn2.vip

1 ONLINE ONLINE collabn2

ora.cvu

1 ONLINE ONLINE collabn1

ora.oc4j

1 ONLINE ONLINE collabn1

ora.orcl.db

1 ONLINE ONLINE collabn1 Open

2 ONLINE ONLINE collabn2 Open

ora.scan1.vip

1 ONLINE ONLINE collabn2

ora.scan2.vip

1 ONLINE ONLINE collabn1

ora.scan3.vip

1 ONLINE ONLINE collabn1

Testing the high availability

Disconnect cable from one of the two interfaces (virtually if you’re in virtualbox 🙂 )

Pay attention at the NO-CARRIER status (in eth2 in this example):

# ip l
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
link/ether 08:00:27:07:33:94 brd ff:ff:ff:ff:ff:ff
3: eth1: <BROADCAST,MULTICAST> mtu 1500 qdisc pfifo_fast state DOWN qlen 1000
link/ether 08:00:27:7f:b4:88 brd ff:ff:ff:ff:ff:ff
4: eth2: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast state DOWN qlen 1000
link/ether 08:00:27:51:1d:78 brd ff:ff:ff:ff:ff:ff
5: eth3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
link/ether 08:00:27:39:86:f2 brd ff:ff:ff:ff:ff:ff

# ip l

1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN

link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00

2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000

link/ether 08:00:27:07:33:94 brd ff:ff:ff:ff:ff:ff

3: eth1: <BROADCAST,MULTICAST> mtu 1500 qdisc pfifo_fast state DOWN qlen 1000

link/ether 08:00:27:7f:b4:88 brd ff:ff:ff:ff:ff:ff

4: eth2: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc pfifo_fast state DOWN qlen 1000

link/ether 08:00:27:51:1d:78 brd ff:ff:ff:ff:ff:ff

5: eth3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000

link/ether 08:00:27:39:86:f2 brd ff:ff:ff:ff:ff:ff

check that the CRS is still up & running:

# crsctl stat res -t
--------------------------------------------------------------------------------
NAME TARGET STATE SERVER STATE_DETAILS
--------------------------------------------------------------------------------
Local Resources
--------------------------------------------------------------------------------
ora.DATA.dg
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
ora.LISTENER.lsnr
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
ora.asm
ONLINE ONLINE collabn1 Started
ONLINE ONLINE collabn2 Started
ora.gsd
OFFLINE OFFLINE collabn1
OFFLINE OFFLINE collabn2
ora.net1.network
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
ora.ons
ONLINE ONLINE collabn1
ONLINE ONLINE collabn2
--------------------------------------------------------------------------------
Cluster Resources
--------------------------------------------------------------------------------
ora.LISTENER_SCAN1.lsnr
1 ONLINE ONLINE collabn2
ora.LISTENER_SCAN2.lsnr
1 ONLINE ONLINE collabn1
ora.LISTENER_SCAN3.lsnr
1 ONLINE ONLINE collabn1
ora.collabn1.vip
1 ONLINE ONLINE collabn1
ora.collabn2.vip
1 ONLINE ONLINE collabn2
ora.cvu
1 ONLINE ONLINE collabn1
ora.oc4j
1 ONLINE ONLINE collabn1
ora.orcl.db
1 ONLINE ONLINE collabn1 Open
2 ONLINE ONLINE collabn2 Open
ora.scan1.vip
1 ONLINE ONLINE collabn2
ora.scan2.vip
1 ONLINE ONLINE collabn1
ora.scan3.vip
1 ONLINE ONLINE collabn1

# crsctl stat res -t

--------------------------------------------------------------------------------

NAME TARGET STATE SERVER STATE_DETAILS

--------------------------------------------------------------------------------

Local Resources

--------------------------------------------------------------------------------

ora.DATA.dg

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

ora.LISTENER.lsnr

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

ora.asm

ONLINE ONLINE collabn1 Started

ONLINE ONLINE collabn2 Started

ora.gsd

OFFLINE OFFLINE collabn1

OFFLINE OFFLINE collabn2

ora.net1.network

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

ora.ons

ONLINE ONLINE collabn1

ONLINE ONLINE collabn2

--------------------------------------------------------------------------------

Cluster Resources

--------------------------------------------------------------------------------

ora.LISTENER_SCAN1.lsnr

1 ONLINE ONLINE collabn2

ora.LISTENER_SCAN2.lsnr

1 ONLINE ONLINE collabn1

ora.LISTENER_SCAN3.lsnr

1 ONLINE ONLINE collabn1

ora.collabn1.vip

1 ONLINE ONLINE collabn1

ora.collabn2.vip

1 ONLINE ONLINE collabn2

ora.cvu

1 ONLINE ONLINE collabn1

ora.oc4j

1 ONLINE ONLINE collabn1

ora.orcl.db

1 ONLINE ONLINE collabn1 Open

2 ONLINE ONLINE collabn2 Open

ora.scan1.vip

1 ONLINE ONLINE collabn2

ora.scan2.vip

1 ONLINE ONLINE collabn1

ora.scan3.vip

1 ONLINE ONLINE collabn1

The virtual interface eth2:1 as failed over on the second interface as eth3:2

eth3:1    Link encap:Ethernet  HWaddr 08:00:27:39:86:F2
          inet addr:169.254.185.134  Bcast:169.254.255.255  Mask:255.255.128.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1

eth3:2    Link encap:Ethernet  HWaddr 08:00:27:39:86:F2
          inet addr:169.254.104.52  Bcast:169.254.127.255  Mask:255.255.128.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1

eth3:1 Link encap:Ethernet HWaddr 08:00:27:39:86:F2

inet addr:169.254.185.134 Bcast:169.254.255.255 Mask:255.255.128.0

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

eth3:2 Link encap:Ethernet HWaddr 08:00:27:39:86:F2

inet addr:169.254.104.52 Bcast:169.254.127.255 Mask:255.255.128.0

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

After the cable is reconnected, the virtual interface is back on eth2:

eth2:1 Link encap:Ethernet HWaddr 08:00:27:51:1D:78
inet addr:169.254.104.52 Bcast:169.254.127.255 Mask:255.255.128.0
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

eth2:1 Link encap:Ethernet HWaddr 08:00:27:51:1D:78

inet addr:169.254.104.52 Bcast:169.254.127.255 Mask:255.255.128.0

UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1

Further information

For this post I’ve used a RAC version 11.2, but RAC 12c use the very same procedure.

You can discover more here about HAIP:

http://docs.oracle.com/cd/E11882_01/server.112/e10803/config_cw.htm#HABPT5279

And here about how to set it (beside this post!):

https://docs.oracle.com/cd/E11882_01/rac.112/e41959/admin.htm#CWADD90980

https://docs.oracle.com/cd/E11882_01/rac.112/e41959/oifcfg.htm#BCGGEFEI

Cheers

—

Ludo

Oracle RAC and Policy Managed Databases

Posted on July 10, 2013 by Ludovico

Some weeks ago I’ve commented a good post of Martin Bach (@MartinDBA on Twitter, make sure to follow him!)

http://martincarstenbach.wordpress.com/2013/06/17/an-introduction-to-policy-managed-databases-in-11-2-rac/

What I’ve realized by is that Policy Managed Databases are not widely used and there is a lot misunderstanding on how it works and some concerns about implementing it in production.

My current employer Trivadis (@Trivadis, make sure to call us if your database needs a health check :-)) use PMDs as best practice, so it’s worth to spend some words on it. Isn’t it?

Why Policy Managed Databases?

PMDs are an efficient way to manage and consolidate several databases and services with the least effort. They rely on Server Pools. Server pools are used to partition physically a big cluster into smaller groups of servers (Server Pool). Each pool have three main properties:

A minumim number of servers required to compose the group
A maximum number of servers
A priority that make a server pool more important than others

If the cluster loses a server, the following rules apply:

If a pool has less than min servers, a server is moved from a pool that has more than min servers, starting with the one with lowest priority.
If a pool has less than min servers and no other pools have more than min servers, the server is moved from the server with the lowest priority.
Poolss with higher priority may give servers to pools with lower priority if the min server property is honored.

This means that if a serverpool has the greatest priority, all other server pools can be reduced to satisfy the number of min servers.

Generally speaking, when creating a policy managed database (can be existent off course!) it is assigned to a server pool rather than a single server. The pool is seen as an abstract resource where you can put workload on.

If you read the definition of Cloud Computing given by the NIST (http://csrc.nist.gov/publications/nistpubs/800-145/SP800-145.pdf) you’ll find something similar:

Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared
pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that
can be rapidly provisioned and released with minimal management effort or service provider interaction

There are some major benefits in using policy managed databases (that’s my solely opinion):

PMD instances are created/removed automatically. This means that you can add and remove nodes nodes to/from the server pools or the whole cluster, the underlying databases will be expanded or shrinked following the new topology.
Server Pools (that are the base for PMDs) allow to give different priorities to different groups of servers. This means that if correctly configured, you can loose several physical nodes without impacting your most critical applications and without reconfiguring the instances.
PMD are the base for Quality of Service management, a 11gR2 feature that does resource management cluster-wide to achieve predictable performances on critical applications/transactions. QOS is a really advanced topic so I warn you: do not use it without appropriate knowledge. Again, Trivadis has deep knowledge on it so you may want to contact us for a consulting service (and why not, perhaps I’ll try to blog about it in the future).
RAC One Node databases (RONDs?) can work beside PMDs to avoid instance proliferation for non critical applications.
Oracle is pushing it to achieve maximum flexibility for the Cloud, so it’s a trendy technology that’s cool to implement!
I’ll find some other reasons, for sure! 🙂

What changes in real-life DB administration?

Well, the concept of having a relation Server -> Instance disappears, so at the very beginning you’ll have to be prepared to something dynamic (but once configured, things don’t change often).

As Martin pointed out in his blog, you’ll need to configure server pools and think about pools of resources rather than individual configuration items.

The spfile doesn’t contain any information related to specific instances, so the parameters must be database-wide.

The oratab will contain only the dbname, not the instance name, and the dbname is present in oratab disregarding if the server belongs to a serverpool or another.

+ASM1:/oracle/grid/11.2.0.3:N           # line added by Agent
PMU:/oracle/db/11.2.0.3:N               # line added by Agent
TST:/oracle/db/11.2.0.3:N               # line added by Agent

+ASM1:/oracle/grid/11.2.0.3:N # line added by Agent

PMU:/oracle/db/11.2.0.3:N # line added by Agent

TST:/oracle/db/11.2.0.3:N # line added by Agent

Your scripts should take care of this.

Also, when connecting to your database, you should rely on services and access your database remotely rather than trying to figure out where the instances are running. But if you really need it you can get it:

# srvctl status database -d PMU
Instance PMU_4 is running on node node2
Instance PMU_2 is running on node node3
Instance PMU_3 is running on node node4
Instance PMU_5 is running on node node6
Instance PMU_1 is running on node node7
Instance PMU_6 is running on node node8

# srvctl status database -d PMU

Instance PMU_4 is running on node node2

Instance PMU_2 is running on node node3

Instance PMU_3 is running on node node4

Instance PMU_5 is running on node node6

Instance PMU_1 is running on node node7

Instance PMU_6 is running on node node8

An approach for the crontab: every DBA soon or late will need to schedule tasks within the crond. Since the RAC have multiple nodes, you don’t want to run the same script many times but rather choose which node will execute it.

My personal approach (every DBA has his personal preference) is to check the instance with cardinality 1 and match it with the current node. e.g.:

# [ `crsctl stat res ora.tst.db -k 1 | grep STATE=ONLINE | awk '{print $NF}'` == `uname -n` ]
# echo $?
0

# [ `crsctl stat res ora.tst.db -k 1 | grep STATE=ONLINE | awk '{print $NF}'` == `uname -n` ]
# echo $?
1

# [ `crsctl stat res ora.tst.db -k 1 | grep STATE=ONLINE | awk '{print $NF}'` == `uname -n` ]

# echo $?

# [ `crsctl stat res ora.tst.db -k 1 | grep STATE=ONLINE | awk '{print $NF}'` == `uname -n` ]

# echo $?

In the example, TST_1 is running on node1, so the first evaluation returns TRUE. The second evaluation is done after the node2, so it returns FALSE.

This trick can be used to have an identical crontab on every server and choose at the runtime if the local server is the preferred to run tasks for the specified database.

A proof of concept with Policy Managed Databases

My good colleague Jacques Kostic has given me the access to a enterprise-grade private lab so I can show you some “live operations”.

Let’s start with the actual topology: it’s an 8-node stretched RAC with ASM diskgroups with failgroups on the remote site.

This should be enough to show you some capabilities of server pools.

The Generic and Free server pools

After a clean installation, you’ll end up with two default server pools:

The Generic one will contain all non-PMDs (if you use only PMDs it will be empty). The Free one will own servers that are “spare”, when all server pools have reached the maximum size thus they’re not requiring more servers.

New server pools

Actually the cluster I’m working on has two serverpools already defined (PMU and TST):

(the node assignment in the graphic is not relevant here).

They have been created with a command like this one:

# srvctl add serverpool -g PMU -l 5 -u 6 -i 3

1	# srvctl add serverpool -g PMU -l 5 -u 6 -i 3

# srvctl add serverpool -g TST -l 2 -u 3 -i 2

1	# srvctl add serverpool -g TST -l 2 -u 3 -i 2

“srvctl -h ” is a good starting point to have a quick reference of the syntax.

You can check the status with:

# srvctl status serverpool
Server pool name: Free
Active servers count: 0
Server pool name: Generic
Active servers count: 0
Server pool name: PMU
Active servers count: 6
Server pool name: TST
Active servers count: 2

# srvctl status serverpool

Server pool name: Free

Active servers count: 0

Server pool name: Generic

Active servers count: 0

Server pool name: PMU

Active servers count: 6

Server pool name: TST

Active servers count: 2

and the configuration:

# srvctl config serverpool
Server pool name: Free
Importance: 0, Min: 0, Max: -1
Candidate server names:
Server pool name: Generic
Importance: 0, Min: 0, Max: -1
Candidate server names:
Server pool name: PMU
Importance: 3, Min: 5, Max: 6
Candidate server names:
Server pool name: TST
Importance: 2, Min: 2, Max: 3
Candidate server names:

# srvctl config serverpool

Server pool name: Free

Importance: 0, Min: 0, Max: -1

Candidate server names:

Server pool name: Generic

Importance: 0, Min: 0, Max: -1

Candidate server names:

Server pool name: PMU

Importance: 3, Min: 5, Max: 6

Candidate server names:

Server pool name: TST

Importance: 2, Min: 2, Max: 3

Candidate server names:

Modifying the configuration of serverpools

In this scenario, PMU is too big. The sum of minumum nodes is 2+5=7 nodes, so I have only one server that can be used for another server pool without falling below the minimum number of nodes.

I want to make some room to make another server pool composed of two or three nodes, so I reduce the serverpool PMU:

# srvctl modify serverpool -g PMU -l 3

1	# srvctl modify serverpool -g PMU -l 3

Notice that PMU maxsize is still 6, so I don’t have free servers yet.

# srvctl status database -d PMU
Instance PMU_4 is running on node node2
Instance PMU_2 is running on node node3
Instance PMU_3 is running on node node4
Instance PMU_5 is running on node node6
Instance PMU_1 is running on node node7
Instance PMU_6 is running on node node8

# srvctl status database -d PMU

Instance PMU_4 is running on node node2

Instance PMU_2 is running on node node3

Instance PMU_3 is running on node node4

Instance PMU_5 is running on node node6

Instance PMU_1 is running on node node7

Instance PMU_6 is running on node node8

So, if I try to create another serverpool I’m warned that some resources can be taken offline:

# srvctl add serverpool -g LUDO -l 2 -u 3 -i 1
PRCS-1009 : Failed to create server pool LUDO
PRCR-1071 : Failed to register or update server pool ora.LUDO
CRS-2736: The operation requires stopping resource 'ora.pmu.db' on server 'node8'
CRS-2736: The operation requires stopping resource 'ora.pmu.db' on server 'node3'
CRS-2737: Unable to register server pool 'ora.LUDO' as this will affect running resources, but the force option was not specified

# srvctl add serverpool -g LUDO -l 2 -u 3 -i 1

PRCS-1009 : Failed to create server pool LUDO

PRCR-1071 : Failed to register or update server pool ora.LUDO

CRS-2736: The operation requires stopping resource 'ora.pmu.db' on server 'node8'

CRS-2736: The operation requires stopping resource 'ora.pmu.db' on server 'node3'

CRS-2737: Unable to register server pool 'ora.LUDO' as this will affect running resources, but the force option was not specified

The clusterware proposes to stop 2 instances from the db pmu on the serverpool PMU because it can reduce from 6 to 3, but I have to confirm the operation with the flag -f.

Modifying the serverpool layout can take time if resources have to be started/stopped.

# srvctl status serverpool
Server pool name: Free
Active servers count: 0
Server pool name: Generic
Active servers count: 0
Server pool name: LUDO
Active servers count: 2
Server pool name: PMU
Active servers count: 4
Server pool name: TST
Active servers count: 2

# srvctl status serverpool

Server pool name: Free

Active servers count: 0

Server pool name: Generic

Active servers count: 0

Server pool name: LUDO

Active servers count: 2

Server pool name: PMU

Active servers count: 4

Server pool name: TST

Active servers count: 2

My new serverpool is finally composed by two nodes only, because I’ve set an importance of 1 (PMU wins as it has an importance of 3).

Inviting RAC One Node databases to the party

Now that I have some room on my new serverpool, I can start creating new databases.

With PMD I can add two types of databases: RAC or RACONDENODE. Depending on the choice, I’ll have a database running on ALL NODES OF THE SERVER POOL or on ONE NODE ONLY. This is a kind of limitation in my opinion, hope Oracle will improve it in the near future: would be great to specify the cardinality also at database level.

Creating a RAC One DB is as simple as selecting two radio box during in the dbca “standard” procedure:

The Server Pool can be created or you can specify an existent one (as in this lab):

I’ve created two new RAC One Node databases:

DB LUDO (service PRISM :-))
DB VICO (service CHEERS)

I’ve ended up with something like this:

--------------------------------------------------------------------------------
NAME           TARGET  STATE        SERVER                   STATE_DETAILS
--------------------------------------------------------------------------------
ora.ludo.db   <<<<< RAC ONE
      1        ONLINE  ONLINE       node8                    Open
ora.ludo.prism.svc
      1        ONLINE  ONLINE       node8
ora.pmu.db
      1        ONLINE  ONLINE       node7                    Open
      2        ONLINE  ONLINE       node4                    Open
      3        ONLINE  ONLINE       node5                    Open
      4        ONLINE  ONLINE       node6                    Open
ora.tst.db
      1        ONLINE  ONLINE       node1                    Open
      2        ONLINE  ONLINE       node2                    Open
ora.vico.cheers.svc
      1        ONLINE  ONLINE       node3
ora.vico.db  <<<< RAC ONE
      1        ONLINE  ONLINE       node3                    Open

--------------------------------------------------------------------------------

NAME TARGET STATE SERVER STATE_DETAILS

--------------------------------------------------------------------------------

ora.ludo.db <<<<< RAC ONE

1 ONLINE ONLINE node8 Open

ora.ludo.prism.svc

1 ONLINE ONLINE node8

ora.pmu.db

1 ONLINE ONLINE node7 Open

2 ONLINE ONLINE node4 Open

3 ONLINE ONLINE node5 Open

4 ONLINE ONLINE node6 Open

ora.tst.db

1 ONLINE ONLINE node1 Open

2 ONLINE ONLINE node2 Open

ora.vico.cheers.svc

1 ONLINE ONLINE node3

ora.vico.db <<<< RAC ONE

1 ONLINE ONLINE node3 Open

That can be represented with this picture:

RAC One Node databases can be managed as always with online relocation (it’s still called O-Motion?)

Losing the nodes

With this situation, what happens if I loose (stop) one node?

# crsctl stop cluster -n node8
CRS-2673: Attempting to stop 'ora.crsd' on 'node8'
CRS-2790: Starting shutdown of Cluster Ready Services-managed resources on 'node8'
CRS-2673: Attempting to stop 'ora.LISTENER.lsnr' on 'node8'
CRS-2673: Attempting to stop 'ora.ludo.prism.svc' on 'node8'
CRS-2677: Stop of 'ora.ludo.prism.svc' on 'node8' succeeded
CRS-2677: Stop of 'ora.LISTENER.lsnr' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.node8.vip' on 'node8'
CRS-2677: Stop of 'ora.node8.vip' on 'node8' succeeded
CRS-2672: Attempting to start 'ora.node8.vip' on 'node4'
CRS-2676: Start of 'ora.node8.vip' on 'node4' succeeded
CRS-2673: Attempting to stop 'ora.ludo.db' on 'node8'
CRS-2677: Stop of 'ora.ludo.db' on 'node8' succeeded
CRS-2672: Attempting to start 'ora.ludo.db' on 'node3'
CRS-2676: Start of 'ora.ludo.db' on 'node3' succeeded
CRS-2672: Attempting to start 'ora.ludo.prism.svc' on 'node3'
CRS-2676: Start of 'ora.ludo.prism.svc' on 'node3' succeeded
CRS-2673: Attempting to stop 'ora.GRID.dg' on 'node8'
CRS-2673: Attempting to stop 'ora.DATA.dg' on 'node8'
CRS-2673: Attempting to stop 'ora.FRA.dg' on 'node8'
CRS-2673: Attempting to stop 'ora.RECO.dg' on 'node8'
CRS-2677: Stop of 'ora.DATA.dg' on 'node8' succeeded
CRS-2677: Stop of 'ora.FRA.dg' on 'node8' succeeded
CRS-2677: Stop of 'ora.RECO.dg' on 'node8' succeeded
CRS-2677: Stop of 'ora.GRID.dg' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.asm' on 'node8'
CRS-2677: Stop of 'ora.asm' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.ons' on 'node8'
CRS-2677: Stop of 'ora.ons' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.net1.network' on 'node8'
CRS-2677: Stop of 'ora.net1.network' on 'node8' succeeded
CRS-2792: Shutdown of Cluster Ready Services-managed resources on 'node8' has completed
CRS-2677: Stop of 'ora.crsd' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.ctssd' on 'node8'
CRS-2673: Attempting to stop 'ora.evmd' on 'node8'
CRS-2673: Attempting to stop 'ora.asm' on 'node8'
CRS-2677: Stop of 'ora.evmd' on 'node8' succeeded
CRS-2677: Stop of 'ora.asm' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'node8'
CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'node8' succeeded
CRS-2677: Stop of 'ora.ctssd' on 'node8' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'node8'
CRS-2677: Stop of 'ora.cssd' on 'node8' succeeded

# crsctl stop cluster -n node8

CRS-2673: Attempting to stop 'ora.crsd' on 'node8'

CRS-2790: Starting shutdown of Cluster Ready Services-managed resources on 'node8'

CRS-2673: Attempting to stop 'ora.LISTENER.lsnr' on 'node8'

CRS-2673: Attempting to stop 'ora.ludo.prism.svc' on 'node8'

CRS-2677: Stop of 'ora.ludo.prism.svc' on 'node8' succeeded

CRS-2677: Stop of 'ora.LISTENER.lsnr' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.node8.vip' on 'node8'

CRS-2677: Stop of 'ora.node8.vip' on 'node8' succeeded

CRS-2672: Attempting to start 'ora.node8.vip' on 'node4'

CRS-2676: Start of 'ora.node8.vip' on 'node4' succeeded

CRS-2673: Attempting to stop 'ora.ludo.db' on 'node8'

CRS-2677: Stop of 'ora.ludo.db' on 'node8' succeeded

CRS-2672: Attempting to start 'ora.ludo.db' on 'node3'

CRS-2676: Start of 'ora.ludo.db' on 'node3' succeeded

CRS-2672: Attempting to start 'ora.ludo.prism.svc' on 'node3'

CRS-2676: Start of 'ora.ludo.prism.svc' on 'node3' succeeded

CRS-2673: Attempting to stop 'ora.GRID.dg' on 'node8'

CRS-2673: Attempting to stop 'ora.DATA.dg' on 'node8'

CRS-2673: Attempting to stop 'ora.FRA.dg' on 'node8'

CRS-2673: Attempting to stop 'ora.RECO.dg' on 'node8'

CRS-2677: Stop of 'ora.DATA.dg' on 'node8' succeeded

CRS-2677: Stop of 'ora.FRA.dg' on 'node8' succeeded

CRS-2677: Stop of 'ora.RECO.dg' on 'node8' succeeded

CRS-2677: Stop of 'ora.GRID.dg' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.asm' on 'node8'

CRS-2677: Stop of 'ora.asm' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.ons' on 'node8'

CRS-2677: Stop of 'ora.ons' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.net1.network' on 'node8'

CRS-2677: Stop of 'ora.net1.network' on 'node8' succeeded

CRS-2792: Shutdown of Cluster Ready Services-managed resources on 'node8' has completed

CRS-2677: Stop of 'ora.crsd' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.ctssd' on 'node8'

CRS-2673: Attempting to stop 'ora.evmd' on 'node8'

CRS-2673: Attempting to stop 'ora.asm' on 'node8'

CRS-2677: Stop of 'ora.evmd' on 'node8' succeeded

CRS-2677: Stop of 'ora.asm' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.cluster_interconnect.haip' on 'node8'

CRS-2677: Stop of 'ora.cluster_interconnect.haip' on 'node8' succeeded

CRS-2677: Stop of 'ora.ctssd' on 'node8' succeeded

CRS-2673: Attempting to stop 'ora.cssd' on 'node8'

CRS-2677: Stop of 'ora.cssd' on 'node8' succeeded

The node was belonging to the pool LUDO, however I have this situation right after:

# srvctl status serverpool
Server pool name: Free
Active servers count: 0
Server pool name: Generic
Active servers count: 0
Server pool name: LUDO
Active servers count: 2
Server pool name: PMU
Active servers count: 3
Server pool name: TST
Active servers count: 2

# srvctl status serverpool

Server pool name: Free

Active servers count: 0

Server pool name: Generic

Active servers count: 0

Server pool name: LUDO

Active servers count: 2

Server pool name: PMU

Active servers count: 3

Server pool name: TST

Active servers count: 2

A server has been taken from the pol PMU and given to the pool LUDO. This is because PMU was having one more server than his minimum server requirement.

Now I can loose one node at time, I’ll have the following situation:

1 node lost: PMU 3, TST 2, LUDO 2
2 nodes lost: PMU 3, TST 2, LUDO 1 (as PMU is already on min and has higher priority, LUDO is penalized because has the lowest priority)
3 nodes lost:PMU 3, TST 2, LUDO 0 (as LUDO has the lowest priority)
4 nodes lost: PMU 3, TST 1, LUDO 0
5 nodes lost: PMU 3, TST 0, LUDO 0

So, my hyper-super-critical application will still have three nodes to have plenty of resources to run even with a multiple physical failure, as it is the server pool with the highest priority and a minimum required server number of 3.

What I would ask to Santa if I’ll be on the Nice List (ad if Santa works at Redwood Shores)

Dear Santa, I would like:

To create databases with node cardinality, to have for example 2 instances in a 3 nodes server pool
Server Pools that are aware of the physical location when I use stretched clusters, so I could end up always with “at least one active instance per site”.

Think about it 😉

—

Ludovico

Dataguard check script for Real Application Clusters (MAA)

Posted on December 31, 2010 by Ludovico

Two years after my posts:
Quick Oracle Dataguard check script and More about Dataguard and how to check it I faced a whole new Dataguard between two Oracle Real Application Clusters, aka Oracle Maximum Availability Architecture (MAA).

This enviromnent is relying on Windows OS. Don’t know how this could be called “availability” but here we are. I revisited my scripts in a quick and very dirty way. Please consider that I did copy and paste to check the alignment once per thread, but it should be improved with some kind of iteration to check each thread in a more structured fashion.

#!D:\oracle\product\10.2.0\db_1\perl\5.8.3\bin\MSWin32-x86-multi-thread\perl.exe -w
use DBI;
use DBD::Oracle qw(:ora_session_modes);
# DB connection #
my $prod  = "prod";
my $stby = "stby";
my $prodh;
unless ($prodh = DBI-&gt;connect('dbi:Oracle:'.$prod, 
    'sys', 'strongpwd', 
    {PrintError=&gt;0, AutoCommit =&gt; 0,
    ora_session_mode =&gt; ORA_SYSDBA}))  {
print "Error connecting to DB: $DBI::errstr\n";
exit(1);
}
$prodh-&gt;{RaiseError}=1;

my $stbyh;
unless ($stbyh = DBI-&gt;connect('dbi:Oracle:'.$stby,
    'sys', 'strongpwd',
    {PrintError=&gt;0, AutoCommit =&gt; 0,
    ora_session_mode =&gt; ORA_SYSDBA}))  {
print "Error connecting to DB: $DBI::errstr\n";
$prodh-&gt;disconnect;
exit(1);
}
$stbyh-&gt;{RaiseError}=1;

my $sth;
### query stdby MRP0
$sth = $stbyh-&gt;prepare( &lt;&lt;EOSQL );
select thread#, SEQUENCE#, BLOCK#
    from gv\$managed_standby 
    where process='MRP0'
EOSQL
$sth-&gt;execute();
my ($mrpthread, $mrpsequence, $mrpblock) = $sth-&gt;fetchrow_array();
$sth-&gt;finish();

### query stdby RFS
$sth = $stbyh-&gt;prepare( &lt;&lt;EOSQL );
select thread#, SEQUENCE#, BLOCK#
    from gv\$managed_standby 
    where process='RFS' and client_process='LGWR' order by thread#
EOSQL
$sth-&gt;execute();
my ($rfsthread1, $rfssequence1, $rfsblock1) = $sth-&gt;fetchrow_array();
my ($rfsthread2, $rfssequence2, $rfsblock2) = $sth-&gt;fetchrow_array();
$sth-&gt;finish();

### query prod
$sth = $prodh-&gt;prepare( &lt;&lt;EOSQL );
select thread#, SEQUENCE#, BLOCK#
    from gv\$managed_standby
    where process='LNS' order by thread#
EOSQL
$sth-&gt;execute();
my ($pthread1, $psequence1, $pblock1) = $sth-&gt;fetchrow_array();
my ($pthread2, $psequence2, $pblock2) = $sth-&gt;fetchrow_array();
$sth-&gt;finish();


printf ("ENVIRONM  Thread Sequence   Block\n");
printf ("--------- ------ ---------- ----------\n");
printf ("PROD     LNS1  1 %10d %10d\n", $psequence1, $pblock1);
printf ("STANDBY  RFS1  1 %10d %10d\n", $rfssequence1, $rfsblock1);
printf ("PROD     LSN2  2 %10d %10d\n", $psequence2, $pblock2);
printf ("STANDBY  RFS2  2 %10d %10d\n", $rfssequence2, $rfsblock2);
printf ("STANDBY  MRP0  %d %10d %10d\n", $mrpthread, $mrpsequence, $mrpblock);

my $psequence;
my $pblock;
if ( $mrpthread == 1 ) {
$psequence=$psequence1;
$pblock=$pblock1;
} else {
$psequence=$psequence2;
$pblock=$pblock2;
}

$sth = $stbyh-&gt;prepare( &lt;&lt;EOSQL );
select nvl(sum(blocks),0)
+ $pblock - $mrpblock as BLOCK_GAP
from gv\$archived_log
where thread#=$mrpthread and sequence#
between $mrpsequence and $psequence
EOSQL
$sth-&gt;execute();
my ($mrpblockgap) = $sth-&gt;fetchrow_array();
$sth-&gt;finish();

$sth = $stbyh-&gt;prepare( &lt;&lt;EOSQL );
select nvl(sum(blocks),0)
+ $pblock1 - $rfsblock1 as BLOCK_GAP
from gv\$archived_log
where thread#=1 and sequence#
between $rfssequence1 and $psequence1
EOSQL
$sth-&gt;execute();
my ($rfsblockgap1) = $sth-&gt;fetchrow_array();
$sth-&gt;finish();

$sth = $stbyh-&gt;prepare( &lt;&lt;EOSQL );
select nvl(sum(blocks),0)
+ $pblock2 - $rfsblock2 as BLOCK_GAP
from gv\$archived_log
where thread#=2 and sequence#
between $rfssequence2 and $psequence2
EOSQL
$sth-&gt;execute();
my ($rfsblockgap2) = $sth-&gt;fetchrow_array();
$sth-&gt;finish();
printf ("\n\n%-10d blocks gap in TRANSMISSION\n", $rfsblockgap1+$rfsblockgap2);
printf ("%-10d blocks gap in APPLY (MRP0)\n", $mrpblockgap);

$stbyh-&gt;disconnect;
$prodh-&gt;disconnect;

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

#!D:\oracle\product\10.2.0\db_1\perl\5.8.3\bin\MSWin32-x86-multi-thread\perl.exe -w

use DBI;

use DBD::Oracle qw(:ora_session_modes);

# DB connection #

my $prod = "prod";

my $stby = "stby";

my $prodh;

unless ($prodh = DBI->connect('dbi:Oracle:'.$prod,

'sys', 'strongpwd',

{PrintError=>0, AutoCommit => 0,

ora_session_mode => ORA_SYSDBA})) {

print "Error connecting to DB: $DBI::errstr\n";

exit(1);

}

$prodh->{RaiseError}=1;

my $stbyh;

unless ($stbyh = DBI->connect('dbi:Oracle:'.$stby,

'sys', 'strongpwd',

{PrintError=>0, AutoCommit => 0,

ora_session_mode => ORA_SYSDBA})) {

print "Error connecting to DB: $DBI::errstr\n";

$prodh->disconnect;

exit(1);

}

$stbyh->{RaiseError}=1;

my $sth;

### query stdby MRP0

$sth = $stbyh->prepare( <<EOSQL );

select thread#, SEQUENCE#, BLOCK#

from gv\$managed_standby

where process='MRP0'

EOSQL

$sth->execute();

my ($mrpthread, $mrpsequence, $mrpblock) = $sth->fetchrow_array();

$sth->finish();

### query stdby RFS

$sth = $stbyh->prepare( <<EOSQL );

select thread#, SEQUENCE#, BLOCK#

from gv\$managed_standby

where process='RFS' and client_process='LGWR' order by thread#

EOSQL

$sth->execute();

my ($rfsthread1, $rfssequence1, $rfsblock1) = $sth->fetchrow_array();

my ($rfsthread2, $rfssequence2, $rfsblock2) = $sth->fetchrow_array();

$sth->finish();

### query prod

$sth = $prodh->prepare( <<EOSQL );

select thread#, SEQUENCE#, BLOCK#

from gv\$managed_standby

where process='LNS' order by thread#

EOSQL

$sth->execute();

my ($pthread1, $psequence1, $pblock1) = $sth->fetchrow_array();

my ($pthread2, $psequence2, $pblock2) = $sth->fetchrow_array();

$sth->finish();

printf ("ENVIRONM Thread Sequence Block\n");

printf ("--------- ------ ---------- ----------\n");

printf ("PROD LNS1 1 %10d %10d\n", $psequence1, $pblock1);

printf ("STANDBY RFS1 1 %10d %10d\n", $rfssequence1, $rfsblock1);

printf ("PROD LSN2 2 %10d %10d\n", $psequence2, $pblock2);

printf ("STANDBY RFS2 2 %10d %10d\n", $rfssequence2, $rfsblock2);

printf ("STANDBY MRP0 %d %10d %10d\n", $mrpthread, $mrpsequence, $mrpblock);

my $psequence;

my $pblock;

if ( $mrpthread == 1 ) {

$psequence=$psequence1;

$pblock=$pblock1;

} else {

$psequence=$psequence2;

$pblock=$pblock2;

}

$sth = $stbyh->prepare( <<EOSQL );

select nvl(sum(blocks),0)

+ $pblock - $mrpblock as BLOCK_GAP

from gv\$archived_log

where thread#=$mrpthread and sequence#

between $mrpsequence and $psequence

EOSQL

$sth->execute();

my ($mrpblockgap) = $sth->fetchrow_array();

$sth->finish();

$sth = $stbyh->prepare( <<EOSQL );

select nvl(sum(blocks),0)

+ $pblock1 - $rfsblock1 as BLOCK_GAP

from gv\$archived_log

where thread#=1 and sequence#

between $rfssequence1 and $psequence1

EOSQL

$sth->execute();

my ($rfsblockgap1) = $sth->fetchrow_array();

$sth->finish();

$sth = $stbyh->prepare( <<EOSQL );

select nvl(sum(blocks),0)

+ $pblock2 - $rfsblock2 as BLOCK_GAP

from gv\$archived_log

where thread#=2 and sequence#

between $rfssequence2 and $psequence2

EOSQL

$sth->execute();

my ($rfsblockgap2) = $sth->fetchrow_array();

$sth->finish();

printf ("\n\n%-10d blocks gap in TRANSMISSION\n", $rfsblockgap1+$rfsblockgap2);

printf ("%-10d blocks gap in APPLY (MRP0)\n", $mrpblockgap);

$stbyh->disconnect;

$prodh->disconnect;

Please foreward me every improvement you implement over my code: it would be nice to post it here.

Oracle RAC Standard Edition to achieve low cost and high performance

Posted on November 28, 2008 by Ludovico

I finished today to create a new production environment based on 2 Linux serverX86_64 and running Oracle RAC 10gR2. (I know, there is 11g right now, but I’m a conservative!)
Wheeew, I just spent a couple of hours applying all the recommended patches!
We choosed 2 nodes with a maximum of 2 multi-core processors each one so we can license Standard Edition instead of Enterprise Edition. 64bits addressing allow us to allocate many gigabytes of SGA. I’m starting with 5Gb but I think we’ll need more. And a set of 6x300Gb 15krpms disks (it can be expanded with more disks and more shelves).
This configuration keeps low the total cost of ownership but achieves best performance.
Due to disks layout, costs and needed usable storage, we had to configure one huge RAID5 on the SAN with multi-path. I decided anyway to create 2 ASM disk groups (ASM is mandatory for Standard Edition RAC), one for the DB, the second one for the recovery area. With spare disks we should have enough availability and even if it’s a RAID5 I saw good write performances (>150M/s).

Welcome new RAC, I hope we’ll feel good together!

JBOSS Cluster isolation and multicasting

Posted on November 24, 2008 by Ludovico

I configured two JBoss clusters in the same LAN: a production and a test environment.
I decided to configure every single cluster with a dedicate private LANs using a restricted netmask to isolate production and test connectivity, so I assigned
192.168.100.0/255.255.255.0 to test and
192.168.200.0/255.255.255.0 to production.
I configured Apache and mod_jk to loadbalance activities between cluster instances.

The page UsingMod_jk1.2WithJBoss (http://www.jboss.org/community/docs/DOC-12525) is a good tutorial to achieve this.

What problems should I expect?
JBoss uses UDP multicasting to replicate informations across cluster nodes: even if I isolate TCP traffic, JBoss will “ear” messages sent from other clusters and will log a lot of warnings like the following:

… WARN [NAKACK] […] discarded message from non-member ….

I had to change BOTH multicast ip address and port (attributes mcast_addr and mcast_port) in the following configuration files:

./deploy/jboss-web-cluster.sar/META-INF/jboss-service.xml

./deploy/jmx-console.war/WEB-INF/web.xml

./deploy/cluster-service.xml

./deploy/ejb3-clustered-sfsbcache-service.xml

./deploy/ejb3-entity-cache-service.xml

Good luck!