DBA survival BLOG

DBA stuff and Oracle Data Guard

Oracle Home Management – part 2: Common patching patterns

Posted on May 3, 2018 by Ludovico

(*) Multiple times in this blog post I refer to a problem with new Oracle Home installs and rollback scripts. The problem has been fixed with PSU Jan 2017, I did not notice it before, sorry. Thanks to Martin Berger for the information

Let’s see some common approaches to Oracle Home patching.

First, how patches are applied

No, I will not talk about how to use opatch 🙂 It is an overview of the “high-level” methods… when you have multiple servers and (eventually) multiple databases per server.

Worst approach (big bang)

1.Stop everything

2.In-place binaries patching

3.Database patching, “big bang” mode

4.Start everything

With this approach, you have a big downtime, a maintenance window hard to get (all applications are down at the same time), no control over a single database and no easy rollback in case your binaries get compromised/corrupted by the patch apply.

Another bad approach (new install and out-of-place patching)

1.Re-install binaries manually in a new path

2.Patch the new binaries

3.Stop, change OH, patch databases one by one

4.Decommission old binaries

This approach is much better than the previous one, but still has some pitfalls:

If you have many servers and environments: doing it frequently might be a challenge
Rollback scripts are not copied automatically: the datapatch will fail unless you copy them by hand (*)
New installs introduce potential human error, unless you use unattended install with your own scripts
Do you like to run opatch apply all the time, after all?

Better approach (software cloning)

This approach is very close to the previous one, with the exception that the new Oracle Home is not installed from scratch, but rather cloned from an existing one. This way, the rollback scripts used by the datapatch binary will be there and there will be no errors when patching the databases. (*)

The procedure for Oracle Home cloning is described in the Oracle Documentation, here.

Another cool thing is that you can clone Oracle Homes across different nodes, so that you might have the same patch level everywhere without repeating the tedious tasks of upgrading the opatch, patching the binaries, etc. etc.

But still, you have to identify which Oracle Home you have to clone and keep track of the latest version.

Best approach (Golden Images)

The best approach would consist in having a central repository for your software, where you store every version of your Oracle Homes, one for each patch level.

Having a central repository allows to install the software ONCE and use a “clone, patch and store it” strategy. You can, for example, use only one server to do all the patching and then distribute your software images to the different database servers.

This is the concept of Golden Images used by Rapid Home Provisioning that will be in the scope of my next blog post.

Second, which patches are applied

Now that we have seen some Oracle Home patching approaches, is it worth to know which patches are important in a patching strategy.

It is better that you get used to the differences between PSU/BP and RU/RUR, by reading this valuable post from Mike Dietrich:

https://mikedietrichde.com/2017/10/24/differences-psu-bp-ru-rur/

I will make the assumption that in every case, the critical patches should be applied quarterly, or at least once per year, in order to fix security bugs.

The conservative approach (stability and performance over improvements)

Prior to 12.2, in order to guarantee security and stability, the best approach was to apply only PSUs each quarter.

From 12.2, the most conservative approach is to apply the latest Release Update Review on top of the oldest as possible Release Update. Confusing? Things will be clearer when I’ll write about the 18c New Release Model in a few days…

The cowboy approach (improvements over stability and performance)

Sometimes Bundle Patches and Release Updates contain cool backports from the new releases; sometimes they contain just more bug fixes than the PSUs and RURs; sometimes they fix important stuff like disabling bad transformations that lead to wrong result bugs or other annoying bugs.

Personally, I prefer to include such improvements in my patching strategy: I regularly apply RU for releases >=12.2 and BP for releases <=12.1. Don’t call me cowboy, however 🙂

The incumbent approach (or why you cannot avoid one-offs)

It does not matter your patch frequency: sometimes you hit a bug, and the only solution is either to apply the one-off patch or the workaround, if available.

If you apply the one-off patch for a specific bug, from an Oracle Home maintenance point of view, it would be better to

apply the same one-off everywhere (read, all your Oracle Homes with the very same release), this makes your environment homogeneous.

use a clone of the Oracle Home with the one-off as basis to apply the release update and distribute it to the other servers.

Why?

Again, it is a problem with rollback scripts (*), with patch conflicts and also, of number of versions to maintain:Less paths, less error-prone!

There is, however, the alternative to one-offs: implementing the workaround instead of applying the patch. Most of the time the workaround consist in disabling “something” through parameters, or worse, hidden parameters (the underscore parameters that the support says you should not set, but advise to do all the time as workaround :-))

It might be a good idea to use the workaround instead of apply tha patch if you already know that the bug will be fixed in the next Release Update (for example), or that the workaround is so easy to implement that it is not worth to create another version of Oracle Home that will require special attention at the next quarter.

If you apply workarounds, anyway, be sure that you comment EXACTLY why, when and who, so you can decide to unset it at the next parameter review or maintenance… e.g.

alter system set "_px_groupby_pushdown"=off
  comment='Ludo, 03.05.16: W/A for bug 18499088' scope=both sid='*';

alter system set "_fix_control"='14033181:0','11843466:off','26664361:7','16732417:1','20243268:1' 
  comment='Ludo, 20.11.17: fixes of BP171017 + W/A bugs 21303294 24499054' scope=spfile sid='*';

alter system set "_px_groupby_pushdown"=off

comment='Ludo, 03.05.16: W/A for bug 18499088' scope=both sid='*';

alter system set "_fix_control"='14033181:0','11843466:off','26664361:7','16732417:1','20243268:1'

comment='Ludo, 20.11.17: fixes of BP171017 + W/A bugs 21303294 24499054' scope=spfile sid='*';

Makes sense?

Oracle Home Management – part 1: “Patch soon, patch often” vs. reality

Posted on May 1, 2018 by Ludovico

With this post, I am starting a new blog series about Oracle Database home management, provisioning, patching… Best (and worst) practices, common practices and blueprints from my point of view as consultant and, sometimes, as operational DBA.

I hope to find the time to continue (and finish) it 🙂

How often should you upgrade/patch?

Database patching and upgrading is not an easy task, but it is really important.

Many companies do not have a clear patching strategy, for several reasons.

Patching is time consuming
It is complex
It introduces some risks
It is not always really necesary
It leads to human errors

Oracle, of course, recommends to apply the patches quarterly, as soon as they are released. But the reality is that it is (still) very common to find customers that do not apply patches regularly.

Look at this:

$ opatch lspatches
26925218;OCW Patch Set Update : 12.1.0.2.180116 (26925218)
26925263;Database Bundle Patch : 12.1.0.2.180116 (26925263)
22243983;

OPatch succeeded.

$ cd $ORACLE_HOME/inventory
$ grep -r "bug description" * |  wc -l
1883
$ grep -r "bug description" * | grep -i "wrong result" | wc -l
56

$ opatch lspatches

26925218;OCW Patch Set Update : 12.1.0.2.180116 (26925218)

26925263;Database Bundle Patch : 12.1.0.2.180116 (26925263)

22243983;

OPatch succeeded.

$ cd $ORACLE_HOME/inventory

$ grep -r "bug description" * | wc -l

1883

$ grep -r "bug description" * | grep -i "wrong result" | wc -l

With January 2018 Bundle Patch, you can fix 1883 bugs, including 56 “wrong results” bugs! I hope I will talk more about this kind of bugs, but for now consider that if you are not patching often, you are taking serious risks, including putting at risk your data consistency.

I will not talk about bugs, upgrade procedures, new releases here… For this, I recommend to follow Mike Dietrich’s blog: Upgrade your Database – NOW!

I would like rather to talk, as the title of this blog series states, about the approaches of maintaining the Oracle Homes across your Oracle server farm.

Common worst practices in maintaining homes

Maintaining a plethora of Oracle Homes across different servers requires thoughtful planning. This is a non-exhaustive list of bad practices that I see from time to time.

Installing by hand every new Oracle Home
Applying different patch levels on Oracle Homes with the same path
Not tracking the installed patches
Having Oracle Home paths hard-coded in the operational scripts
Not minding about Oracle Home path naming convention
Not minding about Oracle Home internal names
Copying Oracle Homes without minding about the Central Inventory

All these worst practices lead to what I like to call “patching madness”… that monster that makes regular patching very difficult / impossible.

THIS IS A SITUATION THAT YOU NEED TO AVOID:

Server A
/u01/app/oracle/product/12.1.0            -> Home "OraHOme12C", contains clean 12.1.0.2

Server B
/u01/app/oracle/product/12.1.0.2          -> Home "OraHome1",   contains 12.1.0.2.PSU161018
/u01/app/oracle/product/12.1.0.2.BP170117 -> Home "OraHome2",   contains 12.1.0.2.BP170117

Server C
/u01/app/oracle/product/12.1.0            -> Home "OraHome1",   contains clean 12.1.0.1
/u01/app/oracle/product/12.1.0.2          -> Home "DBHome_1",   contains 12.1.0.2.BP170117

Server A

/u01/app/oracle/product/12.1.0 -> Home "OraHOme12C", contains clean 12.1.0.2

Server B

/u01/app/oracle/product/12.1.0.2 -> Home "OraHome1", contains 12.1.0.2.PSU161018

/u01/app/oracle/product/12.1.0.2.BP170117 -> Home "OraHome2", contains 12.1.0.2.BP170117

Server C

/u01/app/oracle/product/12.1.0 -> Home "OraHome1", contains clean 12.1.0.1

/u01/app/oracle/product/12.1.0.2 -> Home "DBHome_1", contains 12.1.0.2.BP170117

A better approach, would be starting having some naming conventions, e.g.:

Server A
/u01/app/oracle/product/12.1.0.2           -> Home "Ora12cR2",           contains clean 12.1.0.2

Server B
/u01/app/oracle/product/12.1.0.2.PSU161018 -> Home "Ora12cR2_PSU161018", contains 12.1.0.2.PSU161018
/u01/app/oracle/product/12.1.0.2.BP170117  -> Home "Ora12cR2_BP170117",  contains 12.1.0.2.BP170117

Server C
/u01/app/oracle/product/12.1.0.1           -> Home "Ora12cR1",           contains clean 12.1.0.1
/u01/app/oracle/product/12.1.0.2.BP170117  -> Home "Ora12cR2_BP170117",  contains 12.1.0.2.BP170117

Server A

/u01/app/oracle/product/12.1.0.2 -> Home "Ora12cR2", contains clean 12.1.0.2

Server B

/u01/app/oracle/product/12.1.0.2.PSU161018 -> Home "Ora12cR2_PSU161018", contains 12.1.0.2.PSU161018

/u01/app/oracle/product/12.1.0.2.BP170117 -> Home "Ora12cR2_BP170117", contains 12.1.0.2.BP170117

Server C

/u01/app/oracle/product/12.1.0.1 -> Home "Ora12cR1", contains clean 12.1.0.1

/u01/app/oracle/product/12.1.0.2.BP170117 -> Home "Ora12cR2_BP170117", contains 12.1.0.2.BP170117

In the next blog post, I will talk about common patching patterns and their pitfalls.

DBMS_AUDIT_MGMT.CLEAN_AUDIT_TRAIL not working on 12c? Here’s why…

Posted on April 27, 2018 by Ludovico

It is bad to realize, after a few years, that my customer’s Audit Cleanup procedures are not working properly for every database…

NOTE: The post is based on standard audit, not unified audit.

My customer developed a quite nice procedure for database housekeeping (including diag dest, OS audit trail, recyclebin, DB audit…)

But after some performance problems, I have come across the infamous sql_id 4ztz048yfq32s:

SELECT TO_CHAR(current_timestamp AT TIME ZONE 'GMT', 'YYYY-MM-DD HH24:MI:SS TZD') AS curr_timestamp, COUNT(username) AS failed_count, TO_CHAR(MIN(timestamp), 'yyyy-mm-dd hh24:mi:ss') AS first_occur_time, TO_CHAR(MAX(timestamp), 'yyyy-mm-dd hh24:mi:ss') AS last_occur_time
FROM sys.dba_audit_session
WHERE returncode != 0 AND timestamp >= current_timestamp - TO_DSINTERVAL('0 0:30:00')

SELECT TO_CHAR(current_timestamp AT TIME ZONE 'GMT', 'YYYY-MM-DD HH24:MI:SS TZD') AS curr_timestamp, COUNT(username) AS failed_count, TO_CHAR(MIN(timestamp), 'yyyy-mm-dd hh24:mi:ss') AS first_occur_time, TO_CHAR(MAX(timestamp), 'yyyy-mm-dd hh24:mi:ss') AS last_occur_time

FROM sys.dba_audit_session

WHERE returncode != 0 AND timestamp >= current_timestamp - TO_DSINTERVAL('0 0:30:00')

This SQL comes from the “Failed Logon Attempts” metric in Enterprise Manager.

I’ve checked the specific database, and the table SYS.AUD$ was containing way too many rows, dating before our purge time:

SQL> select min(timestamp) from dba_audit_session;

MIN(TIMESTAMP)
-------------------
04.02.2017 07:01:20

SQL>  select dbid, count(*) from aud$ group by dbid;

      DBID   COUNT(*)
---------- ----------
2416611527   35846477

SQL> select min(timestamp) from dba_audit_session;

MIN(TIMESTAMP)

-------------------

04.02.2017 07:01:20

SQL> select dbid, count(*) from aud$ group by dbid;

DBID COUNT(*)

---------- ----------

2416611527 35846477

The cleanup procedure does basically this:

SQL> begin
  2  dbms_audit_mgmt.set_last_archive_timestamp(audit_trail_type  => DBMS_AUDIT_MGMT.AUDIT_TRAIL_AUD_STD
  3                          ,last_archive_time => SYSTIMESTAMP-31);
  4  end;
  5  /

PL/SQL procedure successfully completed.

SQL> set timing on
SQL> begin
  2  dbms_audit_mgmt.clean_audit_trail(
  3    audit_trail_type => sys.dbms_audit_mgmt.AUDIT_TRAIL_AUD_STD,
  4    use_last_arch_timestamp => TRUE);
  5  end;
  6  /

PL/SQL procedure successfully completed.

Elapsed: 00:00:38.34

SQL> begin

2 dbms_audit_mgmt.set_last_archive_timestamp(audit_trail_type => DBMS_AUDIT_MGMT.AUDIT_TRAIL_AUD_STD

3 ,last_archive_time => SYSTIMESTAMP-31);

4 end;

5 /

PL/SQL procedure successfully completed.

SQL> set timing on

SQL> begin

2 dbms_audit_mgmt.clean_audit_trail(

3 audit_trail_type => sys.dbms_audit_mgmt.AUDIT_TRAIL_AUD_STD,

4 use_last_arch_timestamp => TRUE);

5 end;

6 /

PL/SQL procedure successfully completed.

Elapsed: 00:00:38.34

But despite a retention window of 31 days, the rows are still there:

SQL> select min(timestamp) from dba_audit_session;

MIN(TIMESTAMP)
-------------------
04.02.2017 07:01:20

Elapsed: 00:00:29.06

SQL> select min(timestamp) from dba_audit_session;

MIN(TIMESTAMP)

-------------------

04.02.2017 07:01:20

Elapsed: 00:00:29.06

(today is 27.04.2018, so the oldest records are more than 1 year old)

I’ve checked with ASH, the actual delete statement executed by the clean_audit_trail procedure is:

DELETE FROM SYS.AUD$ WHERE DBID = 2416611527 AND NTIMESTAMP# < to_timestamp('2017-02-04 05:01:10', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

1	DELETE FROM SYS.AUD$ WHERE DBID = 2416611527 AND NTIMESTAMP# < to_timestamp('2017-02-04 05:01:10', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

So, the DBID clause is OK, but the NTIMESTAMP# clause is not!

Why?

Long story long (hint, it’s a bug: 19958239)
Update 30.05.2018 the solution is explained in this Doc: 2068066.1, thanks John)

The cleanup metadata is stored into the view DBA_AUDIT_MGMT_LAST_ARCH_TS. Its structure in 11g was:

SQL> desc dba_audit_mgmt_last_arch_ts
 Name                                      Null?    Type
 ----------------------------------------- -------- ----------------------------
 AUDIT_TRAIL                                        VARCHAR2(20)
 RAC_INSTANCE                              NOT NULL NUMBER
 LAST_ARCHIVE_TS                                    TIMESTAMP(6) WITH TIME ZONE

SQL> desc dba_audit_mgmt_last_arch_ts

Name Null? Type

----------------------------------------- -------- ----------------------------

AUDIT_TRAIL VARCHAR2(20)

RAC_INSTANCE NOT NULL NUMBER

LAST_ARCHIVE_TS TIMESTAMP(6) WITH TIME ZONE

But in 12c, there are 2 new columns:

SQL> desc dba_audit_mgmt_last_arch_ts
 Name                                  Null?    Type
 ------------------------------------- -------- ----------------------------
 AUDIT_TRAIL                                    VARCHAR2(20)
 RAC_INSTANCE                          NOT NULL NUMBER
 LAST_ARCHIVE_TS                                TIMESTAMP(6) WITH TIME ZONE
 DATABASE_ID                           NOT NULL NUMBER
 CONTAINER_GUID                        NOT NULL VARCHAR2(33)

SQL> desc dba_audit_mgmt_last_arch_ts

Name Null? Type

------------------------------------- -------- ----------------------------

AUDIT_TRAIL VARCHAR2(20)

RAC_INSTANCE NOT NULL NUMBER

LAST_ARCHIVE_TS TIMESTAMP(6) WITH TIME ZONE

DATABASE_ID NOT NULL NUMBER

CONTAINER_GUID NOT NULL VARCHAR2(33)

When the database is upgraded from 11g to 12c, the two new columns are set to “0” by default.

SQL> select * from dba_audit_mgmt_last_arch_ts;

AUDIT_TRAIL                 RAC_INSTANCE LAST_ARCHIVE_TS                      DATABASE_ID CONTAINER_GUID
--------------------------- ------------ ------------------------------------ ----------- --------------------------------
STANDARD AUDIT TRAIL                   0 04-FEB-17 05.01.10.000000 AM +00:00            0 00000000000000000000000000000000
OS AUDIT TRAIL                         1 04-FEB-17 05.01.15.000000 AM +02:00            0 00000000000000000000000000000000

SQL> select * from dba_audit_mgmt_last_arch_ts;

AUDIT_TRAIL RAC_INSTANCE LAST_ARCHIVE_TS DATABASE_ID CONTAINER_GUID

--------------------------- ------------ ------------------------------------ ----------- --------------------------------

STANDARD AUDIT TRAIL 0 04-FEB-17 05.01.10.000000 AM +00:00 0 00000000000000000000000000000000

OS AUDIT TRAIL 1 04-FEB-17 05.01.15.000000 AM +02:00 0 00000000000000000000000000000000

But when the procedure DBMS_AUDIT_MGMT.SET_LAST_ARCHIVE_TIMESTAMP is executed, the actual dbid is used, and new lines appear:

SQL> select * from dba_audit_mgmt_last_arch_ts;

AUDIT_TRAIL                 RAC_INSTANCE LAST_ARCHIVE_TS                      DATABASE_ID CONTAINER_GUID
--------------------------- ------------ ------------------------------------ ----------- --------------------------------
STANDARD AUDIT TRAIL                   0 04-FEB-17 05.01.10.000000 AM +00:00            0 00000000000000000000000000000000
OS AUDIT TRAIL                         1 04-FEB-17 05.01.15.000000 AM +02:00            0 00000000000000000000000000000000
STANDARD AUDIT TRAIL                   0 27-MAR-18 12.29.55.000000 PM +00:00   2416611527 4A2962517EF2316FE0532296780AE383
OS AUDIT TRAIL                         1 27-MAR-18 12.20.06.000000 PM +02:00   2416611527 4A2962517EF2316FE0532296780AE383

SQL> select * from dba_audit_mgmt_last_arch_ts;

AUDIT_TRAIL RAC_INSTANCE LAST_ARCHIVE_TS DATABASE_ID CONTAINER_GUID

--------------------------- ------------ ------------------------------------ ----------- --------------------------------

STANDARD AUDIT TRAIL 0 04-FEB-17 05.01.10.000000 AM +00:00 0 00000000000000000000000000000000

OS AUDIT TRAIL 1 04-FEB-17 05.01.15.000000 AM +02:00 0 00000000000000000000000000000000

STANDARD AUDIT TRAIL 0 27-MAR-18 12.29.55.000000 PM +00:00 2416611527 4A2962517EF2316FE0532296780AE383

OS AUDIT TRAIL 1 27-MAR-18 12.20.06.000000 PM +02:00 2416611527 4A2962517EF2316FE0532296780AE383

It is clear now that the DELETE statement is not constructed properly. It should get the LAST_ARCHIVE_TS of the actual DBID being purged… but it takes the other one.

According to my tests, it does not use neither the correct timestamp for the dbid, nor get the oldest timestamp: it uses instead the timestamp of the first record found by the clause “WHERE AUDIT_TRAIL=’STANDARD AUDIT TRAIL'”. It depends on the physical location of the row in the table! Clearly a big mess… (PS, not sure 100%, but this is what I suppose)

So, I have tried to modify the archive time for DBID 0:

SQL> begin
  2  dbms_audit_mgmt.set_last_archive_timestamp(audit_trail_type  => DBMS_AUDIT_MGMT.AUDIT_TRAIL_AUD_STD
  3                          ,last_archive_time => SYSTIMESTAMP-31
  4                          ,database_id => 0
  5                          ,container_guid => '00000000000000000000000000000000');
  6  end;
  7
  8  /

PL/SQL procedure successfully completed.

SQL> select database_id, audit_trail, last_archive_ts from dba_audit_mgmt_last_arch_ts;

DATABASE_ID AUDIT_TRAIL                   LAST_ARCHIVE_TS
----------- ----------------------------- ----------------------------------------
          0 STANDARD AUDIT TRAIL          27-MAR-18 12.37.22.000000 PM +00:00
          0 OS AUDIT TRAIL                04-FEB-17 05.01.15.000000 AM +02:00
 2416611527 STANDARD AUDIT TRAIL          27-MAR-18 12.29.55.000000 PM +00:00
 2416611527 OS AUDIT TRAIL                27-MAR-18 12.20.06.000000 PM +02:00

SQL> begin

2 dbms_audit_mgmt.set_last_archive_timestamp(audit_trail_type => DBMS_AUDIT_MGMT.AUDIT_TRAIL_AUD_STD

3 ,last_archive_time => SYSTIMESTAMP-31

4 ,database_id => 0

5 ,container_guid => '00000000000000000000000000000000');

6 end;

8 /

PL/SQL procedure successfully completed.

SQL> select database_id, audit_trail, last_archive_ts from dba_audit_mgmt_last_arch_ts;

DATABASE_ID AUDIT_TRAIL LAST_ARCHIVE_TS

----------- ----------------------------- ----------------------------------------

0 STANDARD AUDIT TRAIL 27-MAR-18 12.37.22.000000 PM +00:00

0 OS AUDIT TRAIL 04-FEB-17 05.01.15.000000 AM +02:00

2416611527 STANDARD AUDIT TRAIL 27-MAR-18 12.29.55.000000 PM +00:00

2416611527 OS AUDIT TRAIL 27-MAR-18 12.20.06.000000 PM +02:00

Trying to execute the cleanup again, now leads to a better timestamp:

DELETE FROM SYS.AUD$ WHERE DBID = 2416611527 AND NTIMESTAMP# < to_timestamp('2018-03-27 12:37:22', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

1	DELETE FROM SYS.AUD$ WHERE DBID = 2416611527 AND NTIMESTAMP# < to_timestamp('2018-03-27 12:37:22', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

I have then tried to play a little bit with the DBA_AUDIT_MGMT_LAST_ARCH_TS view (and the underlying table DAM_LAST_ARCH_TS$).

First, I’ve faked the DBID:

SQL> update dba_audit_mgmt_last_arch_ts set database_id=2416611526 where database_id=0;

2 rows updated.

SQL> commit;

Commit complete.
SQL> select database_id, audit_trail, last_archive_ts from DBA_AUDIT_MGMT_LAST_ARCH_TS;

DATABASE_ID AUDIT_TRAIL                                                  LAST_ARCHIVE_TS
----------- ------------------------------------------------------------ ---------------------------------------------------------------------------
 2416611526 STANDARD AUDIT TRAIL                                         27-MAR-18 12.37.22.000000 PM +00:00
 2416611526 OS AUDIT TRAIL                                               04-FEB-17 05.01.15.000000 AM +02:00
 2416611527 STANDARD AUDIT TRAIL                                         27-MAR-18 12.29.55.000000 PM +00:00
 2416611527 OS AUDIT TRAIL                                               27-MAR-18 12.20.06.000000 PM +02:00

SQL> update dba_audit_mgmt_last_arch_ts set database_id=2416611526 where database_id=0;

2 rows updated.

SQL> commit;

Commit complete.

SQL> select database_id, audit_trail, last_archive_ts from DBA_AUDIT_MGMT_LAST_ARCH_TS;

DATABASE_ID AUDIT_TRAIL LAST_ARCHIVE_TS

----------- ------------------------------------------------------------ ---------------------------------------------------------------------------

2416611526 STANDARD AUDIT TRAIL 27-MAR-18 12.37.22.000000 PM +00:00

2416611526 OS AUDIT TRAIL 04-FEB-17 05.01.15.000000 AM +02:00

2416611527 STANDARD AUDIT TRAIL 27-MAR-18 12.29.55.000000 PM +00:00

2416611527 OS AUDIT TRAIL 27-MAR-18 12.20.06.000000 PM +02:00

Then, I have tried to increase the retention timestamp (500 days):

SQL> begin
  2  dbms_audit_mgmt.set_last_archive_timestamp(audit_trail_type  => DBMS_AUDIT_MGMT.AUDIT_TRAIL_AUD_STD
  3                          ,last_archive_time => SYSTIMESTAMP-500
  4                          ,database_id => 2416611526
  5                          ,container_guid => '00000000000000000000000000000000');
  6  end;
  7  /

PL/SQL procedure successfully completed.

SQL> select database_id, audit_trail, last_archive_ts from dba_audit_mgmt_last_arch_ts;

DATABASE_ID AUDIT_TRAIL                                                  LAST_ARCHIVE_TS
----------- ------------------------------------------------------------ ---------------------------------------------------------------------------
 2416611526 STANDARD AUDIT TRAIL                                         13-DEC-16 12.48.23.000000 PM +00:00
 2416611526 OS AUDIT TRAIL                                               04-FEB-17 05.01.15.000000 AM +02:00
 2416611527 STANDARD AUDIT TRAIL                                         27-MAR-18 12.29.55.000000 PM +00:00
 2416611527 OS AUDIT TRAIL                                               27-MAR-18 12.20.06.000000 PM +02:00

SQL> begin

2 dbms_audit_mgmt.set_last_archive_timestamp(audit_trail_type => DBMS_AUDIT_MGMT.AUDIT_TRAIL_AUD_STD

3 ,last_archive_time => SYSTIMESTAMP-500

4 ,database_id => 2416611526

5 ,container_guid => '00000000000000000000000000000000');

6 end;

7 /

PL/SQL procedure successfully completed.

SQL> select database_id, audit_trail, last_archive_ts from dba_audit_mgmt_last_arch_ts;

DATABASE_ID AUDIT_TRAIL LAST_ARCHIVE_TS

----------- ------------------------------------------------------------ ---------------------------------------------------------------------------

2416611526 STANDARD AUDIT TRAIL 13-DEC-16 12.48.23.000000 PM +00:00

2416611526 OS AUDIT TRAIL 04-FEB-17 05.01.15.000000 AM +02:00

2416611527 STANDARD AUDIT TRAIL 27-MAR-18 12.29.55.000000 PM +00:00

2416611527 OS AUDIT TRAIL 27-MAR-18 12.20.06.000000 PM +02:00

Finally, I have tried to purge the audit trail with both DBIDs:

SQL> begin
  2  dbms_audit_mgmt.clean_audit_trail(
  3    audit_trail_type => sys.dbms_audit_mgmt.AUDIT_TRAIL_AUD_STD,
  4    database_id =>   2416611526,
  5    use_last_arch_timestamp => TRUE);
  6  end;
  7  /

PL/SQL procedure successfully completed.

Elapsed: 00:00:45.89

SQL> begin
  2   dbms_audit_mgmt.clean_audit_trail(
  3    audit_trail_type => sys.dbms_audit_mgmt.AUDIT_TRAIL_AUD_STD,
  4    database_id =>   2416611527,
  5     use_last_arch_timestamp => TRUE);
  6  end
  7  ;
  8  /

PL/SQL procedure successfully completed.

Elapsed: 00:00:34.72

SQL> begin

2 dbms_audit_mgmt.clean_audit_trail(

3 audit_trail_type => sys.dbms_audit_mgmt.AUDIT_TRAIL_AUD_STD,

4 database_id => 2416611526,

5 use_last_arch_timestamp => TRUE);

6 end;

7 /

PL/SQL procedure successfully completed.

Elapsed: 00:00:45.89

SQL> begin

2 dbms_audit_mgmt.clean_audit_trail(

3 audit_trail_type => sys.dbms_audit_mgmt.AUDIT_TRAIL_AUD_STD,

4 database_id => 2416611527,

5 use_last_arch_timestamp => TRUE);

6 end

7 ;

8 /

PL/SQL procedure successfully completed.

Elapsed: 00:00:34.72

As I expected, in both cases the the cleanup generated the delete with the timestamp of the fake DBID:

-- clean audit trail for dbid 2416611526 
DELETE FROM SYS.AUD$ WHERE DBID = 2416611526 AND NTIMESTAMP# < to_timestamp('2016-12-13 12:48:23', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

-- clean audit trail for dbid 2416611527
DELETE FROM SYS.AUD$ WHERE DBID = 2416611527 AND NTIMESTAMP# < to_timestamp('2016-12-13 12:48:23', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

-- clean audit trail for dbid 2416611526

DELETE FROM SYS.AUD$ WHERE DBID = 2416611526 AND NTIMESTAMP# < to_timestamp('2016-12-13 12:48:23', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

-- clean audit trail for dbid 2416611527

DELETE FROM SYS.AUD$ WHERE DBID = 2416611527 AND NTIMESTAMP# < to_timestamp('2016-12-13 12:48:23', 'YYYY-MM-DD HH24:MI:SS.FF') AND ROWNUM <= 140724603463440

Is it possible to delete the unwanted records from the view DBA_AUDIT_MGMT_LAST_ARCH_TS?

Not only is possible, but I recommend it:

SQL> delete from dba_audit_mgmt_last_arch_ts where database_id=2416611526;

2 rows deleted.

SQL> commit;

Commit complete.

SQL>

SQL> delete from dba_audit_mgmt_last_arch_ts where database_id=2416611526;

2 rows deleted.

SQL> commit;

Commit complete.

SQL>

Afterwards, the timestamp in the where condition is correct and remains correct after subsequent executions of DBMS_AUDIT_MGMT.SET_LAST_ARCHIVE_TIMESTAMP.

Conclusions, IMPORTANT FOR THE DATABASE OPERATIONS:

The upgrade causes the unwanted lines with DBID=0 in the DBA_AUDIT_MGMT_LAST_ARCH_TS view.

Moreover, any duplicate changes the DBID: any subsequent execution of DBMS_AUDIT_MGMT.SET_LAST_ARCHIVE_TIMESTAMP in the duplicated database will lead to additional lines in the view.

This is what I plan to do now:

Whenever I upgrade from 11g to 12c, cleanup the data from DBA_AUDIT_MGMT_LAST_ARCH_TS and schedule the cleanup for DBID 0 as well
Whenever I duplicate a database, I execute a DELETE (without clauses) from DBA_AUDIT_MGMT_LAST_ARCH_T and a truncate of the table SYS.AUD$ (it is a duplicate, after all!)

HTH

BP and Patch 22652097: set optimizer_adaptive_statistics to FALSE explicitly or it might not work!

Posted on February 20, 2018 by Ludovico

Update 14.03.2018: After some exchanges with Nigel Bayliss, the behaviour described here has been filed as unpublished bug 27626925: OPTIMIZER ADAPTIVE STATS DEFAULT FALSE NOT HONORED WHEN ENABLED IN OCT OR JAN BP. It will be fixed starting with April’s bundle patch.

According to Nigel’s blog post:

The Oracle 12.1.0.2 October 2017 BP and the Adaptive Optimizer

if you installled the patch 22652097 prior to apply the Bundle Patch 171018, the BP apply in the database should recognize that the patch was already in place and keep it activated. This is done through the fix control 26664361.

When fix_control 26664361:0 -> Patch 22652097 is not enabled: the parameter optimizer_adaptive_features (OAF) works

When fix_control 26664361:1 -> Patch 22652097 is enabled; optimizer_adaptive_features is discarded and the two new parameters have the priority: optimizer_adaptive_plans (OAP) and optimizer_adaptive_statistics (OAS).

But at my customer, I had another behavior.

My patching story might be very similar to yours!

When I started upgrading my customer’s database to 12c in early 2015, I experienced very soon the infamous problems with SQL Plan Directives (SPD) and Adaptive Dynamic Sampling (ADS) that I described in my paper: ADAPTIVE FEATURES OR: HOW I LEARNED TO STOP WORRYING AND TROUBLESHOOT THE BOMB .

Early fixes

When I was new to the problem, the quick fix for the problematic applications was to set OAF to FALSE.

Later, I discovered some more details and decided to opt for setting:

_optimizer_dsdir_usage_control=0

1	_optimizer_dsdir_usage_control=0

In other cases, I disabled the specific directives that were causing problems.

But many databases did not have so many problems, and I left the defaults.

Patch 22652097 on top of BP170718

At some point, me and my customer decided to apply the fix 22652097, on top of BP170718 that was our current patch level at that time.

The patch installation on a test database was complaining about the optimizer_adaptive_feature set: this parameter was not used anymore. This issue is nicely explained by Flora in her post Patch 22652097 in 12.1 makes optimizer_adaptive_features parameter obsolete.

In order to apply that patch on the remaining databases, we did:

alter system reset optimizer_adaptive_features;
alter system reset “_optimizer_dsdir_usage_control”;
Applied the patch on binaries and datapatch on the databases.

The result at this point was that:

optimizer_adaptive_features was not set
optimizer_adaptive_plans was set to true
optimizer_adaptive_statistics was set to false.

It might seems superflous to say, but it’s not, the SQL Plan Directives were not used anymore: no Adaptice Dynamic Sampling and no performance problems.

Bundle Patch 180116

Three weeks ago, we installled the last Bundle Patch in order to fix some Grid Infrastructure problems, and the BP, as described in Nigel’s note (and Mike Dietrich and many other bloggers :-)) contains the patch 22652097.

According to Nigel’s post, the patch installation should have detected that the patch 22652097 was already there and activate it.

And indeed, after we applied the BP, the fix_control 26664361 was set to 1 (that means that the patch 22652097 is enabled). So we went live with this setup without additional checks.

One week later, we started experiencing performance problems again. I noticed immediately that the Adaptive Dynamic Sampling was very aggressive again, and the SQL Plan Directives used again.

But the fix was there AND ENABLED!

After a few tests, I realized that the SPD is not used anymore if I set optimizer_adaptive_statistics EXPLICITLY to false.

optimizer_adaptive_statistics must be set explicitly, the default does not work

And here’s the proof:

I use once again the great SPD example by Tim Hall (sorry Tim, it’s not the first time that I steal your work 🙂 ) . You can find here:

SQL Plan Directives in Oracle Database 12c Release 1 (12.1)

After applying the BP, I have the default parameter, not set explicitly, and the fix_control enabled:

SQL> select value from v$system_fix_control where bugno = 26664361;

     VALUE
----------
         1

SQL> select name, value, isdefault, ismodified from v$parameter where name='optimizer_adaptive_statistics';  
  
NAME                                    VALUE                          ISDEFAULT ISMODIFIED  
---------------------------------------- ------------------------------ --------- ----------------------------------------  
optimizer_adaptive_statistics            FALSE                          TRUE      FALSE

SQL> select value from v$system_fix_control where bugno = 26664361;

VALUE

----------

SQL> select name, value, isdefault, ismodified from v$parameter where name='optimizer_adaptive_statistics';

NAME VALUE ISDEFAULT ISMODIFIED

---------------------------------------- ------------------------------ --------- ----------------------------------------

optimizer_adaptive_statistics FALSE TRUE FALSE

If I run the test statement (again, find it here https://oracle-base.com/articles/12c/sql-plan-directives-12cr1) the directives are used:

SQL> SELECT /*+ GATHER_PLAN_STATISTICS */  
      *  
  2  FROM  tab1  
WHERE  gender = 'M'  
AND    has_y_chromosome = 'Y';  
  
SET LINESIZE 200 PAGESIZE 100  
  
...  
  
10 rows selected.  
  
SQL> SELECT * FROM TABLE(DBMS_XPLAN.display_cursor(format => 'allstats last'));  
  
PLAN_TABLE_OUTPUT  
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------  
SQL_ID  5t8y8p5mpb99j, child number 0  
-------------------------------------  
SELECT /*+ GATHER_PLAN_STATISTICS */        * FROM  tab1 WHERE  gender  
= 'M' AND    has_y_chromosome = 'Y'  
  
Plan hash value: 1552452781  
  
-----------------------------------------------------------------------------------------------------------------  
| Id  | Operation                          | Name            | Starts | E-Rows | A-Rows |  A-Time  | Buffers |  
-----------------------------------------------------------------------------------------------------------------  
|  0 | SELECT STATEMENT                    |                |      1 |        |    10 |00:00:00.01 |      4 |  
|*  1 |  TABLE ACCESS BY INDEX ROWID BATCHED| TAB1            |      1 |    10 |    10 |00:00:00.01 |      4 |  
|*  2 |  INDEX RANGE SCAN                  | TAB1_GENDER_IDX |      1 |    10 |    10 |00:00:00.01 |      2 |  
-----------------------------------------------------------------------------------------------------------------  
  
Predicate Information (identified by operation id):  
---------------------------------------------------  
  
  1 - filter("HAS_Y_CHROMOSOME"='Y')  
  2 - access("GENDER"='M')  
  
Note  
-----  
  - dynamic statistics used: dynamic sampling (level=2)  
  - 2 Sql Plan Directives used for this statement  
      
      
    26 rows selected.

SQL> SELECT /*+ GATHER_PLAN_STATISTICS */

2 FROM tab1

WHERE gender = 'M'

AND has_y_chromosome = 'Y';

SET LINESIZE 200 PAGESIZE 100

...

10 rows selected.

SQL> SELECT * FROM TABLE(DBMS_XPLAN.display_cursor(format => 'allstats last'));

PLAN_TABLE_OUTPUT

--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

SQL_ID 5t8y8p5mpb99j, child number 0

-------------------------------------

SELECT /*+ GATHER_PLAN_STATISTICS */ * FROM tab1 WHERE gender

= 'M' AND has_y_chromosome = 'Y'

Plan hash value: 1552452781

-----------------------------------------------------------------------------------------------------------------

-----------------------------------------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 1 | | 10 |00:00:00.01 | 4 |

|* 1 | TABLE ACCESS BY INDEX ROWID BATCHED| TAB1 | 1 | 10 | 10 |00:00:00.01 | 4 |

|* 2 | INDEX RANGE SCAN | TAB1_GENDER_IDX | 1 | 10 | 10 |00:00:00.01 | 2 |

-----------------------------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

1 - filter("HAS_Y_CHROMOSOME"='Y')

2 - access("GENDER"='M')

Note

-----

- dynamic statistics used: dynamic sampling (level=2)

- 2 Sql Plan Directives used for this statement

26 rows selected.

but then I set the parameter explicitly:

SQL> alter system flush shared_pool;  
  
System altered.  
  
SQL> alter system set optimizer_adaptive_statistics=false;  
  
System altered.  
  
SQL> select name, value, isdefault, ismodified from v$parameter where name='optimizer_adaptive_statistics';  
  
NAME                                     VALUE                          ISDEFAULT ISMODIFIED  
---------------------------------------- ------------------------------ --------- ----------------------------------------  
optimizer_adaptive_statistics            FALSE                          TRUE      MODIFIED

SQL> alter system flush shared_pool;

System altered.

SQL> alter system set optimizer_adaptive_statistics=false;

System altered.

SQL> select name, value, isdefault, ismodified from v$parameter where name='optimizer_adaptive_statistics';

NAME VALUE ISDEFAULT ISMODIFIED

---------------------------------------- ------------------------------ --------- ----------------------------------------

optimizer_adaptive_statistics FALSE TRUE MODIFIED

and the SPD usage (and consequently, ADS), are gone:

SQL> SELECT /*+ GATHER_PLAN_STATISTICS */  
       *  
FROM   tab1  
WHERE  gender = 'M'  
AND    has_y_chromosome = 'Y';  
  
SET LINESIZE 200 PAGESIZE 100  
  
        ID G H  
---------- - -  
         1 M Y  
         2 M Y  
         3 M Y  
         4 M Y  
         5 M Y  
         6 M Y  
         7 M Y  
         8 M Y  
         9 M Y  
        10 M Y  
  
10 rows selected.  
  
SQL> SELECT * FROM TABLE(DBMS_XPLAN.display_cursor(format => 'allstats last'));  
  
PLAN_TABLE_OUTPUT  
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------  
SQL_ID  5t8y8p5mpb99j, child number 0  
-------------------------------------  
SELECT /*+ GATHER_PLAN_STATISTICS */        * FROM   tab1 WHERE  gender  
= 'M' AND    has_y_chromosome = 'Y'  
  
Plan hash value: 1552452781  
  
-----------------------------------------------------------------------------------------------------------------  
| Id  | Operation                           | Name            | Starts | E-Rows | A-Rows |   A-Time   | Buffers |  
-----------------------------------------------------------------------------------------------------------------  
|   0 | SELECT STATEMENT                    |                 |      1 |        |     10 |00:00:00.01 |       4 |  
|*  1 |  TABLE ACCESS BY INDEX ROWID BATCHED| TAB1            |      1 |     25 |     10 |00:00:00.01 |       4 |  
|*  2 |   INDEX RANGE SCAN                  | TAB1_GENDER_IDX |      1 |     50 |     10 |00:00:00.01 |       2 |  
-----------------------------------------------------------------------------------------------------------------  
  
Predicate Information (identified by operation id):  
---------------------------------------------------  
  
   1 - filter("HAS_Y_CHROMOSOME"='Y')  
   2 - access("GENDER"='M')  
      
      
    21 rows selected.

SQL> SELECT /*+ GATHER_PLAN_STATISTICS */

FROM tab1

WHERE gender = 'M'

AND has_y_chromosome = 'Y';

SET LINESIZE 200 PAGESIZE 100

ID G H

---------- - -

1 M Y

2 M Y

3 M Y

4 M Y

5 M Y

6 M Y

7 M Y

8 M Y

9 M Y

10 M Y

10 rows selected.

SQL> SELECT * FROM TABLE(DBMS_XPLAN.display_cursor(format => 'allstats last'));

PLAN_TABLE_OUTPUT

SQL_ID 5t8y8p5mpb99j, child number 0

-------------------------------------

SELECT /*+ GATHER_PLAN_STATISTICS */ * FROM tab1 WHERE gender

= 'M' AND has_y_chromosome = 'Y'

Plan hash value: 1552452781

-----------------------------------------------------------------------------------------------------------------

-----------------------------------------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 1 | | 10 |00:00:00.01 | 4 |

|* 1 | TABLE ACCESS BY INDEX ROWID BATCHED| TAB1 | 1 | 25 | 10 |00:00:00.01 | 4 |

|* 2 | INDEX RANGE SCAN | TAB1_GENDER_IDX | 1 | 50 | 10 |00:00:00.01 | 2 |

-----------------------------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

1 - filter("HAS_Y_CHROMOSOME"='Y')

2 - access("GENDER"='M')

21 rows selected.

Conclusion

Set the parameter EXPLICITLY when you apply the BP that contains the fix.

And ALWAYS test the behavior!

You can check how many statements use the dynamic sampling by following this short blog post by Dominic Brooks:

Which of my sql statements are using dynamic sampling?

HTH

My own Dbvisit Replicate integration with Grid Infrastructure

Posted on October 30, 2017 by Ludovico

I am helping my customer for a PoC of Dbvisit Replicate as a logical replication tool. I will not discuss (at least, not in this post) about the capabilities of the tool itself, its configuration or the caveats that you should beware of when you do logical replication. Instead, I will concentrate on how we will likely integrate it in the current environment.

My role in this PoC is to make sure that the tool will be easy to operate from the operational point of view, and the database operations, here, are supported by Oracle Grid Infrastructure and cold failover clusters.

Note: there are official Dbvisit online resources about how to configure Dbvisit Replicate in a cluster. I aim to complement those informations, not copy them.

Quick overview

If you know Dbvisit replicate, skip this paragraph.

There are three main components of Dbvisit Replicate: The FETCHER, the MINE and the APPLY processes. The FETCHER gets the redo stream from the source and sends it to the MINE process. The MINE process elaborates the redo streams and converts it in proprietary transaction log files (named plog). The APPLY process gets the plog files and applies the transactions on the destination database.

From an architectural point of view, MINE and APPLY do not need to run close to the databases that are part of the configuration. The FETCHER process, by opposite, needs to be local to the source database online log files (and archived logs).

Because the MINE process is the most resource intensive, it is not convenient to run it where the databases reside, as it might consume precious CPU resources that are licensed for Oracle Database. So, first step in this PoC: the FETCHER processes will run on the cluster, while MINE and APPLY will run on a dedicated Virtual Machine.

Clustering considerations

the FETCHER does NOT need to run on the server of the source database: having access to the online logs through the ASM instance is enough
to avoid SPoF, the fetcher should be a cluster resource that can relocate without problems
to simplify the configuration, the FETCHER configuration and the Dbvisit binaries should be on a shared filesystem (the FETCHER does not persist any data, just the logs)
the destination database might be literally anywhere: the APPLY connects via SQL*Net, so a correct name resolution and routing to the destination database are enough

so the implementation steps are:

create a shared filesystem
install dbvisit in the shared filesystem
create the Dbvisit Replicate configuration on the dedicated VM
copy the configuration files on the cluster
prepare an action script
configure the resource
test!

Convention over configuration: the importance of a strong naming convention

Before starting the implementation, I decided to put all the caveats related to the FETCHER resource relocation on paper:

Where will the configuration files reside? Dbvisit has an important variable: the Configuration Name. All the operations are done by passing a configuration file named /{PATH}/{CONFIG_NAME}/{CONFIG_NAME}-{PROCESS_TYPE}.ddc to the dbvrep binary. So, I decided to put ALL the configuration directories under the same path: given the Configuration Name, I will always be able to get the configuration file path.
How will the configuration files relocate from one node to the other? Easy here: they won’t. I will use an ACFS filesystem
How can I link the cluster resource with its configuration name? Easy again: I call my resources dbvrep.CONFIGNAME.PROCESS_TYPE. e.g. dbvrep.FROM_A_TO_B.fetcher
How will I manage the need to use a new version of dbvisit in the future? Old and new versions must coexist: Instead of using external configuration files, I will just use a custom resource attribute named DBVREP_HOME inside my resource type definition. (see later)
What port number should I use? Of course, many fetchers started on different servers should not have conflicts. This is something that might be either planned or made dynamic. I will opt for the first one. But instead of getting the port number inside the Dbvisit configuration, I will use a custom resource attribute: DBVREP_PORT.

Considerations on the FETCHER listen address

This requires a dedicated paragraph. The Dbvisit documentation suggest to create a VIP, bind on the VIP address and create a dependency between the FETCHER resource and the VIP. Here is where my configuration will differ.

Having a separate VIP per FETCHER resource might, potentially, lead to dozens of VIPs in the cluster. Everything will depend on the success of the PoC and on how many internal clients will decide to ask for such implementation. Many VIPs == many interactions with network admins for address reservation, DNS configurations, etc. Long story short, it might slow down the creation and maintenance of new configurations.

Instead, each FETCHER will listen to the local server address, and the action script will take care of:

getting the current host name
getting the current ASM instance
changing the settings of the specific Dbvisit Replicate configuration (ASM instance and FETCHER listen address)
starting the FETCHER

Implementation

Now that all the caveats and steps are clear, I can show how I implemented it:

Create a shared filesystem

asmcmd volcreate -G ACFS -s 10G dbvisit --column 1
/sbin/mkfs -t acfs /dev/asm/dbvisit-293
sudo /u01/app/grid/product/12.1.0.2/grid/bin/srvctl add filesystem -d /dev/asm/dbvisit-293 -m /u02/data/oracle/dbvisit -u oracle -fstype ACFS -autostart ALWAYS
srvctl start filesystem -d /dev/asm/dbvisit-293

asmcmd volcreate -G ACFS -s 10G dbvisit --column 1

/sbin/mkfs -t acfs /dev/asm/dbvisit-293

sudo /u01/app/grid/product/12.1.0.2/grid/bin/srvctl add filesystem -d /dev/asm/dbvisit-293 -m /u02/data/oracle/dbvisit -u oracle -fstype ACFS -autostart ALWAYS

srvctl start filesystem -d /dev/asm/dbvisit-293

Install dbvisit in the shared filesystem

out of scope!

1	out of scope!

Create the Dbvisit Replicate configuration on the dedicated VM

out of scope!

1	out of scope!

Copy the configuration files from the Dbvisit VM to the cluster

scp /u02/data/oracle/dbvisit/FROM_A_TO_B/FROM_A_TO_B-FETCHER.ddc \ 
 cluster-scan:/u02/data/oracle/dbvisit/FROM_A_TO_B

1 2	scp /u02/data/oracle/dbvisit/FROM_A_TO_B/FROM_A_TO_B-FETCHER.ddc \ cluster-scan:/u02/data/oracle/dbvisit/FROM_A_TO_B

Prepare an action script

$ cat dbvrep.sh
#!/bin/ksh
########################################
# Name   : dbvrep.sh
# Author : Ludovico Caldara, Trivadis AG

# the DBVISIT FETCHER process needs to know 2 attributes: DBVREP_HOME and DBVREP_PORT.
# If you want to call the action script directly set:
# _CRS_NAME=<resource name in format dbvrep.CONFIGNAME.fetcher>
# _CRS_DBVREP_HOME=<dbvrep installation path>
# _CRS_DBVREP_PORT=<listening port>

DBVREP_RES_NAME=${_CRS_NAME}
DBVREP_CONFIG_NAME=`echo $DBVREP_RES_NAME | awk -F. '{print $2}'`

# MINE, FETCHER or APPLY?
DBVREP_PROCESS_TYPE=`echo $DBVREP_RES_NAME | awk -F. '{print toupper($3)}'`

DBVREP_HOME=${_CRS_DBVREP_HOME}
DBVREP=${DBVREP_HOME}/dbvrep
DBVREP_PORT=${_CRS_DBVREP_PORT}
DBVREP_CONFIG_PATH=/u02/data/oracle/dbvisit

DBVREP_CONFIG_FILE=${DBVREP_CONFIG_PATH}/${DBVREP_CONFIG_NAME}/${DBVREP_CONFIG_NAME}-${DBVREP_PROCESS_TYPE}.ddc

function F_verify_dbvrep_up {
        ps -eaf | grep "[d]bvrep ${DBVREP_PROCESS_TYPE} $DBVREP_CONFIG_NAME" > /dev/null
        if [ $? -eq 0 ] ; then
                echo "OK"
        else
                echo "KO"
                exit 1
        fi
}

ACTION="${1}"
case "$ACTION" in

        'start')
        LOCAL_ASM="+"`ps -eaf | grep [a]sm_pmon | awk -F+ '{print $NF}'`;

        if [ "${DBVREP_PROCESS_TYPE}" == "FETCHER" ] ; then
                $DBVREP --daemon --ddcfile ${DBVREP_CONFIG_FILE} --silent <<EOF
set FETCHER.FETCHER_REMOTE_INTERFACE=${HOSTNAME}:${DBVREP_PORT}
set FETCHER.FETCHER_LISTEN_INTERFACE=${HOSTNAME}:${DBVREP_PORT}
set FETCHER.MINE_ASM=${LOCAL_ASM}
start FETCHER
EOF
        fi
;;

        'stop')
        $DBVREP --daemon --ddcfile ${DBVREP_CONFIG_FILE} shutdown ${DBVREP_PROCESS_TYPE}

;;

        'check')
        F_verify_dbvrep_up
;;

        'clean')
        sleep 1
        exit 0
;;

        *)
usage
;;

esac

$ cat dbvrep.sh

#!/bin/ksh

########################################

# Name : dbvrep.sh

# Author : Ludovico Caldara, Trivadis AG

# the DBVISIT FETCHER process needs to know 2 attributes: DBVREP_HOME and DBVREP_PORT.

# If you want to call the action script directly set:

# _CRS_NAME=<resource name in format dbvrep.CONFIGNAME.fetcher>

# _CRS_DBVREP_HOME=<dbvrep installation path>

# _CRS_DBVREP_PORT=<listening port>

DBVREP_RES_NAME=${_CRS_NAME}

DBVREP_CONFIG_NAME=`echo $DBVREP_RES_NAME | awk -F. '{print $2}'`

# MINE, FETCHER or APPLY?

DBVREP_PROCESS_TYPE=`echo $DBVREP_RES_NAME | awk -F. '{print toupper($3)}'`

DBVREP_HOME=${_CRS_DBVREP_HOME}

DBVREP=${DBVREP_HOME}/dbvrep

DBVREP_PORT=${_CRS_DBVREP_PORT}

DBVREP_CONFIG_PATH=/u02/data/oracle/dbvisit

DBVREP_CONFIG_FILE=${DBVREP_CONFIG_PATH}/${DBVREP_CONFIG_NAME}/${DBVREP_CONFIG_NAME}-${DBVREP_PROCESS_TYPE}.ddc

function F_verify_dbvrep_up {

ps -eaf | grep "[d]bvrep ${DBVREP_PROCESS_TYPE} $DBVREP_CONFIG_NAME" > /dev/null

if [ $? -eq 0 ] ; then

echo "OK"

else

echo "KO"

exit 1

}

ACTION="${1}"

case "$ACTION" in

'start')

LOCAL_ASM="+"`ps -eaf | grep [a]sm_pmon | awk -F+ '{print $NF}'`;

if [ "${DBVREP_PROCESS_TYPE}" == "FETCHER" ] ; then

$DBVREP --daemon --ddcfile ${DBVREP_CONFIG_FILE} --silent <<EOF

set FETCHER.FETCHER_REMOTE_INTERFACE=${HOSTNAME}:${DBVREP_PORT}

set FETCHER.FETCHER_LISTEN_INTERFACE=${HOSTNAME}:${DBVREP_PORT}

set FETCHER.MINE_ASM=${LOCAL_ASM}

start FETCHER

EOF

;;

'stop')

$DBVREP --daemon --ddcfile ${DBVREP_CONFIG_FILE} shutdown ${DBVREP_PROCESS_TYPE}

;;

'check')

F_verify_dbvrep_up

;;

'clean')

sleep 1

exit 0

;;

usage

;;

esac

Configure the resource

$ cat dbvrep.type
ATTRIBUTE=ACTION_SCRIPT
DEFAULT_VALUE=/path_to_action_script/dbvrep.ksh
TYPE=STRING
FLAGS=CONFIG

ATTRIBUTE=SCRIPT_TIMEOUT
DEFAULT_VALUE=120
TYPE=INT
FLAGS=CONFIG

ATTRIBUTE=DBVREP_PORT
DEFAULT_VALUE=
TYPE=INT
FLAGS=CONFIG

ATTRIBUTE=DBVREP_HOME
DEFAULT_VALUE=/u02/data/oracle/dbvisit/replicate
TYPE=STRING
FLAGS=CONFIG

ATTRIBUTE=SERVER_POOLS
DEFAULT_VALUE=*
TYPE=STRING
FLAGS=CONFIG|HOTMOD

ATTRIBUTE=START_DEPENDENCIES
DEFAULT_VALUE=hard() weak(type:ora.listener.type,global:type:ora.scan_listener.type) pullup()
TYPE=STRING
FLAGS=CONFIG

ATTRIBUTE=STOP_DEPENDENCIES
DEFAULT_VALUE=hard()
TYPE=STRING
FLAGS=CONFIG


ATTRIBUTE=RESTART_ATTEMPTS
DEFAULT_VALUE=2
TYPE=INT
FLAGS=CONFIG

ATTRIBUTE=CHECK_INTERVAL
DEFAULT_VALUE=60
TYPE=INT
FLAGS=CONFIG

ATTRIBUTE=FAILURE_THRESHOLD
DEFAULT_VALUE=2
TYPE=INT
FLAGS=CONFIG

ATTRIBUTE=UPTIME_THRESHOLD
DEFAULT_VALUE=8h
TYPE=STRING
FLAGS=CONFIG

ATTRIBUTE=FAILURE_INTERVAL
DEFAULT_VALUE=3600
TYPE=INT
FLAGS=CONFIG

$ crsctl add type dbvrep.type -basetype cluster_resource -file dbvrep.type
$ crsctl add resource dbvrep.FROM_A_TO_B.fetcher -type dbvrep.type \
  -attr "START_DEPENDENCIES=hard(db.source) pullup:always(db.source),STOP_DEPENDENCIES=hard(db.source),DBVREP_PORT=7901"

$ cat dbvrep.type

ATTRIBUTE=ACTION_SCRIPT

DEFAULT_VALUE=/path_to_action_script/dbvrep.ksh

TYPE=STRING

FLAGS=CONFIG

ATTRIBUTE=SCRIPT_TIMEOUT

DEFAULT_VALUE=120

TYPE=INT

FLAGS=CONFIG

ATTRIBUTE=DBVREP_PORT

DEFAULT_VALUE=

TYPE=INT

FLAGS=CONFIG

ATTRIBUTE=DBVREP_HOME

DEFAULT_VALUE=/u02/data/oracle/dbvisit/replicate

TYPE=STRING

FLAGS=CONFIG

ATTRIBUTE=SERVER_POOLS

DEFAULT_VALUE=*

TYPE=STRING

FLAGS=CONFIG|HOTMOD

ATTRIBUTE=START_DEPENDENCIES

DEFAULT_VALUE=hard() weak(type:ora.listener.type,global:type:ora.scan_listener.type) pullup()

TYPE=STRING

FLAGS=CONFIG

ATTRIBUTE=STOP_DEPENDENCIES

DEFAULT_VALUE=hard()

TYPE=STRING

FLAGS=CONFIG

ATTRIBUTE=RESTART_ATTEMPTS

DEFAULT_VALUE=2

TYPE=INT

FLAGS=CONFIG

ATTRIBUTE=CHECK_INTERVAL

DEFAULT_VALUE=60

TYPE=INT

FLAGS=CONFIG

ATTRIBUTE=FAILURE_THRESHOLD

DEFAULT_VALUE=2

TYPE=INT

FLAGS=CONFIG

ATTRIBUTE=UPTIME_THRESHOLD

DEFAULT_VALUE=8h

TYPE=STRING

FLAGS=CONFIG

ATTRIBUTE=FAILURE_INTERVAL

DEFAULT_VALUE=3600

TYPE=INT

FLAGS=CONFIG

$ crsctl add type dbvrep.type -basetype cluster_resource -file dbvrep.type

$ crsctl add resource dbvrep.FROM_A_TO_B.fetcher -type dbvrep.type \

-attr "START_DEPENDENCIES=hard(db.source) pullup:always(db.source),STOP_DEPENDENCIES=hard(db.source),DBVREP_PORT=7901"

Test!

$ crsctl start res dbvrep.FROM_A_TO_B.fetcher
CRS-2672: Attempting to start 'dbvrep.FROM_A_TO_B.fetcher' on 'server1'
CRS-2676: Start of 'dbvrep.FROM_A_TO_B.fetcher' on 'server1' succeeded

..in the logs..
2017-10-30 15:24:34.992478 :    AGFW:1127589632: {1:30181:30166} Agent received the message: RESOURCE_START[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175912
2017-10-30 15:24:34.992512 :    AGFW:1127589632: {1:30181:30166} Preparing START command for: dbvrep.FROM_A_TO_B.fetcher 1 1
2017-10-30 15:24:34.992521 :    AGFW:1127589632: {1:30181:30166} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: OFFLINE to: STARTING
2017-10-30 15:24:34.993195 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Executing action script: dbvrep.ksh[start]
2017-10-30 15:24:41.254703 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Variable FETCHER_REMOTE_INTERFACE set to server1:7901 for process
2017-10-30 15:24:41.254726 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] FETCHER.
2017-10-30 15:24:41.354916 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Variable FETCHER_LISTEN_INTERFACE set to server1:7901 for process
2017-10-30 15:24:41.354935 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] FETCHER.
2017-10-30 15:24:41.405052 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Variable MINE_ASM set to +ASM1 for process FETCHER.
2017-10-30 15:24:41.605423 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Starting process FETCHER...started
2017-10-30 15:24:41.655660 :    AGFW:1106577152: {1:30181:30166} Command: start for resource: dbvrep.FROM_A_TO_B.fetcher 1 1 completed with status: SUCCESS
2017-10-30 15:24:41.656100 :CLSDYNAM:1081362176: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [check] Executing action script: dbvrep.ksh[check]
2017-10-30 15:24:41.658242 :    AGFW:1127589632: {1:30181:30166} Agent sending reply for: RESOURCE_START[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175912
2017-10-30 15:24:41.908256 :CLSDYNAM:1081362176: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [check] OK
2017-10-30 15:24:41.908440 :    AGFW:1127589632: {1:30181:30166} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: STARTING to: ONLINE
2017-10-30 15:24:41.908486 :    AGFW:1127589632: {1:30181:30166} Started implicit monitor for [dbvrep.FROM_A_TO_B.fetcher 1 1] interval=60000 delay=60000
2017-10-30 15:24:41.908696 :    AGFW:1127589632: {1:30181:30166} Agent sending last reply for: RESOURCE_START[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175912


$ crsctl stop res dbvrep.FROM_A_TO_B.fetcher
CRS-2673: Attempting to stop 'dbvrep.FROM_A_TO_B.fetcher' on 'server1'
CRS-2677: Stop of 'dbvrep.FROM_A_TO_B.fetcher' on 'server1' succeeded

..in the logs..
2017-10-30 15:22:14.891730 :    AGFW:1127589632: {1:30181:30156} Agent received the message: RESOURCE_STOP[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4099:5175818
2017-10-30 15:22:14.891762 :    AGFW:1127589632: {1:30181:30156} Preparing STOP command for: dbvrep.FROM_A_TO_B.fetcher 1 1
2017-10-30 15:22:14.891772 :    AGFW:1127589632: {1:30181:30156} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: ONLINE to: STOPPING
2017-10-30 15:22:14.892400 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Executing action script: dbvrep.ksh[stop]
2017-10-30 15:22:20.957375 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] DDC loaded from database (458 variables).
2017-10-30 15:22:21.007939 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Dbvisit Replicate version 2.9.04
2017-10-30 15:22:21.007963 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Copyright (C) Dbvisit Software Limited. All rights reserved.
2017-10-30 15:22:21.007976 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] DDC file
2017-10-30 15:22:21.007994 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] /u02/data/oracle/dbvisit/FROM_A_TO_B/FROM_A_TO_B
2017-10-30 15:22:21.008005 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] -FETCHER.ddc loaded.
2017-10-30 15:22:21.108340 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Dbvisit Replicate FETCHER process shutting down.
2017-10-30 15:22:21.108361 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] OK-0: Completed successfully.
2017-10-30 15:22:45.747531 :    AGFW:1091868416: {1:30181:30156} Command: stop for resource: dbvrep.FROM_A_TO_B.fetcher 1 1 completed with status: SUCCESS
2017-10-30 15:22:45.747898 :    AGFW:1127589632: {1:30181:30156} Agent sending reply for: RESOURCE_STOP[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4099:5175818
2017-10-30 15:22:45.747902 :CLSDYNAM:1123387136: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [check] Executing action script: dbvrep.ksh[check]
2017-10-30 15:22:45.949702 :CLSDYNAM:1123387136: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [check] KO
2017-10-30 15:22:45.949913 :    AGFW:1127589632: {1:30181:30156} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: STOPPING to: OFFLINE
2017-10-30 15:22:45.950014 :    AGFW:1127589632: {1:30181:30156} Agent sending last reply for: RESOURCE_STOP[dbvrep.dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175818

$ crsctl start res dbvrep.FROM_A_TO_B.fetcher

CRS-2672: Attempting to start 'dbvrep.FROM_A_TO_B.fetcher' on 'server1'

CRS-2676: Start of 'dbvrep.FROM_A_TO_B.fetcher' on 'server1' succeeded

..in the logs..

2017-10-30 15:24:34.992478 : AGFW:1127589632: {1:30181:30166} Agent received the message: RESOURCE_START[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175912

2017-10-30 15:24:34.992512 : AGFW:1127589632: {1:30181:30166} Preparing START command for: dbvrep.FROM_A_TO_B.fetcher 1 1

2017-10-30 15:24:34.992521 : AGFW:1127589632: {1:30181:30166} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: OFFLINE to: STARTING

2017-10-30 15:24:34.993195 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Executing action script: dbvrep.ksh[start]

2017-10-30 15:24:41.254703 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Variable FETCHER_REMOTE_INTERFACE set to server1:7901 for process

2017-10-30 15:24:41.254726 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] FETCHER.

2017-10-30 15:24:41.354916 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Variable FETCHER_LISTEN_INTERFACE set to server1:7901 for process

2017-10-30 15:24:41.354935 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] FETCHER.

2017-10-30 15:24:41.405052 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Variable MINE_ASM set to +ASM1 for process FETCHER.

2017-10-30 15:24:41.605423 :CLSDYNAM:1106577152: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [start] Starting process FETCHER...started

2017-10-30 15:24:41.655660 : AGFW:1106577152: {1:30181:30166} Command: start for resource: dbvrep.FROM_A_TO_B.fetcher 1 1 completed with status: SUCCESS

2017-10-30 15:24:41.656100 :CLSDYNAM:1081362176: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [check] Executing action script: dbvrep.ksh[check]

2017-10-30 15:24:41.658242 : AGFW:1127589632: {1:30181:30166} Agent sending reply for: RESOURCE_START[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175912

2017-10-30 15:24:41.908256 :CLSDYNAM:1081362176: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30166} [check] OK

2017-10-30 15:24:41.908440 : AGFW:1127589632: {1:30181:30166} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: STARTING to: ONLINE

2017-10-30 15:24:41.908486 : AGFW:1127589632: {1:30181:30166} Started implicit monitor for [dbvrep.FROM_A_TO_B.fetcher 1 1] interval=60000 delay=60000

2017-10-30 15:24:41.908696 : AGFW:1127589632: {1:30181:30166} Agent sending last reply for: RESOURCE_START[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175912

$ crsctl stop res dbvrep.FROM_A_TO_B.fetcher

CRS-2673: Attempting to stop 'dbvrep.FROM_A_TO_B.fetcher' on 'server1'

CRS-2677: Stop of 'dbvrep.FROM_A_TO_B.fetcher' on 'server1' succeeded

..in the logs..

2017-10-30 15:22:14.891730 : AGFW:1127589632: {1:30181:30156} Agent received the message: RESOURCE_STOP[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4099:5175818

2017-10-30 15:22:14.891762 : AGFW:1127589632: {1:30181:30156} Preparing STOP command for: dbvrep.FROM_A_TO_B.fetcher 1 1

2017-10-30 15:22:14.891772 : AGFW:1127589632: {1:30181:30156} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: ONLINE to: STOPPING

2017-10-30 15:22:14.892400 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Executing action script: dbvrep.ksh[stop]

2017-10-30 15:22:20.957375 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] DDC loaded from database (458 variables).

2017-10-30 15:22:21.007939 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Dbvisit Replicate version 2.9.04

2017-10-30 15:22:21.007976 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] DDC file

2017-10-30 15:22:21.007994 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] /u02/data/oracle/dbvisit/FROM_A_TO_B/FROM_A_TO_B

2017-10-30 15:22:21.008005 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] -FETCHER.ddc loaded.

2017-10-30 15:22:21.108340 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] Dbvisit Replicate FETCHER process shutting down.

2017-10-30 15:22:21.108361 :CLSDYNAM:1091868416: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [stop] OK-0: Completed successfully.

2017-10-30 15:22:45.747531 : AGFW:1091868416: {1:30181:30156} Command: stop for resource: dbvrep.FROM_A_TO_B.fetcher 1 1 completed with status: SUCCESS

2017-10-30 15:22:45.747898 : AGFW:1127589632: {1:30181:30156} Agent sending reply for: RESOURCE_STOP[dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4099:5175818

2017-10-30 15:22:45.747902 :CLSDYNAM:1123387136: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [check] Executing action script: dbvrep.ksh[check]

2017-10-30 15:22:45.949702 :CLSDYNAM:1123387136: [dbvrep.FROM_A_TO_B.fetcher]{1:30181:30156} [check] KO

2017-10-30 15:22:45.949913 : AGFW:1127589632: {1:30181:30156} dbvrep.FROM_A_TO_B.fetcher 1 1 state changed from: STOPPING to: OFFLINE

2017-10-30 15:22:45.950014 : AGFW:1127589632: {1:30181:30156} Agent sending last reply for: RESOURCE_STOP[dbvrep.dbvrep.FROM_A_TO_B.fetcher 1 1] ID 4098:5175818

Also the relocation worked as expected: when the settings are modified through:

set FETCHER.FETCHER_REMOTE_INTERFACE=${HOSTNAME}:${DBVREP_PORT}
set FETCHER.FETCHER_LISTEN_INTERFACE=${HOSTNAME}:${DBVREP_PORT}
set FETCHER.MINE_ASM=${LOCAL_ASM}

set FETCHER.FETCHER_REMOTE_INTERFACE=${HOSTNAME}:${DBVREP_PORT}

set FETCHER.FETCHER_LISTEN_INTERFACE=${HOSTNAME}:${DBVREP_PORT}

set FETCHER.MINE_ASM=${LOCAL_ASM}

The MINE process get the change dynamically, so no need to restart it.

Last consideration

Adding a hard dependency between the DB and the FETCHER will require to stop the DB with the force option or to always stop the fetcher before the database. Also, the start of the DB will pullup the FETCHER (pullup:always) and the opposite as well. We will consider furtherly if we will use this dependency or if we will manage it differently (e.g. through the action script).

The hard dependency declared without the global keyword, will always start the fetcher on the server where the database runs. This is not required, but it might be nice to see the fetcher on the same node. Again, a consideration that we will discuss furtherly.

HTH

—

Ludovico

Get the Most out of Oracle Data Guard – The material

Posted on September 29, 2017 by Ludovico

Here we go: as usual, the feedback that I usually get after my talks (specifically, after POUG High Five conference), is if I will share my demo scripts and material.

Sadly, the demos I am doing for my presentation “Get the most out of Oracle Data Guard” are quite tied to an environment built for the purpose of the demos. So, do not expect to get scripts easy to use as is, but rather to get some ideas beyond the demo themselves.

I hope they will help to get the whole picture.

Of course, if you need to implement a cloning strategy based on Data Guard or any other solution that I describe in this post, please feel free to contact me, I will be glad to help you implement it in your environment.

Slides

Demo 1

Video:

Scripts:

#!/bin/bash

function tt () {
  title=$@
  pad=$(printf '%0.1s' "-"{1..60})
  echo
  echo
  echo $pad
  echo "- $title"
  echo $pad
}

. .bash_profile

PAUSE=/home/oracle/pause.sh
SYSPWD=Vagrant1_

clear

sid sour_ludo


sudo sed -i -e '/sour-s/d' /var/named/trivadistraining.com
sudo sed -i '$ a\
sour-s1 IN CNAME ludo01\
sour-s2 IN CNAME ludo01' /var/named/trivadistraining.com

sudo systemctl reload named.service

tt "Naming resolution"
tnsping sour_smart

nslookup sour-s1
nslookup sour-s2


$PAUSE

tt "Connect to sour_smart in another terminal"

$PAUSE
clear

tt "Creating Data Guard Configuration resolution"
dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  show configuration;
EOF

$PAUSE
dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  create configuration sour as primary database is sour_ludo connect identifier is sour_ludo.trivadistraining.com;
  add database sour_vico as connect identifier is sour_vico.trivadistraining.com;
  enable database sour_vico;
  enable configuration;
  host sleep 5;
  show configuration;
EOF

$PAUSE
clear

tt "Modifying the DNS configuration"

sudo sed -i -e '/sour-s2/d' /var/named/trivadistraining.com

sudo sed -i '$ a\
sour-s2 IN CNAME vico01' /var/named/trivadistraining.com

sudo systemctl reload named.service

tt "Naming resolution"
tnsping sour_smart

nslookup sour-s1
nslookup sour-s2

$PAUSE
clear
tt "Switchover to sour_vico"
dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  switchover to sour_vico;
EOF

$PAUSE
tt "Did the session fail over?"
$PAUSE

clear

tt "Modifying the DNS configuration"

sudo sed -i -e '/sour-s1/d' /var/named/trivadistraining.com

sudo sed -i '$ a\
sour-s1 IN CNAME vico01' /var/named/trivadistraining.com

sudo systemctl reload named.service

tt "Naming resolution"
tnsping sour_smart

nslookup sour-s1
nslookup sour-s2

$PAUSE

tt "Removing Data Guard configuration"

dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  remove configuration;
  show configuration;
EOF

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

#!/bin/bash

function tt () {

title=$@

pad=$(printf '%0.1s' "-"{1..60})

echo

echo $pad

echo "- $title"

echo $pad

}

. .bash_profile

PAUSE=/home/oracle/pause.sh

SYSPWD=Vagrant1_

clear

sid sour_ludo

sudo sed -i -e '/sour-s/d' /var/named/trivadistraining.com

sudo sed -i '$ a\

sour-s1 IN CNAME ludo01\

sour-s2 IN CNAME ludo01' /var/named/trivadistraining.com

sudo systemctl reload named.service

tt "Naming resolution"

tnsping sour_smart

nslookup sour-s1

nslookup sour-s2

$PAUSE

tt "Connect to sour_smart in another terminal"

$PAUSE

clear

tt "Creating Data Guard Configuration resolution"

dgmgrl -echo <<EOF

connect sys/$SYSPWD

show configuration;

EOF

$PAUSE

dgmgrl -echo <<EOF

connect sys/$SYSPWD

create configuration sour as primary database is sour_ludo connect identifier is sour_ludo.trivadistraining.com;

add database sour_vico as connect identifier is sour_vico.trivadistraining.com;

enable database sour_vico;

enable configuration;

host sleep 5;

show configuration;

EOF

$PAUSE

clear

tt "Modifying the DNS configuration"

sudo sed -i -e '/sour-s2/d' /var/named/trivadistraining.com

sudo sed -i '$ a\

sour-s2 IN CNAME vico01' /var/named/trivadistraining.com

sudo systemctl reload named.service

tt "Naming resolution"

tnsping sour_smart

nslookup sour-s1

nslookup sour-s2

$PAUSE

clear

tt "Switchover to sour_vico"

dgmgrl -echo <<EOF

connect sys/$SYSPWD

switchover to sour_vico;

EOF

$PAUSE

tt "Did the session fail over?"

$PAUSE

clear

tt "Modifying the DNS configuration"

sudo sed -i -e '/sour-s1/d' /var/named/trivadistraining.com

sudo sed -i '$ a\

sour-s1 IN CNAME vico01' /var/named/trivadistraining.com

sudo systemctl reload named.service

tt "Naming resolution"

tnsping sour_smart

nslookup sour-s1

nslookup sour-s2

$PAUSE

tt "Removing Data Guard configuration"

dgmgrl -echo <<EOF

connect sys/$SYSPWD

remove configuration;

show configuration;

EOF

Demo 2

Video:

Scripts:

#!/bin/bash

function tt () {
  title=$@
  pad=$(printf '%0.1s' "-"{1..60})
  echo
  echo
  echo $pad
  echo "- $title"
  echo $pad
}

. .bash_profile

clear

sid stout_vico
SYSPWD=Vagrant1_

PAUSE=/home/oracle/pause.sh

tt "Current configuration"
dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  show configuration;
EOF

$PAUSE

clear

tt "Instance and redo apply status"
sqlplus / as sysdba <<EOF
  select instance_name, status from v\$instance;
  select db_unique_name, database_role from v\$database;
  select process, status, client_process, sequence#, block#, delay_mins from v\$managed_standby order by process;
EOF

$PAUSE
clear 
tt "Inserting something in the primary"
sqlplus ludo/ludo@stout_ludo <<EOF
  DROP TABLE demo1;
  CREATE TABLE demo1 ( id NUMBER GENERATED AS IDENTITY 
     , foo DATE DEFAULT (sysdate)
     , CONSTRAINT demo1_pk PRIMARY KEY (id)
  );

  INSERT INTO demo1 (foo) VALUES(sysdate);
  INSERT INTO demo1 (foo) VALUES(sysdate);
  INSERT INTO demo1 (foo) VALUES(sysdate);
  INSERT INTO demo1 (foo) VALUES(sysdate);
  INSERT INTO demo1 (foo) VALUES(sysdate);
  COMMIT;
  ALTER SESSION SET NLS_DATE_FORMAT='YYYY-MM-DD HH24:MI:SS';
  SELECT * FROM demo1 ORDER BY id;
  exit
EOF


$PAUSE
clear
tt "Converting physical standby to snapshot standby"
dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  show configuration;
  convert database stout_vico to snapshot standby;
  show configuration;
EOF


$PAUSE
tt "Let's check the alert log (another window)"

$PAUSE
clear
tt "Instance and redo apply status"
sqlplus / as sysdba <<EOF
  SELECT instance_name, status FROM v\$instance;
  SELECT db_unique_name, database_role FROM v\$database;
  set lines 180
  col name for a80
  SELECT scn, name FROM v\$restore_point;
  SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby ORDER BY process;
  set feedback off
  SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby WHERE client_process='LGWR';
  EXEC dbms_lock.sleep(2);
  SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby WHERE client_process='LGWR';
  EXEC dbms_lock.sleep(2);
  SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby WHERE client_process='LGWR';
EOF


$PAUSE
clear
tt "Let's do something in the PRIMARY database!"
sqlplus ludo/ludo@stout_ludo <<EOF
  ALTER TABLE demo1 ADD test VARCHAR(20) DEFAULT ('PRIMARY'); 
  INSERT INTO demo1 (foo) VALUES(sysdate);
  INSERT INTO demo1 (foo) VALUES(sysdate);
  COMMIT;
  ALTER SESSION SET NLS_DATE_FORMAT='YYYY-MM-DD HH24:MI:SS';
  SELECT * FROM demo1 ORDER BY id;
  exit
EOF


$PAUSE
clear
tt "Let's do something in the snapshot standby!"
sqlplus ludo/ludo@stout_vico <<EOF
  ALTER TABLE demo1 ADD test VARCHAR(20) DEFAULT ('STANDBY'); 
  INSERT INTO demo1 (foo) VALUES(sysdate);
  INSERT INTO demo1 (foo) VALUES(sysdate);
  COMMIT;
  ALTER SESSION SET NLS_DATE_FORMAT='YYYY-MM-DD HH24:MI:SS';
  SELECT * FROM demo1 ORDER BY id;
  exit
EOF

$PAUSE
clear

tt "Convert back to physical standby"
dgmgrl -echo <<EOF
  connect sys/$SYSPWD
  show configuration;
  convert database stout_vico to physical standby;
  show configuration;
EOF

$PAUSE
clear
tt "Instance and redo apply status"
sqlplus / as sysdba <<EOF
  SELECT instance_name, status FROM v\$instance;
  SELECT db_unique_name, database_role FROM v\$database;
  set lines 180
  col name for a80
  SELECT scn, name FROM v\$restore_point;
  SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby ORDER BY process;
EOF

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

#!/bin/bash

function tt () {

title=$@

pad=$(printf '%0.1s' "-"{1..60})

echo

echo $pad

echo "- $title"

echo $pad

}

. .bash_profile

clear

sid stout_vico

SYSPWD=Vagrant1_

PAUSE=/home/oracle/pause.sh

tt "Current configuration"

dgmgrl -echo <<EOF

connect sys/$SYSPWD

show configuration;

EOF

$PAUSE

clear

tt "Instance and redo apply status"

sqlplus / as sysdba <<EOF

select instance_name, status from v\$instance;

select db_unique_name, database_role from v\$database;

select process, status, client_process, sequence#, block#, delay_mins from v\$managed_standby order by process;

EOF

$PAUSE

clear

tt "Inserting something in the primary"

sqlplus ludo/ludo@stout_ludo <<EOF

DROP TABLE demo1;

CREATE TABLE demo1 ( id NUMBER GENERATED AS IDENTITY

, foo DATE DEFAULT (sysdate)

, CONSTRAINT demo1_pk PRIMARY KEY (id)

);

INSERT INTO demo1 (foo) VALUES(sysdate);

COMMIT;

ALTER SESSION SET NLS_DATE_FORMAT='YYYY-MM-DD HH24:MI:SS';

SELECT * FROM demo1 ORDER BY id;

exit

EOF

$PAUSE

clear

tt "Converting physical standby to snapshot standby"

dgmgrl -echo <<EOF

connect sys/$SYSPWD

show configuration;

convert database stout_vico to snapshot standby;

show configuration;

EOF

$PAUSE

tt "Let's check the alert log (another window)"

$PAUSE

clear

tt "Instance and redo apply status"

sqlplus / as sysdba <<EOF

SELECT instance_name, status FROM v\$instance;

SELECT db_unique_name, database_role FROM v\$database;

set lines 180

col name for a80

SELECT scn, name FROM v\$restore_point;

SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby ORDER BY process;

set feedback off

SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby WHERE client_process='LGWR';

EXEC dbms_lock.sleep(2);

SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby WHERE client_process='LGWR';

EXEC dbms_lock.sleep(2);

SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby WHERE client_process='LGWR';

EOF

$PAUSE

clear

tt "Let's do something in the PRIMARY database!"

sqlplus ludo/ludo@stout_ludo <<EOF

ALTER TABLE demo1 ADD test VARCHAR(20) DEFAULT ('PRIMARY');

INSERT INTO demo1 (foo) VALUES(sysdate);

COMMIT;

ALTER SESSION SET NLS_DATE_FORMAT='YYYY-MM-DD HH24:MI:SS';

SELECT * FROM demo1 ORDER BY id;

exit

EOF

$PAUSE

clear

tt "Let's do something in the snapshot standby!"

sqlplus ludo/ludo@stout_vico <<EOF

ALTER TABLE demo1 ADD test VARCHAR(20) DEFAULT ('STANDBY');

INSERT INTO demo1 (foo) VALUES(sysdate);

COMMIT;

ALTER SESSION SET NLS_DATE_FORMAT='YYYY-MM-DD HH24:MI:SS';

SELECT * FROM demo1 ORDER BY id;

exit

EOF

$PAUSE

clear

tt "Convert back to physical standby"

dgmgrl -echo <<EOF

connect sys/$SYSPWD

show configuration;

convert database stout_vico to physical standby;

show configuration;

EOF

$PAUSE

clear

tt "Instance and redo apply status"

sqlplus / as sysdba <<EOF

SELECT instance_name, status FROM v\$instance;

SELECT db_unique_name, database_role FROM v\$database;

set lines 180

col name for a80

SELECT scn, name FROM v\$restore_point;

SELECT process, status, client_process, sequence#, block#, delay_mins FROM v\$managed_standby ORDER BY process;

EOF

Demo 3

Video:

Scripts:

Preparation:

#!/bin/bash

NUM=`echo $$ | cut -c 1-4`
export NEWNAME=${1:-poug$NUM}
export ORACLE_SID=$NEWNAME

export ORACLE_HOME=/u01/app/oracle/product/12.2.0.1/dbhome_1

[[ -L /u02/$NEWNAME ]] && rm $/u02/$NEWNAME
ln -s /u02/acfs/.ACFS/snaps/$NEWNAME /u02/$NEWNAME

set -x
$ORACLE_HOME/bin/srvctl add database -db $NEWNAME -oraclehome $ORACLE_HOME -dbtype SINGLE -instance $NEWNAME -spfile /u02/$NEWNAME/spfile$NEWNAME.ora -dbname $NEWNAME -policy MANUAL -acfspath "/u02/acfs,/u02/fra" -node $HOSTNAME

set +x

#!/bin/bash

NUM=`echo $$ | cut -c 1-4`

export NEWNAME=${1:-poug$NUM}

export ORACLE_SID=$NEWNAME

export ORACLE_HOME=/u01/app/oracle/product/12.2.0.1/dbhome_1

[[ -L /u02/$NEWNAME ]] && rm $/u02/$NEWNAME

ln -s /u02/acfs/.ACFS/snaps/$NEWNAME /u02/$NEWNAME

set -x

$ORACLE_HOME/bin/srvctl add database -db $NEWNAME -oraclehome $ORACLE_HOME -dbtype SINGLE -instance $NEWNAME -spfile /u02/$NEWNAME/spfile$NEWNAME.ora -dbname $NEWNAME -policy MANUAL -acfspath "/u02/acfs,/u02/fra" -node $HOSTNAME

set +x

snap_acfs.pl

#!/u01/app/oracle/tvdtoolbox/tvdperl-Linux-x86-64-02.04.00-05.08.04/bin/tvd_perl
#
# Purpose..........: Create a new snapshot with rotating name
# 
# snap_acfs.pl 
#        -p <parent> : name of the parent snapshot
#        -n <name>   : prefix of the snapshot
#        -s <suffix> : optional, use "weekday" to have the day name as suffix (Sun - Sat)
#
# e.g. snap_acfs.pl -p stout -n stout  -s "weekday"
#      will clone from /u02/acfs/.ACFS/snaps/stout
#                   to /u02/acfs/.ACFS/snaps/stout.Tue (or whatever the day is)
#      
# e.g. snap_acfs.pl -n stout -p stout.Mon 
#      will clone from /u02/acfs/.ACFS/snaps/stout.Mon
#                   to /u02/acfs/.ACFS/snaps/stout
#      
# e.g. snap_acfs.pl -n stout2 -p stout
#      will clone from /u02/acfs/.ACFS/snaps/stout
#                   to /u02/acfs/.ACFS/snaps/stout2
#      
# EXISTING SNAPSHOT WILL BE DROPPED!!
#
#
#

use strict;
use File::Copy;
use Net::SMTP;
use Sys::Hostname;
use Getopt::Std 'getopts';
use File::Basename;

my $CloneDIR;                             # predefine rootDir variable
BEGIN {
  use FindBin qw($Bin);                   # get the current path of script
  use Cwd 'abs_path';
  $CloneDIR    = abs_path("$Bin/..");     # get the absolut rood path to clone directory
}

my $CloneLOGDir = $CloneDIR."/log";       # LOG Directory
my $baseACFS = "/u02/acfs/";
my $ORA_CRS_HOME = "/u01/app/grid/12.2.0.1";
my $acfsutil = "/usr/sbin/acfsutil";
my $basename    = basename($0, ".pl");
my $ParentSnapName;
my $ParentSnap=0; ## no parent snapshots by default
my $PrefixName;
my $NewName;
my $SuffixName;
my %opts;
my $MountPoint;
my $SnapCreate;

################################################################################
#  Main
################################################################################
my $StartDate = localtime;
&DoMsg ("Start of $basename.pl");
unless ( open (MAINLOG, ">>$CloneLOGDir/$basename.log") ) {
	&DoMsg ("Can't open Main Logfile $CloneLOGDir/$basename.log");
    exit 1;
}

# Process command line arguments
if  ( ! defined @ARGV ) { &Usage; exit 1; } 
getopts('n:p:s:b:', \%opts);

if ($opts{"p"}) {
   $ParentSnapName    = lc($opts{"p"});
} else {
   &DoMsg ("Parent snapshot name not given!");
   &Usage;
   exit 1;
}
if ($opts{"n"}) {
   $PrefixName    = lc($opts{"n"});
} else {
   &DoMsg ("New snapshot prefix not given! Defaults to ${ParentSnapName}");
   $PrefixName    = "${ParentSnapName}";
}

if ($opts{"s"}) {
   $SuffixName    = lc($opts{"s"});
   if ( $SuffixName eq "weekday" ) {
      $SuffixName    = lc(&getWeekDay);
   }
   $SuffixName  = "." . $SuffixName;
} else {
   $SuffixName = "";
}

$NewName = "${PrefixName}${SuffixName}";


&DoMsg ("Parent: $ParentSnapName");
&DoMsg ("Prefix: $PrefixName");
&DoMsg ("Suffix: $SuffixName");
&DoMsg ("New Name: $NewName");


$MountPoint = $baseACFS;
$SnapCreate = "$acfsutil snap create -w -p $ParentSnapName $NewName $MountPoint";
&DoMsg ("Create Command: $SnapCreate ");


my $cmd = "$acfsutil snap info $NewName $MountPoint";
&DoMsg ($cmd);
open( CMD, $cmd . " |");
&DoMsg (join("", <CMD>));
close CMD;
if ( $? != 0 ) {
   &DoMsg ("Snapshot $NewName does not exist inside mount point $MountPoint. Continuing.");
} else {
   &DoMsg ("Snapshot $NewName already exists inside mount point $MountPoint. Now it will be deleted.");
   $cmd = "$acfsutil snap delete $NewName $MountPoint";
   &DoMsg ($cmd);
   open( CMD, $cmd . " |");
   &DoMsg (join("", <CMD>));
   close CMD;
   if ( $? != 0 ) {
      &DoMsg ("Cannot delete Snapshot $NewName in mount point $MountPoint. Script will exit.");
      exit 1;
   }
}

&DoMsg ("Creating the new snapshot:");
&DoMsg ($SnapCreate);
open( CMD, $SnapCreate . " |");
&DoMsg (join("", <CMD>));
close CMD;
if ( $? != 0 ) {
   &DoMsg ("Cannot create Snapshot $NewName in mount point $MountPoint. Script will exit.");
   exit 1;
} #else {
   #&DoMsg ("Current snapshots:");
   #open( CMD, "$acfsutil snap info $MountPoint |");
   #&DoMsg (join("", <CMD>));
   #close CMD;
#}



#-------------------------------------------------------------------------------
# DoMsg
#
# PURPOSE    : echo with timestamp YYYY-MM-DD_H24:MI:SS
# PARAMS     : $*: the messages
# GLOBAL VARS: none
#-------------------------------------------------------------------------------   
sub DoMsg {

   my $msg = shift;
   my $timestamp = &getTimestamp;
   
   print ("$timestamp $msg\n");
   if (fileno(MAINLOG)) {print MAINLOG "$timestamp $msg\n";}
}


#-------------------------------------------------------------------------------
# getTimestamp
#
# PURPOSE    : returns timestamp in different formats
# PARAMS     : format_parm
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub getTimestamp {
   #
   # Format 1:  dd-mm-yyyy_hh24:mi:ss
   # Format 2:  dd.mm.yyyy_hh24miss
   # Format 3:  dd.mm.yyyy
   # Format 4:  hh24:mi:ss
   # Rest:      dd.mm.yyyy hh24:mi:ss  (default)
   #
   my $Parm = shift;
   my $date;
   my $date2;
   my $heure;
   my $heure2;
   my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst);

   if ( length($Parm) > 1 ) {
      ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($Parm);
   }
   else {
      ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;
   }
   
   $date = (sprintf "%2.0d",($mday)).".".(sprintf "%2.0d",($mon+1)).".".($year+1900);
   $date =~ s/ /0/g;
   $date2 = (sprintf "%2.0d",($mday))."-".(sprintf "%2.0d",($mon+1))."-".($year+1900);
   $date2 =~ s/ /0/g;
   $heure = (sprintf "%2.0d",($hour)).":".(sprintf "%2.0d",($min)).":".(sprintf "%2.0d",($sec));
   $heure =~ s/ /0/g;
   $heure2 = (sprintf "%2.0d",($hour)).(sprintf "%2.0d",($min)).(sprintf "%2.0d",($sec));
   $heure2 =~ s/ /0/g;
   
   if    ($Parm eq "1") { return ($date2."_".$heure) }
   elsif ($Parm eq "2") { return ($date."_".$heure2) }
   elsif ($Parm eq "3") { return ($date) }
   elsif ($Parm eq "4") { return ($heure) }
   else { return ($date." ".$heure) };

}


#-------------------------------------------------------------------------------
# getWeekDay
#
# PURPOSE    : returns weekday (Sun - Sat)
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub getWeekDay{
   my @date = split(" ", localtime(time));
   my $day = $date[0];
   return ($day);
}


#-------------------------------------------------------------------------------
# Usage
#
# PURPOSE    : print the Usage
# PARAMS     : none
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub Usage {

   print <<EOF
   
Usage:  $basename -b <base>  [Optional Arguments]
          -p <parent> : name of the parent snapshot
       
           Optional Arguments:
          -n <prefix_name> : prefix of the new snapshot name (defaults to parent.18H)
          -s <suffix>      : use "weekday" to have the day name as suffix (Sun - Sat)


 e.g. snap_acfs.pl -p scprod -n stout  -s "weekday"
      will clone from /u02/acfs/.ACFS\snaps\stout
                   to /u02/acfs/.ACFS\snaps\stout.Tue (or whatever the day is)
      
 e.g. snap_acfs.pl -n stout -p stout.Mon 
      will clone from /u02/acfs/.ACFS\snaps\stout.Mon
                   to /u02/acfs/.ACFS\snaps\stout
      
 e.g. snap_acfs.pl -n stout2 -p stout
      will clone from /u02/acfs/.ACFS\snaps\stout
                   to /u02/acfs/.ACFS\snaps\stout2
           
  EXISTING SNAPSHOT WILL BE DROPPED!!
EOF

}

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

#!/u01/app/oracle/tvdtoolbox/tvdperl-Linux-x86-64-02.04.00-05.08.04/bin/tvd_perl

# Purpose..........: Create a new snapshot with rotating name

# snap_acfs.pl

# -p <parent> : name of the parent snapshot

# -n <name> : prefix of the snapshot

# -s <suffix> : optional, use "weekday" to have the day name as suffix (Sun - Sat)

# e.g. snap_acfs.pl -p stout -n stout -s "weekday"

# will clone from /u02/acfs/.ACFS/snaps/stout

# to /u02/acfs/.ACFS/snaps/stout.Tue (or whatever the day is)

# e.g. snap_acfs.pl -n stout -p stout.Mon

# will clone from /u02/acfs/.ACFS/snaps/stout.Mon

# to /u02/acfs/.ACFS/snaps/stout

# e.g. snap_acfs.pl -n stout2 -p stout

# will clone from /u02/acfs/.ACFS/snaps/stout

# to /u02/acfs/.ACFS/snaps/stout2

# EXISTING SNAPSHOT WILL BE DROPPED!!

use strict;

use File::Copy;

use Net::SMTP;

use Sys::Hostname;

use Getopt::Std 'getopts';

use File::Basename;

my $CloneDIR; # predefine rootDir variable

BEGIN {

use FindBin qw($Bin); # get the current path of script

use Cwd 'abs_path';

$CloneDIR = abs_path("$Bin/.."); # get the absolut rood path to clone directory

}

my $CloneLOGDir = $CloneDIR."/log"; # LOG Directory

my $baseACFS = "/u02/acfs/";

my $ORA_CRS_HOME = "/u01/app/grid/12.2.0.1";

my $acfsutil = "/usr/sbin/acfsutil";

my $basename = basename($0, ".pl");

my $ParentSnapName;

my $ParentSnap=0; ## no parent snapshots by default

my $PrefixName;

my $NewName;

my $SuffixName;

my %opts;

my $MountPoint;

my $SnapCreate;

################################################################################

# Main

################################################################################

my $StartDate = localtime;

&DoMsg ("Start of $basename.pl");

unless ( open (MAINLOG, ">>$CloneLOGDir/$basename.log") ) {

&DoMsg ("Can't open Main Logfile $CloneLOGDir/$basename.log");

exit 1;

}

# Process command line arguments

if ( ! defined @ARGV ) { &Usage; exit 1; }

getopts('n:p:s:b:', \%opts);

if ($opts{"p"}) {

$ParentSnapName = lc($opts{"p"});

} else {

&DoMsg ("Parent snapshot name not given!");

&Usage;

exit 1;

}

if ($opts{"n"}) {

$PrefixName = lc($opts{"n"});

} else {

&DoMsg ("New snapshot prefix not given! Defaults to ${ParentSnapName}");

$PrefixName = "${ParentSnapName}";

}

if ($opts{"s"}) {

$SuffixName = lc($opts{"s"});

if ( $SuffixName eq "weekday" ) {

$SuffixName = lc(&getWeekDay);

}

$SuffixName = "." . $SuffixName;

} else {

$SuffixName = "";

}

$NewName = "${PrefixName}${SuffixName}";

&DoMsg ("Parent: $ParentSnapName");

&DoMsg ("Prefix: $PrefixName");

&DoMsg ("Suffix: $SuffixName");

&DoMsg ("New Name: $NewName");

$MountPoint = $baseACFS;

$SnapCreate = "$acfsutil snap create -w -p $ParentSnapName $NewName $MountPoint";

&DoMsg ("Create Command: $SnapCreate ");

my $cmd = "$acfsutil snap info $NewName $MountPoint";

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("", <CMD>));

close CMD;

if ( $? != 0 ) {

&DoMsg ("Snapshot $NewName does not exist inside mount point $MountPoint. Continuing.");

} else {

&DoMsg ("Snapshot $NewName already exists inside mount point $MountPoint. Now it will be deleted.");

$cmd = "$acfsutil snap delete $NewName $MountPoint";

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("", <CMD>));

close CMD;

if ( $? != 0 ) {

&DoMsg ("Cannot delete Snapshot $NewName in mount point $MountPoint. Script will exit.");

exit 1;

}

&DoMsg ("Creating the new snapshot:");

&DoMsg ($SnapCreate);

open( CMD, $SnapCreate . " |");

&DoMsg (join("", <CMD>));

close CMD;

if ( $? != 0 ) {

&DoMsg ("Cannot create Snapshot $NewName in mount point $MountPoint. Script will exit.");

exit 1;

} #else {

#&DoMsg ("Current snapshots:");

#open( CMD, "$acfsutil snap info $MountPoint |");

#&DoMsg (join("", <CMD>));

#close CMD;

#-------------------------------------------------------------------------------

# DoMsg

# PURPOSE : echo with timestamp YYYY-MM-DD_H24:MI:SS

# PARAMS : $*: the messages

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub DoMsg {

my $msg = shift;

my $timestamp = &getTimestamp;

print ("$timestamp $msg\n");

if (fileno(MAINLOG)) {print MAINLOG "$timestamp $msg\n";}

}

#-------------------------------------------------------------------------------

# getTimestamp

# PURPOSE : returns timestamp in different formats

# PARAMS : format_parm

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub getTimestamp {

# Format 1: dd-mm-yyyy_hh24:mi:ss

# Format 2: dd.mm.yyyy_hh24miss

# Format 3: dd.mm.yyyy

# Format 4: hh24:mi:ss

# Rest: dd.mm.yyyy hh24:mi:ss (default)

my $Parm = shift;

my $date;

my $date2;

my $heure;

my $heure2;

my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst);

if ( length($Parm) > 1 ) {

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($Parm);

}

else {

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;

}

$date = (sprintf "%2.0d",($mday)).".".(sprintf "%2.0d",($mon+1)).".".($year+1900);

$date =~ s/ /0/g;

$date2 = (sprintf "%2.0d",($mday))."-".(sprintf "%2.0d",($mon+1))."-".($year+1900);

$date2 =~ s/ /0/g;

$heure = (sprintf "%2.0d",($hour)).":".(sprintf "%2.0d",($min)).":".(sprintf "%2.0d",($sec));

$heure =~ s/ /0/g;

$heure2 = (sprintf "%2.0d",($hour)).(sprintf "%2.0d",($min)).(sprintf "%2.0d",($sec));

$heure2 =~ s/ /0/g;

if ($Parm eq "1") { return ($date2."_".$heure) }

elsif ($Parm eq "2") { return ($date."_".$heure2) }

elsif ($Parm eq "3") { return ($date) }

elsif ($Parm eq "4") { return ($heure) }

else { return ($date." ".$heure) };

}

#-------------------------------------------------------------------------------

# getWeekDay

# PURPOSE : returns weekday (Sun - Sat)

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub getWeekDay{

my @date = split(" ", localtime(time));

my $day = $date[0];

return ($day);

}

#-------------------------------------------------------------------------------

# Usage

# PURPOSE : print the Usage

# PARAMS : none

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub Usage {

print <<EOF

Usage: $basename -b <base> [Optional Arguments]

-p <parent> : name of the parent snapshot

Optional Arguments:

-n <prefix_name> : prefix of the new snapshot name (defaults to parent.18H)

-s <suffix> : use "weekday" to have the day name as suffix (Sun - Sat)

e.g. snap_acfs.pl -p scprod -n stout -s "weekday"

will clone from /u02/acfs/.ACFS\snaps\stout

to /u02/acfs/.ACFS\snaps\stout.Tue (or whatever the day is)

e.g. snap_acfs.pl -n stout -p stout.Mon

will clone from /u02/acfs/.ACFS\snaps\stout.Mon

to /u02/acfs/.ACFS\snaps\stout

e.g. snap_acfs.pl -n stout2 -p stout

will clone from /u02/acfs/.ACFS\snaps\stout

to /u02/acfs/.ACFS\snaps\stout2

EXISTING SNAPSHOT WILL BE DROPPED!!

EOF

}

snap_databasae.pl

#!/u01/app/oracle/tvdtoolbox/tvdperl-Linux-x86-64-02.04.00-05.08.04/bin/tvd_perl
#
# Purpose..........: Create a new snapshot of a standby database by apply-off, backup controlfile to trace, copy init, acfs snap, apply-on
# 
# snap_database.pl 
#        -b <base>
#        -n <name>   : prefix of the snapshot
#        -s <suffix> : optional, use "weekday" to have the day name as suffix (Sun - Sat)
#
# e.g. snap_database.pl -b stout -n stout_save  -s "weekday"
#      will clone from /u02/acfs/.ACFS/snaps/stout
#                   to /u02/acfs/.ACFS/snaps/stout_save.Tue (or whatever the day is)
#      
# EXISTING SNAPSHOT WILL BE DROPPED!!
#

#use strict;
use File::Copy;
use Net::SMTP;
use Sys::Hostname;
use Getopt::Std 'getopts';
use File::Basename;
use DBI;
use DBD::Oracle qw(:ora_session_modes);

my $CloneDIR;                             # predefine rootDir variable
BEGIN {
  use FindBin qw($Bin);                   # get the current path of script
  use Cwd 'abs_path';
  $CloneDIR    = abs_path("$Bin/..");     # get the absolut rood path to clone directory
}

my $CloneLOGDir = $CloneDIR."/log";       # LOG Directory
my $baseACFS = "/u02/acfs";
my $basename    = basename($0, ".pl");
my $PrefixName;
my $BaseDB;
my $SuffixName;
my $SnapshotName;
my %opts;
my $dbh;
my $db_create_file_dest;
my $db_unique_name;
my $cmd;
my $syspwd="Vagrant1_";
my $SnapError=0;
my $SnapDir;
my $ControlfileTrace = "control.trc";
my $ORACLE_HOME = "/u01/app/oracle/product/12.2.0.1/dbhome_1";
my $InitName = "init.ora";
my $warnings = 0;

################################################################################
#  Main
################################################################################
my $StartDate = localtime;
&DoMsg ("Start of $basename.pl");
unless ( open (MAINLOG, ">>$CloneLOGDir/$basename.log") ) {
	&DoMsg ("Can't open Main Logfile $CloneLOGDir/$basename.log");
    exit 1;
}

# Process command line arguments
if  ( ! defined @ARGV ) { &Usage; exit 1; } 
getopts('b:n:s:', \%opts);

if ($opts{"b"}) {
   $BaseDB = lc($opts{"b"});
} else {
   &DoMsg ("Base DB not given!");
   &Usage;
   exit 1;
}
if ($opts{"n"}) {
   $PrefixName    = lc($opts{"n"});
} else {
   $PrefixName    = "${BaseDB}_save";
}
if ($opts{"s"}) {
   $SuffixName    = lc($opts{"s"});
   if ( $SuffixName eq "weekday" ) {
      $SuffixName    = lc(&getWeekDay);
   }
   $SuffixName  = "." . $SuffixName;
} else {
   $SuffixName = "";
}

$SnapshotName = "${PrefixName}${SuffixName}";


&DoMsg ("Base: $BaseDB");
&DoMsg ("SnapshotName: $SnapshotName");

&ConnectDB ;

### checking that the database is mounted and physical standby

my $DBstatus= &QueryOneValue('select status from v$instance');
unless ( $DBstatus eq "MOUNTED" ) {
   &DoMsg ("Database is not in MOUNTED status, this is unexpected. Exiting.");
   exit 1
}

my $DBrole= &QueryOneValue('SELECT database_role FROM v$database');
unless ( $DBrole eq "PHYSICAL STANDBY" ) {
   &DoMsg ("Database role is not PHYSICAL STANDBY, this is unexpected. Exiting.");
   exit 1
}


$db_create_file_dest= &QueryOneValue(qq{SELECT value FROM v\$parameter2 WHERE name='db_create_file_dest'});
 &DoMsg ("db_create_file_dest: $db_create_file_dest");

$db_unique_name= &QueryOneValue(qq{SELECT value FROM v\$parameter2 WHERE name='db_unique_name'});
 &DoMsg ("db_unique_name: $db_unique_name");

#unless ($dbh->do(qq{ALTER SESSION SYNC WITH PRIMARY}) ) {
#   &DoMsg ("Error in syncing the session with the primary");
#   $warnings++;
#}

$cmd = qq{dgmgrl -echo sys/$syspwd "edit database $db_unique_name set state=\\\"APPLY-OFF\\\";"};
&DoMsg ($cmd);
open( CMD, $cmd . " |");
&DoMsg (join("",<CMD>));
close (CMD);
my $a=$?;
#if ( $? != 0 ) {
#   &DoMsg ("Error in stopping apply on standby $BaseDB. Exiting.");
#   exit 1
#}


$cmd = $CloneDIR."/bin/snap_acfs.pl -p $BaseDB -n $SnapshotName";
&DoMsg($cmd);
open( CMD, $cmd . " |");
print (join("", <CMD>)); ## only print here as it logs and echoes its time as well
close CMD;
#if ( $? != 0 ) {
#   # track if error in creating the snapshot: we continue and do the apply-on anyway!
#   $SnapError=1;
#}

$SnapDir = $baseACFS . "/.ACFS/snaps/" . $SnapshotName;
$ControlfileTrace = $SnapDir . "/" . $ControlfileTrace;
$InitName = $SnapDir . "/" . $InitName;

unless ($dbh->do(qq{ ALTER DATABASE BACKUP CONTROLFILE TO TRACE AS '$ControlfileTrace' REUSE RESETLOGS}) ) {
   &DoMsg ("Error in taking the controlfile trace $ControlfileTrace.");
   $warnings++;
}

unless ($dbh->do(qq{ CREATE PFILE='$InitName' FROM SPFILE }) ) {
   &DoMsg ("Error in creating the pfile $InitName.");
   $warnings++;
}

$cmd = qq{dgmgrl -echo sys/$syspwd "edit database $db_unique_name set state=\"APPLY-ON\""};
&DoMsg ($cmd);
open( CMD, $cmd . " |");
&DoMsg (join("", <CMD>));
close CMD;
#if ( $? != 0 ) {
#   &DoMsg ("Error in starting apply on standby $BaseDB. MANUAL INTERVENTION REQUIRED");
#   exit 1
#}

if ( $SnapError == 1 ) {
	&DoMsg ("There was an error in creating the snapshot. Exiting.");
        exit 1;
}



if ( $warnings != 0 ) {
   &DoMsg("There have been some warnings, but the procedure completed.");
} else {
   &DoMsg("The procedure completed successfully.");
}

&DisconnectDB ;


#-------------------------------------------------------------------------------
# DoMsg
#
# PURPOSE    : echo with timestamp YYYY-MM-DD_H24:MI:SS
# PARAMS     : $*: the messages
# GLOBAL VARS: none
#-------------------------------------------------------------------------------   
sub DoMsg {

   my $msg = shift;
   my $timestamp = &getTimestamp;
   
   print ("$timestamp $msg\n");
   if (fileno(MAINLOG)) {print MAINLOG "$timestamp $msg\n";}
}


#-------------------------------------------------------------------------------
# getTimestamp
#
# PURPOSE    : returns timestamp in different formats
# PARAMS     : format_parm
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub getTimestamp {
   #
   # Format 1:  dd-mm-yyyy_hh24:mi:ss
   # Format 2:  dd.mm.yyyy_hh24miss
   # Format 3:  dd.mm.yyyy
   # Format 4:  hh24:mi:ss
   # Rest:      dd.mm.yyyy hh24:mi:ss  (default)
   #
   my $Parm = shift;
   my $date;
   my $date2;
   my $heure;
   my $heure2;
   my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst);

   if ( length($Parm) > 1 ) {
      ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($Parm);
   }
   else {
      ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;
   }
   
   $date = (sprintf "%2.0d",($mday)).".".(sprintf "%2.0d",($mon+1)).".".($year+1900);
   $date =~ s/ /0/g;
   $date2 = (sprintf "%2.0d",($mday))."-".(sprintf "%2.0d",($mon+1))."-".($year+1900);
   $date2 =~ s/ /0/g;
   $heure = (sprintf "%2.0d",($hour)).":".(sprintf "%2.0d",($min)).":".(sprintf "%2.0d",($sec));
   $heure =~ s/ /0/g;
   $heure2 = (sprintf "%2.0d",($hour)).(sprintf "%2.0d",($min)).(sprintf "%2.0d",($sec));
   $heure2 =~ s/ /0/g;
   
   if    ($Parm eq "1") { return ($date2."_".$heure) }
   elsif ($Parm eq "2") { return ($date."_".$heure2) }
   elsif ($Parm eq "3") { return ($date) }
   elsif ($Parm eq "4") { return ($heure) }
   else { return ($date." ".$heure) };

}


#-------------------------------------------------------------------------------
# getWeekDay
#
# PURPOSE    : returns weekday (Sun - Sat)
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub getWeekDay{
   my @date = split(" ", localtime(time));
   my $day = $date[0];
   return ($day);
}



#-------------------------------------------------------------------------------
# Usage
#
# PURPOSE    : print the Usage
# PARAMS     : none
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub Usage {

   print <<EOF
   
Usage:  $basename -b <base>  [Optional Arguments]
           -b <base>       : name of the base database
       
        Purpose:
          Create a new snapshot of a standby database by apply-off, acfs snap, backup controlfile to trace, copy init, apply-on.

        Optional Arguments:
          -n <prefix_name> : prefix of the new snapshot name
          -s <suffix>      : use "weekday" to have the day name as suffix (Sun - Sat)

        examples:
            snap_database.pl -b stout -n stout.18h  -s "weekday"
            will clone from /u02/acfs/.ACFS/snaps/stout
                         to /u02/acfs/.ACFS/snaps/stout.18h.Tue (or whatever the day is)

      
            $basename -b stout -s "weekday"
            will clone from /u02/acfs/.ACFS/snaps/stout
                         to /u02/acfs/.ACFS/snaps/stout_save.Wed  (or whatever the day is)
      
  EXISTING SNAPSHOT WILL BE DROPPED!!

EOF

}


sub ConnectDB {

   # DB connection #
   $ENV{ORACLE_SID}=$BaseDB;
   $ENV{ORACLE_HOME}=$ORACLE_HOME;
   delete $ENV{TWO_TASK};

   &DoMsg ("Connecting to DB $BaseDB");
   unless ($dbh = DBI->connect('dbi:Oracle:', "sys", $syspwd, {PrintError=>0, AutoCommit => 0, ora_session_mode => ORA_SYSDBA}))  {
      &DoMsg ("Error connecting to DB: ". $DBI::errstr);
      exit(1);
   }

   #&DoMsg ("Connected to DB $BaseDB");

}

sub QueryOneValue {

   my $sth;
   my $query = shift;

   unless ($sth = $dbh->prepare ($query)) {
      &DoMsg ("Error preparing statement $query: ".$dbh->errstr);
   }
   $sth->execute;
   my ($result) = $sth->fetchrow_array;

   return $result;
}

sub DisconnectDB {
   $dbh->disconnect;
}

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

296

297

298

299

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

320

321

322

323

324

325

326

327

328

329

330

331

332

333

334

#!/u01/app/oracle/tvdtoolbox/tvdperl-Linux-x86-64-02.04.00-05.08.04/bin/tvd_perl

# Purpose..........: Create a new snapshot of a standby database by apply-off, backup controlfile to trace, copy init, acfs snap, apply-on

# snap_database.pl

# -b <base>

# -n <name> : prefix of the snapshot

# -s <suffix> : optional, use "weekday" to have the day name as suffix (Sun - Sat)

# e.g. snap_database.pl -b stout -n stout_save -s "weekday"

# will clone from /u02/acfs/.ACFS/snaps/stout

# to /u02/acfs/.ACFS/snaps/stout_save.Tue (or whatever the day is)

# EXISTING SNAPSHOT WILL BE DROPPED!!

#use strict;

use File::Copy;

use Net::SMTP;

use Sys::Hostname;

use Getopt::Std 'getopts';

use File::Basename;

use DBI;

use DBD::Oracle qw(:ora_session_modes);

my $CloneDIR; # predefine rootDir variable

BEGIN {

use FindBin qw($Bin); # get the current path of script

use Cwd 'abs_path';

$CloneDIR = abs_path("$Bin/.."); # get the absolut rood path to clone directory

}

my $CloneLOGDir = $CloneDIR."/log"; # LOG Directory

my $baseACFS = "/u02/acfs";

my $basename = basename($0, ".pl");

my $PrefixName;

my $BaseDB;

my $SuffixName;

my $SnapshotName;

my %opts;

my $dbh;

my $db_create_file_dest;

my $db_unique_name;

my $cmd;

my $syspwd="Vagrant1_";

my $SnapError=0;

my $SnapDir;

my $ControlfileTrace = "control.trc";

my $ORACLE_HOME = "/u01/app/oracle/product/12.2.0.1/dbhome_1";

my $InitName = "init.ora";

my $warnings = 0;

################################################################################

# Main

################################################################################

my $StartDate = localtime;

&DoMsg ("Start of $basename.pl");

unless ( open (MAINLOG, ">>$CloneLOGDir/$basename.log") ) {

&DoMsg ("Can't open Main Logfile $CloneLOGDir/$basename.log");

exit 1;

}

# Process command line arguments

if ( ! defined @ARGV ) { &Usage; exit 1; }

getopts('b:n:s:', \%opts);

if ($opts{"b"}) {

$BaseDB = lc($opts{"b"});

} else {

&DoMsg ("Base DB not given!");

&Usage;

exit 1;

}

if ($opts{"n"}) {

$PrefixName = lc($opts{"n"});

} else {

$PrefixName = "${BaseDB}_save";

}

if ($opts{"s"}) {

$SuffixName = lc($opts{"s"});

if ( $SuffixName eq "weekday" ) {

$SuffixName = lc(&getWeekDay);

}

$SuffixName = "." . $SuffixName;

} else {

$SuffixName = "";

}

$SnapshotName = "${PrefixName}${SuffixName}";

&DoMsg ("Base: $BaseDB");

&DoMsg ("SnapshotName: $SnapshotName");

&ConnectDB ;

### checking that the database is mounted and physical standby

my $DBstatus= &QueryOneValue('select status from v$instance');

unless ( $DBstatus eq "MOUNTED" ) {

&DoMsg ("Database is not in MOUNTED status, this is unexpected. Exiting.");

exit 1

}

my $DBrole= &QueryOneValue('SELECT database_role FROM v$database');

unless ( $DBrole eq "PHYSICAL STANDBY" ) {

&DoMsg ("Database role is not PHYSICAL STANDBY, this is unexpected. Exiting.");

exit 1

}

$db_create_file_dest= &QueryOneValue(qq{SELECT value FROM v\$parameter2 WHERE name='db_create_file_dest'});

&DoMsg ("db_create_file_dest: $db_create_file_dest");

$db_unique_name= &QueryOneValue(qq{SELECT value FROM v\$parameter2 WHERE name='db_unique_name'});

&DoMsg ("db_unique_name: $db_unique_name");

#unless ($dbh->do(qq{ALTER SESSION SYNC WITH PRIMARY}) ) {

# &DoMsg ("Error in syncing the session with the primary");

# $warnings++;

$cmd = qq{dgmgrl -echo sys/$syspwd "edit database $db_unique_name set state=\\\"APPLY-OFF\\\";"};

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("",<CMD>));

close (CMD);

my $a=$?;

#if ( $? != 0 ) {

# &DoMsg ("Error in stopping apply on standby $BaseDB. Exiting.");

# exit 1

$cmd = $CloneDIR."/bin/snap_acfs.pl -p $BaseDB -n $SnapshotName";

&DoMsg($cmd);

open( CMD, $cmd . " |");

print (join("", <CMD>)); ## only print here as it logs and echoes its time as well

close CMD;

#if ( $? != 0 ) {

# # track if error in creating the snapshot: we continue and do the apply-on anyway!

# $SnapError=1;

$SnapDir = $baseACFS . "/.ACFS/snaps/" . $SnapshotName;

$ControlfileTrace = $SnapDir . "/" . $ControlfileTrace;

$InitName = $SnapDir . "/" . $InitName;

unless ($dbh->do(qq{ ALTER DATABASE BACKUP CONTROLFILE TO TRACE AS '$ControlfileTrace' REUSE RESETLOGS}) ) {

&DoMsg ("Error in taking the controlfile trace $ControlfileTrace.");

$warnings++;

}

unless ($dbh->do(qq{ CREATE PFILE='$InitName' FROM SPFILE }) ) {

&DoMsg ("Error in creating the pfile $InitName.");

$warnings++;

}

$cmd = qq{dgmgrl -echo sys/$syspwd "edit database $db_unique_name set state=\"APPLY-ON\""};

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("", <CMD>));

close CMD;

#if ( $? != 0 ) {

# &DoMsg ("Error in starting apply on standby $BaseDB. MANUAL INTERVENTION REQUIRED");

# exit 1

if ( $SnapError == 1 ) {

&DoMsg ("There was an error in creating the snapshot. Exiting.");

exit 1;

}

if ( $warnings != 0 ) {

&DoMsg("There have been some warnings, but the procedure completed.");

} else {

&DoMsg("The procedure completed successfully.");

}

&DisconnectDB ;

#-------------------------------------------------------------------------------

# DoMsg

# PURPOSE : echo with timestamp YYYY-MM-DD_H24:MI:SS

# PARAMS : $*: the messages

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub DoMsg {

my $msg = shift;

my $timestamp = &getTimestamp;

print ("$timestamp $msg\n");

if (fileno(MAINLOG)) {print MAINLOG "$timestamp $msg\n";}

}

#-------------------------------------------------------------------------------

# getTimestamp

# PURPOSE : returns timestamp in different formats

# PARAMS : format_parm

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub getTimestamp {

# Format 1: dd-mm-yyyy_hh24:mi:ss

# Format 2: dd.mm.yyyy_hh24miss

# Format 3: dd.mm.yyyy

# Format 4: hh24:mi:ss

# Rest: dd.mm.yyyy hh24:mi:ss (default)

my $Parm = shift;

my $date;

my $date2;

my $heure;

my $heure2;

my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst);

if ( length($Parm) > 1 ) {

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($Parm);

}

else {

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;

}

$date = (sprintf "%2.0d",($mday)).".".(sprintf "%2.0d",($mon+1)).".".($year+1900);

$date =~ s/ /0/g;

$date2 = (sprintf "%2.0d",($mday))."-".(sprintf "%2.0d",($mon+1))."-".($year+1900);

$date2 =~ s/ /0/g;

$heure = (sprintf "%2.0d",($hour)).":".(sprintf "%2.0d",($min)).":".(sprintf "%2.0d",($sec));

$heure =~ s/ /0/g;

$heure2 = (sprintf "%2.0d",($hour)).(sprintf "%2.0d",($min)).(sprintf "%2.0d",($sec));

$heure2 =~ s/ /0/g;

if ($Parm eq "1") { return ($date2."_".$heure) }

elsif ($Parm eq "2") { return ($date."_".$heure2) }

elsif ($Parm eq "3") { return ($date) }

elsif ($Parm eq "4") { return ($heure) }

else { return ($date." ".$heure) };

}

#-------------------------------------------------------------------------------

# getWeekDay

# PURPOSE : returns weekday (Sun - Sat)

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub getWeekDay{

my @date = split(" ", localtime(time));

my $day = $date[0];

return ($day);

}

#-------------------------------------------------------------------------------

# Usage

# PURPOSE : print the Usage

# PARAMS : none

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub Usage {

print <<EOF

Usage: $basename -b <base> [Optional Arguments]

-b <base> : name of the base database

Purpose:

Create a new snapshot of a standby database by apply-off, acfs snap, backup controlfile to trace, copy init, apply-on.

Optional Arguments:

-n <prefix_name> : prefix of the new snapshot name

-s <suffix> : use "weekday" to have the day name as suffix (Sun - Sat)

examples:

snap_database.pl -b stout -n stout.18h -s "weekday"

will clone from /u02/acfs/.ACFS/snaps/stout

to /u02/acfs/.ACFS/snaps/stout.18h.Tue (or whatever the day is)

$basename -b stout -s "weekday"

will clone from /u02/acfs/.ACFS/snaps/stout

to /u02/acfs/.ACFS/snaps/stout_save.Wed (or whatever the day is)

EXISTING SNAPSHOT WILL BE DROPPED!!

EOF

}

sub ConnectDB {

# DB connection #

$ENV{ORACLE_SID}=$BaseDB;

$ENV{ORACLE_HOME}=$ORACLE_HOME;

delete $ENV{TWO_TASK};

&DoMsg ("Connecting to DB $BaseDB");

unless ($dbh = DBI->connect('dbi:Oracle:', "sys", $syspwd, {PrintError=>0, AutoCommit => 0, ora_session_mode => ORA_SYSDBA})) {

&DoMsg ("Error connecting to DB: ". $DBI::errstr);

exit(1);

}

#&DoMsg ("Connected to DB $BaseDB");

}

sub QueryOneValue {

my $sth;

my $query = shift;

unless ($sth = $dbh->prepare ($query)) {

&DoMsg ("Error preparing statement $query: ".$dbh->errstr);

}

$sth->execute;

my ($result) = $sth->fetchrow_array;

return $result;

}

sub DisconnectDB {

$dbh->disconnect;

}

clone_from_snap.pl

#!/u01/app/oracle/tvdtoolbox/tvdperl-Linux-x86-64-02.04.00-05.08.04/bin/tvd_perl

use File::Copy;
use File::Path qw(mkpath rmtree);
use Net::SMTP;
use Sys::Hostname;
use Getopt::Std 'getopts';
use File::Basename;
use DBI;
use DBD::Oracle qw(:ora_session_modes);

my $CloneDIR;                             # predefine rootDir variable
BEGIN {
  use FindBin qw($Bin);                   # get the current path of script
  use Cwd 'abs_path';
  $CloneDIR    = abs_path("$Bin/..");     # get the absolut rood path to clone directory
}

my $CloneLOGDir = $CloneDIR."/log";       # LOG Directory
my $baseACFS = "/u02/acfs";
my $basename    = basename($0, ".pl");
my $BaseDB;
my $SnapshotName;
my $DestDB;
my $DestPath; # contains the final snapshot destination
my $oraenv = '/usr/local/bin/oraenv';
my $crsctl = '/u01/app/grid/12.2.0.1/bin/crsctl';
my $ORACLE_HOME = '/u01/app/oracle/product/12.2.0.1/dbhome_1';
my %opts;
my $dbh;
my $db_create_file_dest;
my $db_unique_name;
my $cmd;
my $SnapError=0;
my $SnapDir;
my $ControlfileTrace = "control.trc";
my $InitName = "init.ora";
my $warnings = 0;
my $foo;
my $dbUniqueName;

################################################################################
#  Main
################################################################################
my $StartDate = localtime;
&DoMsg ("Start of $basename.pl");
unless ( open (MAINLOG, ">>$CloneLOGDir/$basename.log") ) {
	&DoMsg ("Can't open Main Logfile $CloneLOGDir/$basename.log");
    exit 1;
}

# b: base db
# u: source database db_unique_name. if empty, will try to get it dynamically
# s: snapshot name
# d: destination name

# Process command line arguments
if  ( ! defined @ARGV ) { &Usage; exit 1; } 
getopts('b:s:d:u:', \%opts);

if ($opts{"b"}) {
   $BaseDB = $opts{"b"};
} else {
   &DoMsg ("Base DB not given!");
   &Usage;
   exit 1;
}
if ($opts{"s"}) {
   $SnapshotName = $opts{"s"};
} else {
   &DoMsg ("Snapshot Name not given!");
   &ListSnapshots;
   exit 1;
}
if ($opts{"d"}) {
   $DestDB = $opts{"d"};
} else {
   &DoMsg ("Dest DB not given!");
   &Usage;
   exit 1;
}


if ($opts{"u"}) {
   $dbUniqueName = $opts{"u"};
} else {
   &DoMsg ("db_unique_name not given, try to get it dynamically");
   
   &ConnectDB ;
   $dbUniqueName= &QueryOneValue(qq{SELECT value FROM v\$parameter2 WHERE name='db_unique_name'});
   &DisconnectDB ;
}

# show the parameters
&DoMsg ("Base: $BaseDB");
&DoMsg ("SnapshotName: $SnapshotName");
&DoMsg ("Dest: $DestDB");
&DoMsg ("db_unique_name: $dbUniqueName");


# try to get the ORACLE_HOME of the resource
my $cmd = "$crsctl status resource ora.".$DestDB.".db -f";
&DoMsg ($cmd);
open( CMD, $cmd . " |");
my @output = <CMD>;
close CMD;
#if ( $? != 0 ) {
#   &DoMsg ("Destination database does not exist, please configure it with srvctl");
#   exit 1;
#} 
foreach (@output) {
   chomp($_);
   if ($_ =~ /^ORACLE_HOME=/) {
      ($foo, $ORACLE_HOME) = split (/=/);
      $ENV{ORACLE_HOME}=$ORACLE_HOME;
      &DoMsg ("OH = $ORACLE_HOME");
   }
} 

# try to get the status of the resource using srvctl
my $cmd = "$ORACLE_HOME/bin/srvctl status database -d $DestDB";
&DoMsg ($cmd);
open( CMD, $cmd . " |");
&DoMsg (join("", <CMD>));
close CMD;
#if ( $? != 0 ) {
#   &DoMsg ("Destination database does not exist, please configure it");
#   exit 1;
#} 

# try to stop the dest db (will ignore errors)
my $cmd = "$ORACLE_HOME/bin/srvctl stop database -d $DestDB -o abort -f";
&DoMsg ($cmd);
open( CMD, $cmd . " |");
&DoMsg (join("", <CMD>));
close CMD;


# drop/recreate the snapshot using snap_acfs.pl
$cmd = "tvd_perl ".$CloneDIR."/bin/snap_acfs.pl -p $SnapshotName -n $DestDB";
&DoMsg($cmd);
open( CMD, $cmd . " |");
print (join("", <CMD>)); ## only print here as it logs and echoes its time as well
close CMD;
#if ( $? != 0 ) {
#   &DoMsg("Error creating the new snapshot for $DestDB. Exiting.");
#   exit(1);
#}

$DestPath = $baseACFS . '/.ACFS/snaps/' . $DestDB;
$ControlfileTrace = $DestPath.'/'.$ControlfileTrace;
$InitName = $DestPath.'/'.$InitName;

&DoMsg("Control file trace: $ControlfileTrace");
&DoMsg("Init file: $InitName");

### remove old archives, redo_logs and control files!
rmtree($baseACFS . '/fra/' . $DestDB , 1, 1 );
mkpath($baseACFS . '/fra/' . $DestDB );

## HERE WE HAVE THE CONTROL AND INIT READY TO BE MODIFIED

open(FILE, "<$ControlfileTrace");
my @ControlLines = <FILE>;
close(FILE);

# sed controlfile
my @NewControlLines;
push(@NewControlLines,"SET ECHO ON;\n");
push(@NewControlLines,"WHENEVER SQLERROR EXIT FAILURE;\n");
push(@NewControlLines,"CREATE SPFILE FROM PFILE='$InitName';\n");

foreach(@ControlLines) {
   # change the snapshot name in the paths
   $_ =~ s/u02\/$BaseDB/u02\/$DestDB/gi;
   # change the db_unique_name in the REDO paths
   $_ =~ s/fra\/$dbUniqueName/fra\/$DestDB/gi;


   # change the dbname in the create controlfile line
   $_ =~ s/CREATE CONTROLFILE.*$/CREATE CONTROLFILE REUSE SET DATABASE "$DestDB" RESETLOGS NOARCHIVELOG/;
   # everything after and including "recover database" can be skipped
   if ($_ =~ /^RECOVER DATABASE /) {
      last;
   }
   print ($_);
   push(@NewControlLines, $_);
}
push(@NewControlLines,"ALTER DATABASE OPEN RESETLOGS;\n");
push(@NewControlLines,"ALTER TABLESPACE TEMP ADD TEMPFILE SIZE 1G;\n");
push(@NewControlLines,"SELECT status FROM v\$instance;\n");
push(@NewControlLines,"QUIT;\n");

# write the new controlfile:
open(FILE, ">$ControlfileTrace");
print FILE @NewControlLines;
close(FILE);

# delete old controlfile
# no more necessary, deleted above  unlink ($DestPath.'/control01.ctl');

# sed init file
open(FILE, "<$InitName");
my @InitLines = <FILE>;
close(FILE);

@InitLines = grep(!/^$BaseDB/i, @InitLines);
@InitLines = grep(!/^\*\.db_name/, @InitLines);
@InitLines = grep(!/^\*\.db_unique_name/, @InitLines);
@InitLines = grep(!/^\*\.dispatchers/, @InitLines);
@InitLines = grep(!/^\*\.audit_file_dest/, @InitLines);
@InitLines = grep(!/^\*\.fal_server/, @InitLines);
@InitLines = grep(!/^\*\.fal_client/, @InitLines);
@InitLines = grep(!/^\*\.log_archive_config/, @InitLines);
@InitLines = grep(!/^\*\.log_archive_dest/, @InitLines);
@InitLines = grep(!/^\*\.memory_target/, @InitLines);
@InitLines = grep(!/^\*\.sga_target/, @InitLines);
@InitLines = grep(!/^\*\.pga_aggregate_target/, @InitLines);
@InitLines = grep(!/^\*\.service_names/, @InitLines);
@InitLines = grep(!/^\*\.dg_broker_start/, @InitLines);

my @NewInitLines;
foreach(@InitLines ) {
   # change only the snapshot name in the paths
   $_ =~ s/u02\/$BaseDB/u02\/$DestDB/gi;
   $_ =~ s/fra\/$dbUniqueName/fra\/$DestDB/gi;
   print ($_);
   push(@NewInitLines, $_);
}   

push(@NewInitLines, "*.db_name='$DestDB'\n");
push(@NewInitLines, "*.db_unique_name='$DestDB'\n");
push(@NewInitLines, "*.dispatchers='(PROTOCOL=TCP)(SERVICE=${DestDB}XDB)'\n");
push(@NewInitLines, "*.log_archive_dest_1='location=USE_DB_RECOVERY_FILE_DEST'\n");
push(@NewInitLines, "*.sga_target=1G\n");
push(@NewInitLines, "*.pga_aggregate_target=100M\n");
push(@NewInitLines, "*.service_names='$DestDB'\n");
#push(@NewInitLines, "*.\n");

# write the new init file
open(FILE, ">$InitName");
print FILE @NewInitLines;
close(FILE);

$ENV{ORACLE_SID}=$DestDB;
$cmd = "$ORACLE_HOME/bin/sqlplus / as sysdba \@$ControlfileTrace";
&DoMsg($cmd);
open( CMD, $cmd . " |");
print (join("", <CMD>)); ## only print here as it logs and echoes its time as well
close CMD;
#if ( $? != 0 ) {
#   &DoMsg("Error creating the new snapshot for $DestDB. Exiting.");
#   exit(1);
#}

&DoMsg("New database snapshot $DestDB created successfully!");
&DoMsg("Starting using srvctl:");

my $cmd = "$ORACLE_HOME/bin/srvctl start database -d $DestDB";
&DoMsg ($cmd);
open( CMD, $cmd . " |");
&DoMsg (join("", <CMD>));
close CMD;
#if ( $? != 0 ) {
#   &DoMsg ("Destination database cannot be started using srvctl");
#   exit 1;
#} 

# 

#-------------------------------------------------------------------------------
# DoMsg
#
# PURPOSE    : echo with timestamp YYYY-MM-DD_H24:MI:SS
# PARAMS     : $*: the messages
# GLOBAL VARS: none
#-------------------------------------------------------------------------------   
sub DoMsg {

   my $msg = shift;
   my $timestamp = &getTimestamp;
   
   print ("$timestamp $msg\n");
   if (fileno(MAINLOG)) {print MAINLOG "$timestamp $msg\n";}
}


#-------------------------------------------------------------------------------
# getTimestamp
#
# PURPOSE    : returns timestamp in different formats
# PARAMS     : format_parm
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub getTimestamp {
   #
   # Format 1:  dd-mm-yyyy_hh24:mi:ss
   # Format 2:  dd.mm.yyyy_hh24miss
   # Format 3:  dd.mm.yyyy
   # Format 4:  hh24:mi:ss
   # Rest:      dd.mm.yyyy hh24:mi:ss  (default)
   #
   my $Parm = shift;
   my $date;
   my $date2;
   my $heure;
   my $heure2;
   my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst);

   if ( length($Parm) > 1 ) {
      ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($Parm);
   }
   else {
      ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;
   }
   
   $date = (sprintf "%2.0d",($mday)).".".(sprintf "%2.0d",($mon+1)).".".($year+1900);
   $date =~ s/ /0/g;
   $date2 = (sprintf "%2.0d",($mday))."-".(sprintf "%2.0d",($mon+1))."-".($year+1900);
   $date2 =~ s/ /0/g;
   $heure = (sprintf "%2.0d",($hour)).":".(sprintf "%2.0d",($min)).":".(sprintf "%2.0d",($sec));
   $heure =~ s/ /0/g;
   $heure2 = (sprintf "%2.0d",($hour)).(sprintf "%2.0d",($min)).(sprintf "%2.0d",($sec));
   $heure2 =~ s/ /0/g;
   
   if    ($Parm eq "1") { return ($date2."_".$heure) }
   elsif ($Parm eq "2") { return ($date."_".$heure2) }
   elsif ($Parm eq "3") { return ($date) }
   elsif ($Parm eq "4") { return ($heure) }
   else { return ($date." ".$heure) };

}


#-------------------------------------------------------------------------------
# getWeekDay
#
# PURPOSE    : returns weekday (Sun - Sat)
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub getWeekDay{
   my @date = split(" ", localtime(time));
   my $day = $date[0];
   return ($day);
}


#-------------------------------------------------------------------------------
# callSQLPLUS
#
# PURPOSE    : calls the rman utility
# PARAMS     : rman script name
# GLOBAL VARS: ReturnStatus, LogFile
#-------------------------------------------------------------------------------
#sub callSQLPLUS {
#    my $script = shift;
#	open( SQL, "$ORACLE_HOME/bin/sqlplus /nolog  \@$script |");  
#    &DoMsg (join("", <SQL>));
#    if ( $? != 0 ) { $rc = 1; } # RC if last call create an error
#    close SQL;
#}



#-------------------------------------------------------------------------------
# Usage
#
# PURPOSE    : print the Usage
# PARAMS     : none
# GLOBAL VARS: none
#-------------------------------------------------------------------------------
sub Usage {

   print <<EOF
   
Usage:  $basename -b <base>  [Optional Arguments]
           -b <base>       : db_name of the source database 
           -d <base>       : name of the destination database
           -s <snapshot>   : name of the snapshot to be used

        Purpose:
          Create a new snapshot of a standby database by apply-off, backup controlfile to trace, copy init, acfs snap, apply-on.


        Optional Arguments:
           -u <db_unique_name>   : name of the db_unique_name of the source database. if not specified, it will be taked from the source db, but it must be mounted!
                                   this parameter is used only for pattern replacement inside control file trace and init file.

        examples:
            $basename -b stout -s stout_save.Wed -d poug2648
            will clone stout from snapshot $baseACFS/.ACFS/snaps/stout_save.Wed to poug2648 
      
  THE EXISTING DESTINATION DATABASE SNAPSHOT WILL BE DROPPED!!
EOF

}


sub ConnectDB {

   # DB connection #
   $ENV{ORACLE_HOME}=$ORACLE_HOME;
   $ENV{ORACLE_SID}=$BaseDB;
   delete $ENV{TWO_TASK};

   &DoMsg ("Connecting to DB $BaseDB");
   &DoMsg ("OH: $ORACLE_HOME");
   &DoMsg ("SID: $BaseDB");
   unless ($dbh = DBI->connect('dbi:Oracle:', "sys", "Vagrant1_", {PrintError=>0, AutoCommit => 0, ora_session_mode => ORA_SYSDBA}))  {
      &DoMsg ("Error connecting to DB: ". $DBI::errstr);
      exit(1);
   }

   #&DoMsg ("Connected to DB $BaseDB");

}

sub QueryOneValue {

   my $sth;
   my $query = shift;

   unless ($sth = $dbh->prepare ($query)) {
      &DoMsg ("Error preparing statement $query: ".$dbh->errstr);
   }
   $sth->execute;
   my ($result) = $sth->fetchrow_array;

   return $result;
}

sub DisconnectDB {
   $dbh->disconnect;
}

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

296

297

298

299

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

320

321

322

323

324

325

326

327

328

329

330

331

332

333

334

335

336

337

338

339

340

341

342

343

344

345

346

347

348

349

350

351

352

353

354

355

356

357

358

359

360

361

362

363

364

365

366

367

368

369

370

371

372

373

374

375

376

377

378

379

380

381

382

383

384

385

386

387

388

389

390

391

392

393

394

395

396

397

398

399

400

401

402

403

404

405

406

407

408

409

410

411

412

413

414

415

416

417

418

419

420

421

422

423

424

425

426

427

428

429

430

431

432

433

434

#!/u01/app/oracle/tvdtoolbox/tvdperl-Linux-x86-64-02.04.00-05.08.04/bin/tvd_perl

use File::Copy;

use File::Path qw(mkpath rmtree);

use Net::SMTP;

use Sys::Hostname;

use Getopt::Std 'getopts';

use File::Basename;

use DBI;

use DBD::Oracle qw(:ora_session_modes);

my $CloneDIR; # predefine rootDir variable

BEGIN {

use FindBin qw($Bin); # get the current path of script

use Cwd 'abs_path';

$CloneDIR = abs_path("$Bin/.."); # get the absolut rood path to clone directory

}

my $CloneLOGDir = $CloneDIR."/log"; # LOG Directory

my $baseACFS = "/u02/acfs";

my $basename = basename($0, ".pl");

my $BaseDB;

my $SnapshotName;

my $DestDB;

my $DestPath; # contains the final snapshot destination

my $oraenv = '/usr/local/bin/oraenv';

my $crsctl = '/u01/app/grid/12.2.0.1/bin/crsctl';

my $ORACLE_HOME = '/u01/app/oracle/product/12.2.0.1/dbhome_1';

my %opts;

my $dbh;

my $db_create_file_dest;

my $db_unique_name;

my $cmd;

my $SnapError=0;

my $SnapDir;

my $ControlfileTrace = "control.trc";

my $InitName = "init.ora";

my $warnings = 0;

my $foo;

my $dbUniqueName;

################################################################################

# Main

################################################################################

my $StartDate = localtime;

&DoMsg ("Start of $basename.pl");

unless ( open (MAINLOG, ">>$CloneLOGDir/$basename.log") ) {

&DoMsg ("Can't open Main Logfile $CloneLOGDir/$basename.log");

exit 1;

}

# b: base db

# u: source database db_unique_name. if empty, will try to get it dynamically

# s: snapshot name

# d: destination name

# Process command line arguments

if ( ! defined @ARGV ) { &Usage; exit 1; }

getopts('b:s:d:u:', \%opts);

if ($opts{"b"}) {

$BaseDB = $opts{"b"};

} else {

&DoMsg ("Base DB not given!");

&Usage;

exit 1;

}

if ($opts{"s"}) {

$SnapshotName = $opts{"s"};

} else {

&DoMsg ("Snapshot Name not given!");

&ListSnapshots;

exit 1;

}

if ($opts{"d"}) {

$DestDB = $opts{"d"};

} else {

&DoMsg ("Dest DB not given!");

&Usage;

exit 1;

}

if ($opts{"u"}) {

$dbUniqueName = $opts{"u"};

} else {

&DoMsg ("db_unique_name not given, try to get it dynamically");

&ConnectDB ;

$dbUniqueName= &QueryOneValue(qq{SELECT value FROM v\$parameter2 WHERE name='db_unique_name'});

&DisconnectDB ;

}

# show the parameters

&DoMsg ("Base: $BaseDB");

&DoMsg ("SnapshotName: $SnapshotName");

&DoMsg ("Dest: $DestDB");

&DoMsg ("db_unique_name: $dbUniqueName");

# try to get the ORACLE_HOME of the resource

my $cmd = "$crsctl status resource ora.".$DestDB.".db -f";

&DoMsg ($cmd);

open( CMD, $cmd . " |");

my @output = <CMD>;

close CMD;

#if ( $? != 0 ) {

# &DoMsg ("Destination database does not exist, please configure it with srvctl");

# exit 1;

foreach (@output) {

chomp($_);

if ($_ =~ /^ORACLE_HOME=/) {

($foo, $ORACLE_HOME) = split (/=/);

$ENV{ORACLE_HOME}=$ORACLE_HOME;

&DoMsg ("OH = $ORACLE_HOME");

}

# try to get the status of the resource using srvctl

my $cmd = "$ORACLE_HOME/bin/srvctl status database -d $DestDB";

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("", <CMD>));

close CMD;

#if ( $? != 0 ) {

# &DoMsg ("Destination database does not exist, please configure it");

# exit 1;

# try to stop the dest db (will ignore errors)

my $cmd = "$ORACLE_HOME/bin/srvctl stop database -d $DestDB -o abort -f";

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("", <CMD>));

close CMD;

# drop/recreate the snapshot using snap_acfs.pl

$cmd = "tvd_perl ".$CloneDIR."/bin/snap_acfs.pl -p $SnapshotName -n $DestDB";

&DoMsg($cmd);

open( CMD, $cmd . " |");

print (join("", <CMD>)); ## only print here as it logs and echoes its time as well

close CMD;

#if ( $? != 0 ) {

# &DoMsg("Error creating the new snapshot for $DestDB. Exiting.");

# exit(1);

$DestPath = $baseACFS . '/.ACFS/snaps/' . $DestDB;

$ControlfileTrace = $DestPath.'/'.$ControlfileTrace;

$InitName = $DestPath.'/'.$InitName;

&DoMsg("Control file trace: $ControlfileTrace");

&DoMsg("Init file: $InitName");

### remove old archives, redo_logs and control files!

rmtree($baseACFS . '/fra/' . $DestDB , 1, 1 );

mkpath($baseACFS . '/fra/' . $DestDB );

## HERE WE HAVE THE CONTROL AND INIT READY TO BE MODIFIED

open(FILE, "<$ControlfileTrace");

my @ControlLines = <FILE>;

close(FILE);

# sed controlfile

my @NewControlLines;

push(@NewControlLines,"SET ECHO ON;\n");

push(@NewControlLines,"WHENEVER SQLERROR EXIT FAILURE;\n");

push(@NewControlLines,"CREATE SPFILE FROM PFILE='$InitName';\n");

foreach(@ControlLines) {

# change the snapshot name in the paths

$_ =~ s/u02\/$BaseDB/u02\/$DestDB/gi;

# change the db_unique_name in the REDO paths

$_ =~ s/fra\/$dbUniqueName/fra\/$DestDB/gi;

# change the dbname in the create controlfile line

$_ =~ s/CREATE CONTROLFILE.*$/CREATE CONTROLFILE REUSE SET DATABASE "$DestDB" RESETLOGS NOARCHIVELOG/;

# everything after and including "recover database" can be skipped

if ($_ =~ /^RECOVER DATABASE /) {

last;

}

print ($_);

push(@NewControlLines, $_);

}

push(@NewControlLines,"ALTER DATABASE OPEN RESETLOGS;\n");

push(@NewControlLines,"ALTER TABLESPACE TEMP ADD TEMPFILE SIZE 1G;\n");

push(@NewControlLines,"SELECT status FROM v\$instance;\n");

push(@NewControlLines,"QUIT;\n");

# write the new controlfile:

open(FILE, ">$ControlfileTrace");

print FILE @NewControlLines;

close(FILE);

# delete old controlfile

# no more necessary, deleted above unlink ($DestPath.'/control01.ctl');

# sed init file

open(FILE, "<$InitName");

my @InitLines = <FILE>;

close(FILE);

@InitLines = grep(!/^$BaseDB/i, @InitLines);

@InitLines = grep(!/^\*\.db_name/, @InitLines);

@InitLines = grep(!/^\*\.db_unique_name/, @InitLines);

@InitLines = grep(!/^\*\.dispatchers/, @InitLines);

@InitLines = grep(!/^\*\.audit_file_dest/, @InitLines);

@InitLines = grep(!/^\*\.fal_server/, @InitLines);

@InitLines = grep(!/^\*\.fal_client/, @InitLines);

@InitLines = grep(!/^\*\.log_archive_config/, @InitLines);

@InitLines = grep(!/^\*\.log_archive_dest/, @InitLines);

@InitLines = grep(!/^\*\.memory_target/, @InitLines);

@InitLines = grep(!/^\*\.sga_target/, @InitLines);

@InitLines = grep(!/^\*\.pga_aggregate_target/, @InitLines);

@InitLines = grep(!/^\*\.service_names/, @InitLines);

@InitLines = grep(!/^\*\.dg_broker_start/, @InitLines);

my @NewInitLines;

foreach(@InitLines ) {

# change only the snapshot name in the paths

$_ =~ s/u02\/$BaseDB/u02\/$DestDB/gi;

$_ =~ s/fra\/$dbUniqueName/fra\/$DestDB/gi;

print ($_);

push(@NewInitLines, $_);

}

push(@NewInitLines, "*.db_name='$DestDB'\n");

push(@NewInitLines, "*.db_unique_name='$DestDB'\n");

push(@NewInitLines, "*.dispatchers='(PROTOCOL=TCP)(SERVICE=${DestDB}XDB)'\n");

push(@NewInitLines, "*.log_archive_dest_1='location=USE_DB_RECOVERY_FILE_DEST'\n");

push(@NewInitLines, "*.sga_target=1G\n");

push(@NewInitLines, "*.pga_aggregate_target=100M\n");

push(@NewInitLines, "*.service_names='$DestDB'\n");

#push(@NewInitLines, "*.\n");

# write the new init file

open(FILE, ">$InitName");

print FILE @NewInitLines;

close(FILE);

$ENV{ORACLE_SID}=$DestDB;

$cmd = "$ORACLE_HOME/bin/sqlplus / as sysdba \@$ControlfileTrace";

&DoMsg($cmd);

open( CMD, $cmd . " |");

print (join("", <CMD>)); ## only print here as it logs and echoes its time as well

close CMD;

#if ( $? != 0 ) {

# &DoMsg("Error creating the new snapshot for $DestDB. Exiting.");

# exit(1);

&DoMsg("New database snapshot $DestDB created successfully!");

&DoMsg("Starting using srvctl:");

my $cmd = "$ORACLE_HOME/bin/srvctl start database -d $DestDB";

&DoMsg ($cmd);

open( CMD, $cmd . " |");

&DoMsg (join("", <CMD>));

close CMD;

#if ( $? != 0 ) {

# &DoMsg ("Destination database cannot be started using srvctl");

# exit 1;

#-------------------------------------------------------------------------------

# DoMsg

# PURPOSE : echo with timestamp YYYY-MM-DD_H24:MI:SS

# PARAMS : $*: the messages

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub DoMsg {

my $msg = shift;

my $timestamp = &getTimestamp;

print ("$timestamp $msg\n");

if (fileno(MAINLOG)) {print MAINLOG "$timestamp $msg\n";}

}

#-------------------------------------------------------------------------------

# getTimestamp

# PURPOSE : returns timestamp in different formats

# PARAMS : format_parm

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub getTimestamp {

# Format 1: dd-mm-yyyy_hh24:mi:ss

# Format 2: dd.mm.yyyy_hh24miss

# Format 3: dd.mm.yyyy

# Format 4: hh24:mi:ss

# Rest: dd.mm.yyyy hh24:mi:ss (default)

my $Parm = shift;

my $date;

my $date2;

my $heure;

my $heure2;

my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst);

if ( length($Parm) > 1 ) {

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($Parm);

}

else {

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;

}

$date = (sprintf "%2.0d",($mday)).".".(sprintf "%2.0d",($mon+1)).".".($year+1900);

$date =~ s/ /0/g;

$date2 = (sprintf "%2.0d",($mday))."-".(sprintf "%2.0d",($mon+1))."-".($year+1900);

$date2 =~ s/ /0/g;

$heure = (sprintf "%2.0d",($hour)).":".(sprintf "%2.0d",($min)).":".(sprintf "%2.0d",($sec));

$heure =~ s/ /0/g;

$heure2 = (sprintf "%2.0d",($hour)).(sprintf "%2.0d",($min)).(sprintf "%2.0d",($sec));

$heure2 =~ s/ /0/g;

if ($Parm eq "1") { return ($date2."_".$heure) }

elsif ($Parm eq "2") { return ($date."_".$heure2) }

elsif ($Parm eq "3") { return ($date) }

elsif ($Parm eq "4") { return ($heure) }

else { return ($date." ".$heure) };

}

#-------------------------------------------------------------------------------

# getWeekDay

# PURPOSE : returns weekday (Sun - Sat)

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub getWeekDay{

my @date = split(" ", localtime(time));

my $day = $date[0];

return ($day);

}

#-------------------------------------------------------------------------------

# callSQLPLUS

# PURPOSE : calls the rman utility

# PARAMS : rman script name

# GLOBAL VARS: ReturnStatus, LogFile

#-------------------------------------------------------------------------------

#sub callSQLPLUS {

# my $script = shift;

# open( SQL, "$ORACLE_HOME/bin/sqlplus /nolog \@$script |");

# &DoMsg (join("", <SQL>));

# if ( $? != 0 ) { $rc = 1; } # RC if last call create an error

# close SQL;

#-------------------------------------------------------------------------------

# Usage

# PURPOSE : print the Usage

# PARAMS : none

# GLOBAL VARS: none

#-------------------------------------------------------------------------------

sub Usage {

print <<EOF

Usage: $basename -b <base> [Optional Arguments]

-b <base> : db_name of the source database

-d <base> : name of the destination database

-s <snapshot> : name of the snapshot to be used

Purpose:

Create a new snapshot of a standby database by apply-off, backup controlfile to trace, copy init, acfs snap, apply-on.

Optional Arguments:

-u <db_unique_name> : name of the db_unique_name of the source database. if not specified, it will be taked from the source db, but it must be mounted!

this parameter is used only for pattern replacement inside control file trace and init file.

examples:

$basename -b stout -s stout_save.Wed -d poug2648

will clone stout from snapshot $baseACFS/.ACFS/snaps/stout_save.Wed to poug2648

THE EXISTING DESTINATION DATABASE SNAPSHOT WILL BE DROPPED!!

EOF

}

sub ConnectDB {

# DB connection #

$ENV{ORACLE_HOME}=$ORACLE_HOME;

$ENV{ORACLE_SID}=$BaseDB;

delete $ENV{TWO_TASK};

&DoMsg ("Connecting to DB $BaseDB");

&DoMsg ("OH: $ORACLE_HOME");

&DoMsg ("SID: $BaseDB");

unless ($dbh = DBI->connect('dbi:Oracle:', "sys", "Vagrant1_", {PrintError=>0, AutoCommit => 0, ora_session_mode => ORA_SYSDBA})) {

&DoMsg ("Error connecting to DB: ". $DBI::errstr);

exit(1);

}

#&DoMsg ("Connected to DB $BaseDB");

}

sub QueryOneValue {

my $sth;

my $query = shift;

unless ($sth = $dbh->prepare ($query)) {

&DoMsg ("Error preparing statement $query: ".$dbh->errstr);

}

$sth->execute;

my ($result) = $sth->fetchrow_array;

return $result;

}

sub DisconnectDB {

$dbh->disconnect;

}

Cheers

—

Ludovico

12.1.0.2 Bundle Patch 170718 breaks Data Guard and Duplicate from active database

Posted on September 14, 2017 by Ludovico

Recently my customer patched its 12.1.0.2 databases with the Bundle Patch 170718 on the new servers (half of the customer’s environment). The old servers are still on 161018 Bundle Patch.

We realized that we could not move anymore the databases from the old servers to the new ones because the duplicate from active database was failing with this error:

RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03002: failure of Duplicate Db command at 09/11/2017 15:59:32
RMAN-05501: aborting duplication of target database
RMAN-03015: error occurred in stored script Memory Script
RMAN-03009: failure of backup command on prmy1 channel at 09/11/2017 15:59:32
ORA-17629: Cannot connect to the remote database server
ORA-17630: Mismatch in the remote file protocol version client 2 server 3

RMAN-00571: ===========================================================

RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============

RMAN-00571: ===========================================================

RMAN-03002: failure of Duplicate Db command at 09/11/2017 15:59:32

RMAN-05501: aborting duplication of target database

RMAN-03015: error occurred in stored script Memory Script

RMAN-03009: failure of backup command on prmy1 channel at 09/11/2017 15:59:32

ORA-17629: Cannot connect to the remote database server

ORA-17630: Mismatch in the remote file protocol version client 2 server 3

The last lines shows the same error that Franck blogged about some months ago.

Oracle 12.2 had introduced incompatibility with previous releases in remote file transfer via SQL*Net. At least this is what it seems. According to Oracle, this is due to a bugfix present in Oracle 12.2

Now, the bundle patch that we installed on BP 170718 contains the same bugfix (Patch for bug 18633374).

So, the incompatibility happens now between databases of the same “Major Release” (12.1.0.2).

There are two possible workarounds:

Apply the same patch level on both sides (BP170718 in my case)
Apply just the patch 18633374 on top of your current PSU/DBBP (a merge might be necessary).

We used the second approach and now we can setup Data Guard again to move our databases without downtime:

oracle@oldserver $ opatch lspatches
18633374;   <<<<<< FIX!
24340679;DATABASE BUNDLE PATCH: 12.1.0.2.161018 (24340679)

oracle@newserver $ opatch lspatches
22652097;
22243983;
25869760;DATABASE BUNDLE PATCH: 12.1.0.2.170718 (25869760)

oracle@oldserver $ opatch lspatches

18633374; <<<<<< FIX!

24340679;DATABASE BUNDLE PATCH: 12.1.0.2.161018 (24340679)

oracle@newserver $ opatch lspatches

22652097;

22243983;

25869760;DATABASE BUNDLE PATCH: 12.1.0.2.170718 (25869760)

HTH

—

Ludovico

Which Oracle Databases use most CPU on my server?

Posted on May 24, 2017 by Ludovico

Assumptions

You have many (hundreds) of instances and more than a couple of servers
One of your servers have high CPU Load
You have Enterprise Manager 12c but the Database Load does not filter by server
You want to have an historical representation of the user CPU utilization, per instance

Getting the data from the EM Repository

With the following query, connected to the SYSMAN schema of your EM repository, you can get the hourly max() and/or avg() of user CPU by instance and time.

SELECT entity_name,
  ROUND(collection_time,'HH') AS colltime,
  ROUND(avg_value,2)/16*100   AS avgv, -- 16 is my number of CPU
  ROUND(max_value,2)/16*100   AS maxv  -- same here
FROM gc$metric_values_hourly mv
JOIN em_targets t
ON (t.target_name         =mv.entity_name)
WHERE t.host_name         ='myserver1'  -- myserver1 is the server that has high CPU Usage
AND mv.metric_column_name = 'user_cpu_time_cnt' -- let's get the user cpu time
AND collection_time>sysdate-14  -- for the lase 14 days
ORDER BY entity_name,
  ROUND(collection_time,'HH');

SELECT entity_name,

ROUND(collection_time,'HH') AS colltime,

ROUND(avg_value,2)/16*100 AS avgv, -- 16 is my number of CPU

ROUND(max_value,2)/16*100 AS maxv -- same here

FROM gc$metric_values_hourly mv

JOIN em_targets t

ON (t.target_name =mv.entity_name)

WHERE t.host_name ='myserver1' -- myserver1 is the server that has high CPU Usage

AND mv.metric_column_name = 'user_cpu_time_cnt' -- let's get the user cpu time

AND collection_time>sysdate-14 -- for the lase 14 days

ORDER BY entity_name,

ROUND(collection_time,'HH');

Suppose you select just the max value: the result will be similar to this:

ENTITY_ COLLTIME          MAXV
------- ----------------  ------
mydbone	10.05.2017 16:00  0.3125
mydbone	10.05.2017 17:00  0.1875
mydbone	10.05.2017 18:00  0.1875
mydbone	10.05.2017 19:00  0.1875
mydbone	10.05.2017 20:00  0.25
mydbone	10.05.2017 21:00  0.125
mydbone	10.05.2017 22:00  0.125
mydbone	10.05.2017 23:00  0.125
mydbone	11.05.2017 00:00  0.1875
mydbone	11.05.2017 01:00  0.125
mydbone	11.05.2017 02:00  0.1875
mydbone	11.05.2017 03:00  0.1875
....                      
mydbone	23.05.2017 20:00  0.125
mydbone	23.05.2017 21:00  0.125
mydbone	23.05.2017 22:00  0.125
mydbone	23.05.2017 23:00  0.0625
mydbtwo	10.05.2017 16:00  0.3125
mydbtwo	10.05.2017 17:00  0.25
mydbtwo	10.05.2017 18:00  0.1875
mydbtwo	10.05.2017 19:00  0.1875
mydbtwo	10.05.2017 20:00  0.3125
mydbtwo	10.05.2017 21:00  0.125
mydbtwo	10.05.2017 22:00  0.125
mydbtwo	10.05.2017 23:00  0.125
.....                     
mydbtwo	14.05.2017 19:00  0.125
mydbtwo	14.05.2017 20:00  0.125
mydbtwo	14.05.2017 21:00  0.125
mydbtwo	14.05.2017 22:00  0.125
mydbtwo	14.05.2017 23:00  0.125
dbthree	10.05.2017 16:00  1.1875
dbthree	10.05.2017 17:00  0.6875
dbthree	10.05.2017 18:00  0.625
dbthree	10.05.2017 19:00  0.5625
dbthree	10.05.2017 20:00  0.8125
dbthree	10.05.2017 21:00  0.5
dbthree	10.05.2017 22:00  0.4375
dbthree	10.05.2017 23:00  0.4375
...

ENTITY_ COLLTIME MAXV

------- ---------------- ------

mydbone 10.05.2017 16:00 0.3125

mydbone 10.05.2017 17:00 0.1875

mydbone 10.05.2017 18:00 0.1875

mydbone 10.05.2017 19:00 0.1875

mydbone 10.05.2017 20:00 0.25

mydbone 10.05.2017 21:00 0.125

mydbone 10.05.2017 22:00 0.125

mydbone 10.05.2017 23:00 0.125

mydbone 11.05.2017 00:00 0.1875

mydbone 11.05.2017 01:00 0.125

mydbone 11.05.2017 02:00 0.1875

mydbone 11.05.2017 03:00 0.1875

....

mydbone 23.05.2017 20:00 0.125

mydbone 23.05.2017 21:00 0.125

mydbone 23.05.2017 22:00 0.125

mydbone 23.05.2017 23:00 0.0625

mydbtwo 10.05.2017 16:00 0.3125

mydbtwo 10.05.2017 17:00 0.25

mydbtwo 10.05.2017 18:00 0.1875

mydbtwo 10.05.2017 19:00 0.1875

mydbtwo 10.05.2017 20:00 0.3125

mydbtwo 10.05.2017 21:00 0.125

mydbtwo 10.05.2017 22:00 0.125

mydbtwo 10.05.2017 23:00 0.125

.....

mydbtwo 14.05.2017 19:00 0.125

mydbtwo 14.05.2017 20:00 0.125

mydbtwo 14.05.2017 21:00 0.125

mydbtwo 14.05.2017 22:00 0.125

mydbtwo 14.05.2017 23:00 0.125

dbthree 10.05.2017 16:00 1.1875

dbthree 10.05.2017 17:00 0.6875

dbthree 10.05.2017 18:00 0.625

dbthree 10.05.2017 19:00 0.5625

dbthree 10.05.2017 20:00 0.8125

dbthree 10.05.2017 21:00 0.5

dbthree 10.05.2017 22:00 0.4375

dbthree 10.05.2017 23:00 0.4375

...

Putting it into excel

There are one million ways to do something more reusable than excel (like rrdtool scripts, gnuplot, R, name it), but Excel is just right for most people out there (including me when I feel lazy).

Configure an Oracle Client and add the ODBC data source to the EM repository:

Open Excel, go to “Data” – “Connections” and add a new connection:
- Search…
- New Source
- DSN ODBC
Select your new ODBC data source, user, password
Uncheck “Connection to a specific table”
Give a name and click Finish
On the DSN -> Properties -> Definition, enter the SQL text I have provided previously

The result should be something similar: ( but much longer :-))

Pivoting the results

Create e new sheet and name it “pivot”, Click on “Create Pivot Table”, select your data and your dimensions:

The result:

Creating the Graph

Now that the data is correctly formatted, it’s easyy to add a graph:

just select the entire pivot table and create a new stacked area graph.

The result will be similar to this:

With such graph, it is easy to spot which databases consumed most CPU on the system in a defined period, and to track the progress if you start a “performance campaign”.

For example, you can see that the “green” and “red” databases were consuming constantly some CPU up to 17.05.2017 and then some magic solved the CPU problem for those instances.

It is also quite convenient for checking the results of new instance caging settings…

The resulting CPU will not necessarily be 100%: the SYS CPU time is not included, as well as the user CPU of all the other processes that are either not DB or not monitored with Enterprise Manager.

HTH

—

Ludovico

Another problem with “KSV master wait” and “ASM file metadata operation”

Posted on March 24, 2017 by Ludovico

My customer today tried to do a duplicate on a cluster. When preparing the auxiliary instance, she noticed that the startup nomount was hanging forever: Nothing in the alert, nothing in the trace files.

Because the database and the spfile were stored inside ASM, I’ve been quite suspicious…

The ASM trace files had the following entries:

kfgbDiscoverNow: called for group 1/0x9f5bfe53 (ACFS)

*** 2017-03-24 12:42:13.327
2017-03-24 12:42:13.327: [    GPNP]clsgpnp_dbmsGetItem_profile: [at clsgpnp_dbms.c:345] Result: (0) CLSGPNP_OK. (:GPNP00401:)got ASM-Profile.DiscoveryString='/dev/mapper/asm_*,/dev/asm_*'

*** 2017-03-24 12:42:15.386
kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:18.387
kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:21.393
kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:24.398
kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:27.403
kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

kfgbDiscoverNow: called for group 1/0x9f5bfe53 (ACFS)

*** 2017-03-24 12:42:13.327

2017-03-24 12:42:13.327: [ GPNP]clsgpnp_dbmsGetItem_profile: [at clsgpnp_dbms.c:345] Result: (0) CLSGPNP_OK. (:GPNP00401:)got ASM-Profile.DiscoveryString='/dev/mapper/asm_*,/dev/asm_*'

*** 2017-03-24 12:42:15.386

kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:18.387

kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:21.393

kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:24.398

kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

*** 2017-03-24 12:42:27.403

kfgbTryFn: failed to acquire DG.1.3 for kfgbRefreshNow (of group 1/0x9f5bfe53)

The ASM instance had the following sessions waiting:

SQL>  select inst_id, sid, serial#, status, event, wait_class, wait_time, logon_time , program, machine from gv$session where wait_class!='Idle' order by sid;

INST_ID  SID SERIAL# STATUS  EVENT                        WAIT_CLASS WAIT_TIME LOGON_TIME          PROGRAM                             MACHINE
------- ---- ------- ------- ---------------------------- ---------- --------- ------------------- ----------------------------------- --------
      2   36   41916 ACTIVE  ASM file metadata operation  Other              0 24.03.2017 13:47:28 oracle@clusrv02 (O001)              clusrv02
      2  266   64885 ACTIVE  KSV master wait              Other              0 24.03.2017 13:47:25 oracletorcl01v@clusrv02 (TNS V1-V3) clusrv02
      1  483   63446 ACTIVE  KSV master wait              Other              0 24.03.2017 13:31:14 oracletorcl01v@clusrv01 (TNS V1-V3) clusrv01
      1  497   31202 ACTIVE  ASM file metadata operation  Other              0 24.03.2017 13:39:07 oracletorcl01v@clusrv01 (TNS V1-V3) clusrv01
      3  708     484 ACTIVE  ASM file metadata operation  Other              0 24.03.2017 12:38:56 OMS                                 omssrv01

SQL> select inst_id, sid, serial#, status, event, wait_class, wait_time, logon_time , program, machine from gv$session where wait_class!='Idle' order by sid;

INST_ID SID SERIAL# STATUS EVENT WAIT_CLASS WAIT_TIME LOGON_TIME PROGRAM MACHINE

------- ---- ------- ------- ---------------------------- ---------- --------- ------------------- ----------------------------------- --------

2 36 41916 ACTIVE ASM file metadata operation Other 0 24.03.2017 13:47:28 oracle@clusrv02 (O001) clusrv02

2 266 64885 ACTIVE KSV master wait Other 0 24.03.2017 13:47:25 oracletorcl01v@clusrv02 (TNS V1-V3) clusrv02

1 483 63446 ACTIVE KSV master wait Other 0 24.03.2017 13:31:14 oracletorcl01v@clusrv01 (TNS V1-V3) clusrv01

1 497 31202 ACTIVE ASM file metadata operation Other 0 24.03.2017 13:39:07 oracletorcl01v@clusrv01 (TNS V1-V3) clusrv01

3 708 484 ACTIVE ASM file metadata operation Other 0 24.03.2017 12:38:56 OMS omssrv01

OMS?

Around 12:38:56, another colleague in the office added a disk to one of the disk groups, through Enterprise Manager 12c!

But there were no rebalance operations:

SQL> select * from gv$asm_operation;

no rows selected

SQL> select * from gv$asm_operation;

no rows selected

It’s not the first time that I hit this type of problems. Sadly, sometimes it requires a full restart of the cluster or of ASM (because of different bugs).

This time, however, I have tried to kill only the foreground sessions waiting on “ASM file metadata operation”, starting with the one coming from the OMS.

Surprisingly, after killing that session, everything was fine again:

-- on +ASM3
SQL> alter system kill session '708,484';

System altered.

SQL>

SQL>  select inst_id, sid, serial#, status, event, wait_class, wait_time, logon_time , program, machine from gv$session where wait_class!='Idle' order by sid;

no rows selected

SQL>

-- on +ASM3

SQL> alter system kill session '708,484';

System altered.

SQL>

SQL> select inst_id, sid, serial#, status, event, wait_class, wait_time, logon_time , program, machine from gv$session where wait_class!='Idle' order by sid;

no rows selected

SQL>

I never add disks via OMS (I’m a sqlplus guy ;-)) , I wonder what went wrong with it 🙂

—

Ludovico

RMAN Catalog Housekeeping: how to purge the old incarnations

Posted on February 21, 2017 by Ludovico

First, let me apologize because every post in my blog starts with a disclaimer… but sometimes it is really necessary. 😉

Disclaimer: this blog post contains PL/SQL code that deletes incarnations from your RMAN recovery catalog. Please DON’T use it unless you deeply understand what you are doing, as it can compromise your backup and recovery strategy.

Small introduction

You may have a central RMAN catalog that stores all the backup metadata for your databases. If it is the case, you will have a database entry for each of your databases and a new incarnation entry for each duplicate, incomplete recovery or flashback (or whatever).

You should also have a delete strategy that deletes the obsolete backups from either your DISK or SBT_TAPE media. If you have old incarnations, however, after some time you will notice that their information never goes away from your catalog, and you may end up soon or later to do some housekeeping. But there is nothing more tedious than checking and deleting the incarnations one by one, especially if you have average big numbers like this catalog:

SQL> SELECT count(*) FROM db;

  COUNT(*)
----------
      1843

SQL> SELECT count(*) FROM dbinc;

  COUNT(*)
----------
      3870

SQL> SELECT count(*) FROM bdf;

  COUNT(*)
----------
   4130959

SQL> SELECT count(*) FROM brl;


  COUNT(*)
----------
  14876291

SQL> SELECT count(*) FROM db;

COUNT(*)

----------

1843

SQL> SELECT count(*) FROM dbinc;

COUNT(*)

----------

3870

SQL> SELECT count(*) FROM bdf;

COUNT(*)

----------

4130959

SQL> SELECT count(*) FROM brl;

COUNT(*)

----------

14876291

Where db, dbinc, bdf and brl contain reslectively the registered databases, incarnations, datafile backups and archivelog backups.

Different incarnations?

Consider the following query:

col dbinc_key_ for a60
set pages 100 lines 200
SELECT lpad(' ',2*(level-1))
  || TO_CHAR(DBINC_KEY) AS DBINC_KEY_,
  db_key,
  db_name,
  TO_CHAR(reset_time,'YYYY-MM-DD HH24:MI:SS'),
  dbinc_status
FROM rman.dbinc
  START WITH PARENT_DBINC_KEY IS NULL
  CONNECT BY prior DBINC_KEY   = PARENT_DBINC_KEY
ORDER BY db_name , db_key, level
;

col dbinc_key_ for a60

set pages 100 lines 200

SELECT lpad(' ',2*(level-1))

|| TO_CHAR(DBINC_KEY) AS DBINC_KEY_,

db_key,

db_name,

TO_CHAR(reset_time,'YYYY-MM-DD HH24:MI:SS'),

dbinc_status

FROM rman.dbinc

START WITH PARENT_DBINC_KEY IS NULL

CONNECT BY prior DBINC_KEY = PARENT_DBINC_KEY

ORDER BY db_name , db_key, level

;

You can run it safely: it returns the list of incarnations hierarchically connected to their parent, by database name, key and level.

Then you have several types of behaviors:

Normal databases (created once, never restored or flashed back) will have just one or two incarnations (it depends on how they are created):

DBINC_KEY                      DB_KEY DB_NAME  TO_CHAR(RESET_TIME, DBINC_ST
-------------------------- ---------- -------- ------------------- --------
104547357                   104546534 VxxxxxxP 2010-09-05 05:49:10 PARENT
  104546535                 104546534 VxxxxxxP 2012-01-18 09:31:01 CURRENT

DBINC_KEY DB_KEY DB_NAME TO_CHAR(RESET_TIME, DBINC_ST

-------------------------- ---------- -------- ------------------- --------

104547357 104546534 VxxxxxxP 2010-09-05 05:49:10 PARENT

104546535 104546534 VxxxxxxP 2012-01-18 09:31:01 CURRENT

They are usually the ones that you may want to keep in your catalog, unless the database no longer exist: in this case perhaps you omitted the deletion from the catalog when you have dropped your database?

Flashed back databases (flashed back multiple times) will have as many incarnations as the number of flashbacks, but all connected with the incarnation prior to the flashback:

DBINC_KEY                                                        DB_KEY DB_NAME  TO_CHAR(RESET_TIME, DBINC_ST
------------------------------------------------------------ ---------- -------- ------------------- --------
1164696351                                                   1164696336 VxxxxxxD 2014-07-07 05:38:47 PARENT
  1164696337                                                 1164696336 VxxxxxxD 2014-12-10 07:39:14 PARENT
    1328815631                                               1164696336 VxxxxxxD 2016-05-12 08:22:23 PARENT
      1329299866                                             1164696336 VxxxxxxD 2016-05-13 13:02:35 PARENT
        1329299867                                           1164696336 VxxxxxxD 2016-05-13 14:05:53 PARENT
          1329299833                                         1164696336 VxxxxxxD 2016-05-13 18:26:59 PARENT
            1331239226                                       1164696336 VxxxxxxD 2016-05-17 08:09:04 PARENT
              1331395102                                     1164696336 VxxxxxxD 2016-05-17 13:20:17 PARENT
                1331815030                                   1164696336 VxxxxxxD 2016-05-18 07:32:13 PARENT
                  1331814966                                 1164696336 VxxxxxxD 2016-05-18 10:57:45 PARENT
                    1387023006                               1164696336 VxxxxxxD 2016-07-13 09:33:05 PARENT
                      1407484366                             1164696336 VxxxxxxD 2016-09-09 13:25:31 PARENT
                        1419007163                           1164696336 VxxxxxxD 2016-10-17 14:32:59 PARENT
                          1436430179                         1164696336 VxxxxxxD 2016-12-12 15:13:55 PARENT
                            1436430034                       1164696336 VxxxxxxD 2016-12-12 16:28:57 PARENT
                              1437118959                     1164696336 VxxxxxxD 2016-12-14 14:48:53 PARENT
                                1437365509                   1164696336 VxxxxxxD 2016-12-15 09:45:00 PARENT
                                  1437365456                 1164696336 VxxxxxxD 2016-12-15 11:11:06 PARENT
                                    1437484026               1164696336 VxxxxxxD 2016-12-15 13:26:37 PARENT
                                      1437483983             1164696336 VxxxxxxD 2016-12-15 17:21:11 PARENT
                                        1437822754           1164696336 VxxxxxxD 2016-12-16 12:07:46 CURRENT

DBINC_KEY DB_KEY DB_NAME TO_CHAR(RESET_TIME, DBINC_ST

------------------------------------------------------------ ---------- -------- ------------------- --------

1164696351 1164696336 VxxxxxxD 2014-07-07 05:38:47 PARENT

1164696337 1164696336 VxxxxxxD 2014-12-10 07:39:14 PARENT

1328815631 1164696336 VxxxxxxD 2016-05-12 08:22:23 PARENT

1329299866 1164696336 VxxxxxxD 2016-05-13 13:02:35 PARENT

1329299867 1164696336 VxxxxxxD 2016-05-13 14:05:53 PARENT

1329299833 1164696336 VxxxxxxD 2016-05-13 18:26:59 PARENT

1331239226 1164696336 VxxxxxxD 2016-05-17 08:09:04 PARENT

1331395102 1164696336 VxxxxxxD 2016-05-17 13:20:17 PARENT

1331815030 1164696336 VxxxxxxD 2016-05-18 07:32:13 PARENT

1331814966 1164696336 VxxxxxxD 2016-05-18 10:57:45 PARENT

1387023006 1164696336 VxxxxxxD 2016-07-13 09:33:05 PARENT

1407484366 1164696336 VxxxxxxD 2016-09-09 13:25:31 PARENT

1419007163 1164696336 VxxxxxxD 2016-10-17 14:32:59 PARENT

1436430179 1164696336 VxxxxxxD 2016-12-12 15:13:55 PARENT

1436430034 1164696336 VxxxxxxD 2016-12-12 16:28:57 PARENT

1437118959 1164696336 VxxxxxxD 2016-12-14 14:48:53 PARENT

1437365509 1164696336 VxxxxxxD 2016-12-15 09:45:00 PARENT

1437365456 1164696336 VxxxxxxD 2016-12-15 11:11:06 PARENT

1437484026 1164696336 VxxxxxxD 2016-12-15 13:26:37 PARENT

1437483983 1164696336 VxxxxxxD 2016-12-15 17:21:11 PARENT

1437822754 1164696336 VxxxxxxD 2016-12-16 12:07:46 CURRENT

Here, despite you have several incarnations, they all belong to the same database (same DB_KEY and DBID), then you must also keep it inside the recovery catalog.

Non-production databases that are frequently refreshed from the production database (via duplicate) will have several incarnations with different DBIDs and DB_KEY:

DBINC_KEY                   DB_KEY DB_NAME  TO_CHAR(RESET_TIME, DBINC_ST
----------------------- ---------- -------- ------------------- --------
1173852671              1173852633 VxxxxxxV 2014-07-07 05:38:47 PARENT
  1173852635            1173852633 VxxxxxxV 2015-01-16 07:29:01 PARENT
    1188550385          1173852633 VxxxxxxV 2015-03-16 16:06:00 CURRENT
1220896058              1220896027 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1220896028            1220896027 VxxxxxxV 2015-07-17 08:06:00 CURRENT
1233975755              1233975724 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1233975725            1233975724 VxxxxxxV 2015-09-10 11:23:53 CURRENT
1244346785              1244346754 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1244346755            1244346754 VxxxxxxV 2015-10-23 07:46:34 CURRENT
1281775847              1281775816 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1281775817            1281775816 VxxxxxxV 2016-02-08 09:44:03 CURRENT
1317447322              1317447257 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1317447258            1317447257 VxxxxxxV 2016-04-07 12:20:56 CURRENT
1323527351              1323527316 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1323527317            1323527316 VxxxxxxV 2016-04-29 10:09:12 CURRENT
1437346753              1437346718 VxxxxxxV 2015-02-27 16:25:13 PARENT
  1437346719            1437346718 VxxxxxxV 2016-12-12 13:33:31 CURRENT

DBINC_KEY DB_KEY DB_NAME TO_CHAR(RESET_TIME, DBINC_ST

----------------------- ---------- -------- ------------------- --------

1173852671 1173852633 VxxxxxxV 2014-07-07 05:38:47 PARENT

1173852635 1173852633 VxxxxxxV 2015-01-16 07:29:01 PARENT

1188550385 1173852633 VxxxxxxV 2015-03-16 16:06:00 CURRENT

1220896058 1220896027 VxxxxxxV 2015-02-27 16:25:13 PARENT

1220896028 1220896027 VxxxxxxV 2015-07-17 08:06:00 CURRENT

1233975755 1233975724 VxxxxxxV 2015-02-27 16:25:13 PARENT

1233975725 1233975724 VxxxxxxV 2015-09-10 11:23:53 CURRENT

1244346785 1244346754 VxxxxxxV 2015-02-27 16:25:13 PARENT

1244346755 1244346754 VxxxxxxV 2015-10-23 07:46:34 CURRENT

1281775847 1281775816 VxxxxxxV 2015-02-27 16:25:13 PARENT

1281775817 1281775816 VxxxxxxV 2016-02-08 09:44:03 CURRENT

1317447322 1317447257 VxxxxxxV 2015-02-27 16:25:13 PARENT

1317447258 1317447257 VxxxxxxV 2016-04-07 12:20:56 CURRENT

1323527351 1323527316 VxxxxxxV 2015-02-27 16:25:13 PARENT

1323527317 1323527316 VxxxxxxV 2016-04-29 10:09:12 CURRENT

1437346753 1437346718 VxxxxxxV 2015-02-27 16:25:13 PARENT

1437346719 1437346718 VxxxxxxV 2016-12-12 13:33:31 CURRENT

This is usually the most frequent case: here you want to delete the old incarnations, but only as far as there are no backups attached to them that are still in the recovery window.

You may also have orphaned incarnations:

DBINC_KEY                                                        DB_KEY DB_NAME  TO_CHAR(RESET_TIME, DBINC_ST
------------------------------------------------------------ ---------- -------- ------------------- --------
1262827482                                                   1262827435 TxxxxxxT 2014-07-07 05:38:47 PARENT
  1262827436                                                 1262827435 TxxxxxxT 2016-01-05 12:15:22 PARENT
    1267262014                                               1262827435 TxxxxxxT 2016-01-19 09:15:47 PARENT
      1267290962                                             1262827435 TxxxxxxT 2016-01-19 11:09:05 PARENT
        1284933855                                           1262827435 TxxxxxxT 2016-02-11 11:18:52 PARENT
          1299685647                                         1262827435 TxxxxxxT 2016-02-23 13:40:18 ORPHAN
          1299837528                                         1262827435 TxxxxxxT 2016-02-23 17:08:36 CURRENT
          1299767977                                         1262827435 TxxxxxxT 2016-02-23 15:34:13 ORPHAN
          1298269640                                         1262827435 TxxxxxxT 2016-02-22 13:16:46 ORPHAN
            1299517249                                       1262827435 TxxxxxxT 2016-02-23 10:37:29 ORPHAN

DBINC_KEY DB_KEY DB_NAME TO_CHAR(RESET_TIME, DBINC_ST

------------------------------------------------------------ ---------- -------- ------------------- --------

1262827482 1262827435 TxxxxxxT 2014-07-07 05:38:47 PARENT

1262827436 1262827435 TxxxxxxT 2016-01-05 12:15:22 PARENT

1267262014 1262827435 TxxxxxxT 2016-01-19 09:15:47 PARENT

1267290962 1262827435 TxxxxxxT 2016-01-19 11:09:05 PARENT

1284933855 1262827435 TxxxxxxT 2016-02-11 11:18:52 PARENT

1299685647 1262827435 TxxxxxxT 2016-02-23 13:40:18 ORPHAN

1299837528 1262827435 TxxxxxxT 2016-02-23 17:08:36 CURRENT

1299767977 1262827435 TxxxxxxT 2016-02-23 15:34:13 ORPHAN

1298269640 1262827435 TxxxxxxT 2016-02-22 13:16:46 ORPHAN

1299517249 1262827435 TxxxxxxT 2016-02-23 10:37:29 ORPHAN

In this case, again, it depends whether the DBID and DB_KEY are the same as the current incarnation or not.

What do you need to delete?

Basically:

Incarnations of databases that no longer exist
Incarnations of existing databases where the database has a more recent current incarnation, only if there are no backups still in the retention window

How to do it?

In order to be sure 100% that you can delete an incarnation, you have to verify that there are no recent backups (for instance, no backups more rercent than the current recovery window for that database). If the database does not have a specified recovery window but rather a default “CONFIGURE RETENTION POLICY TO REDUNDANCY 1; # default”, it is a bit more problematic… in this case let’s assume that we consider “old” an incarnation that does not backup since 1 year (365 days), ok?

Getting the last backup of each database

Sadly, there is not a single table where you can verify that. You have to collect the information from several tables. I think bdf, al, cdf, bs would suffice in most cases.

When you delete an incarnation you specify a db_key: you have to get the last backup for each db_key, with queries like this:

SELECT dbinc_key
     ,max(completion_time) max_al_time
  FROM al
    GROUP by dbinc_key;

SELECT dbinc_key

,max(completion_time) max_al_time

FROM al

GROUP by dbinc_key;

Putting together all the tables:

WITH
   incs AS (
      SELECT lpad(' ',2*(level-1))|| to_char(dbinc_key) AS dbinc_key_
	     ,dbinc_key
         ,db_key
	     ,db_name
	     ,reset_time
	     ,dbinc_status
      FROM rman.dbinc
        START WITH parent_dbinc_key IS NULL
      CONNECT BY PRIOR dbinc_key   = parent_dbinc_key
        ORDER BY db_name, db_key, level
    ),
   mbdf AS (
      SELECT dbinc_key
	     ,max(completion_time) max_bdf_time
	  FROM bdf
	     GROUP by dbinc_key
   ),
   mbrl AS (
      SELECT dbinc_key
	     ,max(next_time) max_brl_time
	  FROM brl
	     GROUP by dbinc_key
   ),
   mal AS (
      SELECT dbinc_key
	     ,max(completion_time) max_al_time
	  FROM al
	     GROUP by dbinc_key
   ),
   mcdf AS (
      SELECT dbinc_key
	     ,max(completion_time) max_cdf_time
	  FROM cdf
	     GROUP by dbinc_key
   ),
   mbs AS (
      SELECT db_key
	     ,max(completion_time) max_bs_time
	  FROM bs
	     GROUP by db_key
   )
SELECT incs.db_key, db.db_id, db.REG_DB_UNIQUE_NAME AS db_uq_name , incs.db_name, dbinc_status
  ,greatest(
     nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))
	 ) AS last_bck
FROM incs
  JOIN db ON (db.db_key=incs.db_key)
  LEFT OUTER JOIN mbdf ON (incs.dbinc_key=mbdf.dbinc_key)
  LEFT OUTER JOIN mcdf ON (incs.dbinc_key=mcdf.dbinc_key)
  LEFT OUTER JOIN mbrl ON (incs.dbinc_key=mbrl.dbinc_key)
  LEFT OUTER JOIN mal ON (incs.dbinc_key=mal.dbinc_key)
  LEFT OUTER JOIN mbs ON (incs.db_key=mbs.db_key)
;

WITH

incs AS (

SELECT lpad(' ',2*(level-1))|| to_char(dbinc_key) AS dbinc_key_

,dbinc_key

,db_key

,db_name

,reset_time

,dbinc_status

FROM rman.dbinc

START WITH parent_dbinc_key IS NULL

CONNECT BY PRIOR dbinc_key = parent_dbinc_key

ORDER BY db_name, db_key, level

mbdf AS (

SELECT dbinc_key

,max(completion_time) max_bdf_time

FROM bdf

GROUP by dbinc_key

mbrl AS (

SELECT dbinc_key

,max(next_time) max_brl_time

FROM brl

GROUP by dbinc_key

mal AS (

SELECT dbinc_key

,max(completion_time) max_al_time

FROM al

GROUP by dbinc_key

mcdf AS (

SELECT dbinc_key

,max(completion_time) max_cdf_time

FROM cdf

GROUP by dbinc_key

mbs AS (

SELECT db_key

,max(completion_time) max_bs_time

FROM bs

GROUP by db_key

)

SELECT incs.db_key, db.db_id, db.REG_DB_UNIQUE_NAME AS db_uq_name , incs.db_name, dbinc_status

,greatest(

nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))

) AS last_bck

FROM incs

JOIN db ON (db.db_key=incs.db_key)

LEFT OUTER JOIN mbdf ON (incs.dbinc_key=mbdf.dbinc_key)

LEFT OUTER JOIN mcdf ON (incs.dbinc_key=mcdf.dbinc_key)

LEFT OUTER JOIN mbrl ON (incs.dbinc_key=mbrl.dbinc_key)

LEFT OUTER JOIN mal ON (incs.dbinc_key=mal.dbinc_key)

LEFT OUTER JOIN mbs ON (incs.db_key=mbs.db_key)

;

Getting the recovery window

The configuration information for each database is stored inside the conf table, but the retention information is stored in a VARCHAR2, either ‘TO RECOVERY WINDOW OF % DAYS’ or ‘TO REDUNDANCY %’

You need to convert it to a number when the retention policy is recovery windows, otherwise you default it to 365 days wher the redundancy is used. You can add a column and a join to the query:

-- new column in the projection
,nvl(to_number(substr(c.value,23,length(c.value)-27)),365) as days

-- new join in the "from"
LEFT OUTER JOIN conf c ON (c.db_key=incs.db_key AND c.NAME = 'RETENTION POLICY' AND value LIKE 'TO RECOVERY WINDOW OF %')

-- new column in the projection

,nvl(to_number(substr(c.value,23,length(c.value)-27)),365) as days

-- new join in the "from"

LEFT OUTER JOIN conf c ON (c.db_key=incs.db_key AND c.NAME = 'RETENTION POLICY' AND value LIKE 'TO RECOVERY WINDOW OF %')

and eventually, either display if it the incarnation is no more used or filter by usage:

-- display if the incarnation is still used
,CASE WHEN
     greatest(
     nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))
	 ) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))
	 THEN 'OLD ONE!'
	 ELSE 'USED'
  END AS USED

-- or filter it
WHERE greatest(
     nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),
	 nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))
	 ) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))

-- display if the incarnation is still used

,CASE WHEN

greatest(

nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))

) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))

THEN 'OLD ONE!'

ELSE 'USED'

END AS USED

-- or filter it

WHERE greatest(

nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))

) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))

Delete the incarnations!

You can delete the incarnations with this procedure:

BEGIN
  dbms_rcvcat.unregisterdatabase(DB_KEY=>:db_key, DB_ID=>:db_id);
END;

BEGIN

dbms_rcvcat.unregisterdatabase(DB_KEY=>:db_key, DB_ID=>:db_id);

END;

This procedure will raise an exception (-20001, ‘Database not found’) when a database does not exist anymore (either already deleted by this procedure or by another session), so you need to handle it.

Putting all together:

col db_uq_name for a12
col dbinc_key_ for a30
set pages 100 lines 200
set serveroutput on
DECLARE

  e_dbatabase_not_found EXCEPTION;
  PRAGMA EXCEPTION_INIT (e_dbatabase_not_found, -20001);

  CURSOR c_old_incarnations IS
  WITH
   incs AS (
      SELECT lpad(' ',2*(level-1))|| to_char(dbinc_key) AS dbinc_key_
             ,dbinc_key
         ,db_key
             ,db_name
             ,reset_time
             ,dbinc_status
      FROM rman.dbinc
        START WITH parent_dbinc_key IS NULL
      CONNECT BY PRIOR dbinc_key   = parent_dbinc_key
        ORDER BY db_name, db_key, level
    ),
   mbdf AS (
      SELECT dbinc_key
             ,max(completion_time) max_bdf_time
          FROM bdf
             GROUP by dbinc_key
   ),
   mbrl AS (
      SELECT dbinc_key
             ,max(next_time) max_brl_time
          FROM brl
             GROUP by dbinc_key
   ),
   mal AS (
      SELECT dbinc_key
             ,max(completion_time) max_al_time
          FROM al
             GROUP by dbinc_key
   ),
   mcdf AS (
      SELECT dbinc_key
             ,max(completion_time) max_cdf_time
          FROM cdf
             GROUP by dbinc_key
   ),
   mbs AS (
      SELECT db_key
             ,max(completion_time) max_bs_time
          FROM bs
             GROUP by db_key
   )
  SELECT distinct incs.db_key, db.db_id, db.REG_DB_UNIQUE_NAME AS db_uq_name , incs.db_name
  ,greatest(
     nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))
         ) AS last_bck
  ,CASE WHEN
     greatest(
     nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))
         ) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))
         THEN 'OLD ONE!'
         ELSE 'USED'
  END AS USED
FROM incs
  JOIN db ON (db.db_key=incs.db_key)
  LEFT OUTER JOIN mbdf ON (incs.dbinc_key=mbdf.dbinc_key)
  LEFT OUTER JOIN mcdf ON (incs.dbinc_key=mcdf.dbinc_key)
  LEFT OUTER JOIN mbrl ON (incs.dbinc_key=mbrl.dbinc_key)
  LEFT OUTER JOIN mal ON (incs.dbinc_key=mal.dbinc_key)
  LEFT OUTER JOIN mbs ON (incs.db_key=mbs.db_key)
  LEFT OUTER JOIN conf c ON (c.db_key=incs.db_key AND c.NAME = 'RETENTION POLICY' AND value LIKE 'TO RECOVERY WINDOW OF %')
 WHERE 1=1
 AND greatest(
     nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_al_time ,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),
         nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))
         ) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))
  order by 4,3, 5
  ;

   r_old_incarnation    c_old_incarnations%ROWTYPE;

   BEGIN
        OPEN c_old_incarnations;
        LOOP
                FETCH c_old_incarnations INTO r_old_incarnation;
                EXIT WHEN  c_old_incarnations%NOTFOUND;

                dbms_output.put('Purging db: ' || r_old_incarnation.db_name);
                dbms_output.put('       IncKey: ' || r_old_incarnation.db_key);
                dbms_output.put('       DBID: ' || r_old_incarnation.db_id);
                dbms_output.put_line('  Last BCK: ' || to_char(r_old_incarnation.last_bck,'YYYY-MM-DD'));
                BEGIN
                   dbms_rcvcat.unregisterdatabase(DB_KEY => r_old_incarnation.db_key, DB_ID => r_old_incarnation.db_id);
                EXCEPTION
                    WHEN e_dbatabase_not_found THEN
                    dbms_output.put_line('Database already unregistered');
                END;
        END LOOP;

        CLOSE c_old_incarnations;
	
END;
/

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

col db_uq_name for a12

col dbinc_key_ for a30

set pages 100 lines 200

set serveroutput on

DECLARE

e_dbatabase_not_found EXCEPTION;

PRAGMA EXCEPTION_INIT (e_dbatabase_not_found, -20001);

CURSOR c_old_incarnations IS

WITH

incs AS (

SELECT lpad(' ',2*(level-1))|| to_char(dbinc_key) AS dbinc_key_

,dbinc_key

,db_key

,db_name

,reset_time

,dbinc_status

FROM rman.dbinc

START WITH parent_dbinc_key IS NULL

CONNECT BY PRIOR dbinc_key = parent_dbinc_key

ORDER BY db_name, db_key, level

mbdf AS (

SELECT dbinc_key

,max(completion_time) max_bdf_time

FROM bdf

GROUP by dbinc_key

mbrl AS (

SELECT dbinc_key

,max(next_time) max_brl_time

FROM brl

GROUP by dbinc_key

mal AS (

SELECT dbinc_key

,max(completion_time) max_al_time

FROM al

GROUP by dbinc_key

mcdf AS (

SELECT dbinc_key

,max(completion_time) max_cdf_time

FROM cdf

GROUP by dbinc_key

mbs AS (

SELECT db_key

,max(completion_time) max_bs_time

FROM bs

GROUP by db_key

)

SELECT distinct incs.db_key, db.db_id, db.REG_DB_UNIQUE_NAME AS db_uq_name , incs.db_name

,greatest(

nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))

) AS last_bck

,CASE WHEN

greatest(

nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_al_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))

) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))

THEN 'OLD ONE!'

ELSE 'USED'

END AS USED

FROM incs

JOIN db ON (db.db_key=incs.db_key)

LEFT OUTER JOIN mbdf ON (incs.dbinc_key=mbdf.dbinc_key)

LEFT OUTER JOIN mcdf ON (incs.dbinc_key=mcdf.dbinc_key)

LEFT OUTER JOIN mbrl ON (incs.dbinc_key=mbrl.dbinc_key)

LEFT OUTER JOIN mal ON (incs.dbinc_key=mal.dbinc_key)

LEFT OUTER JOIN mbs ON (incs.db_key=mbs.db_key)

LEFT OUTER JOIN conf c ON (c.db_key=incs.db_key AND c.NAME = 'RETENTION POLICY' AND value LIKE 'TO RECOVERY WINDOW OF %')

WHERE 1=1

AND greatest(

nvl(max_bdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_brl_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_al_time ,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_cdf_time,to_date('1970-01-01','YYYY-MM-DD')),

nvl(max_bs_time,to_date('1970-01-01','YYYY-MM-DD'))

) < (sysdate - nvl(to_number(substr(c.value,23,length(c.value)-27)),365))

order by 4,3, 5

;

r_old_incarnation c_old_incarnations%ROWTYPE;

BEGIN

OPEN c_old_incarnations;

LOOP

FETCH c_old_incarnations INTO r_old_incarnation;

EXIT WHEN c_old_incarnations%NOTFOUND;

dbms_output.put('Purging db: ' || r_old_incarnation.db_name);

dbms_output.put(' IncKey: ' || r_old_incarnation.db_key);

dbms_output.put(' DBID: ' || r_old_incarnation.db_id);

dbms_output.put_line(' Last BCK: ' || to_char(r_old_incarnation.last_bck,'YYYY-MM-DD'));

BEGIN

dbms_rcvcat.unregisterdatabase(DB_KEY => r_old_incarnation.db_key, DB_ID => r_old_incarnation.db_id);

EXCEPTION

WHEN e_dbatabase_not_found THEN

dbms_output.put_line('Database already unregistered');

END;

END LOOP;

CLOSE c_old_incarnations;

END;

I have used this procedure today for the first time and it worked like a charm.

However, if you have any adjustment or suggestion, don’t hesitate to comment it 🙂

HTH