DBA survival BLOG

DBA stuff and Oracle Data Guard

In-memory Columnar Store hands-on

Posted on July 24, 2014 by Ludovico

As I’ve written in my previous post, the inmemory_size parameter is static, so you need to restart your instance to activate it or change its size. Let’s try to set it at 600M.

SQL> show parameter inmem

NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
inmemory_clause_default              string
inmemory_force                       string      DEFAULT
inmemory_query                       string      ENABLE
inmemory_size                        big integer 0

SQL> alter system set inmemory_size=600M scope=spfile;

SQL> shutdown

...

SQL> startup

...

SQL> show parameter inmem

NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
inmemory_clause_default              string
inmemory_force                       string      DEFAULT
inmemory_query                       string      ENABLE
inmemory_size                        big integer 608M

SQL> show parameter inmem

NAME TYPE VALUE

------------------------------------ ----------- ------------------------------

inmemory_clause_default string

inmemory_force string DEFAULT

inmemory_query string ENABLE

inmemory_size big integer 0

SQL> alter system set inmemory_size=600M scope=spfile;

SQL> shutdown

...

SQL> startup

...

SQL> show parameter inmem

NAME TYPE VALUE

------------------------------------ ----------- ------------------------------

inmemory_clause_default string

inmemory_force string DEFAULT

inmemory_query string ENABLE

inmemory_size big integer 608M

First interesting thing: it has been rounded to 608M so it works in chunks of 16M. (to be verified)

Which views can you select for further information?

SQL> select view_name from dba_views where view_name like 'V_$IM%';

VIEW_NAME
--------------------------------------------------------------------------------
V_$IM_SEGMENTS_DETAIL
V_$IM_SEGMENTS
V_$IM_USER_SEGMENTS
V_$IM_TBS_EXT_MAP
V_$IM_SEG_EXT_MAP
V_$IM_HEADER
V_$IM_COL_CU
V_$IM_SMU_HEAD
V_$IM_SMU_CHUNK
V_$IM_COLUMN_LEVEL

10 rows selected

SQL> select view_name from dba_views where view_name like 'V_$IM%';

VIEW_NAME

--------------------------------------------------------------------------------

V_$IM_SEGMENTS_DETAIL

V_$IM_SEGMENTS

V_$IM_USER_SEGMENTS

V_$IM_TBS_EXT_MAP

V_$IM_SEG_EXT_MAP

V_$IM_HEADER

V_$IM_COL_CU

V_$IM_SMU_HEAD

V_$IM_SMU_CHUNK

V_$IM_COLUMN_LEVEL

10 rows selected

V$IM_SEGMENTS gives a few information about the segments that have a columnar version, including the segment size, the actual memory allocated, the population status and other compression indicators.

The other views help understand the various memory chunks and the status for each column in the segment.

Let’s create a table with a few records:

SQL> create table ludovico.tinmem
as
select
a.OWNER ,
a.TABLE_NAME ,
b.owner owner2,
b.table_name table_name2,
a.TABLESPACE_NAME ,
a.STATUS ,
a.PCT_FREE ,
a.PCT_USED ,
a.INI_TRANS ,
a.MAX_TRANS ,
a.INITIAL_EXTENT ,
a.NEXT_EXTENT ,
a.MIN_EXTENTS ,
a.MAX_EXTENTS ,
a.PCT_INCREASE ,
a.FREELISTS ,
a.FREELIST_GROUPS ,
a.LOGGING
22 from all_tables a, all_tables b;

Table created.

SQL> select count(*) from ludovico.tinmem;

COUNT(*)
----------
5470921

SQL>

SQL> create table ludovico.tinmem

select

a.OWNER ,

a.TABLE_NAME ,

b.owner owner2,

b.table_name table_name2,

a.TABLESPACE_NAME ,

a.STATUS ,

a.PCT_FREE ,

a.PCT_USED ,

a.INI_TRANS ,

a.MAX_TRANS ,

a.INITIAL_EXTENT ,

a.NEXT_EXTENT ,

a.MIN_EXTENTS ,

a.MAX_EXTENTS ,

a.PCT_INCREASE ,

a.FREELISTS ,

a.FREELIST_GROUPS ,

a.LOGGING

22 from all_tables a, all_tables b;

Table created.

SQL> select count(*) from ludovico.tinmem;

COUNT(*)

----------

5470921

SQL>

The table is very simple, it’s a cartesian of two “all_tables” views.

Let’s also create an index on it:

SQL> create index ludovico.tinmem_ix1 on ludovico.tinmem (table_name, pct_increase);

 SQL> select segment_name, bytes/1024/1024 from dba_segments where owner='LUDOVICO';

SEGMENT_NAME      BYTES/1024/1024
----------------- ---------------
TINMEM                        621
TINMEM_IX1                    192

SQL> create index ludovico.tinmem_ix1 on ludovico.tinmem (table_name, pct_increase);

SQL> select segment_name, bytes/1024/1024 from dba_segments where owner='LUDOVICO';

SEGMENT_NAME BYTES/1024/1024

----------------- ---------------

TINMEM 621

TINMEM_IX1 192

The table uses 621M and the index 192M.

How long does it take to do a full table scan almost from disk?

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS

Elapsed: 00:00:15.05

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:15.05

15 seconds! Ok, I’m this virtual machine is on an external drive 5400 RPM… 🙁

Once the table is fully cached in the buffer cache, the query performance progressively improves to ~1 sec.

SQL> r
1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS


Elapsed: 00:00:01.42
SQL> r
1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS


Elapsed: 00:00:00.99

SQL> r

1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:01.42

SQL> r

1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:00.99

There is no inmemory segment yet:

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION

2 FROM V$IM_SEGMENTS; 

no rows selected

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION

2 FROM V$IM_SEGMENTS;

no rows selected

You have to specify it at table level:

SQL> alter table ludovico.tinmem inmemory;

Table altered.

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION
FROM V$IM_SEGMENTS; 2

OWNER
--------------------------------------------------------------------------------
SEGMENT_NAME
--------------------------------------------------------------------------------
INMEMORY INMEMORY_COMPRESS
-------- -----------------
LUDOVICO
TINMEM
HIGH     FOR QUERY

SQL> alter table ludovico.tinmem inmemory;

Table altered.

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION

FROM V$IM_SEGMENTS; 2

OWNER

--------------------------------------------------------------------------------

SEGMENT_NAME

--------------------------------------------------------------------------------

INMEMORY INMEMORY_COMPRESS

-------- -----------------

LUDOVICO

TINMEM

HIGH FOR QUERY

The actual creation of the columnar store takes a while, especially if you don’t specify to create it with high priority. You may have to query the table before seeing the columnar store and its population will also take some time and increase the overall load of the database (on my VBox VM, the performance overhead of columnar store population is NOT negligible).

Once the in-memory store created, the optimizer is ready to use it:

SQL> explain plan for select distinct tablespace_name from ludovico.tinmem order by 1;

Explained.


SQL> SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());

PLAN_TABLE_OUTPUT
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Plan hash value: 1243998285

--------------------------------------------------------------------------------------
| Id  | Operation                   | Name   | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |        |     1 |    13 | 26132   (2)| 00:00:02 |
|   1 |  SORT UNIQUE                |        |     1 |    13 | 25993   (1)| 00:00:02 |
|   2 |   TABLE ACCESS INMEMORY FULL| TINMEM |  5470K|    67M|    26 (100)| 00:00:01 |
--------------------------------------------------------------------------------------

9 rows selected.

SQL> explain plan for select distinct tablespace_name from ludovico.tinmem order by 1;

Explained.

SQL> SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());

PLAN_TABLE_OUTPUT

------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Plan hash value: 1243998285

--------------------------------------------------------------------------------------

--------------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 1 | 13 | 26132 (2)| 00:00:02 |

| 1 | SORT UNIQUE | | 1 | 13 | 25993 (1)| 00:00:02 |

| 2 | TABLE ACCESS INMEMORY FULL| TINMEM | 5470K| 67M| 26 (100)| 00:00:01 |

--------------------------------------------------------------------------------------

9 rows selected.

The previous query now takes half the time on the first attempt!

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS

Elapsed: 00:00:00.50

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:00.50

The columnar store for the whole table uses 23M out of 621M, so the compression ratio is very good compared to the non-compressed index previously created!

SQL> select OWNER, SEGMENT_NAME, SEGMENT_TYPE, INMEMORY_SIZE, BYTES, BYTES_NOT_POPULATED, INMEMORY_PRIORITY, INMEMORY_DISTRIBUTE, INMEMORY_COMPRESSION

2 from V$IM_SEGMENTS;

OWNER    SEGMENT_NAME SEGMENT_TYPE INMEMORY_SIZE BYTES     BYTES_NOT_POPULATED INMEMORY INMEMORY_DISTRI INMEMORY_COMPRESS
-------- ------------ ------------ ------------- --------- ------------------- -------- --------------- -----------------
LUDOVICO TINMEM       TABLE        23527424      651165696 0                   HIGH     AUTO DISTRIBUTE FOR QUERY

Elapsed: 00:00:00.07

SQL> select OWNER, SEGMENT_NAME, SEGMENT_TYPE, INMEMORY_SIZE, BYTES, BYTES_NOT_POPULATED, INMEMORY_PRIORITY, INMEMORY_DISTRIBUTE, INMEMORY_COMPRESSION

2 from V$IM_SEGMENTS;

OWNER SEGMENT_NAME SEGMENT_TYPE INMEMORY_SIZE BYTES BYTES_NOT_POPULATED INMEMORY INMEMORY_DISTRI INMEMORY_COMPRESS

-------- ------------ ------------ ------------- --------- ------------------- -------- --------------- -----------------

LUDOVICO TINMEM TABLE 23527424 651165696 0 HIGH AUTO DISTRIBUTE FOR QUERY

Elapsed: 00:00:00.07

This is a very short example. The result here (2x improvement) is influenced by several factors. It is safe to think that with “normal” production conditions the gain will be much higher in almost all the cases.
I just wanted to demonstrate that in-memory columnar store is space efficient and really provides higher speed out of the box.

Now that you know about it, can you live without? 😛

Oracle Database 12c in-memory option, a quick overview

Posted on July 23, 2014 by Ludovico

Oracle Database 12.1.0.2 is finally out, and as we all knew in advance, it contains the new in-memory option.

I think that, despite its cost ($23k per processor), this is another great improvement! 🙂

Consistent savings!

This new feature is not to be confused with Times Ten. In-memory is a feature that enable a new memory area inside the SGA that is used to contain a columnar organized copy of segments entirely in memory. Columnar stores organize the data as columns instead of rows and they are ideal for queries that involve a few columns on many rows, e.g. for analytic reports, but they work great also for all extemporary queries that cannot make use of existing indexes.

Columnar stores don’t replace traditional indexes for data integrity or fast single-row look-ups, but they can replace many additional indexes created for the solely purpose of reporting. Hence, if from one side it seems a waste of memory, on the other side using in-memory can lead to consistent memory savings due to all the indexes that have no more reason to exist.

Let’s take an example of a table (in RED) with nine indexes (other colors).

If you try to imagine all the blocks in the buffer cache, you may think about something like this:

Now, with the in-memory columnar store, you can get the rid of many indexes because they’ve been created just for reporting and they are now superseded by the performance of the new feature:

In this case, you’re not only saving blocks on disk, but also in the buffer cache, making room for the in-memory area. With columnar store, the compression factor may allow to easily fit your entire table in the same space that was previously required for a few, query-specific indexes. So you’ll have the buffer cache with traditional row-organized blocks (red, yellow, light and dark blue) and the separate in-memory area with a columnar copy of the segment (gray).

The in-memory store doesn’t make use of undo segments and redo buffer, so you’re also saving undo block buffers and physical I/O!

The added value

In my opinion this option will have much more attention from the customers than Multitenant for a very simple reason.

How many customers (in percentage) would pay to achieve better consolidation of hundreds of databases? A few.

How many would pay or are already paying for having better performance for critical applications? Almost all the customers I know!

Internal mechanisms

In-memory is enabled on a per-segment basis: you can specify a table or a partition to be stored in-memory.

Each column is organized in separate chunks of memory called In Memory Compression Units (IMCU). The number of IMCUs required for each column may vary.

Each IMCU contains the data of the column and a journal used to guarantee read consistency with the blocks in the buffer cache. The data is not modified on the fly in the IMCU, but the row it refers to is marked as stale in a journal that is stored inside the IMCU itself. When the stale data grows above a certain threshold the space efficiency of the columnar store decreases and the in-memory coordinator process ([imco]) may force a re-population of the store.
Re-population may also occur after manual intervention or at the instance startup: because it is memory-only, the data actually need to be populated in the in-memory store from disk.

Whether the data is populated immediately after the startup or not, it actually depends on the priority specified for the specific segment. The higher the priority, the sooner the segment will be populated in-memory. The priority attribute also drives which segments would survive in-memory in case of “in-memory pressure”. Sadly, the parameter inmemory_size that specifies the size of the in-memory area is static and an instance restart is required in order to change it, that’s why you need to plan carefully the size prior to its activation. There is a compression advisor for in-memory that can help out on this.

Conclusion

In this post you’ve seen a small introduction about in-memory. I hope I can publish very soon another post with a few practical examples.

Exciting News from Oracle Open World 2013

Posted on October 4, 2013 by Ludovico

I’m back at work now, safely, after the week in San Francisco.

It’s time to sit down, and try to pull out some thought about what I’ve experienced and done.

I’ll start from the new announcements, what is most important for most people, and leave my personal experience for my next post.

In-memory Database Option

Oracle has announced the In-Memory option for the Oracle Database. This feature will store the data simultaneously in traditional row-based and into a new in-memory columnar format, to serve optimally both analytics and OLTP workloads AT THE SAME TIME. Because column-based storage is redundant, it will work without logging mechanism, so the overhead will be minimal. The marketing message claims “ungodly speed”: 100x faster queries for analytics and 2x faster queries in OLTP environments.

By separating Analytics and OLTP with different storage formats, the indexes on the row-based version of the table can be reduced to make the transactions faster, getting the rid of the analytical indexes thank to the columnar format that is already optimized for that kind of workload. The activation of the option will be transparent to the applications.

How it will be activated?

inmemory_size = XXX GB

alter table foo ... inmemory;

inmemory_size = XXX GB

alter table foo ... inmemory;

Now my considerations:

[evil] Will this option make your database faster than putting it on an actual Exadata?
It will be an option, so it will cost extra-money on top of the Enterprise Edition
[I guess] it will be released with 12cR2 because a such big change cannot be introduced simply with a patch set. So I think we’ll not see it before the end of 2014
And, uh, Maria Colgan has given up the Product Management of the Cost Based Optimizer to become the Product Manager of the In-Memory option. Tom Kyte will take the ownership of the CBO.

M6-32 Big Memory Machine

I’ve paid much less attention for this new announcement. The new big super hyper machine engineered by Oracle will have:

1024 DIMMS
32TB of DRAM
12 cores per processors
96 threads per processor

This huge memory machine can be connected through InfiniBand to an Exadata to rely on its storage cells.

But it will cost 3M$, so it’s not really intended for SMBs or for the average DBA, that’s why I don’t care too much about it…

Oracle Database Backup, Logging, Recovery Appliance

Only 8 minutes in the keynote to introduce this appliance that is really hot, IMHO. This… oh my… let’s call it ODBLRA, is a backup appliance (based on the same HW of Exadata) capable of receiving the stream of redo logging over SQL*Net, the same way as it’s done with DataGuard, except that instead of having a standby database, you’ll have an appliance capable of storing all the redo stream of your entire DB farm and have a real-time backup of your transactions. That’s it: no transactions lost between two backup archives and no need to have hundreds of DataGuard setups or network filesystems as secondary destinations in order to make your redo stream safer.

I guess that it will host an engine RMAN-aware that can create incremental-updated backups, so that you can almost forget about full backups. You can leverage an existent tape infrastructure to offload the appliance if it starts getting full.

Your ODBLRA can also replicate your backups to an another appliance hosted on the Oracle Cloud: ODBLRAaaS! 🙂

To conclude, Oracle is pushing for bigger, dedicated, specialized SPARC machines instead of relying on commodity hardware…

Oracle Multi-tenant Self-Service Provisioning

There’s a new APEX application, now in BETA, that can be downloaded from the Oracle Multitenant Page that provides self-service provisioning of databases in a Multitenant architecture. It’s worth a try… if you plan to introduce the Multitenant option in your environment!

All products in the Cloud

Oracle now offers (as a preview) its Database, Middleware and Applications as a Service, in its public cloud. For a DBA can be of interest:

The Storage aaS, use Java & REST API (Openstack SWIFT) for block level access to the storage.

The Computing aaS allows you to scale the computing power to follow your computing needs.

The Database aaS is the standard, full-featured Oracle Database (in the cloud!) 11gR2 or 12c in all editions (SE, SE1, EE). You can choose five different sizes, up to 17cores and 256Gb of RAM, and choose 3 different formulas:

Single Schema (3 sizes: 5, 20 or 50Gb, with prices from 175$/month to 2000$/month)
Basic Database (user-managed, single-instance preconfigured databases only with a local EM)
Managed Database (single-instance with managed backups & PITR, managed quarterly apply of critical parches)
Premium Managed Database (fully managed RAC, with optional DG or Active DG, PDB and upgrades)

My considerations:

Oracle releases this cloud offering with significant delay comparing to his competitors
It’s still in preview and there’s no information about the billing schema. Depending on that, it can be more or less attractive.
As for other cloud services, the performance will be acceptable only when putting all the stack into the same cloud (WebLogic, DB, etc.)

Oracle on Azure

Microsoft starts offering preconfigured Oracle platforms, Database and WebLogic, on Azure on both Linux and Windows systems. I haven’t seen the price list yet, but IMHO Azure has been around since longtime now, and it appears as a reliable and settled alternative comparing to Oracle Cloud. Nice move Microsoft, I think it deserves special attention.

Keynotes recordings

You can see the full keynote recordings here:

Oracle OpenWorld Keynote Highlights

Larry Ellison — Oracle OpenWorld Keynote 9-22-2013

Oracle OpenWorld General Session 2013: Database

Kurian and Fowler — Oracle OpenWorld Keynote 9-24-2013

Will these announcements change your life? Let me know…

…and stay tuned, I’ll come soon with a new post about the my “real” week at the Open World and why I’ve loved it.

Ludovico