July, 2014 - DBA survival BLOG

As I’ve written in my previous post, the inmemory_size parameter is static, so you need to restart your instance to activate it or change its size. Let’s try to set it at 600M.

SQL> show parameter inmem

NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
inmemory_clause_default              string
inmemory_force                       string      DEFAULT
inmemory_query                       string      ENABLE
inmemory_size                        big integer 0

SQL> alter system set inmemory_size=600M scope=spfile;

SQL> shutdown

...

SQL> startup

...

SQL> show parameter inmem

NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
inmemory_clause_default              string
inmemory_force                       string      DEFAULT
inmemory_query                       string      ENABLE
inmemory_size                        big integer 608M

SQL> show parameter inmem

NAME TYPE VALUE

------------------------------------ ----------- ------------------------------

inmemory_clause_default string

inmemory_force string DEFAULT

inmemory_query string ENABLE

inmemory_size big integer 0

SQL> alter system set inmemory_size=600M scope=spfile;

SQL> shutdown

...

SQL> startup

...

SQL> show parameter inmem

NAME TYPE VALUE

------------------------------------ ----------- ------------------------------

inmemory_clause_default string

inmemory_force string DEFAULT

inmemory_query string ENABLE

inmemory_size big integer 608M

First interesting thing: it has been rounded to 608M so it works in chunks of 16M. (to be verified)

Which views can you select for further information?

SQL> select view_name from dba_views where view_name like 'V_$IM%';

VIEW_NAME
--------------------------------------------------------------------------------
V_$IM_SEGMENTS_DETAIL
V_$IM_SEGMENTS
V_$IM_USER_SEGMENTS
V_$IM_TBS_EXT_MAP
V_$IM_SEG_EXT_MAP
V_$IM_HEADER
V_$IM_COL_CU
V_$IM_SMU_HEAD
V_$IM_SMU_CHUNK
V_$IM_COLUMN_LEVEL

10 rows selected

SQL> select view_name from dba_views where view_name like 'V_$IM%';

VIEW_NAME

--------------------------------------------------------------------------------

V_$IM_SEGMENTS_DETAIL

V_$IM_SEGMENTS

V_$IM_USER_SEGMENTS

V_$IM_TBS_EXT_MAP

V_$IM_SEG_EXT_MAP

V_$IM_HEADER

V_$IM_COL_CU

V_$IM_SMU_HEAD

V_$IM_SMU_CHUNK

V_$IM_COLUMN_LEVEL

10 rows selected

V$IM_SEGMENTS gives a few information about the segments that have a columnar version, including the segment size, the actual memory allocated, the population status and other compression indicators.

The other views help understand the various memory chunks and the status for each column in the segment.

Let’s create a table with a few records:

SQL> create table ludovico.tinmem
as
select
a.OWNER ,
a.TABLE_NAME ,
b.owner owner2,
b.table_name table_name2,
a.TABLESPACE_NAME ,
a.STATUS ,
a.PCT_FREE ,
a.PCT_USED ,
a.INI_TRANS ,
a.MAX_TRANS ,
a.INITIAL_EXTENT ,
a.NEXT_EXTENT ,
a.MIN_EXTENTS ,
a.MAX_EXTENTS ,
a.PCT_INCREASE ,
a.FREELISTS ,
a.FREELIST_GROUPS ,
a.LOGGING
22 from all_tables a, all_tables b;

Table created.

SQL> select count(*) from ludovico.tinmem;

COUNT(*)
----------
5470921

SQL>

SQL> create table ludovico.tinmem

select

a.OWNER ,

a.TABLE_NAME ,

b.owner owner2,

b.table_name table_name2,

a.TABLESPACE_NAME ,

a.STATUS ,

a.PCT_FREE ,

a.PCT_USED ,

a.INI_TRANS ,

a.MAX_TRANS ,

a.INITIAL_EXTENT ,

a.NEXT_EXTENT ,

a.MIN_EXTENTS ,

a.MAX_EXTENTS ,

a.PCT_INCREASE ,

a.FREELISTS ,

a.FREELIST_GROUPS ,

a.LOGGING

22 from all_tables a, all_tables b;

Table created.

SQL> select count(*) from ludovico.tinmem;

COUNT(*)

----------

5470921

SQL>

The table is very simple, it’s a cartesian of two “all_tables” views.

Let’s also create an index on it:

SQL> create index ludovico.tinmem_ix1 on ludovico.tinmem (table_name, pct_increase);

 SQL> select segment_name, bytes/1024/1024 from dba_segments where owner='LUDOVICO';

SEGMENT_NAME      BYTES/1024/1024
----------------- ---------------
TINMEM                        621
TINMEM_IX1                    192

SQL> create index ludovico.tinmem_ix1 on ludovico.tinmem (table_name, pct_increase);

SQL> select segment_name, bytes/1024/1024 from dba_segments where owner='LUDOVICO';

SEGMENT_NAME BYTES/1024/1024

----------------- ---------------

TINMEM 621

TINMEM_IX1 192

The table uses 621M and the index 192M.

How long does it take to do a full table scan almost from disk?

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS

Elapsed: 00:00:15.05

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:15.05

15 seconds! Ok, I’m this virtual machine is on an external drive 5400 RPM… 🙁

Once the table is fully cached in the buffer cache, the query performance progressively improves to ~1 sec.

SQL> r
1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS


Elapsed: 00:00:01.42
SQL> r
1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS


Elapsed: 00:00:00.99

SQL> r

1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:01.42

SQL> r

1* select distinct tablespace_name from ludovico.tinmem order by 1

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:00.99

There is no inmemory segment yet:

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION

2 FROM V$IM_SEGMENTS; 

no rows selected

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION

2 FROM V$IM_SEGMENTS;

no rows selected

You have to specify it at table level:

SQL> alter table ludovico.tinmem inmemory;

Table altered.

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION
FROM V$IM_SEGMENTS; 2

OWNER
--------------------------------------------------------------------------------
SEGMENT_NAME
--------------------------------------------------------------------------------
INMEMORY INMEMORY_COMPRESS
-------- -----------------
LUDOVICO
TINMEM
HIGH     FOR QUERY

SQL> alter table ludovico.tinmem inmemory;

Table altered.

SQL> SELECT OWNER, SEGMENT_NAME, INMEMORY_PRIORITY, INMEMORY_COMPRESSION

FROM V$IM_SEGMENTS; 2

OWNER

--------------------------------------------------------------------------------

SEGMENT_NAME

--------------------------------------------------------------------------------

INMEMORY INMEMORY_COMPRESS

-------- -----------------

LUDOVICO

TINMEM

HIGH FOR QUERY

The actual creation of the columnar store takes a while, especially if you don’t specify to create it with high priority. You may have to query the table before seeing the columnar store and its population will also take some time and increase the overall load of the database (on my VBox VM, the performance overhead of columnar store population is NOT negligible).

Once the in-memory store created, the optimizer is ready to use it:

SQL> explain plan for select distinct tablespace_name from ludovico.tinmem order by 1;

Explained.


SQL> SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());

PLAN_TABLE_OUTPUT
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Plan hash value: 1243998285

--------------------------------------------------------------------------------------
| Id  | Operation                   | Name   | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |        |     1 |    13 | 26132   (2)| 00:00:02 |
|   1 |  SORT UNIQUE                |        |     1 |    13 | 25993   (1)| 00:00:02 |
|   2 |   TABLE ACCESS INMEMORY FULL| TINMEM |  5470K|    67M|    26 (100)| 00:00:01 |
--------------------------------------------------------------------------------------

9 rows selected.

SQL> explain plan for select distinct tablespace_name from ludovico.tinmem order by 1;

Explained.

SQL> SELECT PLAN_TABLE_OUTPUT FROM TABLE(DBMS_XPLAN.DISPLAY());

PLAN_TABLE_OUTPUT

------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Plan hash value: 1243998285

--------------------------------------------------------------------------------------

--------------------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 1 | 13 | 26132 (2)| 00:00:02 |

| 1 | SORT UNIQUE | | 1 | 13 | 25993 (1)| 00:00:02 |

| 2 | TABLE ACCESS INMEMORY FULL| TINMEM | 5470K| 67M| 26 (100)| 00:00:01 |

--------------------------------------------------------------------------------------

9 rows selected.

The previous query now takes half the time on the first attempt!

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME
------------------------------
SYSAUX
SYSTEM
USERS

Elapsed: 00:00:00.50

SQL> select distinct tablespace_name from ludovico.tinmem order by 1;

TABLESPACE_NAME

------------------------------

SYSAUX

SYSTEM

USERS

Elapsed: 00:00:00.50

The columnar store for the whole table uses 23M out of 621M, so the compression ratio is very good compared to the non-compressed index previously created!

SQL> select OWNER, SEGMENT_NAME, SEGMENT_TYPE, INMEMORY_SIZE, BYTES, BYTES_NOT_POPULATED, INMEMORY_PRIORITY, INMEMORY_DISTRIBUTE, INMEMORY_COMPRESSION

2 from V$IM_SEGMENTS;

OWNER    SEGMENT_NAME SEGMENT_TYPE INMEMORY_SIZE BYTES     BYTES_NOT_POPULATED INMEMORY INMEMORY_DISTRI INMEMORY_COMPRESS
-------- ------------ ------------ ------------- --------- ------------------- -------- --------------- -----------------
LUDOVICO TINMEM       TABLE        23527424      651165696 0                   HIGH     AUTO DISTRIBUTE FOR QUERY

Elapsed: 00:00:00.07

SQL> select OWNER, SEGMENT_NAME, SEGMENT_TYPE, INMEMORY_SIZE, BYTES, BYTES_NOT_POPULATED, INMEMORY_PRIORITY, INMEMORY_DISTRIBUTE, INMEMORY_COMPRESSION

2 from V$IM_SEGMENTS;

OWNER SEGMENT_NAME SEGMENT_TYPE INMEMORY_SIZE BYTES BYTES_NOT_POPULATED INMEMORY INMEMORY_DISTRI INMEMORY_COMPRESS

-------- ------------ ------------ ------------- --------- ------------------- -------- --------------- -----------------

LUDOVICO TINMEM TABLE 23527424 651165696 0 HIGH AUTO DISTRIBUTE FOR QUERY

Elapsed: 00:00:00.07

This is a very short example. The result here (2x improvement) is influenced by several factors. It is safe to think that with “normal” production conditions the gain will be much higher in almost all the cases.
I just wanted to demonstrate that in-memory columnar store is space efficient and really provides higher speed out of the box.

Now that you know about it, can you live without? 😛

Oracle Database 12.1.0.2 is finally out, and as we all knew in advance, it contains the new in-memory option.

I think that, despite its cost ($23k per processor), this is another great improvement! 🙂

Consistent savings!

This new feature is not to be confused with Times Ten. In-memory is a feature that enable a new memory area inside the SGA that is used to contain a columnar organized copy of segments entirely in memory. Columnar stores organize the data as columns instead of rows and they are ideal for queries that involve a few columns on many rows, e.g. for analytic reports, but they work great also for all extemporary queries that cannot make use of existing indexes.

Columnar stores don’t replace traditional indexes for data integrity or fast single-row look-ups, but they can replace many additional indexes created for the solely purpose of reporting. Hence, if from one side it seems a waste of memory, on the other side using in-memory can lead to consistent memory savings due to all the indexes that have no more reason to exist.

Let’s take an example of a table (in RED) with nine indexes (other colors).

If you try to imagine all the blocks in the buffer cache, you may think about something like this:

Now, with the in-memory columnar store, you can get the rid of many indexes because they’ve been created just for reporting and they are now superseded by the performance of the new feature:

In this case, you’re not only saving blocks on disk, but also in the buffer cache, making room for the in-memory area. With columnar store, the compression factor may allow to easily fit your entire table in the same space that was previously required for a few, query-specific indexes. So you’ll have the buffer cache with traditional row-organized blocks (red, yellow, light and dark blue) and the separate in-memory area with a columnar copy of the segment (gray).

The in-memory store doesn’t make use of undo segments and redo buffer, so you’re also saving undo block buffers and physical I/O!

The added value

In my opinion this option will have much more attention from the customers than Multitenant for a very simple reason.

How many customers (in percentage) would pay to achieve better consolidation of hundreds of databases? A few.

How many would pay or are already paying for having better performance for critical applications? Almost all the customers I know!

Internal mechanisms

In-memory is enabled on a per-segment basis: you can specify a table or a partition to be stored in-memory.

Each column is organized in separate chunks of memory called In Memory Compression Units (IMCU). The number of IMCUs required for each column may vary.

Each IMCU contains the data of the column and a journal used to guarantee read consistency with the blocks in the buffer cache. The data is not modified on the fly in the IMCU, but the row it refers to is marked as stale in a journal that is stored inside the IMCU itself. When the stale data grows above a certain threshold the space efficiency of the columnar store decreases and the in-memory coordinator process ([imco]) may force a re-population of the store.
Re-population may also occur after manual intervention or at the instance startup: because it is memory-only, the data actually need to be populated in the in-memory store from disk.

Whether the data is populated immediately after the startup or not, it actually depends on the priority specified for the specific segment. The higher the priority, the sooner the segment will be populated in-memory. The priority attribute also drives which segments would survive in-memory in case of “in-memory pressure”. Sadly, the parameter inmemory_size that specifies the size of the in-memory area is static and an instance restart is required in order to change it, that’s why you need to plan carefully the size prior to its activation. There is a compression advisor for in-memory that can help out on this.

Conclusion

In this post you’ve seen a small introduction about in-memory. I hope I can publish very soon another post with a few practical examples.

DBA survival BLOG

DBA stuff and Oracle Data Guard

Monthly Archives: July 2014

In-memory Columnar Store hands-on

Oracle Database 12c in-memory option, a quick overview