Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

D. Kuhn, T. KyteExpert Oracle Database Architecturehttps://doi.org/10.1007/978-1-4842-7499-6_13

13. Partitioning

Darl Kuhn¹ and Thomas Kyte²

(1)

Morrison, CO, USA

(2)

Denver, CO, USA

Partitioning is the process of physically breaking a table or index into many smaller, more manageable pieces. As far as the application accessing the database is concerned, there is logically only one table or one index, but physically that table or index may comprise many dozens of physical partitions. Each partition is an independent object that may be manipulated either by itself or as part of the larger object.

Note

Partitioning is an extra cost option to the Enterprise Edition of the Oracle database. It is not available in the Standard Edition.

In this chapter, we will investigate why you might consider using partitioning. The reasons range from increased availability of data to reduced administrative (DBA) burdens and, in certain situations, increased performance. Once you have a good understanding of the reasons for using partitioning, we’ll look at how you may partition tables and their corresponding indexes. The goal of this discussion is not to teach you the details of administering partitions, but rather to present a practical guide to implementing your applications with partitions.

We will also discuss the important fact that partitioning of tables and indexes is not a guaranteed “fast = true” setting for the database. It has been my experience that many developers and DBAs believe that increased performance is an automatic side effect of partitioning an object. Partitioning is just a tool, and one of three things will happen when you partition an index or table: the application using these partitioned tables might run slower, might run faster, or might not be affected one way or the other. I put forth that if you just apply partitioning without understanding how it works and how your application can make use of it, then the odds are you will negatively impact performance by just turning it on.

Lastly, we’ll investigate a very common use of partitions in today’s world: supporting a large online audit trail in OLTP and other operational systems. We’ll discuss how to incorporate partitioning and segment space compression to efficiently store online a large audit trail and provide the ability to archive old records out of this audit trail with minimal work.

Partitioning Overview

Partitioning facilitates the management of very large tables and indexes using divide and conquer logic. Partitioning introduces the concept of a partition key that is used to segregate data based on a certain range value, a list of specific values, or the value of a hash function. If I were to put the benefits of partitioning in some sort of order, it would be as follows:

1.
Increases availability of data: This attribute is applicable to all system types, be they OLTP or warehouse systems by nature.
2.
Eases administration of large segments by removing them from the database: Performing administrative operations on a 100GB table, such as a reorganization to remove migrated rows or to reclaim “whitespace” left in the table after a purge of old information, would be much more onerous than performing the same operation ten times on individual 10GB table partitions. Additionally, using partitions, we might be able to conduct a purge routine without leaving whitespace behind at all, removing the need for a reorganization entirely!
3.
Improves the performance of certain queries: This is mainly beneficial in a large warehouse environment where we can use partitioning to eliminate large ranges of data from consideration, avoiding accessing this data at all. This will not be as applicable in a transactional system, since we are accessing small volumes of data in that system already.
4.
May reduce contention on high-volume OLTP systems by spreading out modifications across many separate partitions: If you have a segment experiencing high contention, turning it into many segments could have the side effect of reducing that contention proportionally.

Let’s take a look at each of these potential benefits of using partitioning.

Increased Availability

Increased availability derives from the independence of each partition. The availability (or lack thereof) of a single partition in an object does not mean the object itself is unavailable. The optimizer is aware of the partitioning scheme that is in place and will remove unreferenced partitions from the query plan accordingly. If a single partition is unavailable in a large object, and your query can eliminate this partition from consideration, then Oracle will successfully process the query.

To demonstrate this increased availability, we’ll set up a hash partitioned table with two partitions, each in a separate tablespace. We’ll create an EMP table that specifies a partition key on the EMPNO column; EMPNO will be our partition key. In this case, this structure means that for each row inserted into this table, the value of the EMPNO column is hashed to determine the partition (and hence the tablespace) into which the row will be placed. First, we create two tablespaces (P1 and P2) and then a partitioned table with two partitions (PART_1 and PART_2), with one partition in each tablespace:

Note

The tablespaces in this example use Oracle Managed Files with the initialization parameter DB_CREATE_FILE_DEST set to /opt/oracle/oradata.

$ sqlplus eoda/foo@PDB1

SQL> create tablespace p1 datafile size 1m autoextend on next 1m;

Tablespace created.

SQL> create tablespace p2 datafile size 1m autoextend on next 1m;

Tablespace created.

SQL> CREATE TABLE emp

( empno int,

ename varchar2(20)

)

PARTITION BY HASH (empno)

( partition part_1 tablespace p1,

partition part_2 tablespace p2

)

Table created.

Next, we insert some data into the EMP table and then, using the partition-extended table name, inspect the contents of each partition:

SQL> insert into emp select empno, ename from scott.emp;

14 rows created.

SQL> select * from emp partition(part_1);

EMPNO ENAME

---------- --------------------

7369 SMITH

7499 ALLEN

7654 MARTIN

7698 BLAKE

7782 CLARK

7839 KING

7876 ADAMS

7934 MILLER

8 rows selected.

SQL> select * from emp partition(part_2);

EMPNO ENAME

---------- --------------------

7521 WARD

7566 JONES

7788 SCOTT

7844 TURNER

7900 JAMES

7902 FORD

6 rows selected.

You should note that the data is somewhat randomly assigned. That is by design here. Using hash partitioning, we are asking Oracle to randomly—but hopefully evenly—distribute our data across many partitions. We cannot control the partition into which data goes; Oracle decides that based on hashing the hash key value itself. Later, when we look at range and list partitioning, we’ll see how we can control what partitions receive which data.

Now, we take one of the tablespaces offline (simulating, e.g., a disk failure), thus making unavailable the data in that partition:

SQL> alter tablespace p1 offline;

Tablespace altered.

Next, we run a query that hits every partition, and we see that this query fails:

SQL> select * from emp;

select * from emp

ERROR at line 1:

ORA-00376: file 22 cannot be read at this time

ORA-01110: data file 22:

'/opt/oracle/oradata/CDB/C217E68DF48779E1E0530101007F73B9/datafile/o1_mf_p1_jc8b

g9nm_.dbf'

However, a query that does not access the offline tablespace will function as normal; Oracle will eliminate the offline partition from consideration. I use a bind variable in this particular example just to demonstrate that even though Oracle does not know at query optimization time which partition will be accessed, it is nonetheless able to perform this elimination at runtime:

SQL> variable n number

SQL> exec :n := 7844;

PL/SQL procedure successfully completed.

SQL> select * from emp where empno = :n;

EMPNO ENAME

---------- --------------------

7844 TURNER

In summary, when the optimizer can eliminate partitions from the plan, it will. This fact increases availability for those applications that use the partition key in their queries.

Partitions also increase availability by reducing downtime. If you have a 100GB table, for example, and it is partitioned into 50 2GB partitions, then you can recover from errors that much faster. If one of the 2GB partitions is damaged, the time to recover is now the time it takes to restore and recover a 2GB partition, not a 100GB table. So availability is increased in two ways:

Partition elimination by the optimizer means that many users may never even notice that some of the data was unavailable.
Downtime is reduced in the event of an error because of the significantly reduced amount of work that is required to recover.

Reduced Administrative Burden

The administrative burden relief is derived from the fact that performing operations on small objects is inherently easier, faster, and less resource intensive than performing the same operation on a large object.

For example, say you have a 10GB index in your database. If you need to rebuild this index and it is not partitioned, then you will have to rebuild the entire 10GB index as a single unit of work. While it is true that you could rebuild the index online, it requires a huge number of resources to completely rebuild an entire 10GB index. You’ll need at least 10GB of free storage elsewhere to hold a copy of both indexes, you’ll need a temporary transaction log table to record the changes made against the base table during the time you spend rebuilding the index, and so on. On the other hand, if the index itself had been partitioned into ten 1GB partitions, then you could rebuild each index partition individually, one by one. Now you need ten percent of the free space you needed previously. Likewise, the individual index rebuilds will each be much faster (ten times faster, perhaps), so far fewer transactional changes occurring during an online index rebuild need to be merged into the new index, and so on.

Also, consider what happens in the event of a system or software failure just before completing the rebuilding of a 10GB index. The entire effort is lost. By breaking the problem down and partitioning the index into 1GB partitions, at most you would lose ten percent of the total work required to rebuild the index.

Last, but not least, it may be that you need to rebuild only ten percent of the total aggregate index—for example, only the “newest” data (the active data) is subject to this reorganization, and all of the “older” data (which is relatively static) remains unaffected.

Another example could be that you discover 50 percent of the rows in your table are “migrated” rows (see Chapter 10 for details on migrated rows), and you would like to fix this. Having a partitioned table will facilitate the operation. To “fix” migrated rows, you must typically rebuild the object—in this case, a table. If you have one 100GB table, you will need to perform this operation in one very large chunk serially, using ALTER TABLE MOVE. On the other hand, if you have 25 partitions, each 4GB in size, then you can rebuild each partition one by one. Alternatively, if you are doing this during off-hours and have ample resources, you can even do the ALTER TABLE MOVE statements in parallel, in separate sessions, potentially reducing the amount of time the whole operation will take. Virtually everything you can do to a nonpartitioned object, you can do to an individual partition of a partitioned object. You might even discover that your migrated rows are concentrated in a very small subset of your partitions; hence, you could rebuild one or two partitions instead of the entire table.

Here is a quick example demonstrating the rebuild of a table with many migrated rows. Both BIG_TABLE1 and BIG_TABLE2 were created from a 10,000,000-row instance of BIG_TABLE (see the “Setting Up Your Environment” at the beginning of the book for the BIG_TABLE creation script). BIG_TABLE1 is a regular, nonpartitioned table, whereas BIG_TABLE2 is a hash partitioned table in eight partitions (we’ll cover hash partitioning in detail in a subsequent section; suffice it to say, it distributed the data rather evenly into eight partitions). This example creates two tablespaces and then creates the two tables:

SQL> create tablespace big1 datafile size 1500M;

Tablespace created.

SQL> create tablespace big2 datafile size 1500m;

Tablespace created.

SQL> create table big_table1

( ID, OWNER, OBJECT_NAME, SUBOBJECT_NAME,

OBJECT_ID, DATA_OBJECT_ID,

OBJECT_TYPE, CREATED, LAST_DDL_TIME,

TIMESTAMP, STATUS, TEMPORARY,

GENERATED, SECONDARY )

tablespace big1

select ID, OWNER, OBJECT_NAME, SUBOBJECT_NAME,

OBJECT_ID, DATA_OBJECT_ID,

OBJECT_TYPE, CREATED, LAST_DDL_TIME,

TIMESTAMP, STATUS, TEMPORARY,

GENERATED, SECONDARY

from big_table;

Table created.

SQL> create table big_table2

( ID, OWNER, OBJECT_NAME, SUBOBJECT_NAME,

OBJECT_ID, DATA_OBJECT_ID,

OBJECT_TYPE, CREATED, LAST_DDL_TIME,

TIMESTAMP, STATUS, TEMPORARY,

GENERATED, SECONDARY )

partition by hash(id)

(partition part_1 tablespace big2,

partition part_2 tablespace big2,

partition part_3 tablespace big2,

partition part_4 tablespace big2,

partition part_5 tablespace big2,

partition part_6 tablespace big2,

partition part_7 tablespace big2,

partition part_8 tablespace big2

)

select ID, OWNER, OBJECT_NAME, SUBOBJECT_NAME,

OBJECT_ID, DATA_OBJECT_ID,

OBJECT_TYPE, CREATED, LAST_DDL_TIME,

TIMESTAMP, STATUS, TEMPORARY,

GENERATED, SECONDARY

from big_table;

Table created.

Now, each of those tables is in its own tablespace, so we can easily query the data dictionary to see the allocated and free space in each tablespace:

SQL> select b.tablespace_name,

mbytes_alloc,

mbytes_free

from ( select round(sum(bytes)/1024/1024) mbytes_free,

tablespace_name

from dba_free_space

group by tablespace_name ) a,

( select round(sum(bytes)/1024/1024) mbytes_alloc,

tablespace_name

from dba_data_files

group by tablespace_name ) b

where a.tablespace_name (+) = b.tablespace_name

and b.tablespace_name in ('BIG1','BIG2')

TABLESPACE_NAME MBYTES_ALLOC MBYTES_FREE

------------------------------ ------------ -----------

BIG1 1500 219

BIG2 1500 219

BIG1 and BIG2 are about 1500MB in size and each about 219MB of free space. We’ll try to rebuild the first table, BIG_TABLE1:

SQL> alter table big_table1 move;

alter table big_table1 move

ERROR at line 1:

ORA-01652: unable to extend temp segment by 1024 in tablespace BIG1

This fails—we need sufficient free space in tablespace BIG1 to hold an entire copy of BIG_TABLE1 at the same time as the old copy is there—in short, we need about two times the storage for a short period (maybe more, maybe less—it depends on the resulting size of the rebuilt table). We now attempt the same operation on BIG_TABLE2:

SQL> alter table big_table2 move;

alter table big_table2 move

ERROR at line 1:

ORA-14511: cannot perform operation on a partitioned object

This is Oracle telling us we can’t do the MOVE operation on the table; we must perform the operation on each partition of the table instead. We can move (hence rebuild and reorganize) each partition one by one:

SQL> alter table big_table2 move partition part_1;

Table altered.

SQL> alter table big_table2 move partition part_2;

Table altered.

SQL> alter table big_table2 move partition part_3;

Table altered.

SQL> alter table big_table2 move partition part_4;

Table altered.

SQL> alter table big_table2 move partition part_5;

Table altered.

SQL> alter table big_table2 move partition part_6;

Table altered.

SQL> alter table big_table2 move partition part_7;

Table altered.

SQL> alter table big_table2 move partition part_8;

Table altered.

Each individual move only needs sufficient free space to hold a copy of one-eighth of the data! Therefore, these commands succeed given the same amount of free space as we had before. We need significantly less temporary resources, and, further, if the system fails (e.g., due to a power outage) after we move PART_4 but before PART_5 finished moving, we won’t lose all of the work performed. The first four partitions would still be moved when the system recovers, and we may resume processing at partition PART_5.

Some may look at that and say, “Wow, eight statements—that is a lot of typing,” and it’s true that this sort of thing would be unreasonable if you had hundreds of partitions (or more). Fortunately, it is very easy to script a solution, and the previous would become simply

SQL> begin

for x in ( select partition_name

from user_tab_partitions

where table_name = 'BIG_TABLE2' )

loop

execute immediate

'alter table big_table2 move partition ' ||

x.partition_name;

end loop;

end;

PL/SQL procedure successfully completed.

All of the information you need is there in the Oracle data dictionary, and most sites that have implemented partitioning also have a series of stored procedures they use to make managing large numbers of partitions easy. Additionally, many GUI tools such as Enterprise Manager have the built-in capability to perform these operations as well, without your needing to type in the individual commands.

Another factor to consider with regard to partitions and administration is the use of sliding windows of data in data warehousing and archiving. In many cases, you need to keep data online that spans the last N units of time. For example, say you need to keep the last 12 months or the last 5 years online. Without partitions, this is generally a massive INSERT followed by a massive DELETE, resulting in two massive transactions. Lots of DML, and lots of redo and undo generated. Now with partitions, you can simply do the following:

1.
Load a separate table with the new months’ (or years’, or whatever) data.
2.
Index the table fully. (These steps could even be done in another instance and transported to this database).
3.
Attach this newly loaded and indexed table onto the end of the partitioned table using a fast DDL command: ALTER TABLE EXCHANGE PARTITION.
4.
Detach the oldest partition off the other end of the partitioned table.

So, you can now very easily support extremely large objects containing time-sensitive information. The old data can easily be removed from the partitioned table and simply dropped if you do not need it, or it can be archived off elsewhere. New data can be loaded into a separate table, so as to not affect the partitioned table until the loading, indexing, and so on is complete. We will take a look at a complete example of a sliding window later.

In short, partitioning can make what would otherwise be daunting, or in some cases unfeasible, operations as easy as they are in a small database.

Enhanced Statement Performance

The third general (potential) benefit of partitioning is in the area of enhanced statement (SELECT, INSERT, UPDATE, DELETE, MERGE) performance. We’ll take a look at two classes of statements—those that modify information and those that just read information—and discuss what benefits we might expect from partitioning in each case.

Parallel DML

Statements that modify data in the database may have the potential to perform parallel DML (PDML) . During PDML, Oracle uses many threads or processes to perform your INSERT, UPDATE, DELETE, or MERGE instead of a single serial process. On a multi-CPU machine with plenty of I/O bandwidth, the potential increase in speed may be large for mass DML operations.

Note

We will cover parallel operations in more detail in Chapter 14.

Query Performance

In the area of strictly read query performance (SELECT statements), partitioning comes into play with two types of specialized operations:

Partition elimination: Some partitions of data are not considered in the processing of the query. We have already seen an example of partition elimination.
Parallel operations: Examples of this are parallel full table scans and parallel index range scans.

However, the benefit you can derive from these depends very much on the type of system you are using.

OLTP Systems

You should not look toward partitions as a way to massively improve query performance in an OLTP system. In fact, in a traditional OLTP system, you must apply partitioning with care so as to not negatively affect runtime performance. In a traditional OLTP system, most queries are expected to return virtually instantaneously, and most of the retrievals from the database are expected to be via very small index range scans. Therefore, the main performance benefits of partitioning listed previously would not come into play. Partition elimination is useful where you have full scans of large objects, because it allows you to avoid full scanning large pieces of an object. However, in an OLTP environment, you are not full scanning large objects (if you are, you have a serious design flaw). Even if you partition your indexes, any increase in performance achieved by scanning a smaller index will be miniscule—if you actually achieve an increase in speed at all. If some of your queries use an index and they cannot eliminate all but one partition from consideration, you may find your queries actually run slower after partitioning since you now have 5, 10, or 20 small indexes to probe, instead of 1 larger index. We will investigate this in much more detail later when we look at the types of partitioned indexes available to us.

As for parallel operations, as we’ll investigate in more detail in the next chapter, you do not want to do a parallel query in an OLTP system. You would reserve your use of parallel operations for the DBA to perform rebuilds, create indexes, gather statistics, and so on. The fact is that in an OLTP system, your queries should already be characterized by very fast index accesses, and partitioning will not speed that up very much, if at all. This does not mean that you should avoid partitioning for OLTP; it means that you shouldn’t expect partitioning to offer massive improvements in performance. Most OLTP applications are not able to take advantage of the times where partitioning is able to enhance query performance, but you can still benefit from the other possible partitioning benefits: administrative ease, higher availability, and reduced contention.

Data Warehouse Systems

In a data warehouse/decision support system, partitioning is not only a great administrative tool but something that will speed up processing. For example, you may have a large table on which you need to perform an ad hoc query. You always do the ad hoc query by sales quarter, as each sales quarter contains hundreds of thousands of records, and you have millions of online records. So, you want to query a relatively small slice of the entire data set, but it is not really feasible to index it based on the sales quarter. This index would point to hundreds of thousands of records, and doing the index range scan in this way would be terrible (refer to Chapter 11 for more details on this). A full table scan is called for to process many of your queries, but you end up having to scan millions of records, most of which won’t apply to your query. Using an intelligent partitioning scheme, you can segregate the data by quarter such that when you query the data for any given quarter, you will full scan only that quarter’s data. This is the best of all possible solutions.

In addition, in a data warehouse/decision support system environment, parallel query is used frequently. Here, operations such as parallel index range scans or parallel fast full index scans are not only meaningful but also beneficial to us. We want to maximize our use of all available resources, and parallel query is a way to do it. So, in this environment, partitioning stands a very good chance of speeding up processing.

Reduced Contention in an OLTP System

The last general benefit area for partitioning is potentially increasing concurrency by decreasing contention in an OLTP system. Partitions can be used to spread the modifications of a single table out over many physical partitions. The idea is if you have a segment experiencing high contention, turning it into many segments could have the side effect of reducing that contention proportionally.

For example, instead of having a single table segment with a single index segment, you might have 20 table partitions and 20 index partitions. It could be like having 20 tables instead of 1 (and 20 indexes instead of 1); hence, contention would be decreased for this shared resource during modifications.

Table Partitioning Schemes

There are currently nine methods by which you can partition tables in Oracle:

Range partitioning: You may specify ranges of data that should be stored together. For example, everything that has a timestamp within the month of Jan-2014 will be stored in partition 1, everything with a timestamp within Feb-2014 in partition 2, and so on. This is probably the most commonly used partitioning mechanism in Oracle.
Hash partitioning: You saw this in the first example in this chapter. A column (or columns) has a hash function applied to it, and the row will be placed into a partition according to the value of this hash.
List partitioning: You specify a discrete set of values, which determines the data that should be stored together. For example, you could specify that rows with a STATUS column value in ( 'A', 'M', 'Z' ) go into partition 1, those with a STATUS value in ( 'D', 'P', 'Q' ) go into partition 2, and so on.
Interval partitioning: This is very similar to range partitioning with the exception that the database itself can create new partitions as data arrives. With traditional range partitioning, the DBA was tasked with pre-creating partitions to hold every possible data value, for now and into the future. This typically meant that a DBA was tasked with creating partitions on a schedule—to hold next months’ or next weeks’ data. With interval partitioning, the database itself will create partitions as new data arrives that doesn’t fit into any existing partition based on a rule specified by the DBA.
Reference partitioning: This allows a child table in a parent/child relationship enforced by a foreign key to inherit the partitioning scheme of the parent table. This makes it possible to equipartition a child table with its parent table without having to denormalize the data model. In the past, a table could only be partitioned based on attributes it physically stored; reference partitioning in effect allows you to partition a table based on attributes from its parent table.
Interval reference partitioning: As the name implies, this is a combination of interval and reference partitioning. It allows for the automatic adding of partitions to parent/child reference partitioned tables.
Virtual column partitioning: This allows partitioning on an expression based on one or more existing columns of the table. The expression is stored as metadata only.
Composite partitioning: This is a combination of range, hash, and list partitioning. It allows you to first apply one partitioning scheme to some data and then, within each resulting partition, have that partition subdivided into subpartitions using some other partitioning scheme.
System partitioning: The application determines which partition a row is explicitly inserted into. This partitioning type has limited uses and won’t be covered in this chapter; we only mention it here to complete the list of partition types that Oracle supports. For more details on system partitioning, see the Oracle Database Cartridge Developer’s Guide.

In the following sections, we’ll look at the benefits of each type of partitioning and at the differences between them. We’ll also look at when to apply which schemes to different application types. This section is not intended to present a comprehensive demonstration of the syntax of partitioning and all of the available options. Rather, the examples are simple and illustrative and designed to give you an overview of how partitioning works and how the different types of partitioning are designed to function.

Note

For full details on partitioning syntax, I refer you to either the Oracle Database SQL Language Reference manual or to Oracle Database Administrator’s Guide. Additionally, the Oracle Database VLDB and Partitioning Guide and Oracle Database Data Warehousing Guide are both excellent sources of information on the partitioning options and are must-reads for anyone planning to implement partitioning.

Range Partitioning

The first type we will look at is a range partitioned table . The following CREATE TABLE statement creates a range partitioned table using the column RANGE_KEY_COLUMN. All data with a RANGE_KEY_COLUMN strictly less than 01-JAN-2021 will be placed into the partition PART_1, and all data with a value strictly less than 01-JAN-2022 (and greater than or equal to 01-JAN-2021) will go into partition PART_2. Any data not satisfying either of those conditions (e.g., a row with a RANGE_KEY_COLUMN value of 01-JAN-2022 or greater) will fail upon insertion, as it cannot be mapped to a partition:

$ sqlplus eoda/foo@PDB1

SQL> CREATE TABLE range_example

( range_key_column date NOT NULL,

data varchar2(20)

)

PARTITION BY RANGE (range_key_column)

( PARTITION part_1 VALUES LESS THAN

(to_date('01/01/2021','dd/mm/yyyy')),

PARTITION part_2 VALUES LESS THAN

(to_date('01/01/2022','dd/mm/yyyy'))

);

Table created.

Note

We are using the date format DD/MM/YYYY in the CREATE TABLE statement to make this international. If we used a format of DD-MON-YYYY, then the CREATE TABLE would fail with ORA-01843: not a valid month if the abbreviation of January was not Jan on your system. The NLS_LANGUAGE setting would affect this. I have used the three-character month abbreviation in the text and inserts, however, to avoid any ambiguity as to which component is the day and which is the month.

Figure 13-1 shows that Oracle will inspect the value of the RANGE_KEY_COLUMN and, based on that value, insert it into one of the two partitions.

../images/319655_4_En_13_Chapter/319655_4_En_13_Fig1_HTML.jpg — Figure 13-1
Range partition insert example

The rows inserted were specifically chosen with the goal of demonstrating that the partition range is strictly less than and not less than or equal to. We first insert the value 15-DEC-2020, which will definitely go into partition PART_1. We also insert a row with a date/time that is one second before 01-JAN-2021—that row will also go into partition PART_1 since that is less than 01-JAN-2021. However, the next insert of midnight on 01-JAN-2021 goes into partition PART_2 because that date/time is not strictly less than the partition range boundary for PART_1. The last row obviously belongs in partition PART_2 since it is greater than or equal to the partition range boundary for PART_1 and less than the partition range boundary for PART_2.

We can confirm that this is the case by performing SELECT statements from the individual partitions:

SQL> select to_char(range_key_column,'dd-mon-yyyy hh24:mi:ss')

from range_example partition (part_1);

TO_CHAR(RANGE_KEY_COLUMN,'DD-

-----------------------------

15-dec-2020 00:00:00

31-dec-2020 23:59:59

SQL> select to_char(range_key_column,'dd-mon-yyyy hh24:mi:ss')

from range_example partition (part_2);

TO_CHAR(RANGE_KEY_COLUMN,'DD-

-----------------------------

01-jan-2021 00:00:00

31-dec-2021 23:59:59

You might be wondering what would happen if you inserted a date that fell outside of the upper bound. The answer is that Oracle would raise an error:

SQL> insert into range_example

( range_key_column, data )

values

( to_date( '01-jan-2022 00:00:00',

'dd-mon-yyyy hh24:mi:ss' ),

'application data...' );

insert into range_example

ERROR at line 1:

ORA-14400: inserted partition key does not map to any partition

There are two approaches to the preceding situation—one would be to use interval partitioning described later or to use a catch-all partition, which we’ll demonstrate now. Suppose you want to segregate 2020 and 2021 dates into their separate partitions as we have, but you want all other dates to go into a third partition. With range partitioning, you can do this using the MAXVALUE clause , which looks like this:

SQL> drop table range_example purge;

SQL> CREATE TABLE range_example

( range_key_column date,

data varchar2(20)

)

PARTITION BY RANGE (range_key_column)

( PARTITION part_1 VALUES LESS THAN

(to_date('01/01/2021','dd/mm/yyyy')),

PARTITION part_2 VALUES LESS THAN

(to_date('01/01/2022','dd/mm/yyyy')),

PARTITION part_3 VALUES LESS THAN

(MAXVALUE)

);

Table created.

Now when you insert a row into that table, it will go into one of the three partitions—no row will be rejected, since partition PART_3 can take any value of RANGE_KEY_COLUMN that doesn’t go into PART_1 or PART_2 (even null values of the RANGE_KEY_COLUMN will be inserted into this new partition).

Hash Partitioning

When hash partitioning a table, Oracle will apply a hash function to the partition key to determine in which of the N partitions the data should be placed. Oracle recommends that N be a number that is a power of 2 (2, 4, 8, 16, and so on) to achieve the best overall distribution, and we’ll see shortly that this is absolutely good advice.

How Hash Partitioning Works

Hash partitioning is designed to achieve a good spread of data across many different devices (disks) or just to segregate data out into more manageable chunks. The hash key chosen for a table should be a column or set of columns that are unique or at least have as many distinct values as possible to provide for a good spread of the rows across partitions. If you choose a column that has only four values, and you use two partitions, then all the rows could quite easily end up hashing to the same partition, obviating the goal of partitioning in the first place!

We will create a hash table with two partitions in this case. We will use a column named HASH_KEY_COLUMN as our partition key. Oracle will take the value in this column and determine the partition this row will be stored in by hashing that value:

$ sqlplus eoda/foo@PDB1

SQL> CREATE TABLE hash_example

( hash_key_column date,

data varchar2(20)

)

PARTITION BY HASH (hash_key_column)

( partition part_1 tablespace p1,

partition part_2 tablespace p2

);

Table created.

Figure 13-2 shows that Oracle will inspect the value in the HASH_KEY_COLUMN, hash it, and determine which of the two partitions a given row will appear in.

../images/319655_4_En_13_Chapter/319655_4_En_13_Fig2_HTML.jpg — Figure 13-2
Hash partition insert example

As noted earlier, hash partitioning gives you no control over which partition a row ends up in. Oracle applies the hash function, and the outcome of that hash determines where the row goes. If you want a specific row to go into partition PART_1 for whatever reason, you should not—in fact, you cannot—use hash partitioning. The row will go into whatever partition the hash function says to put it in. If you change the number of hash partitions, the data will be redistributed over all of the partitions (adding or removing a partition to a hash partitioned table will cause all of the data to be rewritten, as every row may now belong in a different partition).

Hash partitioning is most useful when you have a large table, such as the one shown in the “Reduced Administrative Burden” section, and you would like to divide and conquer it. Rather than manage one large table, you would like to have 8 or 16 smaller tables to manage. Hash partitioning is also useful to increase availability to some degree, as demonstrated in the “Increased Availability” section; the temporary loss of a single hash partition permits access to all of the remaining partitions. Some users may be affected, but there is a good chance that many will not be. Additionally, the unit of recovery is much smaller now. You do not have a single large table to restore and recover; you have a fraction of that table to recover. Lastly, hash partitioning is useful in high update contention environments, as mentioned in the “Reduced Contention in an OLTP System” section. Instead of having a single hot segment, we can hash partition a segment into 16 pieces, each of which is now receiving modifications.

Hash Partition Using Powers of Two

I mentioned earlier that the number of partitions should be a power of two. This is easily observed to be true. To demonstrate, we’ll set up a stored procedure to automate the creation of a hash partitioned table with N partitions (N will be a parameter). This procedure will construct a dynamic query to retrieve the counts of rows by partition and then display the counts and a simple histogram of the counts by partition. Lastly, it will open this query and let us see the results. This procedure starts with the hash table creation. We will use a table named T:

SQL> create or replace

procedure hash_proc

( p_nhash in number,

p_cursor out sys_refcursor )

authid current_user

l_text long;

l_template long :=

'select $POS$ oc, ''p$POS$'' pname, count(*) cnt ' ||

'from t partition ( $PNAME$ ) union all ';

table_or_view_does_not_exist exception;

pragma exception_init( table_or_view_does_not_exist, -942 );

begin

execute immediate 'drop table t';

exception when table_or_view_does_not_exist

then null;

end;

execute immediate '

CREATE TABLE t ( id )

partition by hash(id)

partitions ' || p_nhash || '

select rownum

from all_objects';

Next, we will dynamically construct a query to retrieve the count of rows by partition. It does this using the template query defined earlier. For each partition, we’ll gather the count using the partition-extended table name and union all of the counts together:

for x in ( select partition_name pname,

PARTITION_POSITION pos

from user_tab_partitions

where table_name = 'T'

order by partition_position )

loop

l_text := l_text ||

replace(

replace(l_template,

'$POS$', x.pos),

'$PNAME$', x.pname );

end loop;

Now, we’ll take that query and select out the partition position (PNAME) and the count of rows in that partition (CNT). Using RPAD, we’ll construct a rather rudimentary but effective histogram:

open p_cursor for

'select pname, cnt,

substr( rpad(''*'',30*round( cnt/max(cnt)over(),2),''*''),1,30) hg

from (' || substr( l_text, 1, length(l_text)-11 ) || ')

order by oc';

end;

Procedure created.

If we run this with an input of 4, for four hash partitions, we would expect to see output similar to the following:

SQL> variable x refcursor

SQL> set autoprint on

SQL> exec hash_proc( 4, :x );

PL/SQL procedure successfully completed.

PN CNT HG

-- ---------- ------------------------------

p1 12141 *****************************

p2 12178 *****************************

p3 12417 ******************************

p4 12105 *****************************

The simple histogram depicted shows a nice, even distribution of data over each of the four partitions. Each has close to the same number of rows in it. However, if we simply go from four to five hash partitions, we’ll see the following:

SQL> exec hash_proc( 5, :x );

PL/SQL procedure successfully completed.

PN CNT HG

-- ---------- ------------------------------

p1 6102 **************

p2 12180 *****************************

p3 12419 ******************************

p4 12106 *****************************

p5 6040 **************

This histogram points out that the first and last partitions have just half as many rows as the interior partitions. The data is not very evenly distributed at all. We’ll see the trend continue for six and seven hash partitions:

SQL> exec hash_proc( 6, :x );

PL/SQL procedure successfully completed.

PN CNT HG

-- ---------- ------------------------------

p1 6104 **************

p2 6175 ***************

p3 12420 ******************************

p4 12106 *****************************

p5 6040 **************

p6 6009 **************

6 rows selected.

SQL> exec hash_proc( 7, :x );

PL/SQL procedure successfully completed.

PN CNT HG

-- ---------- ------------------------------

p1 6105 ***************

p2 6176 ***************

p3 6161 ***************

p4 12106 ******************************

p5 6041 ***************

p6 6010 ***************

p7 6263 ***************

7 rows selected.

As soon as we get back to a number of hash partitions that is a power of two, we achieve the goal of even distribution once again:

SQL> exec hash_proc( 8, :x );

PL/SQL procedure successfully completed.

PN CNT HG

-- ---------- ------------------------------

p1 6106 *****************************

p2 6178 *****************************

p3 6163 *****************************

p4 6019 ****************************

p5 6042 ****************************

p6 6010 ****************************

p7 6264 ******************************

p8 6089 *****************************

8 rows selected.

If we continue this experiment up to 16 partitions, we would see the same effects for the 9th through the 15th partitions—a skewing of the data to the interior partitions, away from the edges—and then upon hitting the 16th partition, you would see a flattening out again. The same would be true again up to 32 partitions, and then 64, and so on. This example just points out the importance of using a power of two as the number of hash partitions.

List Partitioning

List partitioning provides the ability to specify in which partition a row will reside, based on discrete lists of values. It is often useful to be able to partition by some code, such as a state or region code. For example, we might want to pull together in a single partition all records for people in the states of Maine (ME), New Hampshire (NH), Vermont (VT), and Massachusetts (MA), since those states are located next to or near each other, and our application queries data by geographic region. Similarly, we might want to group together Connecticut (CT), Rhode Island (RI), and New York (NY).

We can’t use a range partition, since the range for the first partition would be ME through VT, and the second range would be CT through RI. Those ranges overlap. We can’t use hash partitioning since we can’t control which partition any given row goes into; the built-in hash function provided by Oracle does that. With list partitioning, we can accomplish this custom partitioning scheme easily:

$ sqlplus eoda/foo@PDB1

SQL> create table list_example

( state_cd varchar2(2),

data varchar2(20)

)

partition by list(state_cd)

( partition part_1 values ( 'ME', 'NH', 'VT', 'MA' ),

partition part_2 values ( 'CT', 'RI', 'NY' )

);

Table created.

Figure 13-3 shows that Oracle will inspect the STATE_CD column and, based on its value, place the row into the correct partition.

../images/319655_4_En_13_Chapter/319655_4_En_13_Fig3_HTML.jpg — Figure 13-3
List partition insert example

As we saw for range partitioning, if we try to insert a value that isn’t specified in the list partition, Oracle will raise an appropriate error back to the client application. In other words, a list partitioned table without a DEFAULT partition will implicitly impose a constraint much like a check constraint on the table:

SQL> insert into list_example values ( 'VA', 'data' );

insert into list_example values ( 'VA', 'data' )

ERROR at line 1:

ORA-14400: inserted partition key does not map to any partition

If we want to segregate these seven states into their separate partitions, as we have, but have all remaining state codes (or, in fact, any other row that happens to be inserted that doesn’t have one of these seven codes) go into a third partition, then we can use the VALUES ( DEFAULT ) clause. Here, we’ll alter the table to add this partition (we could use this in the CREATE TABLE statement as well):

SQL1> alter table list_example add partition part_3 values ( DEFAULT );

Table altered.

SQL> insert into list_example values ( 'VA', 'data' );

1 row created.

All values that are not explicitly in our list of values will go here. A word of caution on the use of DEFAULT: Once a list partitioned table has a DEFAULT partition, you cannot add any more partitions to it, so

SQL> alter table list_example add partition part_4 values( 'CA', 'NM' );

alter table list_example

ERROR at line 1:

ORA-14323: cannot add partition when DEFAULT partition exists

We would have to remove the DEFAULT partition, then add PART_4, and then put the DEFAULT partition back. The reason behind this is that the DEFAULT partition could have had rows with the list partition key value of CA or NM—they would not belong in the DEFAULT partition after adding PART_4.

Interval Partitioning

Interval partitioning is very similar to range partitioning described previously—in fact, it starts with a range partitioned table but adds a rule (the interval) to the definition so the database knows how to add partitions in the future. The goal of interval partitioning is to create new partitions for data—if, and only if, data exists for a given partition and only when that data arrives in the database. In other words, to remove the need to pre-create partitions for data, to allow the data itself to create the partition as it is inserted. To use interval partitioning, you start with a range partitioned table without a MAXVALUE partition and specify an interval to add to the upper bound, the highest value of that partitioned table to create a new range. You need to have a table that is range partitioned on a single column that permits adding a NUMBER or INTERVAL type to it (e.g., a table partitioned by a VARCHAR2 field cannot be interval partitioned; there is nothing you can add to a VARCHAR2). You can use interval partitioning with any suitable existing range partitioned table; that is, you can ALTER an existing range partitioned table to be interval partitioned, or you can create one with the CREATE TABLE command.

For example, suppose you had a range partitioned table that said “anything strictly less than 01-JAN-2021” (data in the year 2020 and before) goes into partition P1—and that was it. So it had one partition for all data in the year 2020 and before. If you attempted to insert data for the year 2022 into the table, the insert would fail as demonstrated previously in the section on range partitioning. With interval partitioning, you can create a table and specify both a range (strictly less than 01-JAN-2020) and an interval—say one month in duration—and the database would create monthly partitions (a partition capable of holding exactly one month’s worth of data) as the data arrived. The database would not pre-create all possible partitions because that would not be practical. But, as each row arrived, the database would see whether the partition for the month in question existed. The database would create the partition if needed. Here is an example of the syntax:

$ sqlplus eoda/foo@PDB1

SQL> create table audit_trail

( ts timestamp,

data varchar2(30)

)

partition by range(ts)

interval (numtoyminterval(1,'month'))

store in (users, example )

(partition p0 values less than

(to_date('01-01-1900','dd-mm-yyyy'))

);

Table created.

Note

You might have a question in your mind, especially if you just finished reading the previous chapter on datatypes. You can see we are partitioning by a TIMESTAMP, and we are adding an INTERVAL of one month to it. In Chapter 12, we saw how adding an INTERVAL of one month to a TIMESTAMP that fell on January 31 would raise an error, since there is no February 31. Will the same issue happen with interval partitioning? The answer is yes; if you attempt to use a date such as '29-01-1990' (any day of the month after 28 would suffice), you will receive an error "ORA-14767: Cannot specify this interval with existing high bounds". The database will not permit you to use a boundary value that is not safe to add the interval to.

On lines 8 and 9, you see the range partitioning scheme for this table; it starts with a single empty partition that would contain any data prior to 01-JAN-1900. Presumably, since the table holds an audit trail, this partition will remain small and empty forever. It is a mandatory partition and is referred to as the transitional partition. All data that is strictly less than this current high value partition will be range partitioned, using traditional range partitioning. Only data that is created above the transitional partition high value will use interval partitioning. If we query the data dictionary, we can see what has been created so far:

SQL> select a.partition_name, a.tablespace_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name = 'AUDIT_TRAIL'

and a.table_name = b.table_name

order by a.partition_position;

PARTITION_ TABLESPACE HIGH_VALUE INTERVAL

---------- ---------- ------------------------------- --------------------

P0 USERS TIMESTAMP' 1900-01-01 00:00:00'

So far, we have just the single partition, and it is not an INTERVAL partition , as shown by the empty INTERVAL column. Rather, it is just a regular RANGE partition right now; it will hold anything strictly less than 01-JAN-1900.

Looking at the CREATE TABLE statement again, we can see the new interval partitioning–specific information on lines 6 and 7:

interval (numtoyminterval(1,'month'))

store in (users, example )

On line 6, we have the actual interval specification of NUMTOYMINTERVAL (1,'MONTH'). Our goal was to store monthly partitions—a new partition for each month’s worth of data—a very common goal. By using a date that is safe to add a month to (refer to Chapter 12 for why adding a month to a timestamp can be error-prone in some cases)—the first of the month—we can have the database create monthly partitions on the fly, as data arrives, for us.

On line 7, we have specifics: store in (users,example). This allows us to tell the database where to create these new partitions—what tablespaces to use. As the database figures out what partitions it wants to create, it uses this list to decide what tablespace to create each partition in. This allows the DBA to control the maximum desired tablespace size: they might not want a single 500GB tablespace, but they would be comfortable with ten 50GB tablespaces. In that case, they would set up ten tablespaces and allow the database to use all ten to create partitions. Let’s insert a row of data now and see what happens:

SQL> insert into audit_trail (ts,data) values ( to_timestamp('27-feb-2020','dd-mon-yyyy'), 'xx' );

1 row created.

SQL> select a.partition_name, a.tablespace_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name = 'AUDIT_TRAIL'

and a.table_name = b.table_name

order by a.partition_position;

PARTITION_ TABLESPACE HIGH_VALUE INTERVAL

---------- ---------- ------------------------------- ---------------------

P0 USERS TIMESTAMP' 1900-01-01 00:00:00'

SYS_P1623 USERS TIMESTAMP' 2020-03-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

If you recall from the Range Partitioning section, you would expect that INSERT to fail. However, since we are using interval partitioning, it succeeds and, in fact, creates a new partition SYS_P1623. The HIGH_VALUE for this partition is 01-MAR-2020 which, if we were using range partitioning, would imply anything strictly less than 01-MAR-2020 and greater than or equal to 01-JAN-1900 would go into this partition, but since we have an interval, the rules are different. When the interval is set, the range for this partition is anything greater than or equal to the HIGH_VALUE-INTERVAL and strictly less than the HIGH_VALUE . So, this partition would have the range of

SQL> select TIMESTAMP' 2020-03-01 00:00:00'-NUMTOYMINTERVAL(1,'MONTH') greater_than_eq_to,

TIMESTAMP' 2020-03-01 00:00:00' strictly_less_than

from dual;

GREATER_THAN_EQ_TO STRICTLY_LESS_THAN

-------------------------------- --------------------------------

01-FEB-20 12.00.00.000000000 AM 01-MAR-20 12.00.00.000000000 AM

That is—all of the data for the month of February 2020. If we insert another row in some other month, as follows, we can see that another partition, SYS_P1624, is added that contains all of the data for the month of June 2020:

SQL> insert into audit_trail (ts,data) values ( to_date('25-jun-2020','dd-mon-yyyy'), 'xx' );

1 row created.

SQL> select a.partition_name, a.tablespace_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name = 'AUDIT_TRAIL'

and a.table_name = b.table_name

order by a.partition_position;

PARTITION_ TABLESPACE HIGH_VALUE INTERVAL

---------- ---------- ------------------------------- --------------------

P0 USERS TIMESTAMP' 1900-01-01 00:00:00'

SYS_P1623 USERS TIMESTAMP' 2020-03-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

SYS_P1624 USERS TIMESTAMP' 2020-07-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

You might be looking at this output and asking why everything is in the USERS tablespace . We clearly asked for the data to be spread out over the USERS tablespace and the EXAMPLE tablespace, so why is everything in a single tablespace? It has to do with the fact that when the database is figuring out what partition the data goes into, it is also computing which tablespace it would go into. Since each of our partitions is an even number of months away from each other and we are using just two tablespaces, we end up using the same tablespace over and over. If we only loaded “every other month” into this table, we would end up using only a single tablespace. We can see that the EXAMPLE tablespace can be used by adding some row that is an “odd” number of months away from our existing data:

SQL> insert into audit_trail (ts,data) values ( to_date('15-mar-2020','dd-mon-yyyy'), 'xx' );

1 row created.

SQL> select a.partition_name, a.tablespace_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name = 'AUDIT_TRAIL'

and a.table_name = b.table_name

order by a.partition_position;

PARTITION_ TABLESPACE HIGH_VALUE INTERVAL

---------- ---------- ------------------------------- --------------------

P0 USERS TIMESTAMP' 1900-01-01 00:00:00'

SYS_P1623 USERS TIMESTAMP' 2020-03-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

SYS_P1625 EXAMPLE TIMESTAMP' 2020-04-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

SYS_P1624 USERS TIMESTAMP' 2020-07-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

Now we have used the EXAMPLE tablespace. This new partition was slid in between the two existing partitions and will contain all of our March 2020 data.

You might be asking, “What happens if I roll back at this point?” If we were to roll back, it should be obvious that the AUDIT_TRAIL rows we just inserted would go away:

SQL> select * from audit_trail;

TS DATA

----------------------------------- ------------------------------

27-FEB-20 12.00.00.000000 AM xx

15-MAR-20 12.00.00.000000 AM xx

25-JUN-20 12.00.00.000000 AM xx

SQL> rollback;

Rollback complete.

SQL> select * from audit_trail;

no rows selected

But what isn’t clear immediately is what would happen to the partitions we added: do they stay or will they go away as well? A quick query will verify that they will stay:

SQL> select a.partition_name, a.tablespace_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name = 'AUDIT_TRAIL'

and a.table_name = b.table_name

order by a.partition_position;

PARTITION_ TABLESPACE HIGH_VALUE INTERVAL

---------- ---------- ------------------------------- ---------------------

P0 USERS TIMESTAMP' 1900-01-01 00:00:00'

SYS_P1623 USERS TIMESTAMP' 2020-03-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

SYS_P1625 EXAMPLE TIMESTAMP' 2020-04-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

SYS_P1624 USERS TIMESTAMP' 2020-07-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

As soon as they are created, they are committed and visible. These partitions are created using a recursive transaction, a transaction executed separate and distinct from any transaction you might already be performing. When we went to insert the row and the database discovered that the partition we needed did not exist, the database immediately started a new transaction, updated the data dictionary to reflect the new partition’s existence, and committed its work. It must do this, or there would be severe contention (serialization) on many inserts as other transactions would have to wait for us to commit to be able to see this new partition. Therefore, this DDL is done outside of your existing transaction, and the partitions will persist.

You might have noticed that the database names the partition for us; SYS_P1625 is the name of the newest partition. The names are not sortable nor very meaningful in the sense most people would be used to. They show the order in which the partitions were added to the table (although you cannot rely on that always being true; it is subject to change) but not much else. Normally, in a range partitioned table, the DBA would have named the partition using some naming scheme and in most cases would have made the partition names sortable. For example, the February data would be in a partition named PART_2020_02 (using a format of PART_yyyy_mm), March would be in PART_2020_03, and so on. With interval partitioning, you have no control over the partition names as they are created, but you can easily rename them afterward if you like. For example, we could query out the HIGH_VALUE string and using dynamic SQL convert that into nicely formatted, meaningful names. We can do this because we understand how we’d like the names formatted; the database does not. For example:

SQL> declare

l_str varchar2(4000);

begin

for x in ( select a.partition_name, a.tablespace_name, a.high_value

from user_tab_partitions a

where a.table_name = 'AUDIT_TRAIL'

and a.interval = 'YES'

and a.partition_name like 'SYS\_P%' escape '' )

loop

execute immediate

'select to_char( ' || x.high_value ||

'-numtodsinterval(1,''second''), ''"PART_"yyyy_mm'' ) from dual'

into l_str;

execute immediate

'alter table audit_trail rename partition "' ||

x.partition_name || '" to "' || l_str || '"';

end loop;

end;

PL/SQL procedure successfully completed.

So, what we’ve done is take the HIGH_VALUE and subtract one second from it. We know that the HIGH_VALUE represents the strictly less than value, so one second before its value would be a value in the range. Once we have that, we applied the format "PART_"yyyy_mm to the resulting TIMESTAMP and get a string such as PART_2020_03 for March 2020. We use that string in a rename command, and now our data dictionary looks like this:

SQL> select a.partition_name, a.tablespace_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name = 'AUDIT_TRAIL'

and a.table_name = b.table_name

order by a.partition_position;

PARTITION_NAME TABLESPACE HIGH_VALUE INTERVAL

-------------- ---------- ------------------------------- -----------------

P0 USERS TIMESTAMP' 1900-01-01 00:00:00'

PART_2020_02 USERS TIMESTAMP' 2020-03-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

PART_2020_03 EXAMPLE TIMESTAMP' 2020-04-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

PART_2020_06 USERS TIMESTAMP' 2020-07-01 00:00:00' NUMTOYMINTERVAL(1,'MONTH')

We would just run that script every now and then to rename any newly added partitions to keep the nice naming convention in place. Bear in mind, to avoid any SQL injection issues (we are using string concatenation, not bind variables; we cannot use bind variables in DDL), we would want to keep this script as an anonymous block or as an invoker’s rights routine if we decide to make it a stored procedure. That will prevent others from running SQL in our schema as if they were us, which could be a disaster.

Reference Partitioning

Reference partitioning addresses the issue of parent/child equipartitioning; that is, when you need the child table to be partitioned in such a manner that each child table partition has a one-to-one relationship with a parent table partition. This is important in situations such as a data warehouse where you want to keep a specific amount of data online (say the last five years’ worth of ORDER information) and need to ensure the related child data (the ORDER_LINE_ITEMS data) is online as well. In this classic example, the ORDERS table would typically have a column ORDER_DATE, making it easy to partition by month and thus facilitate keeping the last five years of data online easily. As time advances, you would just have next month’s partition available for loading, and you would drop the oldest partition. However, when you consider the ORDER_LINE_ITEMS table, you can see you would have a problem. It does not have the ORDER_DATE column, and there is nothing in the ORDER_LINE_ITEMS table to partition it by; therefore, it’s not facilitating the purging of old information or loading of new information.

In the past, prior to reference partitioning, developers would have to denormalize the data, in effect copying the ORDER_DATE attribute from the parent table ORDERS into the child ORDER_LINE_ITEMS table. This presented the typical problems of data redundancy, that of increased storage overhead, increased data loading resources, cascading update issues (if you modify the parent, you have to ensure you update all copies of the parent data), and so on. Additionally, if you enabled foreign key constraints in the database (as you should), you would discover that you lost the ability to truncate or drop old partitions in the parent table. For example, let’s set up the conventional ORDERS and ORDER_LINE_ITEMS tables starting with the ORDERS table:

$ sqlplus eoda/foo@PDB1

SQL> create table orders

(

order# number primary key,

order_date date,

data varchar2(30)

)

enable row movement

PARTITION BY RANGE (order_date)

(

PARTITION part_2020 VALUES LESS THAN (to_date('01-01-2021','dd-mm-yyyy')) ,

PARTITION part_2021 VALUES LESS THAN (to_date('01-01-2022','dd-mm-yyyy'))

);

Table created.

SQL> insert into orders values (1, to_date( '01-jun-2020', 'dd-mon-yyyy' ), 'xxx' );

1 row created.

SQL> insert into orders values (2, to_date( '01-jun-2021', 'dd-mon-yyyy' ), 'xxx' );

1 row created.

And now we’ll create the ORDER_LINE_ITEMS table—with a bit of data pointing to the ORDERS table:

SQL> create table order_line_items

(

order# number,

line# number,

order_date date, -- manually copied from ORDERS!

data varchar2(30),

constraint c1_pk primary key(order#,line#),

constraint c1_fk_p foreign key(order#) references orders

)

enable row movement

PARTITION BY RANGE (order_date)

(

PARTITION part_2020 VALUES LESS THAN (to_date('01-01-2021','dd-mm-yyyy')) ,

PARTITION part_2021 VALUES LESS THAN (to_date('01-01-2022','dd-mm-yyyy'))

);

Table created.

SQL> insert into order_line_items values ( 1, 1, to_date( '01-jun-2020', 'dd-mon-yyyy' ), 'yyy' );

1 row created.

SQL> insert into order_line_items values ( 2, 1, to_date( '01-jun-2021', 'dd-mon-yyyy' ), 'yyy' );

1 row created.

Now, if we were to drop the ORDER_LINE_ITEMS partition containing 2020 data, you know and I know that the corresponding ORDERS partition for 2020 could be dropped as well, without violating the referential integrity constraint. You and I know it, but the database is not aware of that fact:

SQL> alter table order_line_items drop partition part_2020;

Table altered.

SQL> alter table orders drop partition part_2020;

alter table orders drop partition part_2020

ERROR at line 1:

ORA-02266: unique/primary keys in table referenced by enabled foreign keys

So, not only is the approach of denormalizing the data cumbersome, resource intensive, and potentially damaging to our data integrity, it prevents us from doing something we frequently need to do when administering partitioned tables: purging old information.

Enter reference partitioning. With reference partitioning, a child table will inherit the partitioning scheme of its parent table without having to denormalize the partitioning key, and it allows the database to understand that the child table is equipartitioned with the parent table. That is, we’ll be able to drop or truncate the parent table partition when we truncate or drop the corresponding child table partition.

The simple syntax to reimplement our previous example could be as follows. We’ll reuse the existing parent table ORDERS and just truncate that table:

SQL> drop table order_line_items cascade constraints;

Table dropped.

SQL> truncate table orders;

Table truncated.

SQL> insert into orders values ( 1, to_date( '01-jun-2020', 'dd-mon-yyyy' ), 'xxx' );

1 row created.

SQL> insert into orders values ( 2, to_date( '01-jun-2021', 'dd-mon-yyyy' ), 'xxx' );

1 row created.

And create a new child table:

SQL> create table order_line_items

(

order# number,

line# number,

data varchar2(30),

constraint c1_pk primary key(order#,line#),

constraint c1_fk_p foreign key(order#) references orders

)

enable row movement

partition by reference(c1_fk_p);

Table created.

SQL> insert into order_line_items values ( 1, 1, 'yyy' );

1 row created.

SQL> insert into order_line_items values ( 2, 1, 'yyy' );

1 row created.

The magic is on line 10 of the CREATE TABLE statement. Here, we replaced the range partitioning statement with PARTITION BY REFERENCE. This allows us to name the foreign key constraint to use to discover what our partitioning scheme will be. Here, we see the foreign key is to the ORDERS table—the database read the structure of the ORDERS table and determined that it had two partitions—therefore, our child table will have two partitions. In fact, if we query the data dictionary right now, we can see that the two tables have the same exact partitioning structure:

SQL> select table_name, partition_name

from user_tab_partitions

where table_name in ( 'ORDERS', 'ORDER_LINE_ITEMS' )

order by table_name, partition_name;

TABLE_NAME PARTITION_NAME

-------------------- --------------------

ORDERS PART_2020

ORDERS PART_2021

ORDER_LINE_ITEMS PART_2020

ORDER_LINE_ITEMS PART_2021

Further, since the database understands these two tables are related, we can drop the parent table partition and have it automatically clean up the related child table partitions (since the child inherits from the parent, any alteration of the parent’s partition structure cascades down):

SQL> alter table orders drop partition part_2020 update global indexes;

Table altered.

SQL> select table_name, partition_name

from user_tab_partitions

where table_name in ( 'ORDERS', 'ORDER_LINE_ITEMS' )

order by table_name, partition_name;

TABLE_NAME PARTITION_NAME

-------------------- --------------------

ORDERS PART_2021

ORDER_LINE_ITEMS PART_2021

So, the DROP we were prevented from performing before is now permitted, and it cascades to the child table automatically. Further, if we ADD a partition, as follows, we can see that that operation is cascaded as well; there will be a one-to-one parity between the parent and the child:

SQL> alter table orders add partition

part_2022 values less than

(to_date( '01-01-2023', 'dd-mm-yyyy' ));

Table altered.

SQL> select table_name, partition_name

from user_tab_partitions

where table_name in ( 'ORDERS', 'ORDER_LINE_ITEMS' )

order by table_name, partition_name;

TABLE_NAME PARTITION_NAME

-------------------- ---------------

ORDERS PART_2021

ORDERS PART_2022

ORDER_LINE_ITEMS PART_2021

ORDER_LINE_ITEMS PART_2022

A part of the preceding CREATE TABLE statement that we did not discuss is the ENABLE ROW MOVEMENT. In short, the syntax allows an UPDATE to take place such that the UPDATE modifies the partition key value and modifies it in such a way as to cause the row to move from its current partition into some other partition.

Now, since we defined our parent table originally as permitting row movement, we were forced to define all of our child tables (and their children and so on) as having that capability as well, for if the parent row moves and we are using reference partitioning, we know the child row(s) must move as well. For example:

SQL> select '2021', count(*) from order_line_items partition(part_2021)

union all

select '2022', count(*) from order_line_items partition(part_2022);

'202 COUNT(*)

---- ----------

2021 1

2022 0

We can see that right now our data in the child table ORDER_LINE_ITEMS is in the 2021 partition. By performing a simple update against the parent ORDERS table, as follows, we can see our data moved—in the child table:

SQL> update orders set order_date = add_months(order_date,12);

1 row updated.

SQL> select '2021', count(*) from order_line_items partition(part_2021)

union all

select '2022', count(*) from order_line_items partition(part_2022);

'202 COUNT(*)

---- ----------

2021 0

2022 1

An update against the parent was cascaded down to the child table and caused the child table to move a row (or rows as needed).

To summarize, reference partitioning removes the need to denormalize data when partitioning parent and child tables. Furthermore, when dropping a parent partition, it will automatically drop the referenced child partition. These features are very useful in data warehousing environments.

Interval Reference Partitioning

Oracle flexibly allows you to use a combination of interval and reference partitioning . For example, if you create an interval range partitioned parent table, as follows:

SQL> create table orders

(order# number primary key,

order_date timestamp,

data varchar2(30))

PARTITION BY RANGE (order_date)

INTERVAL (numtoyminterval(1,'year'))

(PARTITION part_2020 VALUES LESS THAN (to_date('01-01-2021','dd-mm-yyyy')) ,

PARTITION part_2021 VALUES LESS THAN (to_date('01-01-2022','dd-mm-yyyy')));

Table created.

Next is the code to create a reference partitioned child table:

SQL> create table order_line_items

( order# number,

line# number,

data varchar2(30),

constraint c1_pk primary key(order#,line#),

constraint c1_fk_p foreign key(order#) references orders)

partition by reference(c1_fk_p);

Table created.

To see interval reference partitioning in action, let’s insert some data. First, we insert rows that will fit within existing range partitions:

SQL> insert into orders values (1, to_date( '01-jun-2020', 'dd-mon-yyyy' ), 'xxx');

1 row created.

SQL> insert into orders values (2, to_date( '01-jun-2021', 'dd-mon-yyyy' ), 'xxx');

1 row created.

SQL> insert into order_line_items values( 1, 1, 'yyy' );

1 row created.

SQL> insert into order_line_items values( 2, 1, 'yyy' );

1 row created.

All of the prior rows fit into the partitions specified when creating the tables. The following query displays the current partitions:

SQL> select table_name, partition_name from user_tab_partitions

where table_name in ( 'ORDERS', 'ORDER_LINE_ITEMS' )

order by table_name, partition_name;

TABLE_NAME PARTITION_NAME

------------------------- -------------------------

ORDERS PART_2020

ORDERS PART_2021

ORDER_LINE_ITEMS PART_2020

ORDER_LINE_ITEMS PART_2021

Next, rows are inserted that don’t fit into an existing range partition; therefore, Oracle automatically creates partitions to hold the newly inserted rows:

SQL> insert into orders values (3, to_date( '01-jun-2022', 'dd-mon-yyyy' ), 'xxx');

1 row created.

SQL> insert into order_line_items values (3, 1, 'zzz' );

1 row created.

The following query shows that two interval partitions were automatically created, one for the parent table and one for the child table:

SQL> select a.table_name, a.partition_name, a.high_value,

decode( a.interval, 'YES', b.interval ) interval

from user_tab_partitions a, user_part_tables b

where a.table_name IN ('ORDERS', 'ORDER_LINE_ITEMS')

and a.table_name = b.table_name

order by a.table_name;

TABLE_NAME PARTITION_ HIGH_VALUE INTERVAL

------------------ ---------- --------------------------------- --------

ORDERS PART_2020 TIMESTAMP' 2021-01-01 00:00:00'

ORDERS PART_2021 TIMESTAMP' 2022-01-01 00:00:00'

ORDERS SYS_P1640 TIMESTAMP' 2023-01-01 00:00:00' NUMTOYMINTERVAL(1,'YEAR')

ORDER_LINE_ITEMS PART_2020

ORDER_LINE_ITEMS PART_2021

ORDER_LINE_ITEMS SYS_P1640 YES

Two partitions named SYS_P1640 were created, with the parent table partition having a high value of 2023-01-01. If desired, you can rename the partitions via the ALTER TABLE command:

SQL> alter table orders rename partition sys_p1640 to part_2022;

Table altered.

SQL> alter table order_line_items rename partition sys_p1640 to part_2022;

Table altered.

Tip

See the “Interval Partitioning” section of this chapter for an example of automating the renaming of partitions via PL/SQL.

Virtual Column Partitioning

Virtual column partitioning allows you to partition based on a SQL expression. This type of partitioning is useful when a table column is overloaded with multiple business values and you want to partition on a portion of that column. For example, let’s say you have a RESERVATION_CODE column in a table:

$ sqlplus eoda/foo@PDB1

SQL> create table res(reservation_code varchar2(30));

Table created.

And the first character in the RESERVATION_CODE column defines a region from which the reservation originated. For the purposes of this example, let’s say a first character of an A or C maps to the NE region, values of B map to the SW region, and values of D map to the NW region.

In this scenario, we know that the first character represents the region, and we want to be able to list partition by region. With the data as it is, it’s not practical to list partition by the RESERVATION_CODE column, whereas virtual partitioning allows us to apply a SQL function to the column and list partition by the first character. Here’s what the table definition looks like with virtual column partitioning:

SQL> drop table res;

SQL> create table res(

reservation_code varchar2(30),

region as

(decode(substr(reservation_code,1,1),'A','NE'

,'C','NE'

,'B','SW'

,'D','NW')

)

partition by list (region)

(partition NE values('NE'),

partition SW values('SW'),

partition NW values('NW'));

Table created.

We can view the partition information via this query:

SQL> select a.table_name, a.partition_name, a.high_value

from user_tab_partitions a, user_part_tables b

where a.table_name = 'RES'

and a.table_name = b.table_name

order by a.table_name;

TABLE_NAME PARTITION_ HIGH_VALUE

---------------- ---------- -------------------------------

RES NE 'NE'

RES NW 'NW'

RES SW 'SW'

Next, some random test data is inserted into the table (your random results will be different from this example):

SQL> insert into res (reservation_code)

select chr(64+(round(dbms_random.value(1,4)))) || level

from dual connect by level < 10;

9 rows created.

Now let’s see how the data was partitioned:

SQL> select 'NE', reservation_code, region from res partition(NE)

union all

select 'SW', reservation_code, region from res partition(SW)

union all

select 'NW', reservation_code, region from res partition(NW);

'N RESERVATION_CODE RE

-- ------------------------------ --

NE C3 NE

NE C5 NE

NE A6 NE

NE C8 NE

SW B2 SW

SW B7 SW

SW B9 SW

NW D1 NW

NW D4 NW

In this way, virtual column partitioning is often appropriate when there is a business requirement to partition on portions of data in a column or combinations of data from different columns (especially when there might not be an obvious way to list or range partition). The expression behind a virtual column can be a complex calculation, return a subset of a column string, combine column values, and so on.

Composite Partitioning

Lastly, we’ll look at some examples of composite partitioning, which is a mixture of range, hash, and/or list. Table 13-1 lists the various ways you can mix and match types of partitioning. In other words, the table shows there are nine different combinations of composite partitioning currently available.

Table 13-1

Oracle Database Supported Composite Partitioning Schemes by Version

	Range	List	Hash
Range	YES	YES	YES
List	YES	YES	YES
Hash	YES	YES	YES

It is interesting to note that when you use composite partitioning, there will be no partition segments; there will be only subpartition segments. When using composite partitioning, the partitions themselves do not have segments (much like a partitioned table doesn’t have a segment). The data is physically stored in subpartition segments, and the partition becomes a logical container, or a container that points to the actual subpartitions.

In our example, we’ll look at a range-hash composite partitioning. Here, we are using a different set of columns for the range partition from those used for the hash partition. This is not mandatory; we could use the same set of columns for both:

$ sqlplus eoda/foo@PDB1

SQL> CREATE TABLE composite_example

( range_key_column date,

hash_key_column int,

data varchar2(20)

)

PARTITION BY RANGE (range_key_column)

subpartition by hash(hash_key_column) subpartitions 2

(

PARTITION part_1

VALUES LESS THAN(to_date('01/01/2021','dd/mm/yyyy'))

(subpartition part_1_sub_1,

subpartition part_1_sub_2

PARTITION part_2

VALUES LESS THAN(to_date('01/01/2022','dd/mm/yyyy'))

(subpartition part_2_sub_1,

subpartition part_2_sub_2

)

);

Table created.

In range-hash composite partitioning, Oracle will first apply the range partitioning rules to figure out which range the data falls into. Then it will apply the hash function to decide into which physical partition the data should finally be placed. This process is described in Figure 13-4.

../images/319655_4_En_13_Chapter/319655_4_En_13_Fig4_HTML.jpg — Figure 13-4
Range-hash composite partition example

So, composite partitioning gives you the ability to break your data up by range and, when a given range is considered too large or further partition elimination could be useful, to break it up further by hash or list. It is interesting to note that each range partition need not have the same number of subpartitions; for example, suppose you were range partitioning on a date column in support of data purging (to remove all old data rapidly and easily). In the year 2020 and before, you had equal amounts of data in odd code numbers in the CODE_KEY_COLUMN and in even code numbers. But after that, you knew the number of records associated with the odd code number was more than double, and you wanted to have more subpartitions for the odd code values. You can achieve that rather easily just by defining more subpartitions:

SQL> CREATE TABLE composite_range_list_example

( range_key_column date,

code_key_column int,

data varchar2(20)

)

PARTITION BY RANGE (range_key_column)

subpartition by list(code_key_column)

(

PARTITION part_1

VALUES LESS THAN(to_date('01/01/2021','dd/mm/yyyy'))

(subpartition part_1_sub_1 values( 1, 3, 5, 7 ),

subpartition part_1_sub_2 values( 2, 4, 6, 8 )

PARTITION part_2

VALUES LESS THAN(to_date('01/01/2022','dd/mm/yyyy'))

(subpartition part_2_sub_1 values ( 1, 3 ),

subpartition part_2_sub_2 values ( 5, 7 ),

subpartition part_2_sub_3 values ( 2, 4, 6, 8 )

)

);

Table created.

Here, you end up with five partitions altogether: two subpartitions for partition PART_1 and three for partition PART_2.

Row Movement

You might wonder what would happen if the value of the column used to determine the partition is modified in any of the preceding partitioning schemes. There are two cases to consider:

The modification would not cause a different partition to be used; the row would still belong in this partition. This is supported in all cases.
The modification would cause the row to move across partitions. This is supported if row movement is enabled for the table; otherwise, an error will be raised.

We can observe these behaviors easily. In the previous example in the “Range Partitioning” section, we inserted a pair of rows into PART_1 of the RANGE_EXAMPLE table:

$ sqlplus eoda/foo@PDB1

SQL> CREATE TABLE range_example

( range_key_column date,

data varchar2(20)

)

PARTITION BY RANGE (range_key_column)

( PARTITION part_1 VALUES LESS THAN

(to_date('01/01/2021','dd/mm/yyyy')),

PARTITION part_2 VALUES LESS THAN

(to_date('01/01/2022','dd/mm/yyyy'))

);

Table created.

SQL> insert into range_example

( range_key_column, data )

values

( to_date( '15-dec-2020 00:00:00', 'dd-mon-yyyy hh24:mi:ss' ),

'application data...' );

1 row created.

SQL> insert into range_example

( range_key_column, data )

values

( to_date( '01-jan-2021 00:00:00', 'dd-mon-yyyy hh24:mi:ss' )-1/24/60/60,

'application data...' );

1 row created.

SQL> select * from range_example partition(part_1);

RANGE_KEY DATA

--------- --------------------

15-DEC-20 application data...

31-DEC-20 application data...

We take one of the rows and update the value in its RANGE_KEY_COLUMN such that it can remain in PART_1:

SQL> update range_example set range_key_column = trunc(range_key_column)

where range_key_column =

to_date( '31-dec-2020 23:59:59', 'dd-mon-yyyy hh24:mi:ss' );

1 row updated.

As expected, this succeeds: the row remains in partition PART_1. Next, we update the RANGE_KEY_COLUMN to a value that would cause it to belong in PART_2:

SQL> update range_example

set range_key_column = to_date('01-jan-2021','dd-mon-yyyy')

where range_key_column = to_date('31-dec-2020','dd-mon-yyyy');

update range_example

ERROR at line 1:

ORA-14402: updating partition key column would cause a partition change

This immediately raises an error since we did not explicitly enable row movement. We can enable row movement on this table to allow the row to move from partition to partition. You should be aware of a subtle side effect of doing this, however, namely, that the ROWID of a row will change as the result of the update:

SQL> select rowid from range_example

where range_key_column = to_date('31-dec-2020','dd-mon-yyyy');

ROWID

------------------

AAAtzXAAGAAAaO6AAB

SQL> alter table range_example enable row movement;

Table altered.

SQL> update range_example

set range_key_column = to_date('01-jan-2021','dd-mon-yyyy')

where range_key_column = to_date('31-dec-2020','dd-mon-yyyy');

1 row updated.

SQL> select rowid from range_example

where range_key_column = to_date('01-jan-2021','dd-mon-yyyy');

ROWID

------------------

AAAtzYAAGAAAae6AAA

As long as you understand that the ROWID of the row will change on this update, enabling row movement will allow you to update partition keys.

Note

There are other cases where a ROWID can change as a result of an update. It can happen as a result of an update to the primary key of an IOT. The universal ROWID will change for that row, too. The FLASHBACK TABLE command may also change the ROWID of rows, as might the ALTER TABLE SHRINK command.

You need to understand that, internally, row movement is done as if you had, in fact, deleted the row and reinserted it. It will update every single index on this table and delete the old entry and insert a new one. It will do the physical work of a DELETE plus an INSERT. However, it is considered an update by Oracle even though it physically deletes and inserts the row—therefore, it won’t cause INSERT and DELETE triggers to fire, just the UPDATE triggers. Additionally, child tables that might prevent a DELETE due to a foreign key constraint won’t. You do have to be prepared, however, for the extra work that will be performed; it is much more expensive than a normal UPDATE. Therefore, it would be a bad design decision to construct a system whereby the partition key was modified frequently and that modification would cause a partition movement.

Table Partitioning Schemes Wrap-Up

In general, range partitioning is useful when you have data that is logically segregated by some value(s). Time-based data immediately comes to the forefront as a classic example—partition by “Sales Quarter,” “Fiscal Year,” or “Month.” Range partitioning is able to take advantage of partition elimination in many cases, including the use of exact equality and ranges (less than, greater than, between, and so on).

Hash partitioning is suitable for data that has no natural ranges by which you can partition. For example, if you had to load a table full of census-related data, there might not be an attribute by which it would make sense to range partition. However, you would still like to take advantage of the administrative, performance, and availability enhancements offered by partitioning. Here, you would simply pick a unique or almost unique set of columns to hash on. This would achieve an even distribution of data across as many partitions as you like. Hash partitioned objects can take advantage of partition elimination when exact equality or IN ( value, value, ... ) is used, but not when ranges of data are used.

List partitioning is suitable for data that has a column with a discrete set of values, and partitioning by the column makes sense based on the way your application uses it (e.g., it easily permits partition elimination in queries). Classic examples would be a state or region code—or, in fact, many code type attributes in general.

Interval partitioning extends the range partitioning feature and allows partitions to automatically be added when data inserted into the table doesn’t fit into an existing partition. This feature greatly enhances range partitioning in that there is less maintenance involved (because the DBA doesn’t have to necessarily monitor the ranges and manually add partitions).

Reference partitioning eases the implementation of partitioned tables that are related through referential integrity constraints. This allows the child table to be logically partitioned in the same manner as the parent table without having to duplicate parent table columns to the child table.

Interval reference partitioning allows you to combine the interval and reference partitioning features. This ability is useful when you need to use the interval and reference partitioning features in tandem.

Virtual column partitioning allows you to partition using a virtual column as the key. This feature provides you the flexibility to partition on a substring of a regular column value (or any other SQL expression). This is useful when it’s not feasible to use an existing column as the partition key, but you can partition on a subset of the value contained in an existing column.

Composite partitioning is useful when you have something logical by which you can range partition, but the resulting range partitions are still too large to manage effectively. You can apply the range, list, or hash partitioning and then further divide each range by a hash function or use lists to partition or even ranges. This will allow you to spread I/O requests out across many devices in any given large partition. Additionally, you may achieve partition elimination at three levels now. If you query on the partition key, Oracle is able to eliminate any partitions that do not meet your criteria. If you add the subpartition key to your query, Oracle can eliminate the other subpartitions within that partition. If you just query on the subpartition key (not using the partition key), Oracle will query only those hash or list subpartitions that apply from each partition.

It is recommended that if there is something by which it makes sense to range partition your data, you should use that over hash or list partitioning. Hash and list partitioning add many of the salient benefits of partitioning, but they are not as useful as range partitioning when it comes to partition elimination. Using hash or list partitions within range partitions is advisable when the resulting range partitions are too large to manage or when you want to use all PDML capabilities or parallel index scanning against a single range partition.

Partitioning Indexes

Indexes, like tables, may be partitioned. There are two possible methods to partition indexes :

Equipartition the index with the table: This is also known as a local index. For every table partition, there will be an index partition that indexes just that table partition. All of the entries in a given index partition point to a single table partition, and all of the rows in a single table partition are represented in a single index partition.
Partition the index by range or hash: This is also known as a globally partitioned index. Here, the index is partitioned by range, or optionally by hash, and a single index partition may point to any (and all) table partitions.

Figure 13-5 demonstrates the difference between a local and a global index.

../images/319655_4_En_13_Chapter/319655_4_En_13_Fig5_HTML.jpg — Figure 13-5
Local and global index partitions

In the case of a globally partitioned index, note that the number of index partitions may be different from the number of table partitions.

Since global indexes may be partitioned by range or hash only, you must use local indexes if you wish to have a list or composite partitioned index. The local index will be partitioned using the same scheme as the underlying table.

Local Indexes vs. Global Indexes

In my experience, most partition implementations in data warehouse systems use local indexes. In an OLTP system, global indexes are much more common, and we’ll see why shortly. It has to do with the need to perform partition elimination on the index structures to maintain the same query response times after partitioning as before partitioning them.

Local indexes have certain properties that make them the best choice for most data warehouse implementations. They support a more available environment (less downtime), since problems will be isolated to one range or hash of data. On the other hand, since it can point to many table partitions, a global index may become a point of failure, rendering all partitions inaccessible to certain queries.

Local indexes are more flexible when it comes to partition maintenance operations. If the DBA decides to move a table partition, only the associated local index partition needs to be rebuilt or maintained. With a global index, all index partitions must be rebuilt or maintained in real time. The same is true with sliding window implementations, where old data is aged out of the partition and new data is aged in. No local indexes will be in need of a rebuild, but all global indexes will be either rebuilt or maintained during the partition operation. In some cases, Oracle can take advantage of the fact that the index is locally partitioned with the table and will develop optimized query plans based on that. With global indexes, there is no such relationship between the index and table partitions.

Local indexes also facilitate a partition point-in-time recovery operation. If a single partition needs to be recovered to an earlier point in time than the rest of the table for some reason, all locally partitioned indexes can be recovered to that same point in time. All global indexes would need to be rebuilt on this object. This does not mean “avoid global indexes”—in fact, they are vitally important for performance reasons, as you’ll learn shortly—you just need to be aware of the implications of using them.

Local Indexes

Oracle makes a distinction between the following two types of local indexes:

Local prefixed indexes: These are indexes whereby the partition keys are on the leading edge of the index definition. For example, if a table is range partitioned on a column named LOAD_DATE, a local prefixed index on that table would have LOAD_DATE as the first column in its column list.
Local nonprefixed indexes: These indexes do not have the partition key on the leading edge of their column list. The index may or may not contain the partition key columns.

Both types of indexes are able to take advantage of partition elimination, both can support uniqueness (as long as the nonprefixed index includes the partition key), and so on. The fact is that a query that uses a local prefixed index will always allow for index partition elimination, whereas a query that uses a local nonprefixed index might not. This is why local nonprefixed indexes are said to be slower by some people—they do not enforce partition elimination (but they do support it).

There is nothing inherently better about a local prefixed index as opposed to a local nonprefixed index when that index is used as the initial path to the table in a query. What I mean is that if the query can start with “scan an index” as the first step, there isn’t much difference between a prefixed and a nonprefixed index.

Partition Elimination Behavior

For the query that starts with an index access, whether or not it can eliminate partitions from consideration all really depends on the predicate in your query. A small example will help demonstrate this. The following code creates a table, PARTITIONED_TABLE, that is range partitioned on a numeric column A such that values less than two will be in partition PART_1 and values less than three will be in partition PART_2:

$ sqlplus eoda/foo@PDB1

SQL> CREATE TABLE partitioned_table

( a int,

b int,

data char(20)

)

PARTITION BY RANGE (a)

(

PARTITION part_1 VALUES LESS THAN(2) tablespace p1,

PARTITION part_2 VALUES LESS THAN(3) tablespace p2

);

Table created.

We then create both a local prefixed index, LOCAL_PREFIXED, and a local nonprefixed index, LOCAL_NONPREFIXED. Note that the nonprefixed index does not have A on the leading edge of its definition, which is what makes it a nonprefixed index:

SQL> create index local_prefixed on partitioned_table (a,b) local;

Index created.

SQL> create index local_nonprefixed on partitioned_table (b) local;

Index created.

Next, we’ll insert some data into one partition and gather statistics:

SQL> insert into partitioned_table

select mod(rownum-1,2)+1, rownum, 'x'

from dual connect by level <= 70000;

70000 rows created.

SQL> begin

dbms_stats.gather_table_stats (user, 'PARTITIONED_TABLE', cascade=>TRUE );

end;

PL/SQL procedure successfully completed.

We take tablespace P2 offline, which contains the PART_2 partition for both the tables and indexes:

SQL> alter tablespace p2 offline;

Tablespace altered.

Taking tablespace P2 offline will prevent Oracle from accessing those specific index partitions. It will be as if we had suffered media failure, causing them to become unavailable. Now we’ll query the table to see what index partitions are needed by different queries. This first query is written to permit the use of the local prefixed index:

SQL> select * from partitioned_table where a = 1 and b = 1;

A B DATA

---------- ---------- --------------------

1 1 x

This query succeeded, and we can see why by reviewing the explain plan. We’ll use the built-in package DBMS_XPLAN to see what partitions this query accesses. The PSTART (partition start) and PSTOP (partition stop) columns in the output show us exactly what partitions this query needs to have online and available in order to succeed:

SQL> explain plan for select * from partitioned_table where a = 1 and b = 1;

Explained.

Now access DBMS_XPLAN.DISPLAY and instruct it to show the basic explain plan details plus partitioning information:

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

---------------------------------------------------------------------------

---------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION RANGE SINGLE | | 1 | 1 |

| 2 | TABLE ACCESS BY LOCAL INDEX ROWID BATCHED| PARTITIONED_TABLE | 1 | 1 |

| 3 | INDEX RANGE SCAN | LOCAL_PREFIXED | 1 | 1 |

---------------------------------------------------------------------------

So, the query that uses LOCAL_PREFIXED succeeds. The optimizer was able to exclude PART_2 of LOCAL_PREFIXED from consideration because we specified A=1 in the query, and we can see that clearly in the plan PSTART and PSTOP are both equal to 1. Partition elimination kicked in for us. The second query fails, however:

SQL> select * from partitioned_table where b = 1;

ERROR:

ORA-00376: file 23 cannot be read at this time

ORA-01110: data file 23: '/opt/oracle/oradata/CDB/C217E68DF48779E1E0530101007F73B9/datafile/o1_mf_p2_jc8bg9py_.dbf'

And using the same technique, we can see why:

SQL> explain plan for select * from partitioned_table where b = 1;

Explained.

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

---------------------------------------------------------------------------

---------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION RANGE ALL | | 1 | 2 |

| 2 | TABLE ACCESS BY LOCAL INDEX ROWID BATCHED| PARTITIONED_TABLE | 1 | 2 |

| 3 | INDEX RANGE SCAN | LOCAL_NONPREFIXED | 1 | 2 |

---------------------------------------------------------------------------

Here, the optimizer was not able to remove PART_2 of LOCAL_NONPREFIXED from consideration—it needed to look in both the PART_1 and PART_2 partitions of the index to see if B=1 was in there. Herein lies a performance issue with local nonprefixed indexes: they do not make you use the partition key in the predicate as a prefixed index does. It is not that prefixed indexes are better; it’s just that in order to use them, you must use a query that allows for partition elimination.

If we drop the LOCAL_PREFIXED index and rerun the original successful query, as follows:

SQL> drop index local_prefixed;

Index dropped.

SQL> select * from partitioned_table where a = 1 and b = 1;

A B DATA

---------- ---------- --------------------

1 1 x

It succeeds, but as we’ll see, it used the same index that just a moment ago failed us. The plan shows that Oracle was able to employ partition elimination here—the predicate A=1 was enough information for the database to eliminate index partition PART_2 from consideration:

SQL> explain plan for select * from partitioned_table where a = 1 and b = 1;

Explained.

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

---------------------------------------------------------------------------

---------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION RANGE SINGLE | | 1 | 1 |

| 2 | TABLE ACCESS BY LOCAL INDEX ROWID BATCHED| PARTITIONED_TABLE | 1 | 1 |

| 3 | INDEX RANGE SCAN | LOCAL_NONPREFIXED | 1 | 1 |

---------------------------------------------------------------------------

Note the PSTART and PSTOP column values of 1 and 1. This proves that the optimizer is able to perform partition elimination even for nonprefixed local indexes.

If you frequently query the preceding table with the following queries, then you might consider using a local nonprefixed index on (b,a):

select ... from partitioned_table where a = :a and b = :b;

select ... from partitioned_table where b = :b;

That index would be useful for both of the preceding queries. The local prefixed index on (a,b) would be useful only for the first query.

The bottom line here is that you should not be afraid of nonprefixed indexes or consider them as major performance inhibitors. If you have many queries that could benefit from a nonprefixed index as outlined previously, then you should consider using one. The main concern is to ensure that your queries contain predicates that allow for index partition elimination whenever possible. The use of prefixed local indexes enforces that consideration. The use of nonprefixed indexes does not. Consider also how the index will be used. If it will be used as the first step in a query plan, there are not many differences between the two types of indexes.

Local Indexes and Unique Constraints

To enforce uniqueness—and that includes a UNIQUE constraint or PRIMARY KEY constraints —your partitioning key must be included in the constraint itself if you want to use a local index to enforce the constraint. This is the largest limitation of a local index, in my opinion. Oracle enforces uniqueness only within an index partition—never across partitions. What this implies, for example, is that you cannot range partition on a TIMESTAMP field and have a primary key on the ID that is enforced using a locally partitioned index. Oracle will instead utilize a global index to enforce uniqueness.

In the next example, we will create a range partitioned table that is partitioned by a column named TIMESTAMP but has a primary key on the ID column. We can do that by executing the following CREATE TABLE statement in a schema that owns no other objects, so we can easily see exactly what objects are created by looking at every segment this user owns:

SQL> CREATE TABLE partitioned

( timestamp date,

id int,

constraint partitioned_pk primary key(id)

)

PARTITION BY RANGE (timestamp)

(

PARTITION part_1 VALUES LESS THAN

( to_date('01/01/2021','dd/mm/yyyy') ) ,

PARTITION part_2 VALUES LESS THAN

( to_date('01/01/2022','dd/mm/yyyy') )

);

Table created.

And inserting some data so that we get segments created:

SQL> insert into partitioned values(to_date('01/01/2020','dd/mm/yyyy'),1);

1 row created.

SQL> insert into partitioned values(to_date('01/01/2021','dd/mm/yyyy'),2);

1 row created.

Assuming we run this in a schema with no other objects created, we’ll see the following:

SQL > select segment_name, partition_name, segment_type from user_segments;

SEGMENT_NAME PARTITION_NAME SEGMENT_TYPE

------------------------- ------------------------- ---------------

PARTITIONED PART_1 TABLE PARTITION

PARTITIONED PART_2 TABLE PARTITION

PARTITIONED_PK INDEX

The PARTITIONED_PK index is not even partitioned, let alone locally partitioned, and as we’ll see, it cannot be locally partitioned. Even if we try to trick Oracle by realizing that a primary key can be enforced by a nonunique index as well as a unique index, we’ll find that this approach will not work either:

SQL> drop table partitioned;

SQL> CREATE TABLE partitioned

( timestamp date,

id int

)

PARTITION BY RANGE (timestamp)

(

PARTITION part_1 VALUES LESS THAN

( to_date('01-jan-2021','dd-mon-yyyy') ) ,

PARTITION part_2 VALUES LESS THAN

( to_date('01-jan-2022','dd-mon-yyyy') )

);

Table created.

SQL> create index partitioned_idx on partitioned(id) local;

Index created.

And inserting some data so that we get segments created:

SQL> insert into partitioned values(to_date('01/01/2020','dd/mm/yyyy'),1);

1 row created.

SQL> insert into partitioned values(to_date('01/01/2021','dd/mm/yyyy'),2);

1 row created

SQL> select segment_name, partition_name, segment_type from user_segments;

SEGMENT_NAME PARTITION_NAME SEGMENT_TYPE

------------------------- ------------------------- ---------------

PARTITIONED PART_1 TABLE PARTITION

PARTITIONED PART_2 TABLE PARTITION

PARTITIONED_IDX PART_1 INDEX PARTITION

PARTITIONED_IDX PART_2 INDEX PARTITION

SQL> alter table partitioned

add constraint

partitioned_pk

primary key(id);

alter table partitioned

ERROR at line 1:

ORA-01408: such column list already indexed

Here, Oracle attempts to create a global index on ID, but finds that it cannot since an index already exists. The preceding statements would work if the index we created was not partitioned, as Oracle would have used that index to enforce the constraint.

The reasons why uniqueness cannot be enforced, unless the partition key is part of the constraint, are twofold. First, if Oracle allowed this, it would void most of the advantages of partitions. Availability and scalability would be lost, as each and every partition would always have to be available and scanned to do any inserts and updates. The more partitions you had, the less available the data would be. The more partitions you had, the more index partitions you would have to scan, and the less scalable partitions would become. Instead of providing availability and scalability, doing this would actually decrease both.

Additionally, Oracle would have to effectively serialize inserts and updates to this table at the transaction level. This is because if we add ID=1 to PART_1, Oracle would have to somehow prevent anyone else from adding ID=1 to PART_2. The only way to do this would be to prevent others from modifying index partition PART_2, since there isn’t anything to really lock in that partition.

In an OLTP system, unique constraints must be system enforced (i.e., enforced by Oracle) to ensure the integrity of data. This implies that the logical model of your application will have an impact on the physical design. Uniqueness constraints will either drive the underlying table partitioning scheme, driving the choice of the partition keys, or point you toward the use of global indexes instead. We’ll take a look at global indexes in more depth next.

Global Indexes

Global indexes are partitioned using a scheme that is different from that used in the underlying table. The table might be partitioned by a TIMESTAMP column into ten partitions, and a global index on that table could be partitioned into five partitions by the REGION column. Unlike local indexes, there is only one class of global index, and that is a prefixed global index. There is no support for a global index whose index key does not begin with the partitioning key for that index. That implies that whatever attribute(s) you use to partition the index will be on the leading edge of the index key itself.

Building on our previous example, here is a quick example of the use of a global index. It shows that a global partitioned index can be used to enforce uniqueness for a primary key, so you can have partitioned indexes that enforce uniqueness, but do not include the partition key of the table. The following example creates a table partitioned by TIMESTAMP that has an index partitioned by ID:

$ sqlplus eoda/foo@PDB1

SQL> drop table partitioned;

SQL> CREATE TABLE partitioned

( timestamp date,

id int

)

PARTITION BY RANGE (timestamp)

(

PARTITION part_1 VALUES LESS THAN

( to_date('01-jan-2021','dd-mon-yyyy') ) ,

PARTITION part_2 VALUES LESS THAN

( to_date('01-jan-2022','dd-mon-yyyy') )

);

Table created.

SQL> create index partitioned_index

on partitioned(id)

GLOBAL

partition by range(id)

(

partition part_1 values less than(1000),

partition part_2 values less than (MAXVALUE)

);

Index created.

Note the use of MAXVALUE in this index. MAXVALUE can be used in any range partitioned table as well as in the index. It represents an infinite upper bound on the range. In our examples so far, we’ve used hard upper bounds on the ranges (values less than <some value>). However, a global index has a requirement that the highest partition (the last partition) must have a partition bound whose value is MAXVALUE. This ensures that all rows in the underlying table can be placed in the index.

Now, completing this example, we’ll add our primary key to the table:

SQL> alter table partitioned add constraint partitioned_pk primary key(id);

Table altered.

It is not evident from this code that Oracle is using the index we created to enforce the primary key (it is to me because I know that Oracle is using it), so we can prove it by simply trying to drop that index:

SQL> drop index partitioned_index;

drop index partitioned_index

ERROR at line 1:

ORA-02429: cannot drop index used for enforcement of unique/primary key

To show that Oracle will not allow us to create a nonprefixed global index, we only need try the following:

SQL> create index partitioned_index2

on partitioned(timestamp,id)

GLOBAL

partition by range(id)

(

partition part_1 values less than(1000),

partition part_2 values less than (MAXVALUE)

);

partition by range(id)

ERROR at line 4:

ORA-14038: GLOBAL partitioned index must be prefixed

The error message is pretty clear. The global index must be prefixed. So, when would you use a global index? We’ll take a look at two system types, data warehouse and OLTP, and see when they might apply.

Data Warehousing and Global Indexes

Many data warehouses implement a sliding window approach to managing data—that is, drop the oldest partition of a table and add a new partition for the newly loaded data. In the following sections, we’ll take a look at what is meant by a sliding window of data and the potential impact of a global index on it.

Sliding Windows and Indexes

The following example implements a classic sliding window of data. In many implementations, data is added to a warehouse over time and the oldest data is aged out. Many times, this data is range partitioned by a date attribute, so that the oldest data is stored together in a single partition, and the newly loaded data is likewise stored together in a new partition. The monthly load process involves the following:

Detaching the old data: The oldest partition is either dropped or exchanged with an empty table (turning the oldest partition into a table) to permit archiving of the old data.
Loading and indexing of the new data: The new data is loaded into a work table and indexed and validated.
Attaching the new data: Once the new data is loaded and processed, the table it is in is exchanged with an empty partition in the partitioned table, turning this newly loaded data in a table into a partition of the larger partitioned table.

This process is repeated every month, or however often the load process is performed; it could be every day or every week. We will implement this very typical process in this section to show the impact of global partitioned indexes and demonstrate the options we have during partition operations to increase availability, allowing us to implement a sliding window of data and maintain continuous availability of data.

We’ll process yearly data in this example and have fiscal years 2020 and 2021 loaded up. The table will be partitioned by the TIMESTAMP column, and it will have two indexes created on it—one is a locally partitioned index on the ID column, and the other is a global index (nonpartitioned, in this case) on the TIMESTAMP column:

$ sqlplus eoda/foo@PDB1

SQL> drop table partitioned;

SQL> CREATE TABLE partitioned

( timestamp date,

id int

)

PARTITION BY RANGE (timestamp)

(

PARTITION fy_2020 VALUES LESS THAN

( to_date('01-jan-2021','dd-mon-yyyy') ) ,

PARTITION fy_2021 VALUES LESS THAN

( to_date('01-jan-2022','dd-mon-yyyy') ) );

Table created.

SQL> insert into partitioned partition(fy_2020)

select to_date('31-dec-2020','dd-mon-yyyy')-mod(rownum,360), rownum

from dual connect by level <= 70000;

70000 rows created.

SQL> insert into partitioned partition(fy_2021)

select to_date('31-dec-2021','dd-mon-yyyy')-mod(rownum,360), rownum

from dual connect by level <= 70000;

70000 rows created.

SQL> create index partitioned_idx_local on partitioned(id) LOCAL;

Index created.

SQL> create index partitioned_idx_global on partitioned(timestamp) GLOBAL;

Index created.

This sets up our warehouse table. The data is partitioned by fiscal year, and we have the last two years’ worth of data online. This table has two indexes: one is LOCAL and the other is GLOBAL. Now it’s the end of the year and we would like to do the following:

1.
Remove the oldest fiscal year data: We do not want to lose this data forever; we just want to age it out and archive it.
2.
Add the newest fiscal year data: It will take a while to load it, transform it, index it, and so on. We would like to do this work without impacting the availability of the current data, if at all possible.

The first step is to set up an empty table for fiscal year 2020 that looks just like the partitioned table. We’ll use this table to exchange with the FY_2020 partition in the partitioned table, turning that partition into a table and in turn emptying out the partition in the partitioned table. The net effect is that the oldest data in the partitioned table will have been in effect removed after the exchange:

SQL> create table fy_2020 ( timestamp date, id int );

Table created.

SQL> create index fy_2020_idx on fy_2020(id);

Index created.

We’ll do the same to the new data to be loaded. We’ll create and load a table that structurally looks like the existing partitioned table (but that is not itself partitioned):

SQL> create table fy_2022 ( timestamp date, id int );

Table created.

SQL> insert into fy_2022

select to_date('31-dec-2022','dd-mon-yyyy')-mod(rownum,360), rownum

from dual connect by level <= 70000;

70000 rows created.

SQL> create index fy_2022_idx on fy_2022(id) nologging;

Index created.

We’ll turn the current full partition into an empty partition and create a full table with the FY_2020 data in it. Also, we’ve completed all of the work necessary to have the FY_2022 data ready to go. This would have involved verifying the data, transforming it—whatever complex tasks we need to undertake to get it ready. Now we’re ready to update the live data using an exchange partition:

SQL> alter table partitioned

exchange partition fy_2020

with table fy_2020

including indexes

without validation;

Table altered.

SQL> alter table partitioned drop partition fy_2020;

Table altered.

This is all we need to do to age the old data out. We turned the partition into a full table and the empty table into a partition. This was a simple data dictionary update. No large amount of I/O took place—it just happened. We can now export that FY_2020 table (perhaps using a transportable tablespace) out of our database for archival purposes. We could reattach it quickly if we ever needed to.

Next, we want to slide in the new data:

SQL> alter table partitioned add partition fy_2022

values less than ( to_date('01-jan-2023','dd-mon-yyyy') );

Table altered.

SQL> alter table partitioned

exchange partition fy_2022

with table fy_2022

including indexes

without validation;

Table altered.

Again, this was instantaneous; it was accomplished via simple data dictionary updates—the WITHOUT VALIDATION clause allowed us to accomplish that. When you use that clause, the database will trust that the data you are placing into that partition is, in fact, valid for that partition. Adding the empty partition took very little time to process. Then, we exchange the newly created empty partition with the full table, and the full table with the empty partition, and that operation is performed quickly as well. The new data is online.

Looking at our indexes, however, we’ll find the following:

SQL> select index_name, status from user_indexes;

INDEX_NAME STATUS

------------------------- --------

PARTITIONED_IDX_LOCAL N/A

PARTITIONED_IDX_GLOBAL UNUSABLE

FY_2020_IDX VALID

FY_2022_IDX VALID

The global index is, of course, unusable after this operation. Since each index partition can point to any table partition, and we just took away a partition and added a partition, that index is invalid. It has entries that point into the partition we dropped. It has no entries that point into the partition we just added. Any query that would make use of this index would fail and not execute, or if we skip unusable indexes, the query’s performance would be negatively impacted by not being able to use the index:

SQL> select /*+ index( partitioned PARTITIONED_IDX_GLOBAL ) */ count(*)

from partitioned

where timestamp between to_date( '01-mar-2022', 'dd-mon-yyyy' )

and to_date( '31-mar-2022', 'dd-mon-yyyy' );

select /*+ index( partitioned PARTITIONED_IDX_GLOBAL ) */ count(*)

ERROR at line 1:

ORA-01502: index 'EODA.PARTITIONED_IDX_GLOBAL' or partition of such index is in unusable state

SQL> explain plan for select count(*)

from partitioned

where timestamp between to_date( '01-mar-2022', 'dd-mon-yyyy' )

and to_date( '31-mar-2022', 'dd-mon-yyyy' );

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

--------------------------------------------------------------

--------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | SORT AGGREGATE | | | |

| 2 | PARTITION RANGE SINGLE| | 2 | 2 |

| 3 | TABLE ACCESS FULL | PARTITIONED | 2 | 2 |

---------------------------------------------------------------

So, our choices after performing this partition operation with global indexes are

Ensure that SKIP_UNUSABLE_INDEXES=TRUE (the default is TRUE), and then Oracle will not use this unusable index. But then we lose the performance the index was giving us.
Rebuild this index to make it usable again.
Use the UPDATE GLOBAL INDEXES clause when performing partition maintenance (the focus of the next section in this chapter).

The sliding window process, which so far has resulted in virtually no downtime, will now take a very long time to complete while we rebuild the global index. Runtime query performance of queries that relied on these indexes will be negatively affected during this time—either they will not run at all or they will run without the benefit of the index. All of the data must be scanned and the entire index reconstructed from the table data. If the table is many hundreds of gigabytes in size, this will take considerable resources.

“Live” Global Index Maintenance

Oracle has the ability to maintain the global indexes during the partition operation using the UPDATE GLOBAL INDEXES clause . This means that as you drop a partition, split a partition, perform whatever operation necessary on a partition, Oracle will perform the necessary modifications to the global index to keep it up to date. Since most partition operations will cause this global index invalidation to occur, this feature can be a boon to systems that need to provide continual access to the data. You’ll find that you sacrifice the raw speed of the partition operation, but with the associated window of unavailability immediately afterward as you rebuild indexes, for a slower overall response time from the partition operation but coupled with 100 percent data availability. In short, if you have a data warehouse that cannot have downtime, but must support these common data warehouse techniques of sliding data in and out, then this feature is for you—but you must understand the implications.

Revisiting our previous example, if our partition operations had used the UPDATE GLOBAL INDEXES clause when relevant (in this example, it would not be needed on the ADD PARTITION statement since the newly added partition would not have any rows in it), we would have discovered the indexes to be perfectly valid and usable both during and after the operation:

SQL> alter table partitioned

exchange partition fy_2020

with table fy_2020

including indexes

without validation

UPDATE GLOBAL INDEXES;

Table altered.

SQL> alter table partitioned drop partition fy_2020 update global indexes;

Table altered.

SQL> alter table partitioned add partition fy_2022

values less than ( to_date('01-jan-2023','dd-mon-yyyy') );

Table altered.

SQL> alter table partitioned

exchange partition fy_2022

with table fy_2022

including indexes

without validation

UPDATE GLOBAL INDEXES;

Table altered.

Note in the following output, the N/A status observed for the PARTITIONED_IDX_LOCAL index simply means the statuses are associated with the index partitions associated with that index, not the index itself. It doesn’t make sense to say the locally partitioned index is valid or not; it is just a container that logically holds the index partitions themselves:

SQL> select index_name, status from user_indexes;

INDEX_NAME STATUS

------------------------- --------

PARTITIONED_IDX_LOCAL N/A

PARTITIONED_IDX_GLOBAL VALID

FY_2020_IDX VALID

FY_2022_IDX VALID

SQL> explain plan for select count(*) from partitioned

where timestamp between to_date( '01-mar-2022', 'dd-mon-yyyy' )

and to_date( '31-mar-2022', 'dd-mon-yyyy' );

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

---------------------------------------------------

| Id | Operation | Name |

---------------------------------------------------

| 0 | SELECT STATEMENT | |

| 1 | SORT AGGREGATE | |

| 2 | INDEX RANGE SCAN| PARTITIONED_IDX_GLOBAL |

---------------------------------------------------

But there is a trade-off: we are performing the logical equivalent of DELETE and INSERT operations on the global index structures. When we drop a partition, we have to delete all of the global index entries that might be pointing to that partition. When we did the exchange of a table with a partition, we had to delete all of the global index entries pointing to the original data and then insert all of the new ones that we just slid in there. So the amount of work performed by the ALTER commands was significantly increased.

You should expect with global index maintenance considerations that the approach without index maintenance will consume fewer database resources and therefore perform faster but incur a measurable period of downtime. The second approach, involving maintaining the indexes, will consume more resources and perhaps take longer overall, but will not incur downtime. As far as the end users are concerned, their ability to work never ceased. They might have been processing a bit slower (since we were competing with them for resources), but they were still processing, and they never stopped.

The index rebuild approach will almost certainly run faster, considering both the elapsed time and the CPU time. This fact has caused many a DBA to pause and say, “Hey, I don’t want to use UPDATE GLOBAL INDEXES—it’s slower.” That is too simplistic of a view, however. What you need to remember is that while the operations overall took longer, processing on your system was not necessarily interrupted. Sure, you as the DBA might be looking at your screen for a longer period of time, but the really important work that takes place on your system was still taking place. You need to see if this trade-off makes sense for you. If you have an eight-hour maintenance window overnight in which to load new data, then by all means, use the rebuild approach if that makes sense. However, if you have a mandate to be available continuously, then the ability to maintain the global indexes will be crucial.

One more thing to consider is the redo generated by each approach. You will find that the UPDATE GLOBAL INDEXES generates considerably more redo (due to the index maintenance), and you should expect that to only go up as you add more and more global indexes to the table. The redo generated by the UPDATE GLOBAL INDEXES processing is unavoidable and cannot be turned off via NOLOGGING, since the maintenance of the global indexes is not a complete rebuild of their structure but more of an incremental maintenance. Additionally, since you are maintaining the live index structure, you must generate undo for that—in the event the partition operation fails, you must be prepared to put the index back the way it was. And remember, undo is protected by redo itself, so some of the redo you see generated is from the index updates and some is from the rollback. Add another global index or two and you would reasonably expect these numbers to increase.

So, UPDATE GLOBAL INDEXES is an option that allows you to trade off availability for resource consumption. If you need to provide continuous availability, it’s the option for you. But you have to understand the ramifications and size other components of your system appropriately. Specifically, many data warehouses have been crafted over time to use bulk direct path operations, bypassing undo generation and, when permitted, redo generation as well. Using UPDATE GLOBAL INDEXES cannot bypass either of those two elements. You need to examine the rules you use to size your redo and undo needs before using this feature, so you can assure yourself it can work on your system.

Asynchronous Global Index Maintenance

As shown in the prior section, you can maintain global indexes while dropping or truncating partitions via the UPDATE GLOBAL INDEXES clause. However, as shown previously, such operations come at a cost in terms of time and resource consumption.

When dropping or truncating table partitions, Oracle postpones the removal of the global index entries associated with the dropped or truncated partitions. This is known as asynchronous global index maintenance. Oracle postpones the maintenance of the global index to a future time while keeping the global index usable. The idea is that this improves the performance of dropping/truncating partitions while keeping any global indexes in a usable state. The actual cleanup of the index entries is done later (asynchronously) either by the DBA or by an automatically scheduled Oracle job. It’s not that less work is being done, rather it’s the cleanup of index entries is decoupled from the DROP/TRUNCATE statement.

A small example will demonstrate asynchronous global index maintenance. To set this up, we create a table, populate it with test data, and create a global index:

$ sqlplus eoda/foo@PDB1

SQL> drop table partitioned;

SQL> CREATE TABLE partitioned

( timestamp date,

id int

)

PARTITION BY RANGE (timestamp)

(PARTITION fy_2020 VALUES LESS THAN

(to_date('01-jan-2021','dd-mon-yyyy')),

PARTITION fy_2021 VALUES LESS THAN

( to_date('01-jan-2022','dd-mon-yyyy')));

SQL> insert into partitioned partition(fy_2020)

select to_date('31-dec-2020','dd-mon-yyyy')-mod(rownum,364), rownum

from dual connect by level < 100000;

99999 rows created.

SQL> insert into partitioned partition(fy_2021)

select to_date('31-dec-2021','dd-mon-yyyy')-mod(rownum,364), rownum

from dual connect by level < 100000;

99999 rows created.

SQL> create index partitioned_idx_global on partitioned(timestamp) GLOBAL;

Index created.

Next, we’ll run a query to retrieve the current values of redo size and db block gets statistics for the current session:

SQL> col r1 new_value r2

SQL> col b1 new_value b2

SQL> select * from

(select b.value r1

from v$statname a, v$mystat b

where a.statistic# = b.statistic#

and a.name = 'redo size'),

(select b.value b1

from v$statname a, v$mystat b

where a.statistic# = b.statistic#

and a.name = 'db block gets');

R1 B1

---------- ----------

56928036 80829

Next, a partition is dropped with the UPDATE GLOBAL INDEXES clause specified:

SQL> alter table partitioned drop partition fy_2020 update global indexes;

Table altered.

Now we’ll calculate the amount of redo generated and the number of current blocks accessed:

SQL> select * from

(select b.value - &r2 redo_gen

from v$statname a, v$mystat b

where a.statistic# = b.statistic#

and a.name = 'redo size'),

(select b.value - &b2 db_block_gets

from v$statname a, v$mystat b

where a.statistic# = b.statistic#

and a.name = 'db block gets');

old 2: (select b.value - &r2 redo_gen

new 2: (select b.value - 4816712 redo_gen

old 6: (select b.value - &b2 db_block_gets

new 6: (select b.value - 4512 db_block_gets

REDO_GEN DB_BLOCK_GETS

---------- -------------

16864 103

Only a small amount of the redo is generated, and a small number of blocks are accessed. The reason behind this is that Oracle doesn’t immediately perform the index maintenance of removing the index entries from the dropped partition. Rather, these entries are marked as orphaned and will later be cleaned up by Oracle. The existence of orphaned entries can be verified via the following:

SQL> select index_name, orphaned_entries, status from user_indexes where table_name='PARTITIONED';

INDEX_NAME ORP STATUS

------------------------- --- --------

PARTITIONED_IDX_GLOBAL YES VALID

How do the orphaned entries get cleaned up? Oracle has an automatically scheduled PMO_DEFERRED_GIDX_MAINT_JOB, which runs in a nightly maintenance window:

SQL> select job_name from dba_scheduler_jobs where job_name like 'PMO%';

JOB_NAME

---------------------------------------------------

PMO_DEFERRED_GIDX_MAINT_JOB

If you don’t want to wait for that job, you can manually clean up the entries yourself:

SQL> exec dbms_part.cleanup_gidx;

PL/SQL procedure successfully completed.

Now checking for orphaned rows shows there are none:

SQL> select index_name, orphaned_entries, status from user_indexes where table_name='PARTITIONED';

INDEX_NAME ORP STATUS

-------------------------- --- --------------------

PARTITIONED_IDX_GLOBAL NO VALID

In this way, you can perform operations such as dropping and truncating partitions and still leave your global indexes in a usable state without the immediate overhead of cleaning up the index entries as part of the drop/truncate operation.

Tip

See Oracle Support note 1482264.1 for further details on asynchronous global index maintenance.

OLTP and Global Indexes

An OLTP system is characterized by the frequent occurrence of many small read and write transactions. In general, fast access to the row (or rows) you need is paramount. Data integrity is vital. Availability is also very important. Global indexes make sense in many cases in OLTP systems. Table data can be partitioned by only one key—one set of columns. However, you may need to access the data in many different ways. You might partition EMPLOYEE data by LOCATION in the table, but you still need fast access to EMPLOYEE data by

DEPARTMENT: Departments are geographically dispersed. There is no relationship between a department and a location.
EMPLOYEE_ID: While an employee ID will determine a location, you don’t want to have to search by EMPLOYEE_ID and LOCATION; hence, partition elimination cannot take place on the index partitions. Also, EMPLOYEE_ID by itself must be unique.
JOB_TITLE: There is no relationship between JOB_TITLE and LOCATION. All JOB_TITLE values may appear in any LOCATION.

There is a need to access the EMPLOYEE data by many different keys in different places in the application, and speed is paramount. In a data warehouse, we might just use locally partitioned indexes on these keys and use parallel index range scans to collect a large amount of data fast. In these cases, we don’t necessarily need to use index partition elimination. In an OLTP system, however, we do need to use it. Parallel query is not appropriate for these systems; we need to provide the indexes appropriately. Therefore, we will need to make use of global indexes on certain fields.

The following are the goals we need to meet:

Fast access
Data integrity
Availability

Global indexes can help us accomplish these goals in an OLTP system. We will probably not be doing sliding windows, auditing aside for a moment. We will not be splitting partitions (unless we have a scheduled downtime), we will not be moving data, and so on. The operations we perform in a data warehouse are not done on a live OLTP system in general.

Here is a small example that shows how we can achieve the three goals just listed with global indexes. I am going to use simple, single partition global indexes, but the results would not be different with global indexes in multiple partitions (except for the fact that availability and manageability would increase as we added index partitions). We start by creating tablespaces P1, P2, P3, and P4, then create a table that is range partitioned by location, LOC, according to our rules, which place all LOC values less than 'C' into partition P1, those less than 'D' into partition P2, and so on:

$ sqlplus eoda/foo@PDB1

SQL> create tablespace p1 datafile size 1m autoextend on next 1m;

Tablespace created.

SQL> create tablespace p2 datafile size 1m autoextend on next 1m;

Tablespace created.

SQL> create tablespace p3 datafile size 1m autoextend on next 1m;

Tablespace created.

SQL> create tablespace p4 datafile size 1m autoextend on next 1m;

Tablespace created.

SQL> create table emp

(EMPNO NUMBER(4) NOT NULL,

ENAME VARCHAR2(10),

JOB VARCHAR2(9),

MGR NUMBER(4),

HIREDATE DATE,

SAL NUMBER(7,2),

COMM NUMBER(7,2),

DEPTNO NUMBER(2) NOT NULL,

LOC VARCHAR2(13) NOT NULL

)

partition by range(loc)

(

partition p1 values less than('C') tablespace p1,

partition p2 values less than('D') tablespace p2,

partition p3 values less than('N') tablespace p3,

partition p4 values less than('Z') tablespace p4

);

Table created.

We alter the table to add a constraint on the primary key column:

SQL> alter table emp add constraint emp_pk primary key(empno);

Table altered.

A side effect of this is that there exists a unique index on the EMPNO column. This shows we can support and enforce data integrity, one of our goals. Finally, we create two more global indexes on DEPTNO and JOB to facilitate accessing records quickly by those attributes:

SQL> create index emp_job_idx on emp(job) GLOBAL;

Index created.

SQL> create index emp_dept_idx on emp(deptno) GLOBAL;

Index created.

SQL> insert into emp

select e.*, d.loc

from scott.emp e, scott.dept d

where e.deptno = d.deptno;

14 rows created.

Let’s see what is in each partition:

SQL> break on pname skip 1

SQL> select 'p1' pname, empno, job, loc from emp partition(p1)

union all

select 'p2' pname, empno, job, loc from emp partition(p2)

union all

select 'p3' pname, empno, job, loc from emp partition(p3)

union all

select 'p4' pname, empno, job, loc from emp partition(p4);

PN EMPNO JOB LOC

-- ---------- --------- -------------

p2 7499 SALESMAN CHICAGO

7521 SALESMAN CHICAGO

7654 SALESMAN CHICAGO

7698 MANAGER CHICAGO

7844 SALESMAN CHICAGO

7900 CLERK CHICAGO

p3 7369 CLERK DALLAS

7566 MANAGER DALLAS

7788 ANALYST DALLAS

7876 CLERK DALLAS

7902 ANALYST DALLAS

p4 7782 MANAGER NEW YORK

7839 PRESIDENT NEW YORK

7934 CLERK NEW YORK

14 rows selected.

This shows the distribution of data, by location, into the individual partitions. We can now review some query plans to see what we could expect performance-wise:

SQL> variable x varchar2(30);

SQL> begin

dbms_stats.set_table_stats

( user, 'EMP', numrows=>100000, numblks => 10000 );

end;

PL/SQL procedure successfully completed.

SQL> explain plan for select empno, job, loc from emp where empno = :x;

Explained.

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

--------------------------------------------------------------------

--------------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

--------------------------------------------------------------------

The plan here shows an INDEX UNIQUE SCAN of the nonpartitioned index EMP_PK that was created in support of our primary key. Then there is a TABLE ACCESS BY GLOBAL INDEX ROWID, with a PSTART and PSTOP of ROWID/ROWID, meaning that when we get the ROWID from the index, it will tell us precisely which index partition to read to get this row. This index access will be as effective as on a nonpartitioned table and perform the same amount of I/O to do so. It is just a simple, single index unique scan followed by “get this row by rowid.” Now, let’s look at one of the other global indexes, the one on JOB:

SQL> explain plan for select empno, job, loc from emp where job = :x;

Explained.

SQL> select * from table(dbms_xplan.display);

---------------------------------------------------------------------------

---------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

---------------------------------------------------------------------------

Sure enough, we see a similar effect for the INDEX RANGE SCAN. Our indexes are used and can provide high-speed OLTP access to the underlying data. If they were partitioned, they would have to be prefixed and enforce index partition elimination; hence, they are scalable as well, meaning we can partition them and observe the same behavior. In a moment, we’ll look at what would happen if we used LOCAL indexes only.

Lastly, let’s look at the area of availability. The Oracle documentation claims that globally partitioned indexes make for less available data than locally partitioned indexes. I don’t fully agree with this blanket characterization. I believe that in an OLTP system they are as highly available as a locally partitioned index. Consider the following:

SQL> alter tablespace p1 offline;

SQL> alter tablespace p2 offline;

SQL> alter tablespace p3 offline;

SQL> select empno, job, loc from emp where empno = 7782;

EMPNO JOB LOC

---------- --------- -------------

7782 MANAGER NEW YORK

Here, even though most of the underlying data is unavailable in the table, we can still gain access to any bit of data available via that index. As long as the EMPNO we want is in a tablespace that is available, and our GLOBAL index is available, our GLOBAL index works for us. On the other hand, if we had been using the highly available local index in the preceding case, we might have been prevented from accessing the data! This is a side effect of the fact that we partitioned on LOC but needed to query by EMPNO. We would have had to probe each local index partition and would have failed on the index partitions that were not available.

Other types of queries, however, will not (and cannot) function at this point in time:

SQL> select empno, job, loc from emp where job = 'CLERK';

ERROR at line 1:

ORA-00376: file 38 cannot be read at this time

ORA-01110: data file 38: '/opt/oracle/oradata/CDB/C217E68DF48779E1E0530101007F73B9/datafile/o1_mf_p2_jcbnhfh2_.dbf'

The CLERK data is in all of the partitions, and the fact that three of the tablespaces are offline does affect us. This is unavoidable unless we had partitioned on JOB, but then we would have had the same issues with queries that needed data by LOC. Anytime you need to access the data from many different keys, you will have this issue. Oracle will give you the data whenever it can.

Note, however, that if the query can be answered from the index, avoiding the TABLE ACCESS BY ROWID, the fact that the data is unavailable is not as meaningful:

SQL> select count(*) from emp where job = 'CLERK';

COUNT(*)

----------

Since Oracle didn’t need the table in this case, the fact that most of the partitions were offline doesn’t affect this query (assuming the index isn’t in one of the offline tablespaces of course). As this type of optimization (i.e., answer the query using just the index) is common in an OLTP system, there will be many applications that are not affected by the data that is offline. All we need to do now is make the offline data available as fast as possible (restore it and recover it).

Partial Indexes

Oracle allows you to create either local or global indexes on a subset of partitions in a table. You may want to do this if you’ve pre-created partitions and don’t yet have data for range partitions that map to future dates—the idea being that you’ll build the index after the partitions have been loaded (at some future date).

You set up the use of a partial index by first specifying INDEXING ON|OFF for each partition in the table. In this next example, PART_1 has indexing turned on and PART_2 has indexing turned off:

$ sqlplus eoda/foo@PDB1

SQL> CREATE TABLE p_table (a int)

PARTITION BY RANGE (a)

(PARTITION part_1 VALUES LESS THAN(1000) INDEXING ON,

PARTITION part_2 VALUES LESS THAN(2000) INDEXING OFF);

Table created.

Next, a partial local index is created:

SQL> create index pi1 on p_table(a) local indexing partial;

Index created.

In this scenario, the INDEXING PARTIAL clause instructs Oracle to only build and make usable local index partitions that point to partitions in the table that were defined with INDEXING ON. In this case, one usable index partition will be created with index entries pointing to data in the PART_1 table partition:

SQL> select a.index_name, a.partition_name, a.status

from user_ind_partitions a, user_indexes b

where b.table_name = 'P_TABLE'

and a.index_name = b.index_name;

INDEX_NAME PARTITION_NAME STATUS

-------------------- -------------------- --------

PI1 PART_2 UNUSABLE

PI1 PART_1 USABLE

Next, we’ll insert some test data, generate statistics, set autotrace on, and run a query that should locate data in the PART_1 partition:

SQL> insert into p_table select rownum from dual connect by level < 2000;

1999 rows created.

SQL> exec dbms_stats.gather_table_stats(user,'P_TABLE');

PL/SQL procedure successfully completed.

SQL> explain plan for select * from p_table where a = 20;

Explained.

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

------------------------------------------------------

------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION RANGE SINGLE| | 1 | 1 |

| 2 | INDEX RANGE SCAN | PI1 | 1 | 1 |

------------------------------------------------------

As expected, the optimizer was able to generate an execution plan utilizing the index. Next, a query is issued that selects data from the partition defined with INDEXING OFF:

SQL> explain plan for select * from p_table where a = 1500;

Explained.

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

---------------------------------------------------------

---------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION RANGE SINGLE| | 2 | 2 |

| 2 | TABLE ACCESS FULL | P_TABLE | 2 | 2 |

---------------------------------------------------------

The output shows a full table scan of PART_2 was required, as there is no usable index with entries pointing at data in PART_2. We can instruct Oracle to create index entries pointing to data in PART_2 by rebuilding the index partition associated with the PART_2 partition:

SQL> alter index pi1 rebuild partition part_2;

Index altered.

Rerunning the previous select query shows that the optimizer is now utilizing the local partitioned index pointing to the PART_2 table partition:

------------------------------------------------------

------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION RANGE SINGLE| | 2 | 2 |

| 2 | INDEX RANGE SCAN | PI1 | 2 | 2 |

------------------------------------------------------

In this way, partial indexes allow you to disable the index while the table partition is being loaded (increasing the loading speed), and then later you can rebuild the partial index to make it available.

Partitioning and Performance, Revisited

Many times, I hear people say, “I’m very disappointed in partitioning. We partitioned our largest table and it went much slower. So much for partitioning being a performance increasing feature!” Partitioning can do one of the following three things to overall query performance:

Make your queries go faster
Not impact the performance of your queries at all
Make your queries go much slower and use many times the resources as the nonpartitioned implementation

In a data warehouse, with an understanding of the questions being asked of the data, the first bullet point is very much achievable. Partitioning can positively impact queries that frequently full scan large database tables by eliminating large sections of data from consideration. Suppose you have a table with one billion rows in it. There is a timestamp attribute. Your query is going to retrieve one year’s worth of data from this table (and it has ten years of data). Your query uses a full table scan to retrieve this data. Had it been partitioned by this timestamp entry—say, a partition per month—then you could have full scanned one-tenth the data (assuming a uniform distribution of data over the years). Partition elimination would have removed the other 90 percent of the data from consideration. Your query would likely run faster.

Now, take a similar table in an OLTP system. You would never retrieve ten percent of a one billion–row table in that type of application. Therefore, the massive increase in speed seen by the data warehouse just would not be achievable in a transactional system. You are not doing the same sort of work, and the same possible improvements are just not realistic. Therefore, in general, in your OLTP system the first bullet point is not achievable, and you won’t be applying partitioning predominantly for increased performance. Increased availability—absolutely. Administrative ease of use—very much so. But in an OLTP system, I say you have to work hard to make sure you achieve the second point: that you do not impact the performance of your queries at all, negatively or positively. Many times, your goal is to apply partitioning without affecting query response time.

On many occasions, I’ve seen that the implementation team will see they have a medium-sized table, say of 100 million rows. Now, 100 million sounds like an incredibly large number (and five or ten years ago, it would have been, but time changes all things). So the team decides to partition the data. But in looking at the data, there are no logical attributes that make sense for RANGE partitioning. There are no sensible attributes for that. Likewise, LIST partitioning doesn’t make sense. Nothing pops out of this table as being the right thing to partition by. So, the team opts for hash partitioning on the primary key, which just happens to be populated by an Oracle sequence number. It looks perfect, it is unique and easy to hash, and many queries are of the form SELECT * FROM T WHERE PRIMARY_KEY = :X.

But the problem is there are many other queries against this object that are not of that form. For illustrative purposes, assume the table in question is really the ALL_OBJECTS dictionary view, and while internally many queries would be of the form WHERE OBJECT_ID = :X, the end users frequently have these requests of the application as well:

Show me the details of SCOTT’s EMP table (where owner = :o and object_type = :t and object_name = :n).
Show me all of the tables SCOTT owns (where owner = :o and object_type = :t).
Show me all of the objects SCOTT owns (where owner = :o).

In support of those queries, you have an index on (OWNER,OBJECT_TYPE,OBJECT_NAME). But you also read that local indexes are more available, and you would like to be more available regarding your system, so you implement them. You end up re-creating your table like this, with 16 hash partitions:

$ sqlplus eoda/foo@PDB1

SQL> create table t

( OWNER, OBJECT_NAME, SUBOBJECT_NAME, OBJECT_ID, DATA_OBJECT_ID,

OBJECT_TYPE, CREATED, LAST_DDL_TIME, TIMESTAMP, STATUS,

TEMPORARY, GENERATED, SECONDARY )

partition by hash(object_id)

partitions 16

select OWNER, OBJECT_NAME, SUBOBJECT_NAME, OBJECT_ID, DATA_OBJECT_ID,

OBJECT_TYPE, CREATED, LAST_DDL_TIME, TIMESTAMP, STATUS,

TEMPORARY, GENERATED, SECONDARY

from all_objects;

Table created.

SQL> create index t_idx on t(owner,object_type,object_name) LOCAL;

Index created.

SQL> exec dbms_stats.gather_table_stats( user, 'T' );

PL/SQL procedure successfully completed.

And you execute your typical OLTP queries that you know you will run frequently:

SQL> variable o varchar2(30)

SQL> variable t varchar2(30)

SQL> variable n varchar2(30)

SQL> exec :o := 'SCOTT'; :t := 'TABLE'; :n := 'EMP';

SQL> select *

from t

where owner = :o

and object_type = :t

and object_name = :n;

SQL> select *

from t

where owner = :o

and object_type = :t;

SQL> select *

from t

where owner = :o;

However, when you run this with autotrace on and review the output, you notice the following performance characteristics:

SQL> set autotrace on

SQL> select * from t where owner = :o and object_type = :t and object_name = :n;

Here’s a partial snippet of the output:

Statistics

----------------------------------------------------------

2 recursive calls

0 db block gets

36 consistent gets

0 physical reads

0 redo size

You compare that to the same table, only with no partitioning implemented, and discover the following:

SQL> set autotrace on

SQL> select * from t where owner = :o and object_type = :t and object_name = :n;

Here’s a partial snippet of the output:

Statistics

----------------------------------------------------------

2 recursive calls

0 db block gets

7 consistent gets

0 physical reads

0 redo size

You might immediately jump to the (erroneous) conclusion that partitioning causes an increase in I/O. The query gets 7 consistent gets without partitioning and 36 with partitioning. If your system had an issue with high consistent gets (logical I/Os before), it is worse now. If it didn’t have one before, it might well get one. The root cause? The index partitioning scheme. Look at the following output from an explain plan for the partitioned version of this table:

SQL> explain plan for select * from t where owner = :o and object_type = :t and object_name = :n;

Explained.

SQL> select * from table(dbms_xplan.display(null,null,'BASIC +PARTITION'));

----------------------------------------------------------------------------

----------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | | |

| 1 | PARTITION HASH ALL | | 1 | 16 |

| 2 | TABLE ACCESS BY LOCAL INDEX ROWID BATCHED| T | 1 | 16 |

| 3 | INDEX RANGE SCAN | T_IDX | 1 | 16 |

----------------------------------------------------------------------------

This query has to look at each and every index partition here. The reason for that is because entries for SCOTT may well be in each and every index partition and probably is. The index is logically hash partitioned by OBJECT_ID; any query that uses this index and that does not also refer to the OBJECT_ID in the predicate must consider every index partition! So, what is the solution here? You should globally partition your index. Using the previous case as the example, we could choose to hash partition the index:

Note

There are considerations to be taken into account with hash partitioned indexes regarding range scans, which we’ll discuss later in this section.

SQL> drop index t_idx;

SQL> create index t_idx

on t(owner,object_type,object_name)

global

partition by hash(owner)

partitions 16;

Index created.

Much like the hash partitioned tables we investigated earlier, Oracle will take the OWNER value, hash it to a partition between 1 and 16, and place the index entry in there. Now when we review the output from autotrace, we are much closer to the work performed by the nonpartitioned table earlier—that is, we have not negatively impacted the work performed by our queries:

SQL> set autotrace on;

SQL> select * from t where owner = :o and object_type = :t and object_name = :n;

Statistics

----------------------------------------------------------

5 recursive calls

0 db block gets

5 consistent gets

0 physical reads

0 redo size

It should be noted, however, that a hash partitioned index cannot be range scanned; in general, it is most suitable for exact equality (equals or in-lists). If you were to query “WHERE OWNER > :X” using the preceding index, it would not be able to perform a simple range scan using partition elimination. You would be back to inspecting all 16 hash partitions.

USING ORDER BY

This example brought to mind an unrelated but very important fact. When looking at hash partitioned indexes, we are faced with another case where the use of an index to retrieve data would not automatically retrieve the data sorted. Many people assume that if the query plan shows an index is used to retrieve the data, the data will be retrieved sorted. This has never been true. The only way we can retrieve data in any sort of sorted order is to use an ORDER BY clause on the query. If your query does not contain an ORDER BY statement , you cannot make any assumptions about the sorted order of the data.

A quick example demonstrates this. We create a small table as a copy of ALL_USERS and create a hash partitioned index with four partitions on the USER_ID column:

SQL> create table t as select * from all_users;

SQL> create index t_idx on t(user_id) global partition by hash(user_id) partitions 4;

Now, we will query that table and use a hint to have Oracle use the index. Notice the ordering (actually, the lack of ordering) of the data:

SQL> set autotrace on explain

SQL> select /*+ index( t t_idx ) */ user_id from t where user_id > 0;

Here’s a partial snippet of the output:

USER_ID

----------

...

116

43 rows selected.

-----------------------------------------------------------------------------

-----------------------------------------------------------------------------

| 0 | SELECT STATEMENT | | 43 | 172 | 4 (0)| 00:00:01 | | |

| 1 | PARTITION HASH ALL| | 43 | 172 | 4 (0)| 00:00:01 | 1 | 4 |

|* 2 | INDEX RANGE SCAN | T_IDX | 43 | 172 | 4 (0)| 00:00:01 | 1 | 4 |

-----------------------------------------------------------------------------

So, even though Oracle used the index in a range scan, the data is obviously not sorted. In fact, you might observe a pattern in this data. There are four sorted results here: the ... replaces values that were increasing in value; and between the rows with USER_ID = 13 and 97, the values were increasing in the output. Then the row with USER_ID = 22 appeared. What we are observing is Oracle returning “sorted data” from each of the four hash partitions, one after the other.

This is just a warning that unless your query has an ORDER BY, you have no reason to anticipate the data being returned to you in any kind of sorted order whatsoever. (And no, GROUP BY doesn’t have to sort either! There is no substitute for ORDER BY.)

Does that mean partitioning won’t affect OLTP performance at all in a positive sense? No, not entirely—you just have to look in a different place. In general, it will not positively impact the performance of your data retrieval in OLTP; rather, care has to be taken to ensure data retrieval isn’t affected negatively. But on data modification, partitioning may provide salient benefits in highly concurrent environments.

Consider the preceding a rather simple example of a single table with a single index, and add into the mix a primary key. Without partitioning, there is a single table: all insertions go into this single table. There is contention perhaps for the freelists on this table. Additionally, the primary key index that would be on the OBJECT_ID column would be a heavy right-hand-side index, as we discussed in Chapter 11. Presumably, it would be populated by a sequence; hence, all inserts would go after the rightmost block leading to buffer busy waits. Also, there would be a single index structure T_IDX that people would be contending for. So far, a lot of single items.

Enter partitioning. You hash partition the table by OBJECT_ID into 16 partitions. There are now 16 tables to contend for, and each table has one-sixteenth the number of users hitting it simultaneously. You locally partition the primary key index on OBJECT_ID into 16 partitions. You now have 16 right-hand sides, and each index structure will receive one-sixteenth the workload it had before. And so on. That is, you can use partitioning in a highly concurrent environment to reduce contention, much like we used a reverse key index in Chapter 11 to reduce the buffer busy waits. However, you must be aware that the very process of partitioning out the data consumes more CPU itself than not having partitioning. That is, it takes more CPU to figure out where to put the data than it would if the data had but one place to go.

So, as with everything, before applying partitioning to a system to increase performance, make sure you understand what that system needs. If your system is currently CPU bound, but that CPU usage is not due to contention and latch waits, introducing partitioning could make the problem worse, not better!

Ease of Maintenance Features

At the beginning of this chapter, I stated the goal was to provide a practical guide to implement applications with partitioning, and that I wouldn’t be focusing so much on administration. However, there are a few new administrative features available that deserve some discussion, namely:

Multiple partition maintenance operations
Cascade exchange
Cascade delete

These features have a positive impact in terms of ease of maintenance, data integrity, and performance. Therefore, it’s important to be aware of these features when implementing partitioning.

Multiple Partition Maintenance Operations

This feature eases the administration of partitioning and in some scenarios reduces the database resources required to perform maintenance operations. Oracle allows you to combine more than one partition maintenance operation in one DDL statement. Consider the following example:

$ sqlplus eoda/foo@PDB1

SQL> create table p_table

(a int)

partition by range (a)

(partition p1 values less than (1000),

partition p2 values less than (2000));

Table created.

Now say you want to add multiple partitions to the table that was just created. Oracle allows you to perform multiple partition operations in one statement:

SQL> alter table p_table add

partition p3 values less than (3000),

partition p4 values less than (4000);

Table altered.

Note

In addition to adding partitions, multiple partition maintenance operations can be applied to dropping, merging, splitting, and truncating.

Performing multiple maintenance partition operations in one DDL statement is particularly advantageous for splitting partitions and thus deserves more discussion. A small example will illustrate this. Let’s set this up by creating a table and loading it with data:

SQL> CREATE TABLE sales(

sales_id int

,s_date date)

PARTITION BY RANGE (s_date)

(PARTITION P2021 VALUES LESS THAN (to_date('01-jan-2022','dd-mon-yyyy')));

Table created.

SQL> insert into sales

select level, to_date('01-jan-2021','dd-mon-yyyy') + ceil(dbms_random.value(1,364))

from dual connect by level < 100000;

99999 rows created.

Next, we create a small utility function to help us measure the resources consumed while performing an operation:

SQL> create or replace function get_stat_val( p_name in varchar2 ) return number

l_val number;

begin

select b.value

into l_val

from v$statname a, v$mystat b

where a.statistic# = b.statistic#

and a.name = p_name;

return l_val;

end;

Function created.

Next, we’ll split the P2021 partition into four partitions in one DDL statement and measure the resources consumed:

SQL> var r1 number

SQL> exec :r1 := get_stat_val('redo size');

PL/SQL procedure successfully completed.

SQL> var c1 number

SQL> exec :c1 := dbms_utility.get_cpu_time;

PL/SQL procedure successfully completed.

SQL> alter table sales split partition P2021

into (partition Q1 values less than (to_date('01-apr-2021','dd-mon-yyyy')),

partition Q2 values less than (to_date('01-jul-2021','dd-mon-yyyy')),

partition Q3 values less than (to_date('01-oct-2021','dd-mon-yyyy')),

partition Q4);

Table altered.

SQL> set serverout on

SQL> exec dbms_output.put_line(get_stat_val('redo size') - :r1);

47068

SQL> exec dbms_output.put_line(dbms_utility.get_cpu_time - :c1);

The amount of redo generated via the single DDL statement is relatively low. Depending on the number of partitions being split and if you’re updating indexes at the same time, the amount of redo generated and CPU consumed can be considerably less than when splitting the maintenance operations into multiple statements.

Cascade Truncate

Oracle allows you to truncate parent/child tables in tandem as a single atomic DDL statement. While the truncate cascade is taking place, any queries issued against the parent/child table combination are always presented with a read-consistent view of the data, meaning that the data in the parent/child tables will either be seen as both tables populated or both tables truncated.

The truncate cascade functionality is initiated with a TRUNCATE ... CASCADE statement on the parent table. For the cascade truncate to take place, any child tables must be defined with the foreign key relational constraint of ON DELETE CASCADE. What does cascade truncate have to do with partitioning? In a reference partitioned table, you can truncate a parent table partition and have it cascade to the child table partition in one transaction.

Let’s look at an example of this. Applying the TRUNCATE ... CASCADE functionality to reference partitioned tables, the parent ORDERS table is created here, and the ORDER_LINE_ITEMS table is created with ON DELETE CASCADE applied to the foreign key constraint:

$ sqlplus eoda/foo@PDB1

SQL> create table orders

(

order# number primary key,

order_date date,

data varchar2(30)

)

PARTITION BY RANGE (order_date)

(

PARTITION part_2020 VALUES LESS THAN (to_date('01-01-2021','dd-mm-yyyy')) ,

PARTITION part_2021 VALUES LESS THAN (to_date('01-01-2022','dd-mm-yyyy'))

);

Table created.

SQL> insert into orders values ( 1, to_date( '01-jun-2020', 'dd-mon-yyyy' ), 'xyz' );

1 row created.

SQL> insert into orders values ( 2, to_date( '01-jun-2021', 'dd-mon-yyyy' ), 'xyz' );

1 row created.

And now we’ll create the ORDER_LINE_ITEMS table, ensuring we include the ON DELETE CASCADE clause :

SQL> create table order_line_items

(

order# number,

line# number,

data varchar2(30),

constraint c1_pk primary key(order#,line#),

constraint c1_fk_p foreign key(order#) references orders on delete cascade

) partition by reference(c1_fk_p);

SQL> insert into order_line_items values ( 1, 1, 'yyy' );

1 row created.

SQL> insert into order_line_items values ( 2, 1, 'yyy' );

1 row created.

Now we can issue a TRUNCATE ... CASCADE that truncates both the parent table partition and the child table partition as a single transaction:

SQL> alter table orders truncate partition PART_2020 cascade;

Table truncated.

In other words, the TRUNCATE ... CASCADE functionality prevents applications from seeing the child table truncated before the parent table is truncated.

You can also truncate all partitions in the parent and child tables via

SQL> truncate table orders cascade;

Table truncated.

Again, just to be clear, the ability to cascade truncate parent/child tables is not exclusively a partitioning feature. This feature also applies to nonpartitioned parent/child tables. This allows you to use one DDL statement to initiate truncate operations and also ensures the database application is always presented with a consistent view of parent/child partitions.

Cascade Exchange

Oracle allows you to exchange the combination of parent/child reference partitioned tables in one atomic DDL statement. A small example will demonstrate this. First, a reference partitioned parent and child table is created to set this up:

$ sqlplus eoda/foo@PDB1

SQL> create table orders

( order# number primary key,

order_date date,

data varchar2(30))

PARTITION BY RANGE (order_date)

(PARTITION part_2020 VALUES LESS THAN (to_date('01-01-2021','dd-mm-yyyy')) ,

PARTITION part_2021 VALUES LESS THAN (to_date('01-01-2022','dd-mm-yyyy')));

SQL> insert into orders values (1, to_date( '01-jun-2014', 'dd-mon-yyyy' ), 'xyz');

SQL> insert into orders values (2, to_date( '01-jun-2015', 'dd-mon-yyyy' ), 'xyz');

SQL> create table order_line_items

(order# number,

line# number,

data varchar2(30),

constraint c1_pk primary key(order#,line#),

constraint c1_fk_p foreign key(order#) references orders

) partition by reference(c1_fk_p);

SQL> insert into order_line_items values ( 1, 1, 'yyy' );

SQL> insert into order_line_items values ( 2, 1, 'yyy' );

Next, an empty partition is added to the reference partitioned table:

SQL> alter table orders add partition part_2022 values less than (to_date('01-01-2023','dd-mm-yyyy'));

Next, a parent and a child table are created and loaded with data. These are the tables that will be exchanged with the empty partitions in the reference partitioned table:

SQL> create table part_2022

( order# number primary key,

order_date date,

data varchar2(30));

SQL> insert into part_2022 values (3, to_date('01-jun-2022', 'dd-mon-yyyy' ), 'xyz');

SQL> create table c_2022

(order# number,

line# number,

data varchar2(30),

constraint ce1_pk primary key(order#,line#),

constraint ce1_fk_p foreign key(order#) references part_2022);

SQL> insert into c_2022 values(3, 1, 'xyz');

Now we can exchange the prior two tables in one transaction into the reference partitioned tables. Notice the CASCADE option is specified:

SQL> alter table orders

exchange partition part_2022

with table part_2022

without validation

CASCADE

update global indexes;

That’s it. With one DDL statement, we simultaneously exchanged two tables related by a foreign key constraint into a reference partitioned table. Anybody accessing the database will see the parent and child table partitions added seamlessly as one unit of work.

Auditing and Segment Space Compression

Not too many years ago, US government constraints such as those imposed by HIPAA (www.hhs.gov/ocr/hipaa) were not in place. Companies such as Enron were still in business, and another US government requirement for Sarbanes-Oxley compliance did not exist. Back then, auditing was considered something that “we might do someday, maybe.” Today, however, auditing is at the forefront, and many DBAs are challenged to retain online up to seven years of audit trail information for their financial, business, and health-care databases.

Audit trail information is the one piece of data in your database that you might well insert but never retrieve during the normal course of operation. It is there predominantly as a forensic, after-the-fact trail of evidence. We need to have it, but from many perspectives, it is just something that sits on our disks and consumes space—lots and lots of space. And then every month or year or some other time interval, we have to purge or archive it. Auditing is something that if not properly designed from the beginning can kill you at the end. Seven years from now when you are faced with your first purge or archive of the old data is not when you want to be thinking about how to accomplish it. Unless you designed for it, getting that old information out is going to be painful.

Enter two technologies that make auditing not only bearable but also pretty easy to manage and consume less space. These technologies are partitioning and segment space compression, as we discussed in Chapter 10. That second one might not be as obvious since basic segment space compression only works with large bulk operations like a direct path load (OLTP compression is a feature of the Advanced Compression Option—not available with all database editions), and audit trails are typically inserted into a row at a time, as events happen. The trick is to combine sliding window partitions with segment space compression.

Suppose we decide to partition the audit trail by month. During the first month of business, we just insert into the partitioned table; these inserts go in using the conventional path, not a direct path, and hence are not compressed. Now, before the month ends, we’ll add a new partition to the table to accommodate next month’s auditing activity. Shortly after the beginning of next month, we will perform a large bulk operation on last month’s audit trail—specifically, we’ll use the ALTER TABLE command to move last month’s partition, which will have the effect of compressing the data as well. If we, in fact, take this a step further, we could move this partition from a read-write tablespace, which it must have been in, into a tablespace that is normally read-only (and contains other partitions for this audit trail). In that fashion, we can back up that tablespace once a month, after we move the partition in there; ensure we have a good, clean, current readable copy of the tablespace; and then not back it up anymore that month. We might have the following tablespaces for our audit trail:

A current online, read-write tablespace that gets backed up like every other normal tablespace in our system: The audit trail information in this tablespace is not compressed, and it is constantly inserted into.
A read-only tablespace containing “this year to date” audit trail partitions in a compressed format: At the beginning of each month, we make this tablespace read-write, move and compress last month’s audit information into this tablespace, make it read-only again, and back it up.
A series of tablespaces for last year, the year before, and so on: These are all read-only and might even be on slow, cheap media. In the event of a media failure, we just need to restore from backup. We would occasionally pick a year at random from our backup sets to ensure they are still restorable (tapes go bad sometimes).

In this fashion, we have made purging easy (i.e., drop a partition). We have made archiving easy, too—we could just transport a tablespace off and restore it later. We have reduced our space utilization by implementing compression. We have reduced our backup volumes, as in many systems, the single largest set of data is audit trail data. If you can remove some or all of that from your day-to-day backups, the difference will be measurable.

In short, audit trail requirements and partitioning are two things that go hand in hand, regardless of the underlying system type, be it data warehouse or OLTP.

Tip

Consider using Oracle’s Flashback Data Archive feature for auditing requirements. When enabled for a table, the Flashback Data Archive will automatically create an underlying partitioned table to record transactional information.

Summary

Partitioning is extremely useful in scaling up large database objects in the database. This scaling is visible from the perspective of performance scaling, availability scaling, and administrative scaling. All three are extremely important to different people. The DBA is concerned with administrative scaling. The owners of the system are concerned with availability, because downtime is lost money, and anything that reduces downtime—or reduces the impact of downtime—boosts the payback for a system. The end users of the system are concerned with performance scaling. No one likes to use a slow system, after all.

We also looked at the fact that in an OLTP system, partitions may not increase performance, especially if applied improperly. Partitions can increase the performance of certain classes of queries, but those queries are generally not applied in an OLTP system. This point is important to understand, as many people associate partitioning with “free performance increase.” This does not mean that partitions should not be used in OLTP systems—they do provide many other salient benefits in this environment—just don’t expect a massive increase in throughput. Expect reduced downtime. Expect the same good performance (partitioning will not slow you down when applied appropriately). Expect easier manageability, which may lead to increased performance due to the fact that some maintenance operations are performed by the DBAs more frequently because they can be.

We investigated the various table partitioning schemes offered by Oracle—range, hash, list, interval, reference, interval reference, virtual column, and composite—and talked about when they are most appropriately used. We spent the bulk of our time looking at partitioned indexes and examining the differences between prefixed and nonprefixed and local and global indexes. We investigated partition operations in data warehouses combined with global indexes, and the trade-off between resource consumption and availability. We also looked at the ease of maintenance features such as the ability to perform maintenance operations on multiple partitions at a time, cascade truncate, and cascade exchange. Oracle continues to update and improve partitioning with each new release.

Over time, I see this feature becoming more relevant to a broader audience as the size and scale of database applications grow. The Internet and its database-hungry nature along with legislation requiring longer retention of audit data are leading to more and more extremely large collections of data, and partitioning is a natural tool to help manage that problem.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 13. Partitioning

Create new playlist

Sign In

Sign Up

13. Partitioning

Partitioning Overview

Increased Availability

Reduced Administrative Burden

Enhanced Statement Performance

Parallel DML

Query Performance

OLTP Systems

Data Warehouse Systems

Reduced Contention in an OLTP System

Table Partitioning Schemes

Range Partitioning

Hash Partitioning

How Hash Partitioning Works

Hash Partition Using Powers of Two

List Partitioning

Interval Partitioning

Reference Partitioning

Interval Reference Partitioning

Virtual Column Partitioning

Composite Partitioning

Row Movement

Table Partitioning Schemes Wrap-Up

Partitioning Indexes

Local Indexes vs. Global Indexes

Local Indexes

Partition Elimination Behavior

Local Indexes and Unique Constraints

Global Indexes

Data Warehousing and Global Indexes

Sliding Windows and Indexes

“Live” Global Index Maintenance

Asynchronous Global Index Maintenance

OLTP and Global Indexes

Partial Indexes

Partitioning and Performance, Revisited

Ease of Maintenance Features

Multiple Partition Maintenance Operations

Cascade Truncate

Cascade Exchange

Auditing and Segment Space Compression

Summary

Table of Contents for
13. Partitioning