Chapter 14. Performance tune SQL Server

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 14

Performance tune SQL Server

This chapter reviews the database concepts and objects most associated with tuning the performance of queries and coded objects within the Database Engine for SQL Server, Azure SQL Database, and Azure SQL Managed Instance. Much of this content also applies to dedicated SQL pools in Azure Synapse Analytics, though that product is not a focus of this book.

The first two sections of this chapter look at isolation levels and durability, including the ACID properties of a relational database management system (RDBMS). These correspond to settings and configurations that affect performance.

As you might have learned in an introductory database class, ACID properties are as follows:

Atomicity. A transaction is committed as an all or nothing operation and cannot leave the database in an incomplete state.
Consistency. A transaction brings the entire database from one state to another—not just a shard of a database.
Isolation. Transactions, though handled concurrently, are processed sequentially and independently. Incomplete transactions should not be visible to other transactions.
Durability. In the event of hardware failure, committed data survives. The data must exist in non-volatile memory.

It’s important to understand the principals of ACID not just from an academic or theoretical standpoint. Various features of modern database systems violate ACID principles in creative and advantageous ways to increase performance. You should be aware of the tradeoffs. For example:

Globally scalable databases like Azure Cosmos DB violate consistency in important, controlled, and well-documented ways. The database systems behind many global websites rely on eventual consistency. This is typically a design decision related to the application architecture from inception. For more about consistency, refer to Chapter 7, “Understand table features.”
If you want a data layer without isolation, try designing a multiuser application to write data to a flat file, where there is no serialization of concurrent writers. Similarly, the READ UNCOMMITTED isolation level in SQL Server violates isolation, allowing the changes of an uncommitted transaction to be read.
SQL Server’s in-memory OLTP functionality, introduced in SQL Server 2014, can be configured to violate durability. Similarly, data changes cached by applications in memory, but not immediately committed to the database, violate Durability.

This chapter covers various features that tweak SQL Server defaults to improve performance for certain scenarios. It is important to understand both your application performance needs and how SQL Server features and configuration options can meet them.

It also explores the process of how SQL Server executes queries, including the execution plans that the query processor creates to execute your query. It discusses how execution plans are captured, analyzed, reported on, and manipulated by the Query Store feature. It covers execution plans in some detail, what to look for when performance tuning, and how to control when query execution plans go parallel, meaning SQL Server can use multiple processors to execute your query without the code changing at all.

Inside OUT

Is this all there is to performance tuning?

Entire books have been written on this topic! We can’t go into that degree of detail in a single chapter, but we do provide a deep-enough discussion to jumpstart and accelerate your learning on SQL Server performance tuning, especially in the role of an administrator. This includes the newest features added in SQL Server 2022, of which some are quite amazing, and many of which leverage the rich query history data collected by the Query Store.

Optimizing queries is not the ultimate solution to every performance issue. Look for opportunities to make changes in tables (better data types, indexes, partitions) as well as data architectures. Consider the use of PolyBase to read data in place, in its native source, as opposed to ETL solutions that copy data into SQL Server.

The Query Store in particular has a powerful payload of new features in SQL Server 2022. Both Microsoft and the authors of this book recommend enabling and configuring Query Store on all databases. In fact, Query Store will be enabled by default on new databases in SQL Server 2022, and this change has come to Azure SQL Database as well.

Some of SQL Server 2022’s best new performance features require Query Store to be enabled in each database. Look into new passive features like degree of parallelism (DOP) feedback, cardinality estimation (CE) feedback, and new interactive features like Query Store hints.

One of the best new features of SQL Server 2022 is Parameter Sensitive Plan (PSP) optimization, which attempts to all but solve the problem of bad execution plans due to parameter sniffing. PSP optimization doesn’t require Query Store to be enabled, only that you are in database compatibility level 160.

Performance tuning can be a daunting task as organizations want to process more and more data, but Microsoft is adding new features to the product in every version.

The examples in this chapter behave identically in SQL Server instances and databases in Azure SQL Managed Instance and Azure SQL Database unless otherwise noted. All sample scripts in this book are available for download at https://MicrosoftPressStore.com/SQLServer2022InsideOut/downloads.

Understand isolation levels and concurrency

When working on a multiuser system, the fundamental problem is how to handle scenarios in which users need to read and write the same data, concurrently. So, if there is a row—say row X—and user 1 and user 2 both want to do something with this row, what are the rules of engagement? If both users want to read the row, one set of concerns exists. If one user wants to read the row and the other wants to write to it, this is another set of issues. Finally, if both users want to write to the row, still another set of concerns arises. This is where the concept of isolation comes in, including how to isolate one connection from the other.

This is all related to the concept of atomicity, and as such, to transactions containing one or more statements, because we need to isolate logical atomic operations from one another. Even a single statement in a declarative programming system like Transact-SQL (T-SQL) can result in hundreds and thousands of steps behind the scenes.

Isolation isn’t only a matter of physical access to resources (a disk drive is fetching data from a row, so the next reader must wait for that to complete). This is a different problem for the hardware. Instead, while one transaction is doing its operations, other transactions need to be just as isolated from the data the user has affected. The performance implications are large, because the more isolated the operations need to be, the slower processing must be. However, the freer and less isolated transactions are, the greater the chance for loss of data.

Here are some the phenomena that can occur between two transactions:

Dirty read. Reading data that another connection is in the process of changing. The problem is much like trying to read a paper note that someone else is scribbling on. You might see an incomplete message, or even see words that the writer will scratch out in the future.
Non-repeatable read. Reading the same data over again that has changed or gone away. This problem is like when you open a box of doughnuts and see there is one left. While you are standing there, in control of the box, no one can take that last doughnut. But step away to get coffee, and when you come back, that doughnut might have a bite taken out of it. A repeatable read always gives you back rows with the same content as you first read (but might include more rows that did not exist when you first read them).
Phantom read. When you read a set of data, but then come back and read it again and get new rows that did not previously exist. In the previous doughnut example, this is the happiest day of your life, because there are now more doughnuts. However, this can be bad if your query needs to get back the same answer every time you ask the same question.
Reading a previously committed version of data. In some cases, you might be able to eliminate blocking by allowing connections to read a previously committed version of data that another connection is in the process of changing after your transaction started. A real-world example of this regularly happens in personal banking. You and your partner see you have $50 in your account, and you both attempt to withdraw $50, not realizing the intentions of the other. Without transaction serialization and a fresh version of the row containing your balance, your ATM might even say you have $0 after both transactions, using stale information. This does not change the cruel overdraft fees you will be receiving, of course.

Where this gets complicated is that many operations in a database system are bundled into multistep operations that need to be treated as one atomic operation. Reading data and getting back different results when executing the same query again, during what you expect to be an atomic operation, greatly increases the likelihood of returning incorrect results.

These phenomena can be understood by how they are bundled by the isolation levels that allow them to occur. For example, the default isolation level, READ COMMITTED, is subject to nonrepeatable reads and phantom rows, but not dirty reads. This provides adequate protection and performance in most situations, but not all.

You need a fundamental understanding of these effects. These aren’t just arcane keywords you study only when it is certification time; they can have a profound effect on application performance, stability, and—absolutely the most important thing for any RDBMS—data integrity.

For example, suppose you are writing software to control trains using track A. Two trains traveling in opposite directions need to use track A, so both conductors ask if the track is vacant, and are assured that it is. So, both put their trains on the track heading toward each other. Not good.

Understanding the differing impact of isolation levels on locking and blocking, and therefore on concurrency, is the key to understanding when you should use an isolation level different from the default of READ COMMITTED. Table 14-1 presents the isolation levels available in the Database Engine along with the phenomena that are possible in each.

Table 14-1 Isolation levels and phenomena that can be incurred

Isolation level	Dirty reads	Nonrepeatable reads	Phantom rows	Reading a previously committed version of data
READ UNCOMMITTED	X	X	X
READ COMMITTED		X	X
REPEATABLE READ			X
SERIALIZABLE
READ COMMITTED SNAPSHOT (RCSI)		X	X	X
SNAPSHOT				X

Inside OUT

Should you always just use SERIALIZABLE to be safe?

At this point in the process, it probably seems like you should protect your data with the SERIALIZABLE isolation level in every case. Safety first, right? However, this approach hinders performance. It’s a bit like wearing a full-body enclosure explosive ordnance disposal suit to mow your lawn.

Most real-world scenarios you deal with will not require strict isolation between connections. In fact, the SQL Server default READ COMMITTED isolation level will suffice for many applications.

However, software development frameworks do take a “safety first” approach, including using SERIALIZABLE by default. For example, the .NET System.Transactions infrastructure creates Serializable isolation level transactions by default. While safe, as we detail in this chapter, in many application scenarios you can understandably achieve far greater scalability and performance with virtually no danger by stepping IsolationLevel to RepeatableRead or ReadCommitted.

For more information, see https://learn.microsoft.com/dotnet/api/system.transactions.isolationlevel.

When you choose an isolation level for a transaction in an application, you should consider primarily the transactional safety and business requirements of the transaction in a highly concurrent multiuser environment. The performance of the transaction should be a distant second priority (yet still a priority) when choosing an isolation level.

Locking, which SQL Server uses for the normal isolation of processes, is not the issue. It is the way that every transaction in SQL Server cooperates with others when dealing with disk-based tables.

The default isolation level of READ COMMITTED is generally safe because it only allows connection to access data that has been committed by other transactions. Dirty reads are generally the only modification phenomenon that is almost universally bad. With READ COMMITTED, modifications to a row will block reads from other connections to that same row. This is especially important during multi-statement transactions, such as when parent and child rows in a foreign key relationship must be created in the same transaction. In that scenario, reads should not access either row in either table until both changes are updated.

Since the READ COMMITTED isolation level allows non-repeatable reads and phantom rows, it does not ensure that row data and row count won’t change between two SELECT queries on the same data in a transaction. READ COMMITTED isolation levels allow SQL Server to release locks from objects it has read and lets other users have any access, holding only locks on resources that it has changed.

For some application scenarios, this might be acceptable or even desired, but not for others. To avoid these two problematic scenarios (which we talk more about soon), you need to choose the proper, more stringent isolation level for the transaction.

For scenarios in which transactions must have a higher degree of isolation from other transactions, escalating the isolation level of a transaction is appropriate. For example, if a transaction must write multiple rows, even in multiple tables and statements, it cannot allow other transactions to change data it has read during the transaction, where escalating the isolation level of a transaction is appropriate.

For example, the REPEATABLE READ isolation level blocks other transactions from changing or deleting rows needed during a multistep transaction. Unlike READ COMMITTED, REPEATABLE READ has the effect of holding locks on resources and preventing any other readers from changing them until it has completed, thus avoiding non-repeatable reads.

If the transaction in this example needs to ensure that the same exact rows in a result set are returned throughout a multistep transaction, the SERIALIZABLE isolation is necessary. It is the only isolation level that prevents other transactions from inserting new rows inside of a range of rows. It prevents other connections from adding new rows by not only locking rows it has accessed, but also ranges of rows that it would have accessed had they existed. For example, say you queried for rows LIKE 'A%' in a SERIALIZABLE transaction and got back Apple and Annie. If another user tries to insert Aardvark, it is prevented until the LIKE 'A%' transaction is completed.

Lastly, it is essential to understand that every statement is a transaction. UPDATE TableName SET column = 1; operates in a transaction, as does a statement like SELECT 1;. When you do not manually start a transaction, it is referred to as an implicit transaction. An explicit transaction is one where you start with BEGIN TRANSACTION and end with COMMIT TRANSACTION or ROLLBACK TRANSACTION. The REPEATABLE READ and SERIALIZABLE isolation levels can gather a lot of locks, more so with explicit transactions of multiple statements, if they are not quickly closed. The more locks are present, the more likely your connection might be stuck indefinitely waiting or participate in a deadlock where one session must be terminated.

For more on monitoring database locking and blocking, see Chapter 8, “Maintain and monitor SQL Server.”

Inside OUT

When blocked, do your connections wait indefinitely? Can you control this?

By default, the timeout for a request being blocked in SQL Server is indefinite, and it is rarely set to anything different. This is not to be confused with an application connection provider that implements a timeout limitation. Applications that time out are receiving an application-layer timeout, not a timeout from the Database Engine.

You can determine the current setting of the lock timeout using the global variable @@LOCK_TIMEOUT. The default is -1, indicating there is no limit to the time a request will wait if blocked by another request’s locks. When the value is positive, and a connection waits on a lock longer than the timeout, error 1222 is raised, with the following message: “Lock request time out period exceeded. The statement has been terminated.” This is different from the timeout error generated by an application connection provider, which is just a duration of execution.

You can change the SQL Server timeout from the default for the current session by using the following statement,

SET LOCK_TIMEOUT n;

where n is the number of milliseconds before a request is cancelled by SQL Server. This might be handy in niche scenarios—perhaps when queries execute in a loop and regularly poll a table for reporting, or for first-in, first out (FIFO) or last-in, first-out (LIFO) purposes. Many business processes use FIFO queues. For example, fast food restaurants process tickets more or less sequentially, or FIFO, as food orders arrive.

Take caution in implementing this change to SQL Server’s default lock timeout. Try to first understand the cause of the blocking. If you change the lock timeout in code, ensure that any applications creating the sessions are prepared to handle the errors gracefully. Similar to deadlock detection, applications should detect these errors and automatically retry.

SQL Server does have a configuration setting for a timeout for outgoing remote connections called Remote Query Timeout, which defaults to 600 seconds. This timeout applies only to connections to remote data providers, not to requests run on the SQL Server instance. It specifies the number of seconds that the query can execute, not how long it can be blocked, before it is terminated.

The most complex of the phenomena concerns reading data that is not the committed version that was initially accessed. There are two main places where this becomes an issue.

Reading previous versions of data. When you use SNAPSHOT or READ COMMITTED SNAPSHOT (RCSI), your query will see how data looked when first accessed within the transaction. This means the data later in the transaction might not match the current state of the database.

A side effect of this is that in SNAPSHOT isolation level, if two transactions try to modify or delete the same row, you will get an update conflict, requiring you to restart the transaction.
Reading new versions of data. In any isolation level that allows phantoms and non-repeatable reads, running the same statement twice can return entirely different results. This becomes important in multistep transactions with multiple SELECT statements that access the same data. This might be desirable or problematic; the application developer should understand the difference.

Isolation levels are important to understand. It can be tricky to test your code to see what happens when two connections simultaneously try to make incompatible reads and modifications to data. Mature application performance testing always incorporates simulated concurrent users’ sessions accessing the same data.

Understand how concurrent sessions become blocked

This section reviews a series of examples of how concurrency works in a multiuser application interacting with SQL Server tables. First, it discusses how to diagnose whether a request is being blocked or if it is blocking another request. Note that these initial examples assume that SQL Server has been configured in the default manner for concurrency. We will adjust that later in this chapter to give you more ways to tune performance.

What causes blocking?

We have alluded to it already, and the answer is that when you use resources, they are locked. These locks can be on several different levels and types of resources, as seen in Table 14-2.

Table 14-2 Lockable resources (not every type of resource)

Type of Lock	Granularity
Row or row identifier (RID)	A single row in a heap (a table without a clustered index).
Key	A single value in an index. (A table with a clustered index is represented as an index in all physical structures.)
Key range	A range of key values (for example, to lock rows with values from A–M, even if no rows currently exist). Used for `SERIALIZABLE` isolation level.
Extent	A contiguous group of eight 8-KB pages.
Page	An 8-KB index or data page.
HoBT	An entire heap or B-tree structure.
Object	An entire table (including all rows and indexes), view, function, stored procedure, and so on.
Application	A special type of lock that is user-defined.
Metadata	Metadata about the schema, such as catalog objects.
Database	An entire database.
Allocation unit	A set of related pages that are used as a unit.
File	A data or log file in the database.

Locks on a given resource are of a mode. Table 14-3 lists the modes that a data object might be in. Two of the most important ones are shared (indicating a row is being read only) and exclusive (indicating a row should not be accessible by any other connection.)

Table 14-3 Lock modes

Lock Mode	Definition
Shared	Grants access for reads only. This mode is generally used when users are looking at but not editing data. It’s called “shared” because multiple processes can have a shared lock on the same resource, allowing read-only access. However, sharing resources prevents other processes from modifying the resource.
Exclusive	Gives exclusive access to a resource and is also used during data modification. Only one process might have an active exclusive lock on a resource.
Update	Used to inform other processes that you’re planning to modify the data. Other connections might also issue shared locks, but not update or exclusive locks, while you’re still preparing to do the modification. Update locks are used to prevent deadlocks (covered later in this section) by marking rows that a statement will possibly update rather than upgrading directly from a shared lock to an exclusive one.
Intent	Communicates to other processes that taking one of the previously listed modes might be necessary. It establishes a lock hierarchy with existing locks. You might see this mode as intent shared, intent exclusive, or shared with intent exclusive.
Schema	Used to lock the structure of a resource when it’s in use so you cannot alter a structure (like a table) when a user is reading data from it. (Schema locks show up as part of the mode in many views.)

As queries are performing different operations, such as querying data, modifying data, or changing objects, resources are locked in a given mode. Blocking comes when one connection has a resource locked in a certain mode, and another connection needs to lock a resource in an incompatible mode. You can see the compatibility of different modes in Table 14-4.

Note

To read this table, pick the lock mode in one axis; an X will be displayed in any compatible column in the other axis. For example, an update lock is compatible with an intent shared and a shared lock, but not with another update lock, or any of the exclusive variants.

Table 14-4 Lock modes and compatibility

Mode	IS	S	U	IX	SIX	X
Intent shared (IS)	X	X	X	X	X
Shared (S)	X	X	X
Update (U)	X	X
Intent exclusive (IX)	X
Shared intent exclusive (SIX)		X
Exclusive (X)

If a connection is reading data, it will take a shared lock, allowing other readers to also take a shared lock, which will not cause a blocked situation. However, if another connection is modifying data, it will get an exclusive lock, which will prevent the connection (and any other connections) from accessing the exclusively locked resources in any manner (other than ignoring the locks, discussed later in this section).

How to observe locks and blocking

You can find out in real time whether a request is being blocked. The dynamic management object (DMO) sys.dm_db_requests, when combined with sys.dm_db_sessions on the session_id column, provides data about blocking and the state of sessions on the server. This provides much more information than the legacy sp_who or sp_who2 commands, as you can see in this query:

RANGE_HI_KEY	RANGE_ROWS	EQ_ROWS	DISTINCT_RANGE_ROWS	AVERAGE_RANGE_ROWS…
Smith	400	200	20	10
Tests	200	120	23	5

Table of Contents for Chapter 14. Performance tune SQL Server

Create new playlist

Sign In

Sign Up

Chapter 14

Understand isolation levels and concurrency

Understand how concurrent sessions become blocked

What causes blocking?

How to observe locks and blocking

Change the isolation level

Use the SET TRANSACTION ISOLATION LEVEL statement

Use table hints to change isolation

Understand and handle common concurrency scenarios

Understand two requests updating the same rows

Understand how a write blocks a read

Understand nonrepeatable reads

Prevent a nonrepeatable read

Understand phantom rows

Prevent phantom reads

The case against the READ UNCOMMITTED isolation level

Understand row version-based concurrency

Access data in SNAPSHOT isolation level

Implement row-versioned concurrency

Understand update operations in the SNAPSHOT isolation level

Understand on-disk versus memory-optimized concurrency

Understand memory-optimized data and isolation

Specify the isolation level for memory-optimized tables in queries

Understand durability settings for performance

Delayed durability database options

How SQL Server executes a query

Understand the query execution process

View execution plans

Display the estimated execution plan

Display the actual execution plan

Displaying live query statistics

Permissions necessary to view execution plans

Understand execution plans

Read graphical execution plans

Start an execution plan

Start with the upper-left operator

Look right, then read from right to left

The weight of the lines connecting operators tells part of the story, but isn’t the full story

Operator cost share isn’t the full story, either

Look for join operators and understand the different algorithms

Look for parallel icons

Understand cardinality estimation

Understand parameterization and parameter sniffing

Explore the procedure cache

Clear the procedure cache

Analyze cached execution plans

Permissions required to access cached plan metadata

Understand parallelism

Force a parallel execution plan

Use advanced engine features to tune queries

Internal improvements in SQL Server 2022

Recent improvements to tempdb

Leverage the Query Store feature

Initial configuration of Query Store

Troubleshoot with Query Store data

Query Store hints

Use Query Store hints

Automatic plan correction

Intelligent query processing

Batch mode on rowstore

Cardinality estimation (CE) feedback

Degree of parallelism (DOP) feedback

Memory grant feedback

Parameter Sensitive Plan optimization

Table variable deferred compilation

T-SQL scalar user-defined function (UDF) inlining

Example of scalar UDF inlining

Table of Contents for
Chapter 14. Performance tune SQL Server