Chapter 8. Maintain and monitor SQL Server

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 8

Maintain and monitor SQL Server

Previous chapters covered the importance and logistics of database backups, but what else do you need to do on a regular basis to maintain a healthy SQL Server?

This chapter lays the foundation for the what and why of Microsoft SQL Server monitoring, based on dynamic management objects (DMOs), Database Consistency Checker (DBCC) commands, Extended Events (which replace Profiler/trace), and other tools provided by Microsoft.

Beyond simply setting up these tools, this chapter reviews what to look for on SQL Server instances on Windows and Linux, as well as SQL monitoring solutions in the Azure portal.

There is a lot for a DBA to be concerned with to monitor your databases—corrupt data files, lack of use of indexes and stats, properly sized data files, and baselined performance metrics, just to start. This chapter covers these topics and more.

All sample scripts in this book are available for download at https://MicrosoftPressStore.com/SQLServer2022InsideOut/downloads.

Detect, prevent, and respond to database corruption

After database backups, the second most important task concerning database integrity is proper configuration to prevent—and monitoring to mitigate—database corruption. A very large part of this is a proactive schedule of detection for rare cases when corruption occurs despite your best efforts. This isn’t a complicated topic and mostly revolves around configuring one setting and regularly running one command.

Set the database’s page verify option

For all databases, the page verify option should be CHECKSUM. Since SQL Server 2005, CHECKSUM has been the superior and default setting, but it requires a manual change after a database is restored up to a new SQL Server version.

If you still have databases whose page verify option is not CHECKSUM, you should change this setting immediately. The legacy NONE or TORN_PAGE_DETECTION options for this setting are a clear sign that this database has been moved over the years from a pre–SQL Server 2005 version. This setting is never automatically changed; you must change this setting after restoring the database up to a new version of SQL Server.

Warning

Before making the change to the CHECKSUM page verify option, take a full backup!

If corruption is found with the newly enabled CHECKSUM setting, the database can drop into a SUSPECT state, in which it becomes inaccessible. It is entirely possible that changing a database from NONE or TORN_PAGE_DETECTION to CHECKSUM could result in the discovery of existing, even long-present database corruption.

You should periodically run CHECKDB on all databases. This is a time-consuming but crucial process. You should run DBCC CHECKDB at least as often as your backup retention plan. Consider DBCC CHECKDB nearly as important as regular database backups.

The only reliable solution to database corruption is restoring from a known good backup.

For example, if you keep local backups around for one month, you should ensure that you perform a successful DBCC CHECKDB at least once per month, but more often is recommended. This ensures you will at least have a recovery point for uncorrupted, unchanged data, and a starting point for corrupted data fixes.

The DBCC CHECKDB command covers other more granular database integrity check tasks, including DBCC CHECKALLOC, DBCC CHECKTABLE, and DBCC CHECKCATALOG, all of which are important, and in only rare cases need to be run separately to split up the workload.

Running DBCC CHECKDB with no other parameters or syntax performs an integrity test on the current database context. Without specifying a database, however, no other additional options can be provided.

On large databases, DBCC CHECKDB is a resource-intensive operation (CPU, memory, and I/O), can take hours, and affects other user queries because of that resource consumption. DBCC CHECKDB may take hours to complete and tie up CPU resources, so it should be run only outside of business hours. To mitigate this, consider specifying the MAXDOP option (more on that in a moment). You can evaluate the progress of a DBCC CHECKDB operation (as well as backup and restore operations) by referencing the value in sys.dm_exec_requests.percent_complete for the executing session.

Here are some parameters worth noting:

NOINDEX. This can reduce the duration of the integrity check by skipping checks on nonclustered rowstore and columnstore indexes. It is not recommended.

Example usage:

Click here to view code image
```
DBCC CHECKDB (databasename, NOINDEX);
```
REPAIR_REBUILD. This ensures you have no data loss. However, there are some limitations to its potential benefit. You should run this only after considering other options, including a backup and restore, because although it might have some success, it is unlikely to result in a complete repair. It can also be very time consuming, involving the rebuilding of indexes based on attempted repair data.
- Review the DBCC CHECKDB documentation at https://learn.microsoft.com/sql/t-sql/database-console-commands/dbcc-checkdb-transact-sql.
Example usage:

Click here to view code image
```
DBCC CHECKDB (databasename) WITH REPAIR_REBUILD;
```
REPAIR_ALLOW_DATA_LOSS. You should run this only as a last resort to achieve a partial database recovery, because it can force a database to resolve errors by simply deallocating pages, potentially creating gaps in rows or columns. You must run this in SINGLE_USER mode, and you should run it in EMERGENCY mode. Review the DBCC CHECKDB documentation for a number of caveats, and do not execute this command casually.

Example usage (last resort only, not recommended!):

Click here to view code image
```
ALTER DATABASE WorldWideImporters SET EMERGENCY, SINGLE_USER;
DBCC CHECKDB('WideWorldImporters', REPAIR_ALLOW_DATA_LOSS);
ALTER DATABASE WorldWideImporters SET MULTI_USER;
```
Note

A complete review of EMERGENCY mode and REPAIR_ALLOW_DATA_LOSS is detailed in this blog post by Microsoft’s original DBCC CHECKDB engineer, Paul Randal: http://sqlskills.com/blogs/paul/checkdb-from-every-angle-emergency-mode-repair-the-very-very-last-resort.
WITH NO_INFOMSGS. This suppresses informational status messages and returns only errors.

Example usage:

Click here to view code image
```
DBCC CHECKDB (databasename) WITH NO_INFOMSGS;
```
WITH ESTIMATEONLY. This estimates the amount of space required in tempdb for the CHECKDB operation.

Example usage:

Click here to view code image
```
DBCC CHECKDB (databasename) WITH ESTIMATEONLY;
```
WITH MAXDOP = n. Similar to limiting the maximum degree of parallelism in other areas of SQL Server, this option limits the CHECKDB operation’s parallelism, possibly extending duration but potentially reducing the CPU utilization. SQL Server Enterprise edition supports parallel execution of the DBCC CHECKDB command, up to the server’s MAXDOP setting. Therefore, in Enterprise edition, consider MAXDOP = 1 to run the command single-threaded, or, overriding the other limitations on maximum degree of parallelism with MAXDOP = 0, allowing the CHECKDB unlimited parallelism to potentially finish sooner. Outside of Enterprise and Developer editions of SQL Server, objects are not checked in parallel.

Example usage, combined with the aforementioned NO_INFOMSGS command to show multiple parameters:

Click here to view code image
```
DBCC CHECKDB (databasename) WITH NO_INFOMSGS, MAXDOP = 0;
```
- You can see all the syntax options for CHECKDB, and those options that can be used together, at https://learn.microsoft.com/sql/t-sql/database-console-commands/dbcc-checkdb-transact-sql#syntax.
- For more information on automating DBCC CHECKDB, see Chapter 9, “Automate SQL Server administration.”

Inside OUT

How do you tell when a DBCC CHECKDB was last run on a database?

SQL Server writes each execution of DBCC CHECKDB to the SQL Server Error Log, but also records it internally. You can retrieve the latest good known execution date of DBCC CHECKDB by using the SELECT DATABASEPROPERTYEX command.

For example:

Table of Contents for Chapter 8. Maintain and monitor SQL Server

Create new playlist

Sign In

Sign Up

Chapter 8

Detect, prevent, and respond to database corruption

Set the database’s page verify option

Repair database data file corruption

Recover from database transaction log file corruption

Database corruption in Azure SQL Database

Maintain indexes and statistics

Change the fill factor when beneficial

Track page splits

Monitor index fragmentation

Maintain indexes

Rebuild indexes

Reorganize indexes

Update index statistics

Reorganize columnstore indexes

Manage database file sizes

Understand and find autogrowth events

Shrink database files

Shrink data files

Shrink transaction log files

Monitor activity with DMOs

Observe sessions and requests

Understand wait types and wait statistics

Monitor wait type aggregates

Understand wait resources

Benign wait types

Wait types to be aware of

Monitor with the SQL Assessment API

Get started with the SQL Assessment API

Use Extended Events

View Extended Events data

Understand the variety of Extended Events targets

Use Extended Events to capture deadlocks

Use Extended Events to detect autogrowth events

Use Extended Events to detect page splits

Secure Extended Events

Capture performance metrics with DMOs and data collectors

Query performance metrics with DMVs

Capture performance metrics with Performance Monitor

Monitor key performance metrics

Average Disk seconds per Read or Write

Page Life Expectancy (PLE)

Page Reads

Memory Pages

Batch Requests

Page Faults

Available Memory

Total Server Memory

Target Server Memory

Monitor key performance metrics in Linux

View performance counters in Linux

Monitor key performance metrics in Azure portal

View data in Azure Monitor

Leverage Azure Monitor logs

Create Microsoft Log Analytics solutions

Protect important workloads with Resource Governor

Configure the Resource Governor classifier function

Configure Resource Governor resource pools and workload groups

Monitor resource pools and workload groups

Understand the SQL Server servicing model

Updated servicing model

Plan for the product support life cycle

Table of Contents for
Chapter 8. Maintain and monitor SQL Server