Home Page Icon
Home Page
Table of Contents for
Cover
Close
Cover
by Jordan Pritchard, Ramin Keene, Rodney Lester, Jay Holler, Michael Kehoe, Tammy B
Reducing MTTD for High-Severity Incidents
Reducing Mean Time to Detection for High-Severity Incidents
Introduction
Step 0: Incident Classification
SEV Descriptions and Levels
The SEV Timeline
The TTD Timeline
Step 1: Organization-Wide Critical-Service Monitoring
Critical-Service Dashboard
High-Level Service Dashboards
Critical-Service KPI Metrics Emails
Critical-Service KPI Emails
Daily Service-Specific KPI Email: A Database Example
Step 2: Service Ownership and Metrics
The Role of Services in SEVs
Service Alerting Configuration
Step 3: On-Call Principles
On-Call Schedules
On-Call Alerting
Step 4: Chaos Engineering
Step 5: Detecting Incidents Caused by Self-Healing Systems
Step 6: Listening to Your People and Creating a High-Reliability Culture
Conclusion
Further Reading on Reducing MTTD for High-Severity Incidents
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Next
Next Chapter
Reducing MTTD for High-Severity Incidents
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset