Home Page Icon
Home Page
Table of Contents for
Cover
Close
Cover
by Mike Julian
Practical Monitoring
Preface
Who Should Read This Book
Why I Wrote This Book
A Word on Monitoring Today
Navigating This Book
Online Resources
Conventions Used in This Book
Using Code Examples
O’Reilly Safari
How to Contact Us
Acknowledgments
I. Monitoring Principles
1. Monitoring Anti-Patterns
Anti-Pattern #1: Tool Obsession
Monitoring Is Multiple Complex Problems Under One Name
Avoid Cargo-Culting Tools
Sometimes, You Really Do Have to Build It
The Single Pane of Glass Is a Myth
Anti-Pattern #2: Monitoring-as-a-Job
Anti-Pattern #3: Checkbox Monitoring
What Does “Working” Actually Mean? Monitor That.
OS Metrics Aren’t Very Useful—for Alerting
Collect Your Metrics More Often
Anti-Pattern #4: Using Monitoring as a Crutch
Anti-Pattern #5: Manual Configuration
Wrap-Up
2. Monitoring Design Patterns
Pattern #1: Composable Monitoring
The Components of a Monitoring Service
Pattern #2: Monitor from the User Perspective
Pattern #3: Buy, Not Build
It’s Cheaper
You’re (Probably) Not an Expert at Architecting These Tools
SaaS Allows You to Focus on the Company’s Product
No, Really, SaaS Is Actually Better
Pattern #4: Continual Improvement
Wrap-Up
3. Alerts, On-Call, and Incident Management
What Makes a Good Alert?
Stop Using Email for Alerts
Write Runbooks
Arbitrary Static Thresholds Aren’t the Only Way
Delete and Tune Alerts
Use Maintenance Periods
Attempt Automated Self-Healing First
On-Call
Fixing False Alarms
Cutting Down on Needless Firefighting
Building a Better On-Call Rotation
Incident Management
Postmortems
Wrap-Up
4. Statistics Primer
Before Statistics in Systems Operations
Math to the Rescue!
Statistics Isn’t Magic
Mean and Average
Median
Seasonality
Quantiles
Standard Deviation
Wrap-Up
II. Monitoring Tactics
5. Monitoring the Business
Business KPIs
Two Real-World Examples
Yelp
Reddit
Tying Business KPIs to Technical Metrics
My App Doesn’t Have Those Metrics!
Finding Your Company’s Business KPIs
Wrap-Up
6. Frontend Monitoring
The Cost of a Slow App
Two Approaches to Frontend Monitoring
Document Object Model (DOM)
Frontend Performance Metrics
OK, That’s Great, but How Do I Use This?
Logging
Synthetic Monitoring
Wrap-Up
7. Application Monitoring
Instrumenting Your Apps with Metrics
How It Works Under the Hood
Monitoring Build and Release Pipelines
Health Endpoint Pattern
Application Logging
Wait a Minute…Should I Have a Metric or a Log Entry?
What Should I Be Logging?
Write to Disk or Write to Network?
Serverless / Function-as-a-Service
Monitoring Microservice Architectures
Wrap-Up
8. Server Monitoring
Standard OS Metrics
CPU
Memory
Network
Disk
Load
SSL Certificates
SNMP
Web Servers
Database Servers
Load Balancers
Message Queues
Caching
DNS
NTP
Miscellaneous Corporate Infrastructure
DHCP
SMTP
Monitoring Scheduled Jobs
Logging
Collection
Storage
Analysis
Wrap-Up
9. Network Monitoring
The Pains of SNMP
What Is SNMP?
How Does It Work?
A Word on Security
How Do I Use SNMP?
Interface Metrics
Interface and Logging
Recap
Configuration Tracking
Voice and Video
Routing
Spanning Tree Protocol (STP)
Chassis
CPU and Memory
Hardware
Flow Monitoring
Capacity Planning
Working Backward
Forecasting
Wrap-up
10. Security Monitoring
Monitoring and Compliance
User, Command, and Filesystem Auditing
Setting Up auditd
auditd and Remote Logs
Host Intrusion Detection System (HIDS)
rkhunter
Network Intrusion Detection System (NIDS)
Wrap-Up
11. Conducting a Monitoring Assessment
Business KPIs
Frontend Monitoring
Application and Server Monitoring
Security Monitoring
Alerting
Wrap-Up
A. An Example Runbook: Demo App
Demo App
Metadata
Escalation Procedure
External Dependencies
Internal Dependencies
Tech Stack
Metrics and Logs
Alerts
B. Availability Chart
Index
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Next
Next Chapter
Practical Monitoring
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset