0%

Book Description

Network infrastructure is in the midst of a paradigm shift. As systems become more distributed, methods for building and operating them are rapidly evolving—and that makes visibility into your services and infrastructure more important than ever. In this practical ebook, author Cindy Sridharan examines new monitoring tools that, while promising, bring their own set of technical and organizational challenges.

Platforms such as Kubernetes have solved several problems that traditional monitoring tools used to flag, but partial, implicit, and "soft" failure modes have risen along with the overall complexity of the system. This ebook provides an honest overview of monitoring challenges and trade-offs to help you choose the best observability strategy for your distributed system.

  • Learn the pros and cons of the three pillars of modern observability—logging, metrics collection, and request tracing
  • Explore the challenges involved when logging, tracing, and metrics are used in conjunction
  • Understand what you need to monitor in a modern cloud-native environment to derive alerts and insightful analysis
  • Examine the current crop of monitoring systems using Prometheus for metrics and Jaegar for tracing
  • Learn when it makes sense to use additional tools to augment logging, tracing, and metrics
  • Get a blueprint for evolving your monitoring stack to include the latest advances in the space

Table of Contents

  1. 1. The Need for Observability
    1. What Is Observability?
    2. Observability Isn’t Purely an Operational Concern
    3. Conclusion
  2. 2. Monitoring and Observability
    1. Alerting Based on Monitoring Data
    2. Best Practices for Alerting
      1. What Monitoring Signals to Use for Alerting?
      2. Debugging “Unmonitorable” Failures
      3. Observability Isn’t a Panacea
    3. Conclusion
  3. 3. Coding and Testing for Observability
    1. Coding for Failure
      1. Operational Semantics of the Application
      2. Operational Characteristics of the Dependencies
      3. Debuggable Code
    2. Testing for Failure
    3. Conclusion
  4. 4. The Three Pillars of Observability
    1. Event Logs
      1. The Pros and Cons of Logs
      2. Logging as a Stream Processing Problem
    2. Metrics
      1. The Anatomy of a Modern Metric
      2. Advantages of Metrics over Event Logs
      3. The Drawbacks of Metrics
    3. Tracing
    4. The Challenges of Tracing
      1. Service Meshes: A New Hope for the Future?
    5. Conclusion
  5. 5. Conclusion
18.119.157.39