0%

Book Description

Why have stream-oriented data systems become so popular, when batch-oriented systems have served big data needs for many years? In this report, author Dean Wampler examines the rise of streaming systems for handling time-sensitive problems—such as detecting fraudulent financial activity as it happens. You’ll explore the characteristics of fast data architectures, along with several open source tools for implementing them.

Batch-mode processing isn’t going away, but exclusive use of these systems is now a competitive disadvantage. You’ll learn that, while fast data architectures are much harder to build, they represent the state of the art for dealing with mountains of data that require immediate attention.

  • Learn step-by-step how a basic fast data architecture works
  • Understand why event logs are the core abstraction for streaming architectures, while message queues are the core integration tool
  • Use methods for analyzing infinite data sets, where you don’t have all the data and never will
  • Take a tour of open source streaming engines, and discover which ones work best for different use cases
  • Get recommendations for making real-world streaming systems responsive, resilient, elastic, and message driven
  • Explore an example streaming application for the IoT: telemetry ingestion and anomaly detection for home automation systems

Table of Contents

  1. 1. Introduction
    1. A Brief History of Big Data
    2. Batch-Mode Architecture
  2. 2. The Emergence of Streaming
    1. Streaming Architecture
    2. What About the Lambda Architecture?
  3. 3. Event Logs and Message Queues
    1. The Event Log Is the Core Abstraction
    2. Message Queues Are the Core Integration Tool
    3. Why Kafka?
  4. 4. How Do You Analyze Infinite Data Sets?
    1. Which Streaming Engine(s) Should You Use?
  5. 5. Real-World Systems
    1. Some Specific Recommendations
  6. 6. Example Application
    1. Machine Learning Considerations
  7. 7. Where to Go from Here
    1. Additional References
3.144.143.31