0%

Explore Kinesis managed services such as Kinesis Data Streams, Kinesis Data Analytics, Kinesis Data Firehose, and Kinesis Video Streams with the help of practical use cases

Key Features

  • Get well versed with the capabilities of Amazon Kinesis
  • Explore the monitoring, scaling, security, and deployment patterns of various Amazon Kinesis services
  • Learn how other Amazon Web Services and third-party applications such as Splunk can be used as destinations for Kinesis data

Book Description

Amazon Kinesis is a collection of secure, serverless, durable, and highly available purpose-built data streaming services. This data streaming service provides APIs and client SDKs that enable you to produce and consume data at scale.

Scalable Data Streaming with Amazon Kinesis begins with a quick overview of the core concepts of data streams, along with the essentials of the AWS Kinesis landscape. You'll then explore the requirements of the use case shown through the book to help you get started and cover the key pain points encountered in the data stream life cycle. As you advance, you'll get to grips with the architectural components of Kinesis, understand how they are configured to build data pipelines, and delve into the applications that connect to them for consumption and processing. You'll also build a Kinesis data pipeline from scratch and learn how to implement and apply practical solutions. Moving on, you'll learn how to configure Kinesis on a cloud platform. Finally, you'll learn how other AWS services can be integrated into Kinesis. These services include Redshift, Dynamo Database, AWS S3, Elastic Search, and third-party applications such as Splunk.

By the end of this AWS book, you'll be able to build and deploy your own Kinesis data pipelines with Kinesis Data Streams (KDS), Kinesis Data Firehose (KFH), Kinesis Video Streams (KVS), and Kinesis Data Analytics (KDA).

What you will learn

  • Get to grips with data streams, decoupled design, and real-time stream processing
  • Understand the properties of KFH that differentiate it from other Kinesis services
  • Monitor and scale KDS using CloudWatch metrics
  • Secure KDA with identity and access management (IAM)
  • Deploy KVS as infrastructure as code (IaC)
  • Integrate services such as Redshift, Dynamo Database, and Splunk into Kinesis

Who this book is for

This book is for solutions architects, developers, system administrators, data engineers, and data scientists looking to evaluate and choose the most performant, secure, scalable, and cost-effective data streaming technology to overcome their data ingestion and processing challenges on AWS. Prior knowledge of cloud architectures on AWS, data streaming technologies, and architectures is expected.

Table of Contents

  1. Scalable Data Streaming with Amazon Kinesis
  2. Contributors
  3. About the authors
  4. About the reviewers
  5. Preface
    1. Who this book is for
    2. What this book covers
    3. To get the most out of this book
    4. Download the example code files
    5. Download the color images
    6. Conventions used
    7. Get in touch
    8. Reviews
  6. Section 1: Introduction to Data Streaming and Amazon Kinesis
  7. Chapter 1: What Are Data Streams?
    1. Introducing data streams
    2. Sources of data
    3. The value of real-time data in analytics
    4. Decoupling systems
    5. Challenges associated with distributed systems
    6. Transactions per second
    7. Scaling
    8. Latency
    9. Fault tolerance/high availability
    10. Overview of messaging concepts
    11. Overview of core messaging components
    12. Messaging concepts
    13. Examples of data streaming
    14. Application log processing
    15. Internet of Things
    16. Real-time recommendations
    17. Video streams
    18. Summary
    19. Further reading
  8. Chapter 2: Messaging and Data Streaming in AWS
    1. Amazon Kinesis Data Streams (KDS)
    2. Encryption, authentication, and authorization
    3. Producing and consuming records
    4. Data delivery guarantees
    5. Integration with other AWS services
    6. Monitoring
    7. Amazon Kinesis Data Firehose (KDF)
    8. Encryption, authentication, and authorization
    9. Monitoring
    10. Producers
    11. Delivery destinations
    12. Transformations
    13. Amazon Kinesis Data Analytics (KDA)
    14. Amazon KDA for SQL
    15. Amazon Kinesis Data Analytics for Apache Flink (KDA Flink)
    16. Amazon Kinesis Video Streams (KVS)
    17. Amazon Simple Queue Service (SQS)
    18. Amazon Simple Notification Service (SNS)
    19. Amazon SNS integrations with other AWS services
    20. Encryption at rest
    21. Amazon MQ for Apache ActiveMQ
    22. IoT Core
    23. Device software
    24. Control services
    25. Analytics services
    26. Amazon Managed Streaming for Apache Kafka (MSK)
    27. Apache Kafka
    28. Amazon MSK
    29. Amazon EventBridge
    30. Service comparison summary
    31. Summary
  9. Chapter 3: The SmartCity Bike-Sharing Service
    1. The mission for sustainable transportation
    2. SmartCity new mobile features
    3. SmartCity data pipeline
    4. SmartCity data lake
    5. SmartCity operations and analytics dashboard
    6. SmartCity video
    7. The AWS Well-Architected Framework
    8. Summary
    9. Further reading
  10. Section 2: Deep Dive into Kinesis
  11. Chapter 4: Kinesis Data Streams
    1. Technical requirements
    2. Discovering Amazon Kinesis Data Streams
    3. Creating streams and shards
    4. Creating a stream producer application
    5. Creating a stream consumer application
    6. Data pipelines with Amazon Kinesis Data Streams
    7. Data pipeline design (simple)
    8. Data pipeline design (intermediate)
    9. Data pipeline design (full design)
    10. Designing for scalable and reliable analytics pipelines
    11. Monitoring and scaling with Amazon Kinesis Data Streams
    12. X-Ray tracing with Amazon Kinesis Data Streams
    13. Scaling up with Amazon Kinesis Data Streams
    14. Securing Amazon Kinesis Data Streams
    15. Implementing least-privilege access
    16. Summary
    17. Further reading
  12. Chapter 5: Kinesis Firehose
    1. Technical requirements
    2. Setting up the AWS account
    3. Using a local development environment
    4. Using an AWS Cloud9 development environment
    5. Code examples
    6. Discovering Amazon Kinesis Firehose
    7. Understanding KDF delivery streams
    8. Understanding encryption in KDF
    9. Using data transformation in KDF with a Lambda function
    10. Understanding delivery stream destinations
    11. Amazon S3
    12. Amazon Redshift
    13. Amazon Elasticsearch Service
    14. Splunk destination
    15. HTTP endpoint destination
    16. Understanding data format conversion in KDF
    17. Deserialization
    18. Schema
    19. Serializer
    20. Data format conversion errors
    21. Understanding monitoring in KDF
    22. Use-case example – Bikeshare station data pipeline with KDF
    23. Steps to recreate the example
    24. Summary
    25. Further reading
  13. Chapter 6: Kinesis Data Analytics
    1. Technical requirements
    2. AWS account setup
    3. AWS CDK
    4. Java and Java IDE
    5. Code examples
    6. Discovering Amazon KDA
    7. Working on SmartCity bike share analytics use cases
    8. Creating operational insights using SQL Engine
    9. Core concepts and capabilities
    10. Creating operational insights using Apache Flink
    11. Options for running Flink applications in AWS Cloud
    12. Flink applications on KDA
    13. Building bike ride analytic applications
    14. Setting up a producer application
    15. Building a KDA SQL application
    16. Building a KDA Flink application
    17. Monitoring KDA applications
    18. Summary
    19. Further reading
    20. Blogs
    21. Workshops
  14. Chapter 7: Amazon Kinesis Video Streams
    1. Technical requirements
    2. AWS account setup
    3. Using a local development environment
    4. Code examples
    5. Understanding video fundamentals
    6. Containers
    7. Codecs
    8. Discovering Amazon Kinesis video streams WebRTC
    9. Core concepts and connection patterns
    10. Creating a signaling channel
    11. Establishing a connection
    12. Discovering Amazon KVS
    13. Key components of KVS
    14. Stream
    15. Kinesis producer
    16. Consuming
    17. Creating a stream
    18. Producing
    19. Integration with Rekognition
    20. Building video-enabled applications with KVS
    21. Summary
    22. Further reading
  15. Section 3: Integrations
  16. Chapter 8: Kinesis Integrations
    1. Technical requirements
    2. AWS account setup
    3. AWS CLI
    4. Kinesis Data Generator
    5. Code examples
    6. Amazon services that can produce data to send to Kinesis
    7. Amazon Connect
    8. Amazon Aurora database activity
    9. DynamoDB activity
    10. Processing Kinesis data with Apache Spark
    11. Amazon services that consume data from Kinesis
    12. Serverless data lake
    13. Amazon services that transform Kinesis data
    14. Routing events with EventBridge
    15. Third-party integrations with Kinesis
    16. Splunk
    17. Summary
    18. Further reading
    19. Why subscribe?
  17. Other Books You May Enjoy
    1. Packt is searching for authors like you
    2. Leave a review - let other readers know what you think
18.118.184.237