Learning Spark

Author Matei Zaharia , Patrick Wendell , Andy Konwinski , Holden Karau

Release Date: 2015/02/01

ISBN: 9781449359034

Topic:

Data

19
Chapters

0-1
Hours read

0k
Total Words

Start Reading Now
Add to Wishlist
View table of contents

Book Description

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java, and Scala. This edition includes new information on Spark SQL, Spark Streaming, setup, and Maven coordinates.

Foreword
Preface
1. Introduction to Data Analysis with Spark
2. Downloading Spark and Getting Started
3. Programming with RDDs
4. Working with Key/Value Pairs
5. Loading and Saving Your Data
6. Advanced Spark Programming
7. Running on a Cluster
8. Tuning and Debugging Spark
9. Spark SQL
10. Spark Streaming
11. Machine Learning with MLlib
Index

Learning Spark

Book Description

Table of Contents