Putting Structure on Your Big Data with SparkSQL

In this chapter, we'll learn how to manipulate DataFrames with Spark SQL schemas, and use the Spark DSL to build queries for structured data operations. By now we have already learned to get big data into the Spark Environment using RDDs and carried out multiple operations on that big data. Let us now look that how to manipulate our DataFrames and build queries for structured data operations.

In particular, we will cover the following topics:

  • Manipulating DataFrames with Spark SQL schemas
  • Using Spark DSL to build queries
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.226.187.233