Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Pseudocluster mode (aka Spark local)

As you already know, Spark jobs can be run in local mode. This is sometimes called pseudocluster mode of execution. This is also nondistributed and single JVM-based deployment mode where Spark issues all the execution components, for example, driver program, executor, LocalSchedulerBackend, and master, into your single JVM. This is the only mode where the driver itself is used as an executor. The following figure shows the high-level architecture of the local mode for submitting your Spark jobs:

Figure 6: High-level architecture of local mode for Spark jobs (source: https://jaceklaskowski.gitbooks.io/mastering-apache-spark/content/spark-local.html)

Is it too surprising? No, I guess, since you can achieve some short of parallelism as well, where the default parallelism is the number of threads (aka Core used) as specified in the master URL, that is, local [4] for 4 cores/threads and local [*] for all the available threads. We will discuss this topic later in this chapter.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

18.224.61.12

Table of Contents for Pseudocluster mode (aka Spark local)

Create new playlist

Sign In

Sign Up

Table of Contents for
Pseudocluster mode (aka Spark local)