Apache Spark for data processing

Apache Spark is a new-ish project (at least in the world of big data, which moves at warp speed) that integrates well with Hadoop but does not necessarily require Hadoop components to operate. It is a fast and general engine for large-scale data processing as described on the Spark project team welcome page. The tagline of lightning fast cluster computing is a little catchier: we like that one better.

Apache Spark logo
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.224.63.87