Chapter 3. Cloudera's Distribution Including Apache Hadoop

With knowledge of HDFS and MapReduce, you are now ready to explore the world's most used Apache Hadoop distribution, Cloudera's Distribution Including Apache Hadoop (CDH). CDH is thoroughly tested and consists of a host of components that have been carefully packaged to work well with each other.

In this chapter, we will cover the following topics:

  • Getting started with CDH
  • Understanding the CDH components
  • Installing CDH
  • Installing the CDH components

Getting started with CDH

Cloudera is an organization that has been working with Hadoop and its related technologies for a few years now. It is an expert in the field of handling large amounts of data using Hadoop and various other open source tools and projects. It is one of the major contributors to several of the Apache projects. Over the years, Cloudera has deployed several clusters for hundreds of its customers. It is equipped with practical knowledge of the issues and details of real production clusters. To solve these issues, Cloudera built CDH.

In most distributed computing clusters, there are several tools that need to work together to provide the desired output. These tools are individually installed and are then configured to work well with each other. This approach often creates problems as the tools are never tested together.

Also, the setup and configuration of these tools is tedious and prone to errors. CDH solves this problem as it is packaged with thoroughly tested tools that work well together in a single powerful distribution. Installation and configuration of the various tools and components is more organized with CDH.

CDH has everything an enterprise needs for its big data projects. The components packaged into CDH provide tools for storage as well as the computation of large volumes of data. By using CDH, an enterprise is guaranteed to have good support from the community for its Hadoop deployment.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.145.14.200