Cloudera Manager can be installed using the following two methods:
Automatic installation
Manual installation
In this section, we will walk through the steps for automatic installation as most of the configurations are handled efficiently by Cloudera Manager itself.
For this demonstration, we will be using the following machine configuration for the Cloudera Manager Server:
Operating system: CentOS 6.4
RAM: 4 GB
CPU: 4 CPU cores
Disk space: 50 GB
For the machines running the Cloudera Manager Agents, we will use the following configuration:
Operating system: CentOS 6.4
RAM: 4 GB
CPU: 4 CPU cores
Disk space: This will depend on the volume of data you want to store in your cluster
Note
For a complete list of all the hardware and software requirements, visit the following Cloudera website:
Once downloaded, modify the downloaded binary file to be executable using the following command:
$ chmod u+x cloudera-manager-installer-bin
Next, update the config directory under /etc/selinux and then set SELINUX=disabled to disableSecurity-Enhanced Linux (SELinux). This is required for the installation of Cloudera Manager and can be re-enabled after installation.
Start the installation of Cloudera Manager using the following command:
$ ./cloudera-manager-installer-bin
As shown in the previous screenshot, the installer starts with the Cloudera Manager README screen. The README page describes all the steps the Cloudera Manager installer will perform to install the Cloudera Manager Server and its agents. Select Next to proceed.
You will be presented with the Cloudera Express License screen as shown in the following screenshot. Select Next to proceed and accept the license.
On accepting the license, you will be prompted with the Oracle Binary Code License Agreement. Select Next and accept the license.
On accepting the Oracle Binary Code License Agreement, the Cloudera Manager installer starts the installation process, as shown in the following screenshot:
After the installation completes, as shown in the following screenshot, a dialog with the URL for the Cloudera Manager Web console is displayed. Click on OK and open up a browser to visit the link provided.
As shown in the following screenshot, the login screen is displayed for Cloudera Manager. The default username and password is admin.
The first time you log in, you will be prompted to select the Cloudera Manager edition. We will go ahead and install the Cloudera Enterprise Data Hub Edition Trial version, which can be evaluated for 60 days. This gives us enough time to test out all the features of the full version of Cloudera Manager. If you are interested in getting the Cloudera Manager license, you will need to contact Cloudera directly.
The next screen, as shown in the following screenshot, will display the list of services that will be installed as part of Cloudera Enterprise Data Hub Edition Trial 5.0.1. Click on Continue.
In the next step, you will need to enter the hostnames or IP addresses of all machines that are going to be part of your Apache Hadoop cluster. As shown in the following screenshot, you can enter all the addresses and click on Search to check whether they are available:
After you perform the search, all the machines will be listed as shown in the following screenshot along with the response time from each machine. Once you are satisfied with the results, select the required nodes and click on Continue.
After the node selection, you will be presented with a few options to perform the cluster installation as shown in the following screenshot. The cluster installation is a five-step process. The installer provides two types of installation options: packages and parcels. Cloudera recommends the use of parcels. After selecting the required options, click on Continue.
The next screen, as shown in the following screenshot, provides an option to install Java Unlimited Strength Encryption Policy Files. These files are used to set up a secure server. For now, we can leave the box unchecked and click on Continue.
Next, provide the SSH username and password to log in to the different machines as shown in the following screenshot. As you can see, we are using the root user. For this set up, the root password is the same for all the nodes in the cluster. Once you are done entering the credentials, click on Continue.
The next step starts installing all the Cloudera Manager components to the different machines on the cluster as shown in the following screenshot. This may take a few minutes to complete. If for some reason you want to abort the installation, you can click on Abort Installation.
After the installation completes, you should see the output as shown in the following screenshot. This confirms that the initial set of Cloudera Manager components have been installed correctly. Also, you will notice that the previously grayed-out Continue button is now active. Click on Continue to proceed.
After the installation of the initial components of Cloudera Manager, the next step installs the CDH5 parcel as shown in the following screenshot. Once the installation completes, click on Continue to proceed.
After the parcel is installed, all hosts are inspected for correctness as shown in the following screenshot. If there are any errors, a cross mark is displayed next to the failed validation. Click on Continue to proceed.
The next screen, as shown in the following screenshot, will prompt you to select the different combination of services that you would like to install. Select the appropriate one and click on Continue. As you can see, I have selected All Services to get all the services available within CDH5 to our cluster.
The next screen provides the options that can be customized under the Customize Role Assignments option.
Using this screen, you can decide which services need to run on which hosts. After you are done configuring, click on Continue.
The next step, as shown in the following screenshot, is to configure the database. We are selecting the Use Embedded Database option for our installation. After selecting the option, click on Test Connection to test the database connections. Once the connections are successfully tested, the Continue button will be activated. Click on Continue to proceed.
The next screen, as shown in the following screenshot, displays all the configuration values for the cluster for review. You can update the values as you desire or leave them at the defaults. Once you are done updating, click on Continue to proceed.
The next step, as shown in the following screenshot, starts all the services on the cluster. Once all the services have started successfully, click on Continue to proceed.
Once the installation is complete, you will be logged in to the Cloudera Manager Administrator Web console as shown in the following screenshot:
This completes the installation of Cloudera Manager and CDH5 on the cluster. Next, let us look into all the operations available in the Cloudera Manager Web console that could help the administrator manage the Apache Hadoop cluster.