After Impala is installed, you must perform a few mandatory and recommended configuration settings for smooth Impala operations. Cloudera Manager does some of the configurations automatically; however, a few of them need to be completed after any kind of installation. The following is a list of post-installation configurations:
hdfs-site.xml
in each DataNode as follows:<property> <name>dfs.client.read.shortcircuit</name> <value>true</value> </property> <property> <name>dfs.domain.socket.path</name> <value>/var/run/hadoop-hdfs/dn._PORT</value> </property> <property> <name>dfs.client.file-block-storage-locations.timeout</name> <value>3000</value> </property>
/var/run/Hadoop-hdfs/
is group writable, make sure its group is the root.core-site.xml
and hdfs-site.xml
from the Hadoop configuration folder to the Impala configuration folder at /etc/impala/conf
.hdfs-site.xml
on each DataNode must have the following setting:<property> <name>dfs.datanode.hdfs-blocks-metadata.enabled</name> <value>true</value> </property>
hdfs-site.xml
file is placed in the Impala configuration folder at /etc/impala/conf
.libhadoop.so
Hadoop Native Library. If this library is not available, you might receive the Unable to load native-hadoop library for your platform... using built-in-java classes where applicable message in Impala logs, indicating that native checksumming is not enabled.18.218.212.102