Important files in Hadoop

We will look at some important files that we need to consider. We will also see why these files are important in Hadoop. The following is the list of files:

  • Hadoop-default files: The following is the list of files that are shipped with Hadoop, which contains the default settings. Users must not change settings inside this file; rather, the changes must be done in site-specific files, which are listed in Hadoop overridden files.
    • mapred-default.xml
    • core-default.xml
    • hdfs-default.xml
    • yarn-default.xml
    • httpfs-default.xml
  • Hadoop configuration files to override default values: These are the files that are site specific or the values inside these files override the default parameters. So, changes must be made or new parameters must be added to these files:
    • mapred-site.xml
    • core-site.xml
    • hdfs-site.xml
    • yarn-site.xml
    • httpfs-site.xml
  • Hadoop configuration files to specify runtime parameters: These are the runtime files that provide Java-related settings and memory and daemon-related settings:
    • hadoop-env.sh
    • httpfs-env.sh
    • mapred-env.sh
    • yarn-env.sh

We can make changes in site-specific and env files. In env files, we need to add Java settings, JVM-related settings, Java path, Hadoopconfdir, log directory, and some other settings.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.216.88.54