Managing data lakes

Data lakes are places used to dump tons of potentially valuable data from multiple sources. Some sources will be IoT devices, while some sources will be internal company data such as production, purchasing, or customer service records. The concept is to put all of this variety of data in one place so it can be accessible through a unified interface. In the case of Hadoop, the data lake would be stored in HDFS and probably accessed through Hive or Spark.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.116.118.198