A beginner's guide to learning and implementing Amazon EMR for building data analytics solutions
Sakti Mishra
BIRMINGHAM—MUMBAI
Copyright © 2022 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
Group Product Manager: Sunith Shetty
Publishing Product Manager: Reshma Raman
Senior Editor: Tazeen Shaikh
Content Development Editor: Shreya Moharir
Technical Editor: Devanshi Ayare
Copy Editor: Safis Editing
Project Coordinator: Aparna Nair
Proofreader: Safis Editing
Indexer: Sejal Dsilva
Production Designer: Nilesh Mohite
Marketing Coordinator: Priyanka Mhatre
First published: March 2022
Production reference: 1170222
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham
B3 2PB, UK.
ISBN 978-1-80107-107-9
Sakti Mishra is an engineer, architect, author, and technology leader with over 16 years of experience in the IT industry. He is currently working as a senior data lab architect at Amazon Web Services (AWS).
He is passionate about technologies and has expertise in big data, analytics, machine learning, artificial intelligence, graph networks, web/mobile applications, and cloud technologies such as AWS and Google Cloud Platform.
Sakti has a bachelor's degree in engineering and a master's degree in business administration. He holds several certifications in Hadoop, Spark, AWS, and Google Cloud. He is also an author of multiple technology blogs, workshops, white papers and is a public speaker who represents AWS in various domains and events.
Suvojit Dasgupta is a senior data architect with AWS, focusing on data engineering and analytics. In his 17 years of experience, he has led multiple strategic initiatives to design, build, migrate, modernize, and operate petabyte-scale data platforms for Fortune 500 companies. He is passionate about data architecture and takes pride in building well-architected solutions. In his free time, he likes to explore new technologies and listen to audio books. You can follow Suvojit on Twitter at @suvojitdasgupta.
Praveen Gupta is currently a data engineering manager with AWS, and has over 17 years of experience in the IT industry. Praveen started his career as an ETL/reporting developer working on traditional RDBMSs and reporting tools. Since 2014, he has been working on the AWS cloud on projects related to data science/machine learning and building complex data engineering pipelines on AWS. He specializes in data ingestion, big data processing, reporting, and building massive data warehouses at the petabyte scale for his customers, helping them make data-driven decisions. Praveen has an undergraduate degree and a master's degree, both in computer science from UIUC, USA. Praveen lives in Portland, USA with his wife and 8-year-old daughter.
3.144.98.190