290 | Big Data Simplied
What are the analyses that you are not
getting today and what do you need to
get those analyses done? Do you need Big
Data technology to get there or is it simply
that you need to reshuffle and get different
results from technologies you already have
in place today?
You need to decide on the positioning and
use of your existing BI platform.
Likewise, does Hadoop or NoSQL fit in
and where? You may be able to use these
technologies in combination with Hadoop
or you may be able to use these technolo-
gies instead of Hadoop.
Once you have decided what combina-
tion of Hadoop and BI tools or those SQL
technologies you want to use, determine
whether you want to provision those
on-premises or in the Cloud or probably,
adopt a hybrid approach by combining the
two.
You will also want to weigh the personnel
and economic factors involved. You will
have to correlate your technical choices
with the skill set availability in your
organization and the amount of investment
you have already put into the development
of those skill sets over the years.
Think about all these aspects very carefully
and eventually, you will figure out whether
you need to simply re-architect the kinds
of solutions you have built around existing
technologies or whether you need to bring
in newer technologies, like NoSQL and/or
Hadoop into your organization.
Your next step is to determine a pilot proj-
ect, to execute that pilot, and to reassess
your answers to all the questions post the
pilot project. Refine your answers and then
go forward and implement that project in
a production scale and capacity. Learn
from your mistakes and learn from your
successes.
By putting all these things together and by
balancing that with your now-significant
and rather comprehensive knowledge of
what Big Data is about and what the major
technologies and who the major players are,
I hope you will be in a very good position
to optimally leverage the power of data.
Short-answer Type Questions (5 Marks Questions)
1. How does a Big Data strategy save operat-
ing costs?
2. How can you design a Big Data program
for enhancing value?
3. What is a data warehouse?
4. What do you understand by a data lake?
5. What are the symptoms of bad data for an
enterprise?
6. What are some of the database choices for
Big Data?
7. How does Hive and Sqoop help in integrat-
ing a Big Data Lake with existing systems?
8. Give examples of tooling used in a Big Data
programme.
M11 Big Data Simplified XXXX 01.indd 290 5/13/2019 9:57:46 PM
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.227.111.197