Chapter 1
In This Chapter
Understanding what big data is and why it’s so important
Taking a peek at the different types of data
Putting big data to work in your business
Big data has been making big headlines over the last couple of years, but it’s much more than just a buzz phrase or the latest business fad. The phenomenon is very real and is producing concrete benefits in so many different areas – from business to medical research to national security.
In this chapter, I look at how this phenomenon is transforming the way you do business. I also look at what sorts of data are available these days and introduce my step-by-step processes for using big data in business.
Personally, I don’t love the term big data because I think it places far too much emphasis on the sheer volume of data, when, as I talk about in this chapter, what you do with the data is much more important than how much of it you have. I have a feeling the term will gradually disappear and what’s now called big data will, in the future, just be known as data.
Given all the hype around big data, it’s no surprise that market researchers Gartner found in 2014 that 73 per cent of businesses have already invested in a big data plan or are planning to do so in the next few years.
The online behemoths that have come to dominate business in the Internet age – Google, Facebook, Amazon, you know the ones – all base their business models on big data. It’s by collecting and analysing huge amounts of information from us that they’re able to determine precisely what we want. The data also enables them to sell advertising services capable of precisely targeting their clients’ preferred demographics.
But big data isn’t just for giant corporations, it matters to every company – no matter how small or traditional. To cater for this huge demand, many companies have sprung up to offer services to other businesses, enabling them to launch big data initiatives of their own.
Of course, data collection itself isn’t new. But technological advances like chip and sensor technology, the Internet, cloud computing and the ability to store and analyse data have changed the quantity of data you can collect.
And the amount of data being generated every day is staggering. For example, users of Facebook upload around one billion pieces of content to the social network site every day. In industry, machinery and vehicles are fitted with sensors and trackers that record their every move, and whenever you call a call centre, an audio recording of your conversation is made and stored in a huge digital database.
Eventually, every aspect of your lives will be affected by big data. However, there are some areas where big data is already making a real difference today – in business and in other areas. Let’s look at the main areas where big data is most widely used right now.
Big data might seem like it’s something that only big business can make use of. When people first hear that massive volumes of information are being used to fight terrorism, cure cancer or predict the spread of Ebola, it sounds expensive, difficult and time-consuming. But that doesn’t have to be the case.
Huge datasets on everything from demographics to weather and consumer spending habits are freely available online for small businesses to use. Plus, the basic tools to make sense of the data are also free and becoming increasingly simple for anyone to use. For example, if you’re using Google’s AdWords to track what your customers are searching for online, you’re engaging in big data analysis, even if you don’t know it.
Plenty of small businesses are already using big data to better understand and target customers. Retailers can predict what products will sell, car insurance companies can understand how well their customers actually drive and detect potential fraud and takeaway companies can tailor their services to meet local customer preferences and demand. Social media has become a particularly valuable source of data for understanding customers, trends and markets.
Big data can also help improve business processes. Retailers are able to optimise their stock levels based on what’s trending on social media, what people are searching for on the web or even weather forecasts. Supply chains can be optimised so that delivery drivers use less gas and reach customers faster. And you can use data to understand and improve staff engagement or improve your hiring process.
There’s more detail on the many big data uses in Chapter 3 – and there are examples dotted throughout the book. Just look out for the Example icon.
The first thing to understand is that data in itself isn’t a new business phenomenon. Business data is as old as, well, business itself. Just think of sales and financial ledgers or, in more recent history, customer databases. It’s specifically big data that’s the new phenomenon. But, as I mention at the start of the chapter, big data isn’t just about how big it is. In fact, volume is just one of the key defining factors of big data.
To understand big data, and what separates it from normal data, you need to understand four main factors, which all handily start with a V. It’s these Vs that define what’s really special about big data, why it’s different to regular data and why it’s so transformative for businesses. You can find more information on the Vs in Chapter 2.
The four Vs are:
I think there are three main reasons why big data is in the news so much these days:
I look at each of these reasons in Chapter 2.
Another exciting aspect of big data is that it’s only going to get bigger and more widely used. As the tools to collect and analyse data become less and less expensive and more and more accessible, we’ll develop more and more uses for it – everything from smart yoga mats (no, really) to better healthcare tools and a more effective police force.
It may seem like big data has exploded onto the business scene out of nowhere. But in fact it’s been a more like a gradual evolution: from dusty archive rooms to the microfiche to databases and on to data centres. I think it’s part of human nature to want to continually gather information and make sense of what’s going on around us – we’ve just developed sleeker technology for it over the years.
There’s more detail on the technology changes that underpin big data in Chapter 6. But here’s a short overview of these three advances.
Distributed computing gives you greater storage capacity, but it also allows you to connect data faster than ever before. With data being spread across many different locations, you need to be able to bring that data back together quickly. This is where faster networks come into play.
These massive increases in storage and computing power make number crunching possible on a very large scale. Without faster networks that connect data sets together for analysis, big data just wouldn’t be possible for the average business. Now you can break up the analysis of data into manageable chunks, meaning that no one machine has to bear the whole load. This makes analysis faster and far more efficient – and cheaper.
It used to be that data would fit neatly into tables, spreadsheets and databases: think of data like sales figures, customer records, wholesale prices and so on. But now you can look at all sorts of data – including emails, Facebook posts, photos, blog comments and voice recordings – and extract meaning.
In this section I give an overview of the different types of data, but you can find more detail (and some great examples) in Chapters 4 and 5.
There are two main types of data: structured and unstructured.
Structured data has three main things going for it: it’s usually cheap to use, it’s easy to store, and it’s easy to mine for information. But, on the downside, it represents only a small proportion of all the data available these days – as the digital traces you leave behind get bigger and bigger, only a small amount of this data is structured in format.
Another downside is that structured data is simply less rich in insights than unstructured data, meaning it can be more difficult (maybe even impossible) to really understand what’s going on if you’re using only structured data. For best results, structured data often needs to be paired with other data to get a fuller picture. For example, structured data can tell you that hits on your website increased 20 per cent last month, but you need other forms of data to explore why that happened.
Unstructured and semi-structured data tends to be much more difficult to store (not least because so darn much of it is created every day). Now, thanks to massive increases in storage capacities and the ability to tag and categorise this data, as well as huge leaps in analytical technologies, you can finally make use of this data.
The advantages of unstructured and semi-structured data are that there’s absolutely loads of it (it accounts for around 80 per cent of all business-relevant data being generated today), and it providers a richer picture than structured data. However, it’s harder to store and more difficult to analyse, which makes it more expensive to work with.
The beauty of internal data is that it’s cheap (or maybe even free) and, as you own the data, there are no access issues to deal with. But, the downsides include having to maintain and secure the data (especially if it includes personal data). You may also find that internal data on its own doesn’t provide enough information to meet your strategic goals and you may need to supplement it with external data.
External data is powerful because it gives you access to information that’s often more up to date and richer than any information you could gather yourself. And, as it’s someone else’s data, you have the added bonus of not worrying about the security and data protection issues. But, the obvious downside is that you don’t own the data, and you usually have to pay for access (although not always – check out Chapter 15 for some great free data sources).
You leave more and more digital traces of your activities than ever before. If you think about what you’ve done today so far, most of those activities have left some digital trace (data) that can and is being collected and analysed. Some of the data you can now collect is new; some has been around a while but we’ve only just found ways to really analyse it.
Some of the exciting new types of data include:
There are three keys areas of decision making that relate to big data: one is pulling out insights from the data and using that information to guide your decision making, another is deciding how to build your big data skills and competencies, and the final aspect relates to infrastructure decisions. I look at each in turn in the next sections.
In today’s competitive business world, success often comes down to a company’s ability to learn faster than the competition and act on what they learn faster than the competition. The process of turning data into insights and actionable knowledge is the key to that success.
A key part of this process is making sure the right information is delivered to the right people at the right time. In order to aid decision making and ensure the necessary action is taken, insights need to be presented in a clear, concise and interesting way. People are less likely to act if they have to work hard to understand what the data is telling them.
Data and insights can also feed into the machines in your company, as well as your people. This applies to any machine or technology that’s a key part of how the business operates on a day-to-day basis, such as stock control systems or machinery on a production line. Connecting data and machines allows businesses to increase efficiency, improve product quality, cut costs and much more. Processes and systems can also be connected with data, so that you can improve how you do things based on what the data shows.
There’s more on focusing on insights and feeding data to your people, machines and systems in Chapter 7.
There’s currently a skill shortage in big data, meaning there’s more demand for big data experts than there are available experts. This can make it hard for smaller businesses to recruit good data staff. But there are alternatives to hiring in-house staff. You can try training up your existing people, working with external data providers (of which there are now many, big and small) and partnering with other organisations, such as universities.
Whether you want to hire new people or boost your existing skills, I think there are six key skills required to successfully use big data in business:
There’s more on these skills in Chapter 8.
If data is going to be a key part of your business, then it’s a good idea to consider hiring a data scientist to work in-house. The six skills I list are a good starting point when you’re searching for the right person, and I also list some helpful recruitment questions in Chapter 8. If you don’t have any tech experience at all then recruiting in the tech field can seem daunting – with these questions and by focusing on the core skills, you’ll be able to assess candidates with confidence.
Your ultimate goal is to gather insights which will lead to better decision making and improved business performance. In order to do this, you’ll need to invest in some tools or services.
I explore the main options for each element in Chapter 9, along with some of the most commonly used software packages.
The first step is to assess your existing infrastructure so, for each of these four elements, you need to consider what related technology or resources you already have in-house and how they might need to be improved or supplemented. For example, you may already be collecting useful customer data through your website or customer service department but don’t yet have the analytics in place to work with that data.
In the furore about big data it’s easy to forget that it’s all just data at the end of the day. There may be more of it than ever before, and there may be new forms of data but that data is still only really useful if you can use it to answer your strategic business questions and improve the way you do business.
Implementing a data strategy in an intelligent, structured way is what differentiates a big data-driven enterprise from one that is simply using data on an ad hoc basis. And the basics are no different for a small, agile and growing company than they are for the tech industry giants who have been using big data for years.
After all, most small companies don’t want to stay small, right? Data analysis can lead to big things for small business – but it’s much more likely to happen if you go about it in a smart way. Therefore, every company, big or small, in any industry needs a solid big data strategy. Chapter 10 takes you through the process of developing a big data strategy and making a business case for using big data.
You need to know what questions you need answers to before you dive into data. Focusing on strategic questions allows you to forget about big data and focus on smart data instead. By working out exactly what you need to know, you can hone in on the data that you really need.
I believe data should be at the heart of strategic decision making in all businesses, from Fortune 500 companies to the local taxi firm. Data can provide insights that help you answer your key business questions such as ‘How can I improve customer satisfaction?’ or ‘How can I improve staff retention?’.
By using data to make better decisions, you can improve the customer experience, increase employee satisfaction, enhance your business performance and gain competitive advantage. You can solve problems and react to opportunities. The power of data is in how you use it.
Identify your unanswered strategic questions.
For example, what do you need to know in order to meet your goals?
Chapter 11 walks you through this process in more detail.
After you do all that you can with the data and communicate your insights to the key people in the company, it’s time to review the evidence so that everyone in the business can move toward more fact-based decision making and leverage data to meet your objectives.
Many businesses start by using data to inform their decision making, and this remains the most widespread way that businesses use data. But data can also integrate very successfully into your daily business operations.
Thanks to new technologies, you’re seeing more businesses successfully integrate data and algorithms into their everyday operational processes. Increased connectivity, especially the Internet of Things (which I talk about in Chapter 5), has played a big role in this change. Just imagine how having your systems – production, stock control, distribution and security systems – all connected and talking to each other could make your business more efficient.
In Chapter 12 I set out an eight-step process for using data to transform your business operations. In brief, the steps are:
Data can also lead to bigger changes in your business – it could even lead to you reshaping your business model. Many companies are now using their data to create new income streams (for example, by selling data back to clients). For some, it’s even resulted in a complete change in business model. There’s more on this in Chapter 13.
Thanks to big data, the world is getting smarter every day. Businesses that don’t embrace the data revolution run the risk of being left behind. Those that do embrace it can become smarter, more efficient and much more competitive.
3.147.85.175