Descriptive Analytics—Overview, Applications, and a Case
Chapter Highlights
Overview: Descriptive Analytics
Descriptive analytics tools are used to understand the occurrence of certain business phenomenon or outcomes and explaining these outcomes through graphical, quantitative, and numerical analyses. Through the visual and simple analysis, descriptive analytics explores the current performance of the business and the possible reasons for the occurrence of certain phenomenon. Many of the hidden patterns and features not apparent through mere examination of data can be exposed through graphical and numerical analyses. Descriptive analytics uses simple tools to uncover many of the problems quickly and easily. The results enable us question many of the outcomes so that corrective actions can be taken.
Successful use and implementation of descriptive analytics require the understanding of data, types and sources of data, data preparation for analysis (data cleansing, transformation, and modeling), difference between unstructured and structured data, and data quality. Graphical/visual representation of data and graphical techniques using computer are basic requirements of descriptive analytics. These concepts related to data, data types, and the graphical and visual techniques are explained in detail in this chapter. The visual techniques of descriptive analytics tools include the commonly used graphs and charts along with some newly developed graphical tools such as bullet graphs, tree maps, and data dashboards. Dashboards are now becoming very popular with big data. They are used to display the multiple views of business data graphically.
The other aspect of descriptive analytics is an understanding of simple numerical methods, including the measures of central tendency, measures of position, measures of variation, and measures of shape, and how different measures and statistics are used to draw conclusions and make decision from the data. Some other topics of interest are the understanding of empirical rule and the relationship between two variables—the covariance and correlation coefficient. The tools of descriptive analytics are helpful in understanding the data, identifying the trend or patterns in the data, and making sense from the data contained in the databases of companies. The understanding of databases, data warehouse, web search and query, and big data concepts are important in extracting and applying descriptive analytics tools. The flow chart in Figure 4.1 outlines the tools and methods used in descriptive analytics.
Figure 4.1 Tools and methods of descriptive analytics
Descriptive Analytics Applications: A Business Analytics Case
A case analysis showing different aspects of descriptive analytics is presented here. The case demonstrates the graphical and numerical analyses performed in an online order database of a retail store and is described below.
Case Study: Buying Pattern of Online Customers in a Large Department Store
The data file “Case-Online Orders.xlsx” contains data on 500 customer orders. The data were collected over a period of several days from the customers placing orders online. As the orders are placed, customer information is recorded in the database. Data on several categorical and numerical values are recorded. The categorical variables shown in the data file are day of the week, time (morning, midday), payment type (credit, debit cards, etc.), region of the country order was placed from, order volume, sale or promotion item, free shipping offer, gender, and customer survey rating. The quantitative variables include order quantity and the dollar value of the order placed or “Total Orders.” Table 4.1 shows the part of the data.
Table 4.1 Partial data online orders
The operations manager of the store wants to understand the buying pattern of the customers by summarizing and displaying the data visually and numerically. He believes that using the descriptive analytics tools including the data visualization tools, numerical methods, graphical displays, dashboards, and tables of collected data can be created to gain more insight into the online order process. They will also provide opportunities for improving the process.
The manager hired an intern and gave her the responsibility to prepare a descriptive analytics summary of the customer data using graphical and numerical tools that can help understand the buying pattern of the customers and help improve the online order process to attract more online customers to the store.
The intern was familiar with one of the tools available in EXCEL—the Pivot Table/Pivot Chart that she thought can be used in extracting information from a large database. In this case, the pivot tables can help break the data down by categories so that useful insight can be obtained. For example, this tool can create a table of orders received by the geographical region or summarize the orders by the day or time of the week. She performed analyses on the data to answer the questions and concerns the manager expressed in the meeting. As part of the analysis, the following graphs, tables, and numerical analyses were performed.
Figure 4.2 Number of orders by day
Figure 4.3 Number and percent of orders
Table 4.2 Number of orders by time
Row Labels |
Count of Time |
Afternoon |
112 |
Evening |
65 |
Late afternoon |
20 |
Midday |
92 |
Morning |
33 |
Night |
178 |
Grand total |
500 |
Figure 4.4 Plot of number of orders by time
Figure 4.5 Number of orders by region
Figure 4.6 Percent of orders by region
Table 4.3 Customer ratings
Count of Customer Survey Rating |
Column Labels |
||||
Row Labels |
Excellent |
Fair |
Good |
Poor |
Grand Total |
Female |
25 |
48 |
45 |
38 |
156 |
Male |
89 |
62 |
110 |
83 |
344 |
Grand total |
114 |
110 |
155 |
121 |
500 |
Figure 4.7 Customer ratings by gender
Table 4.4 Descriptive statistics of total orders
Figure 4.8 Graphical summary of the total orders data
A symmetrical or bell-shaped data that is characterized by a normal distribution is often used to draw conclusion by combining the mean and standard deviation. If the data can be approximated by a normal distribution, an empirical rule applies. For our case data, if we can assume that the “total orders” data is approximately symmetrical, we can draw the following conclusions relating the mean and standard deviation of the “total orders” data that were calculated in part (5).
Conclusions using Empirical Rule are shown in Table 4.5.
Table 4.5 Conclusions using the empirical rule
Approximately 68 percent of the orders are between the mean and ± 1 standard deviation, or, – 1s = (223.87 – 1(84.23)) = (139.64 – 308.10) or between $139.64 to $308.10 |
Approximately 95 percent of the orders are between the mean and ± 2 standard deviation, or, – 2s = (223.87 – 2(84.23)) = (55.41.64 – 392.33) or between $55.41 to $392.33 |
Approximately 99.7 percent of the orders are between the mean and ± 3 standard deviation, or, – 3s = (223.87 – 3(84.23)) = (–28.82 – 476.56) or between $0 to $476.56 |
The mean and standard deviation of total orders are:
Variable: Total Orders ($): Mean: = 223.87
Standard Deviation: s = 84.23
Figure 4.9 A dashboard of online orders data
The type of analytics that goes beyond descriptive analytics is predictive analytics. Applying the tools of descriptive analytics enables one to gain insight and learn from the data. These tools help to understand what has happened in the past and is very helpful in predicting future business outcome. Predictive analytics tools help answer these questions. The rest of the book explores predictive analytics tools and applications.
Summary
In this chapter, we provided a brief description of descriptive analytics and a case to illustrate the tools and applications of visual techniques used in descriptive analytics. The descriptive analytics is critical in studying the current state of the business and to learn what has happened in the past using the company’s data. The knowledge from the descriptive analytics lays a foundation for further analysis and leads to predictive analytics. As mentioned, the knowledge obtained by descriptive analytics helps us to learn what has happened in the past. This information is used to create predictive analytics models.
The subsequent chapters discuss the predictive analytics and background information needed for predictive analytics along with the analytical tools. Specific predictive analytics models and their applications are the topics of chapters that follow The rest of this book covers mostly predictive analytics.
3.128.30.193