Chapter 13 Applying regression to track trends and make forecasts

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 13 Applying regression to track trends and make forecasts

In this chapter, you will:

Understand regression analysis
Learn how to use simple regression on linear data
Plot and calculate best-fit trendlines
Plot and calculate forecasted values
Learn how to use simple regression on nonlinear data
Understand and use multiple regression analysis

In these complex and uncertain times, forecasting business performance is increasingly important. Today, more than ever before, managers at all levels need to make intelligent predictions of future sales and profit trends as part of their overall business strategy. By forecasting sales six months, a year, or even three years down the road, managers can anticipate related needs such as employee acquisitions, warehouse space, and raw material requirements. Similarly, a profit forecast enables a company to plan for its future expansion.

Business forecasting has been around for many years, and various methods have been developed—some of them more successful than others. The most common forecasting method is the qualitative “seat of the pants” approach, in which a manager (or a group of managers) estimates future trends based on experience and knowledge of the market. This method, however, suffers from an inherent subjectivity and a short-term focus because many managers tend to extrapolate from recent experience and ignore the long-term trend. Other methods (such as averaging past results) are more objective but generally are useful for forecasting only a few months in advance.

This chapter presents a technique called regression analysis. Regression is a powerful statistical procedure that has become a popular business tool. In its general form, you use regression analysis to determine the relationship between one phenomenon and another. For example, car sales might be dependent on interest rates, and units sold might be dependent on the amount spent on advertising. The dependent phenomenon is called the dependent variable, or the y-value, and the phenomenon upon which it’s dependent is called the independent variable, or the x-value. (Think of a chart or graph on which the independent variable is plotted along the horizontal [x] axis and the dependent variable is plotted along the vertical [y] axis.)

Given these variables, you can do two things with regression analysis:

Determine the relationship between the known x- and y-values and use the results to calculate and visualize the overall trend of the data.
Use the existing trend to forecast new y-values.

As you’ll see in this chapter, Excel is well stocked with tools that enable you to both calculate the current trend and make forecasts no matter what type of data you’re dealing with.

Choosing a regression method

Three methods of regression analysis are used most often in business:

Simple regression: Use this type of regression when you’re dealing with only one independent variable. For example, if the dependent variable is car sales, the independent variable might be interest rates. You also need to decide whether your data is linear or nonlinear:
- Linear means that if you plot the data on a chart, the resulting data points resemble (roughly) a line.
- Nonlinear means that if you plot the data on a chart, the resulting data points form a curve.
Polynomial regression: Use this type of regression when you’re dealing with only one independent variable, but the data fluctuates in such a way that the pattern in the data doesn’t resemble either a straight line or a simple curve.
Multiple regression: Use this type of regression when you’re dealing with more than one independent variable. For example, if the dependent variable is car sales, the independent variables might be interest rates and disposable income.

You’ll learn about all three methods in this chapter.

Using simple regression on linear data

With linear data, the dependent variable is related to the independent variable by some constant factor. For example, you might find that car sales (the dependent variable) increase by 1 million units whenever interest rates (the independent variable) decrease by 1%. Similarly, you might find that division revenue (the dependent variable) increases by $100,000 for every $10,000 you spend on advertising (the independent variable).

Analyzing trends using best-fit lines

You make these sorts of determinations by examining the trend underlying the current data you have for the dependent variable. In linear regression, you analyze the current trend by calculating the line of best fit, or the trendline. This is a line through the data points for which the differences between the points above and below the line cancel each other out (more or less).

Note

Excel includes a tool called Forecast Sheet that simplifies many of the tasks and calculations that I present in this chapter. To use it, select your data, select the Data tab, and then select Forecast Sheet. This opens the Create Forecast Worksheet dialog box, which shows a basic best-fit trendline. You can select Options and change the default settings to gain a bit more control over the result. Select Create to add the new forecasting worksheet.

Plotting a best-fit trendline

The easiest way to see the best-fit line is to use a chart. Note, however, that this works only if your data is plotted using an XY (scatter) chart. For example, Figure 13-1 shows a worksheet with quarterly sales figures plotted on an XY chart. Here, the quarterly sales data is the dependent variable, and the period is the independent variable. (In this example, the independent variable is just time, represented, in this case, by fiscal quarters.) You can add a trendline through the plotted points.

The figure shows an Excel worksheet with quarterly sales data plotted as an XY chart. — **FIGURE 13-1** To see a trendline through your data, first make sure the data is plotted using an XY chart.

The following steps show you how to add a trendline to a chart:

Select the chart and, if more than one data series are plotted, select the series you want to work with.
Select Design > Add Chart Element > Trendline > More Trendline Options. Excel displays the Format Trendline task pane, shown in Figure 13-2.

FIGURE 13-2 In the Format Trendline task pane, use the Trendline Options tab to select the type of trendline you want to see.
On the Trendline Options tab, select Linear.
Select the Display Equation On Chart check box. (See the next section, “Understanding the regression equation.”)
Select the Display R-Squared Value On Chart check box. (See “Understanding R²,” later in this chapter.)
Select Close (X). Excel inserts the trendline. Note that you might need to drag the regression equation text box away from the trendline to read the values.

Figure 13-3 shows the best-fit trendline added to the chart.

The figure shows an Excel worksheet with the quarterly sales chart that includes a best-fit trendline. — **FIGURE 13-3** This quarterly sales chart has an added best-fit trendline.

Understanding the regression equation

In the steps outlined in the previous section, I instructed you to select the Display Equation On Chart check box. Doing this displays the regression equation on the chart, as pointed out in Figure 13-3. This equation is crucial to regression analysis because it gives you a specific formula for the relationship between the dependent variable and the independent variable.

For linear regression, the best-fit trendline is a straight line with an equation that takes the following form:

y = mx + b

Here’s how you can interpret this equation with respect to the quarterly sales data:

y	This is the dependent variable, so it represents the trendline value (quarterly sales) for a specific period.
x	This is the independent variable, which, in this example, is the period (quarter) you’re working with.
m	This is the slope of the trendline. In other words, it’s the amount by which the sales increase per period, according to the trendline.
b	This is the y-intercept, which means that it’s the starting value for the trend.

Here’s the regression equation for the example (refer to Figure 13-3):

y = 1407.6x + 259800

To determine the first point on the trendline, substitute 1 for x:

y = 1407.6 * 1 + 259800

The result is 261,207.6.

Caution

It’s important not to view the trendline values as somehow trying to predict or estimate the actual y-values (sales). The trendline simply gives you an overall picture of how the y-values change when the x-values change.

Understanding R²

When you select the Display R-Squared Value On Chart check box when adding a trendline, Excel places the following on the chart:

R² = n

Here, n is called the coefficient of determination (which statisticians abbreviate as r², and Excel abbreviates as R²). This is actually the square of the correlation; as I discuss in Chapter 12, “Building inferential statistical formulas,” the correlation tells you something about how well two things are related to each other. In this context, R² gives you some idea of how well the trendline fits the data. Roughly, it tells you the proportion of the variance in the dependent variable that is associated with the independent variable. Generally, the closer the result is to 1, the better the fit. Values below about 0.7 mean that the trendline is not a very good fit for the data.

Tip

If you don’t get a good fit with a linear trendline, your data might not be linear. Try using a different trendline type to see if you can increase the value of R².

You’ll see in the next section that it’s possible to calculate values for the best-fit trendline. Having those values enables you to calculate the correlation between the known y-values and the generated trend values by using the CORREL() function, which I describe in Chapter 12. Note that squaring the CORREL() result gives you the value of R².

Calculating best-fit values using `TREND()`

The problem with using a chart best-fit trendline is that you don’t get actual values to work with. If you want to get some values on the worksheet, you can calculate individual trendline values using the regression equation. However, what if the underlying data changes? For example, those values might be estimates, or they might change as more accurate data comes in. In that case, you need to delete the existing trendline, add a new one, and then recalculate the trend values based on the new equation.

If you need to work with worksheet trend values, you can avoid having to perform repeated trendline analyses by calculating the values using Excel’s TREND() function:

Array Location	Statistic	Description
Row 1, column 1	m	The slope of the trendline
Row 1, column 2	b	The y-intercept of the trendline
Row 2, column 1	se	The standard error value for m
Row 2, column 2	seb	The standard error value for b
Row 3, column 1	R²	The coefficient of determination
Row 3, column 2	sey	The standard error value for the y estimate
Row 4, column 1	F	The F statistic
Row 4, column 2	df	The degrees of freedom
Row 5, column 1	ssreg	The regression sum of squares
Row 5, column 2	ssresid	The residual sum of squares

Table of Contents for Chapter 13 Applying regression to track trends and make forecasts

Create new playlist

Sign In

Sign Up

Chapter 13

Applying regression to track trends and make forecasts

Choosing a regression method

Using simple regression on linear data

Analyzing trends using best-fit lines

Plotting a best-fit trendline

Understanding the regression equation

Understanding R2

Calculating best-fit values using TREND()

Calculating best-fit values using LINEST()

Analyzing the sales versus advertising trend

Making forecasts

Plotting forecasted values

Extending a linear trend with the fill handle

Extending a linear trend using the Series command

Forecasting with the regression equation

Forecasting with TREND()

Forecasting with LINEST()

Using simple regression on nonlinear data

Working with an exponential trend

Plotting an exponential trendline

Calculating exponential trend and forecast values

Exponential trending and forecasting using the GROWTH() function

Working with a logarithmic trend

Plotting a logarithmic trendline

Calculating logarithmic trend and forecast values

Working with a power trend

Plotting a power trendline

Calculating power trend and forecast values

Using polynomial regression analysis

Plotting a polynomial trendline

Calculating polynomial trend and forecast values

Using multiple regression analysis

Table of Contents for
Chapter 13 Applying regression to track trends and make forecasts

Understanding R²

Calculating best-fit values using `TREND()`

Calculating best-fit values using `LINEST()`

Forecasting with `TREND()`

Forecasting with `LINEST()`