Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

5
Factor analysis of multivariate time series

Similar to principle component analysis, factor analysis is one of the commonly used dimension reduction methods. It is a statistical technique widely used to explain a m‐dimensional vector with a few underlying factors. After introducing different methods to derive factors, we will illustrate the method with empirical examples. We will also discuss its use in forecasting.

5.1 Introduction

Just like principle component analysis, the purpose of factor analysis is to approximate the covariance relationships among a set of variables. Specifically, it is used to describe the covariance relationships for many variables in terms of a relatively few underlying factors, which are unobservable random quantities. The concept was developed by the researchers in the field of psychometrics in the early twentieth century. It has become a commonly used statistical method in many areas.

5.2 The orthogonal factor model

Given a weakly stationary m‐dimensional random vector at time t, Z_t = [Z_1,t, Z_2,t, …, Z_m,t]′ with mean μ = (μ₁, μ₂, …, μ_m)^′, and covariance matrix Γ, the factor model assumes that Z_t is dependent on a small number of k unobservable factors, F_j,t, j = 1, 2, …, k, known as common factors, and m additional noises ε_i,t, i = 1, 2, …, m, also known as specific factors, that is

(5.1)

More compactly, we can write the system in following matrix form,

(5.2)

where is a (k × 1) vector of factors at time t, L = [ℓ_i,j] is a (m × k) loading matrix, with ℓ_i,j is the loading of the ith variable on the jth factor, i = 1, 2, …, m, j = 1, 2, …, k, and ε_t = (ε_1,t, ε_2,t, …, ε_m,t)′ is a (m × 1) vector of noises, with E(ε_t) = 0, and

The factor model in Eq. (5.2) is an orthogonal factor model if it satisfies the following assumptions:

E(F_t) = 0, and Cov(F_t) = I_k, the (k × k) identity matrix,
E(ε_t) = 0, and a (m × m) diagonal matrix, and
F_t and ε_t are independent and so a (k × m) zero matrix.

It follows from Eq. (5.2) that the covariance structure of Z_t is

(5.3)

The model in Eq. (5.2) shows that the m‐dimensional process Z_t is linear related to the k common factors. More specifically,

(5.4)

which implies that

(5.5)

Also,

(5.6)

So the variance of the ith variable Z_i,t is the sum due to the k common factors, which is known as the ith communality, and due to the ith specific factor, which is known as the ith specific variance.

5.3 Estimation of the factor model

5.3.1 The principal component method

Given observations Z_t = (Z_1,t, Z_2,t, …, Z_m,t)^′, for t = 1, 2, …, n, and its m × m sample covariance matrix a natural method of estimation is simply to use the principle component analysis introduced in Chapter 4 and choose k, which is much less than m, common factors from the first k largest eigenvalue‐eigenvector pairs in with Let be the estimate of L. Then,

(5.7)

and the estimated specific variances are obtained by

(5.8)

with the ith specific variance estimate being

(5.9)

where the sum of squares is the estimate of the ith communality

(5.10)

The contribution to the first common factor to the total sample variance is given by

(5.11)

where we note that the eigenvectors in the principle component analysis are standardized to the unit length. More general, if we let P be the proportion of the jth common factor to the total sample variance, we have

(5.12)

and it can be used to determine the desired number of common factors.

In practice, it is important to check carefully whether the units used in the component variables Z_i,t, i = 1, 2, …, m, are comparable. If not, to avoid improperly influence of the units, we can standardize these variables, that is

(5.13)

where D is the diagonal matrix in which the ith diagonal element is the sample variance of Z_i,t, that is

(5.14)

Then, we will perform the principle component estimation method to the sample covariance matrix of the standardized observations U_t, t = 1,2, …, n, which is actually the sample correlation matrix of the original variables in Z_t, t = 1,2, …, n.

Again, the results from the sample covariance matrix and sample correlation matrix may not be the same, and the choice depends on applications.

5.3.2 Empirical Example 1 – Model 1 on daily stock returns from the second set of 10 stocks

Example 5.1 Principle component approach on 10 daily stock returns

Let us consider the data set, WW5, of the daily stock returns from 10 different stocks including CVX (Chevron), XOM (Exxon), AAPL (Apple), FB (Facebook), MSFT (Microsoft), JNJ (Johnson and Johnson), MRK (Merck), PFE (Pfizer), FBIOX (Fidelity Select Biotechnology), and FSPHX (Fidelity Select Health Care) that were traded in the New York Stock Exchange from August 2, 2016 to December 30, 2016. The data are plotted in Figure 5.1.

Figure 5.1 Ten daily stock returns for 10 stocks between August 2 and December 30, 2016.

In this example, the sample Z_t = [Z_1,t, Z_2,t, Z_3,t, Z_4,t, Z_5,t, Z_6,t, Z_7,t, Z_8,t, Z_9,t, Z_10,t], t = 1, 2, …, n, is the daily stock returns for Chevron (Z_1,t), Exxon (Z_2,t), Apple (Z_3,t), Facebook (Z_4,t), Microsoft (Z_5,t), Johnson and Johnson(Z_6,t), Merck (Z_7,t), Pfizer(Z_8,t), Fidelity Select Biotechnology (Z_9,t), and Fidelity Select Health (Z_10,t), which were traded in the New York Stock Exchange from August 2, 2016 to December 30, 2016 and so m = 10, and n = 106. We have the sample mean vector,

and the sample covariance matrix,

	CVX	XOM	AAPL	FB	MSFT
CVX	1.087154e−04	6.983585e−05	8.576443e−06	2.841400e−05	2.349692e−05
XOM	6.983585e−05	1.063956e−04	1.159600e−05	2.475253e−05	2.188846e−05
AAPL	8.576443e−06	1.159600e−05	1.231327e−04	6.237891e−05	5.277169e−05
FB	2.841400e−05	2.475253e−05	6.237891e−05	1.487562e−04	6.697871e−05
MSFT	2.349692e−05	2.188846e−05	5.277169e−05	6.697871e−05	1.012891e−04
JNJ	1.794397e−05	2.544403e−05	1.763044e−05	1.691757e−05	1.720156e−05
MRK	2.015158e−05	5.188890e−05	3.017592e−05	3.534955e−05	2.720992e−05
PFE	1.959854e−05	4.650749e−05	1.080636e−05	3.061439e−05	1.225974e−05
FBIOX	3.440326e−05	4.943891e−05	2.463021e−05	5.788276e−05	2.019403e−05
FSPHX	2.652420e−05	3.863676e−05	2.939527e−05	4.926419e−05	2.813533e−05
	JNJ	MRK	PFE	FBIOX	FSPHX
CVX	1.794397e−05	2.015158e−05	1.959854e−05	3.440326e−05	2.652420e−05
XOM	2.544403e−05	5.188890e−05	4.650749e−05	4.943891e−05	3.863676e−05
AAPL	1.763044e−05	3.017592e−05	1.080636e−05	2.463021e−05	2.939527e−05
FB	1.691757e−05	3.534955e−05	3.061439e−05	5.788276e−05	4.926419e−05
MSFT	1.720156e−05	2.720992e−05	1.225974e−05	2.019403e−05	2.813533e−05
JNJ	6.606769e−05	6.159274e−05	5.885948e−05	6.521474e−05	4.117798e−05
MRK	6.159274e−05	2.522852e−04	1.180967e−04	1.580319e−04	8.893903e−05
PFE	5.885948e−05	1.180967e−04	1.659857e−04	1.379977e−04	8.349862e−05
FBIOX	6.521474e−05	1.580319e−04	1.379977e−04	3.597483e−04	1.729823e−04
FSPHX	4.117798e−05	8.893903e−05	8.349862e−05	1.729823e−04	1.061556e−04

For this example, we will obtain the principal component estimation for the factor model using standardized variables through the sample correlation matrix, which is given next.

	CVX	XOM	AAPL	FB	MSFT	JNJ	MRK	PFE	FBIOX	FSPHX
CVX	1.000	0.649	0.074	0.223	0.224	0.212	0.122	0.146	0.174	0.247
XOM	0.649	1.000	0.101	0.197	0.211	0.303	0.317	0.350	0.253	0.364
AAPL	0.074	0.101	1.000	0.461	0.473	0.195	0.171	0.076	0.117	0.257
FB	0.223	0.197	0.461	1.000	0.546	0.171	0.182	0.195	0.250	0.392
MSFT	0.224	0.211	0.473	0.546	1.000	0.210	0.170	0.095	0.106	0.271
JNJ	0.212	0.303	0.195	0.171	0.210	1.000	0.477	0.562	0.423	0.492
MRK	0.122	0.317	0.171	0.182	0.170	0.477	1.000	0.577	0.525	0.543
PFE	0.146	0.350	0.076	0.195	0.095	0.562	0.577	1.000	0.565	0.629
FBIOX	0.174	0.253	0.117	0.250	0.106	0.423	0.525	0.565	1.000	0.885
FSPHX	0.247	0.364	0.257	0.392	0.271	0.492	0.543	0.629	0.885	1.000

The screeplot of the PCA is given in Figure 5.2, which suggests k = 3 or 4.

Figure 5.2 The PCA Scree plot of the 10 daily stock returns.

For more details, the eigenvalues and eigenvectors, which are variances and component loadings, of the sample correlation matrix are given in Table 5.1.

Table 5.1 Sample PCA result for the daily stock returns for the 10 stocks based on the sample correlation matrix.

	Comp. 1	Comp. 2	Comp. 3	Comp. 4	Comp. 5	Comp. 6	Comp. 7	Comp. 8	Comp. 9	Comp. 10
CVX	−0.219	−0.169	0.676	0.115		−0.168		−0.320	0.560
XOM	−0.288		0.602		0.172	0.193	−0.117	0.340	−0.588
AAPL	−0.195	−0.480	−0.284	−0.222	0.723	−0.177	0.129	0.112	0.125
FB	−0.261	−0.466	−0.154	0.287	−0.346	0.104	0.563	−0.358	−0.176
MSFT	−0.224	−0.534		−0.123	−0.419	0.156	−0.571	0.320
JNJ	−0.340	0.145		−0.585	−0.236	−0.568		−0.285	−0.242
MRK	−0.352	0.212	−0.121	−0.285	0.171	0.700	−0.154	−0.433
PFE	−0.370	0.297		−0.172	−0.211		0.458	0.507	0.464
FBIOX	−0.384	0.257	−0.166	0.486	0.123	−0.162	−0.252			−0.643
FSPHX	−0.436	0.126	−0.141	0.382		−0.152	−0.147	0.110		0.748

Variance	3.987	1.700	1.326	0.753	0.536	0.518	0.443	0.368	0.285	0.084
Cumulative Percentage %	39.87	56.87	70.13	77.66	83.02	88.20	92.63	96.31	99.16	100

The eigenvalues of the first three components are greater than 1 and they account for

of the total standardized sample variance. So, we will simply choose them as our first, second, and third common factors F₁, F₂, F₃, and summarize the factor model estimation in Table 5.2.

images — **Table 5.2** The principal component estimation for the factor model.

The first common factor represents the general stock market. The second common factor represents the contrast between the technology sector and the health‐related sector. The third common factor represents the contrast between oil industry and non‐oil industry.

5.3.3 The maximum likelihood method

When the common factors F_t and the specific factors ε_t can be assumed to be normally distributed, then Z_t, t = 1, 2, …, n, will follow a multivariate normal distribution N(μ, Γ), and we can apply the maximum likelihood method to its likelihood function,

(5.15)

which is actually a function of L and Γ through the relationship Γ = LL^′ + Σ, and μ is estimated by the sample mean However, because of Γ = LL^′ + Σ = LΦΦ^′L^′ + Σ, and for any k × k orthogonal matrix Φ, to make Eq. (5.15) well defined, we also impose the following unique condition,

(5.16)

The maximum likelihood estimates and can be accomplished with numerical methods using statistical software such as EViews, MATLAB, MINITAB, R, SAS, and SPSS. However, the useful patterns of the factor loadings from the maximum likelihood solution obtained under the imposed unique condition may not be clear until the factors are rotated, which will be discussed later.

The maximum likelihood estimate of the ith communality is

(5.17)

and the proportion of the contribution of the ith common factor is by

(5.18)

To build a k common factor model for an m‐dimensional process, under the normal assumption, we can test the adequacy of the model with the following null hypothesis,

(5.19)

versus

(5.20)

To test the null hypothesis, we apply the likelihood ratio test

(5.21)

It is well known that

(5.22)

follows approximately the chi‐square distribution with degrees of freedom,

(5.23)

where we note that

(5.24)

is the MLE of Γ without any restriction, which has [(m² + m)/2] free parameters, and

(5.25)

is the MLE of Γ under the restriction of factor model, which has (mk + m) parameters but with [(k² − k)/2] constraints on the unique condition on being a k × k diagonal matrix given in Eq. (5.16). In practice, the test statistic in Eq. (5.22) will be replaced by Bartlett‐Corrected Test Statistic,

(5.26)

which gives a better approximation to the chi‐square distribution as shown by Bartlett (1954). Hence, the null hypothesis will be rejected if

(5.27)

Because the degrees of freedom must be positive, we also require that

(5.28)

When correlation matrix ρ is used for factor analysis, the test statistic in Eq. (5.26) becomes

(5.29)

with exactly the same degrees of freedom, The reason is that the elements and computation of ρ are based on the covariance matrix Γ. The number of its free parameters is exactly the same as that of Γ. In terms of the m‐dimensional case, it is (m² + m)/2.

5.3.4 Empirical Example II – Model 2 on daily stock returns from the second set of 10 stocks

Example 5.2 Maximum likelihood method on 10 daily stock returns

Let us consider again the daily stock returns from the 10 different stocks including Chevron, Exxon, Apple, Facebook, Microsoft, Johnson and Johnson, Merck, Pfizer, Fidelity Select Biotechnology, and Fidelity Select Health that were traded in the New York Stock Exchange from August 2, 2016 to December 30, 2016, which was used in Example 5.1 as shown in Figure 5.1.

From Eq. (5.26), So we will consider k = 5 common factors, and obtain the following maximum likelihood estimate for the factor model based on the sample correlation matrix using the program factanal of software R as shown in Table 5.3.

Table 5.3 MLE of common factor loadings, communalities, and specific variances for k = 5.

Variable (j)
CVX	0.187	0.685	−0.454			0.710	0.290
XOM	0.347	0.603	−0.401	−0.106		0.656	0.343
AAPL	0.115	0.417	0.489		0.140	0.446	0.554
FB	0.259	0.540	0.479		−0.215	0.634	0.366
MSFT	0.121	0.580	0.436			0.637	0.363
JNJ	0.560	0.186		−0.153	0.293	0.457	0.543
MRK	0.625	0.103			0.314	0.500	0.500
PFE	0.880			−0.470		0.995	0.005
FBIOX	0.886			0.458		0.995	0.005
FSPHX	0.862	0.200	0.107	0.272		0.868	0.132
Sum of squared loadings	3.256	1.720	1.039	0.550	0.269
Cumulative percentage of total standardized sample variance	32.56	49.76	60.15	65.65	68.34

The proportion of the contribution of the ith common factor is computed as follows,

(5.30)

where we note that for the standardized variables, for the example.

With m = 10, to test the null hypothesis for k = 5, we compute

(5.31)

which is much less than So at a 5% significance level, the null hypothesis of five common factor is not rejected.

However, after carefully examining the loadings for factors which have sum of squared loadings much less than 1, we decide to try a model with three common factors. So, we will re‐compute the maximum likelihood estimate for the factor model with k = 3. The result is given in Table 5.4.

Table 5.4 MLE of common factor loadings, communalities, and specific variances for k = 3.

Variable (j)
CVX		0.651	0.102	0.434	0.566
XOM		0.996		0.984	0.016
AAPL	0.227	0.114	0.582	0.403	0.597
FB	0.333	0.216	0.612	0.532	0.468
MSFT	0.193	0.223	0.708	0.585	0.415
JNJ	0.416	0.326		0.452	0.548
MRK	0.479	0.341		0.346	0.654
PFE	0.541	0.378	−0.118	0.449	0.551
FBIOX	0.865	0.296	−0.174	0.866	0.134
FSPHX	0.883	0.409		0.947	0.053
Sum of squared loadings	2.425	2.146	1.272
Cumulative percentage of total standardized sample variance	24.25	45.71	58.43

The proportion of the contribution of the common factor is

(5.32)

From Table 5.4, it is very clear that common factor one, F_1,t, has relatively large positive loadings in the health and biotechnology related industry, common factor two, F_2,t, has large positive loadings on energy related industry, and common factor three, F_3,t, represents communication technology industry. This three common factor model explains almost the same amount of total variation as the five common factor model shown in Table 5.3. Most importantly, the simpler three factor model provides much clearer interpretations. Thus, we will choose it as our factor model to represent the stock return process of dimension m = 10.

Comparing Tables 5.2 and 5.4, we note that the cumulative percentage of total standardized sample variance explained by the factors is larger for the principal component factoring than the maximum likelihood factoring. This may be expected, because the loadings obtained by the principal component method are achieved through a variance optimizing property.

5.4 Factor rotation

It should be noted that regardless of what method is used in factor analysis, from Eq. (5.2), for any k × k orthogonal matrix Φ, we have

(5.33)

where

(5.34)

As a result, there are many multiple choices for L through orthogonal transformations, each of which is equivalent to rotating the common factors in the m‐dimensional space. This leads researchers to use arithmetic to find a new set of factor loadings so that the resulting common factors have easier and nicer interpretations. The methods are commonly known as factor rotations.

5.4.1 Orthogonal rotation

Let F_t^* be the rotated factor and be the rotated matrix of factor loadings. One of the most widely used orthogonal rotation methods is the varimax proposed by Kaiser (1958), which finds an orthogonal transformation to maximize the sum of the variances of the squared loadings:

(5.35)

where

(5.36)

are the rotated coefficients scaled by the square root of communalities. The varimax will be achieved if any given variable has a high loading on a single factor but nearly zero loadings on the remaining factors or any given factor is formed by only a few variables with very high loadings and the remaining variables have nearly zero loadings on this factor. This is a widely used method for orthogonal rotation with all factors remaining uncorrelated. After the orthogonal transformation is determined, we will multiply the loadings by so that the original communalities are preserved.

5.4.2 Oblique rotation

The orthogonal rotation methods like varimax assume that the factors in the analysis are independent. On the other hand, some researchers believe that the purpose of factor rotations is to achieve a simple structure with a new set of factor loadings so that the resulting common factors have simpler and nicer interpretations, and hence one should relax the independence assumption for the factors. The resulting method is often known as oblique rotation. Just like orthogonal rotations, there are many different forms of oblique rotation, see Carroll (1953, 1957), and Jennrich and Sampson (1966).

Although we introduce the rotation concept here, they are used for principal components analysis (PCA) too. There are many factor rotations available and they are implemented in statistical software like EViews, MATLAB, MINITAB, R, SAS, and SPSS. We will not spend more time on the discussion of various factor rotations. Instead, we would like to point out that the rationale of factor rotations is to simplify the factor structure with easier interpretation, and Thurstone (1947) suggested the following criteria:

Each variable should produce at least one zero loading on some factor.
Each factor should have at least as many nearly zero loadings as there are factors.
Each pair of factors should have variables with significant loadings on one and near zero loadings on the other.
Each pair of factors should have a large proportion of zero loadings on both factors.
Each pair of factors should have only a small number of large loadings. For more details, we refer readers to a good multivariate analysis textbook by Johnson and Wichern (2007) and some relevant statistical software manuals.

5.4.3 Empirical Example III – Model 3 on daily stock returns from the second set of 10 stocks

Example 5.3

The maximum likelihood estimation results for the factor model with three common factors given in Table 5.4 is one without rotation. Applying the orthogonal rotation method, varimax, using the program factanal of software R, we get the result given in Table 5.5.

The proportion of the contribution of the ith common factor is computed as follows:

(5.37)

Comparing the results in Tables 5.4 and 5.5, we see that the rotation does enhance the values of factor loadings and interpretations. The m = 10‐dimensional stock return process can be very well represented by the three common factors, the factor of health care related industry, the factor of communication technology related industry, and the factor of energy related industry.

5.5 Factor scores

5.5.1 Introduction

Once factor analysis is completed and parameters are estimated, we have and By treating these and as known and their elements like the common factor loadings and the variances of specific factors as if they were the true values, we can estimate the values of the unobserved random factor vector F_t, known as factor scores, from the Eq. (5.2), which repeats as follows:

(5.38)

where we regard the specific factors as errors. Because the variances of ε_{i, t} for i = 1, 2, …, m, can be unequal, Bartlett (1937) has suggested using the weighted least squares to estimate the common factor values, that is choosing the estimate to minimize the following weighted sum of squares of the errors,

(5.39)

The solution from Chapter 3 can be easily seen to be

(5.40)

Thus, treating and as the true values, the factor scores for time t is obtained as

(5.41)

When the factor model is obtained through the standardized variables, that is through the correlation matrix, it becomes

(5.42)

When and are determined by the MLE, we have

(5.43)

where from Eq. (5.16). If the factor model is obtained through the standardized variables, then

(5.44)

5.5.2 Empirical Example IV – Model 4 on daily stock returns from the second set of 10 stocks

Example 5.4

Let us use again the daily stock returns from the 10 different stocks (Chevron, Exxon, Apple, Facebook, Microsoft, Johnson and Johnson, Merck, Pfizer, Fidelity Biotech, and Fidelity Health) that were traded in the New York Stock Exchange from August 2, 2016 to December 30, 2016, as an example. We have obtained the maximum likelihood estimation varimax rotated three common factor model as shown in Table 5.5. Given the standardized observation at time 106,

and

So, we can compute

With the known factor scores, we can estimate the values of U_t by

(5.45)

Hence,

Similarly, we can obtain the estimate and hence for t = 1, 2, …, 105. They can be compared with the true value of Z_t and used to judge whether the factor model fits the data well in terms of MSE and other similar criteria. However, one should not get confused with these estimates and forecasts.

5.6 Factor models with observable factors

In many applications, we can consider a factor model, where the factors F_t are observable with factor scores. For example, in some economic and financial studies, a commonly used factor model is the one where some macroeconomic variables and market indices such as inflation rate, industrial production index, employment and unemployment rates, interest rate, S&P 500 Index, Dow Jones Industrial Average (DJIA), and Consumer Price Index (CPI) can be used as factors, and they are observable.

When factors are observable, the factor loadings can be estimated using both Z_t and F_t. Specifically, let we can rewrite the model in Eqs. (5.2) or (5.38) as

(5.46)

Thus, for t = 1, 2, …, n, the whole system of the factor model can be expressed as the multivariate multiple time series regression,

(5.47)

where

and

Each ξ_(i) follows a n‐dimensional multivariate normal distribution i = 1, …, m, and ξ_(i) and ξ_(j) are uncorrelated if i ≠ j.

Equation (5.47) implies that

(5.48)

which is the standard matrix form of the multiple regression model, and the least squares estimate of β_(i) is given by

(5.49)

Hence,

(5.50)

The residuals of Eq. (5.47) are

(5.51)

Under the assumption given in Eq. (5.2), the estimate of the covariance matrix of ε_t is

(5.52)

where diag(A) represents the diagonal matrix consisting of the diagonal elements of the matrix A.

In time series applications, given a m‐dimensional time series, it is natural to build a factor model with lag operator and a time series model on the factors as shown in the following,

(5.53)

or equivalently,

(5.54)

where all time series are assumed to be stationary, u_t is a k‐dimensional zero mean white noise process independent of ε_t. The model is known as the dynamic factor model. It was first introduced by Geweke (1977) and has been widely used in practice. Once values of factors are obtained, we can combine the lagged values of Z_t or other observed variables in the model and build a model for the h‐step ahead forecast for Z_t + h,, that is

(5.55)

where X_t is a m × 1 vector of lagged values of Z_t and/or other observed variables.

5.7 Another empirical example – Yearly U.S. sexually transmitted diseases (STD)

We will now consider a data set that contains yearly STD morbidity rates reported to National Center for HIV/AIDS, viral Hepatitis, STD, and TB Prevention (NCHHSTP), Center for HIV, and Centers for Disease Control and Prevention (CDC) from 1984 to 2013. The dataset was retrieved from CDC's website and includes 50 states plus D.C. The rates per 100000 persons are calculated as the incidence of STD reports, divided by the population, and multiple by 100000.

For the analysis, we remove data from following states, Montana, North Dakota, South Dakota, Vermont, Wyoming, Alaska, and Hawaii, due to missing data. Hence, the dimension of series X_t is m = 44 and n = 30. The data set is known as WW8c in the Data Appendix, and its plot is shown in Figure 5.3.

Graph of morbidity rates vs. time displaying intersecting curves, illustrating the US yearly STD of 43 states and DC. — **Figure 5.3** U.S. yearly STD of 43 states and D.C.

We first take difference of the series by Z_t = (1 − B)X_t, and the differenced data are plotted in Figure 5.4. The analysis will be based on differenced data. We used the first 24 data points for model fitting, and the rest of observations for evaluating the forecasting performance.

Graph of morbidity rates vs. time displaying intersecting curves, illustrating the differenced yearly STD of 43 states and DC. — **Figure 5.4** The differenced yearly STD of 43 states and D.C.

5.7.1 Principal components analysis (PCA)

As discussed in Chapter 4, PCA can be based on a covariance matrix or a correlation matrix. For a high dimensional case, the print out of a covariance or correlation matrix is tedious. Since the correlation matrix is simply the covariance matrix of standardized variables, instead of saying that PCA is based on a covariance matrix or a correlation matrix, we will simply specify whether PCA is based on unstandardized variables or standardized variables.

5.7.1.1 PCA for standardized Z_t

We first do PCA for Z_t where the data set is standardized. The screeplot in Figure 5.5 shows that first six principal components can explain most of the variance so that first six components will be enough.

Screeplot for standardized variables, displaying a descending curve with 20 circle markers lying on it. — **Figure 5.5** The screeplot for standardized variables.

The first six components from sample PCA for the standardized variables is given in Table 5.6.

Table 5.6 Sample PCA result for standardized variables.

	Comp. 1	Comp. 2	Comp. 3	Comp. 4	Comp. 5	Comp. 6
CT	−0.127	−0.215	0.200	−0.046	−0.193	0.191
ME	0.008	0.051	−0.048	−0.188	−0.123	0.213
MA	−0.196	−0.176	−0.071	−0.144	0.063	−0.064
NH	−0.082	0.023	−0.058	0.095	−0.329	−0.223
RI	−0.141	0.055	0.212	0.138	0.081	0.230
NJ	−0.214	−0.155	0.026	−0.216	0.020	0.089
NY	−0.233	−0.168	−0.033	0.074	−0.076	−0.015
DE	−0.137	−0.031	0.217	0.067	−0.336	−0.070
DC	−0.250	−0.131	−0.007	0.014	0.039	−0.155
MD	−0.170	−0.168	−0.011	−0.068	0.208	−0.134
PA	−0.223	−0.127	0.075	−0.138	−0.201	0.004
VA	−0.213	0.021	0.009	0.113	0.204	0.254
WV	−0.099	0.086	0.229	0.234	0.013	0.315
AL	−0.237	−0.042	0.060	−0.083	0.204	−0.025
FL	0	−0.283	−0.077	0.152	−0.251	0.134
GA	−0.208	−0.183	0.122	−0.027	−0.101	0.044
KY	−0.049	0.124	−0.264	−0.106	−0.081	0.229
MS	−0.096	0.179	−0.174	−0.128	−0.061	0.101
NC	−0.170	0.135	−0.049	0.076	−0.166	−0.196
SC	−0.203	0.113	0.043	0.228	0.162	−0.006
TN	−0.223	−0.096	−0.022	−0.255	0.012	0.004
IL	−0.212	0.178	0.007	0.105	0.003	−0.149
IN	−0.031	0.206	−0.171	−0.090	−0.208	0.021
MI	−0.245	0.010	0.054	0.115	0.141	−0.075
MN	−0.119	0.070	−0.110	0.008	−0.213	0.063
OH	−0.111	0.217	−0.217	−0.115	−0.058	0.143
WI	−0.199	0.160	−0.089	0.025	0.089	0.045
AR	−0.199	0.164	−0.003	0.144	0.006	−0.194
LA	−0.227	0.142	−0.121	−0.053	0.015	0.040
NM	0.029	−0.127	−0.292	0.110	0.160	−0.001
OK	−0.101	0.071	−0.178	−0.009	−0.179	−0.331
TX	−0.232	−0.031	−0.072	−0.063	0.190	0.069
IA	−0.098	0.056	−0.176	0.006	0.074	0.222
KS	−0.129	0.175	0.087	0.308	−0.054	−0.125
MO	−0.061	0.236	−0.227	−0.009	−0.198	0.064
NE	−0.053	0.070	0.154	0.242	−0.100	0.178
CO	0.017	0.034	−0.289	0.070	0.111	0.212
UT	0.007	−0.063	−0.185	0.283	0.067	0.086
AZ	−0.073	−0.227	−0.174	−0.187	0.089	−0.165
CA	−0.003	−0.245	−0.219	0.178	−0.162	0.043
NV	0.001	−0.190	−0.225	0.099	0.047	0.025
ID	0.018	−0.129	−0.17	0.276	0.153	−0.168
OR	0.045	−0.202	−0.203	0.327	−0.079	0.009
WA	−0.053	−0.256	−0.043	0.041	−0.221	0.226
Variance	12.128	7.218	5.228	3.478	3.192	2.561
Cumulative Percentage	27.56	43.97	55.85	63.76	71.01	76.83

Let us look at the plot for the first and second components of these time series in Figure 5.6.

Graph of component 2 vs. component 1 for standardized variables displaying scattered texts such as LA, SC, VA, DE, UT, ME, CO, NE KY, IN, MO, OH, NC, KS, CA, OR, WA, AZ, CT, MD, NY, TN, PA, NJ, MI, VA, TX, VA, etc. — **Figure 5.6** The first and second components for standardized variables.

It appears that states that are spatially close are also tending to be close to each other in the component plot. For examples: (i) NJ, NY, PA, DC; (ii) OH, KS, MS, MO; and (iii) CA, OR, NV, WA.

5.7.1.2 PCA for unstandardized Z_t

Now, let us look at what the result is when we apply PCA on unstandardized Z_t, which is equivalent to the construction of the PCA model based on the covariance matrix of Z_t. The screeplot in Figure 5.7 shows that first three principal components can explain most of the variance, so we will choose a three‐component PCA model.

Screeplot for unstandardized variables, displaying a descending curve with 10 circle markers lying on it. — **Figure 5.7** The screeplot for unstandardized variables.

The first three components from a sample PCA for the unstandardized variables are given in Table 5.7.

Table 5.7 Sample PCA result for unstandardized variables.

	Comp. 1	Comp. 2	Comp. 3
CT	0.064	0.115	−0.016
ME	−0.002	−0.004	−0.001
MA	0.042	0.018	0.001
NH	0.005	−0.001	−0.004
RI	0.018	−0.026	0.054
NJ	0.067	0.007	−0.018
NY	0.138	0.028	−0.133
DE	0.073	0.025	0.079
DC	0.900	0.098	0.098
MD	0.097	0.058	0.059
PA	0.093	0.011	−0.024
VA	0.040	−0.043	−0.003
WV	0.007	−0.013	0.051
AL	0.117	−0.061	0.115
FL	0.054	0.396	−0.482
GA	0.188	0.115	−0.038
KY	0	−0.029	−0.045
MS	0.034	−0.597	−0.560
NC	0.057	−0.116	−0.029
SC	0.081	−0.131	0.110
TN	0.113	−0.041	−0.052
IL	0.058	−0.137	0.056
IN	−0.004	−0.047	−0.030
MI	0.059	−0.035	0.053
MN	0.006	−0.013	−0.012
OH	0.007	−0.091	−0.057
WI	0.024	−0.062	0.011
AR	0.091	−0.206	0.050
LA	0.182	−0.451	−0.094
NM	0.003	0.045	−0.135
OK	0.017	−0.038	−0.026
TX	0.079	−0.057	−0.030
IA	0.003	−0.017	−0.025
KS	0.012	−0.027	0.047
MO	−0.001	−0.146	−0.120
NE	0	−0.001	0.006
CO	−0.004	−0.010	−0.042
UT	0.001	0.010	−0.019
AZ	0.040	0.055	−0.069
CA	0.025	0.137	−0.239
NV	0.046	0.254	−0.487
ID	0.004	0.019	−0.015
OR	0.003	0.090	−0.124
WA	0.011	0.039	−0.052
Variance	3472.528	717.536	332.014
Cumulative Percentage	65.55	79.10	85.37

As shown in Figure 5.8, the plot of the first and second components for the unstandardized variables is very different from the one for standardized variables. It suggests that the unstandardized data does not produce similar or good separation as the standardized data does. It also shows that PCA for standardized variables is much more robust. So, we will use standardized variables for our PCA model.

Graph of component 2 vs. component 1 for unstandardized variables displaying MS, LA, AR, MO, GA, CA, CT, OR, TN, NV, FL, etc. found at the left side and DC at the right side of the graph. — **Figure 5.8** The first and second components for unstandardized variables.

5.7.2 Factor analysis

To build a factor model, we can either use the principle components method or maximum likelihood estimation method. However, for our data set, we have a vector series with a dimension m = 44 and n = 30. Its maximum likelihood equation cannot be properly solved. So, we can only use the principle component method. Thus, we will use PCA to estimate factors. In this analysis, we will follow Bai and Ng (2002), and select six factors, which is supported by the screeplot shown in Figure 5.5.

From the PCA analysis in Section 5.7.1, we will choose a factor model with six common factors,

based on the standardized variables. The corresponding factor loadings, communalities, and specific variables are given in Table 5.8. The six estimated factor scores are plotted in Figure 5.9.

Table 5.8 The principal component estimation for the factor model from the STD data set.


CT	−0.442	−0.577	0.456	−0.086	−0.344	0.306	0.956	0.044
ME	0.029	0.136	−0.110	−0.350	−0.219	0.341	0.319	0.681
MA	−0.683	−0.473	−0.163	−0.269	0.113	−0.102	0.812	0.188
NH	−0.285	0.063	−0.133	0.178	−0.588	−0.357	0.607	0.393
RI	−0.492	0.149	0.485	0.257	0.145	0.368	0.722	0.278
NJ	−0.747	−0.417	0.059	−0.403	0.036	0.143	0.919	0.081
NY	−0.813	−0.451	−0.075	0.137	−0.136	−0.024	0.907	0.093
DE	−0.479	−0.082	0.495	0.124	−0.601	−0.113	0.870	0.130
DC	−0.869	−0.352	−0.015	0.026	0.070	−0.248	0.948	0.052
MD	−0.592	−0.452	−0.025	−0.128	0.372	−0.214	0.756	0.244
PA	−0.778	−0.342	0.172	−0.258	−0.359	0.007	0.947	0.053
VA	−0.743	0.055	0.021	0.211	0.365	0.407	0.900	0.100
WV	−0.345	0.231	0.525	0.436	0.022	0.503	0.892	0.108
AL	−0.825	−0.112	0.137	−0.155	0.364	−0.039	0.870	0.130
FL	−0.002	−0.761	−0.176	0.284	−0.449	0.215	0.938	0.062
GA	−0.723	−0.491	0.279	−0.050	−0.180	0.070	0.883	0.117
KY	−0.172	0.332	−0.602	−0.197	−0.144	0.366	0.697	0.303
MS	−0.336	0.482	−0.398	−0.239	−0.110	0.161	0.599	0.401
NC	−0.591	0.363	−0.111	0.142	−0.297	−0.314	0.701	0.299
SC	−0.709	0.304	0.098	0.425	0.289	−0.010	0.868	0.132
TN	−0.778	−0.258	−0.050	−0.475	0.022	0.006	0.900	0.100
IL	−0.737	0.479	0.017	0.196	0.006	−0.238	0.869	0.131
IN	−0.108	0.553	−0.391	−0.168	−0.372	0.033	0.637	0.363
MI	−0.852	0.026	0.124	0.214	0.252	−0.120	0.866	0.134
MN	−0.415	0.187	−0.253	0.016	−0.381	0.101	0.426	0.574
OH	−0.385	0.584	−0.495	−0.215	−0.104	0.230	0.844	0.156
WI	−0.694	0.429	−0.202	0.046	0.159	0.072	0.739	0.261
AR	−0.693	0.441	−0.006	0.268	0.011	−0.311	0.843	0.157
LA	−0.791	0.380	−0.276	−0.099	0.027	0.065	0.862	0.138
NM	0.100	−0.342	−0.669	0.204	0.286	−0.002	0.697	0.303
OK	−0.350	0.190	−0.408	−0.017	−0.320	−0.530	0.709	0.291
TX	−0.808	−0.083	−0.164	−0.118	0.340	0.110	0.829	0.171
IA	−0.343	0.150	−0.402	0.012	0.132	0.356	0.446	0.554
KS	−0.451	0.470	0.199	0.574	−0.096	−0.200	0.843	0.157
MO	−0.212	0.635	−0.520	−0.017	−0.354	0.102	0.855	0.145
NE	−0.186	0.187	0.352	0.451	−0.179	0.285	0.510	0.490
CO	0.058	0.090	−0.661	0.131	0.198	0.339	0.620	0.380
UT	0.026	−0.170	−0.424	0.528	0.119	0.137	0.521	0.479
AZ	−0.253	−0.609	−0.397	−0.349	0.158	−0.264	0.810	0.190
CA	−0.010	−0.658	−0.501	0.331	−0.289	0.069	0.882	0.118
NV	0.003	−0.511	−0.514	0.184	0.085	0.040	0.568	0.432
ID	0.064	−0.347	−0.389	0.514	0.274	−0.269	0.687	0.313
OR	0.157	−0.543	−0.464	0.609	−0.140	0.014	0.926	0.074
WA	−0.186	−0.687	−0.097	0.076	−0.396	0.362	0.809	0.191

6 Graphs, each displaying a fluctuating curve, illustrating the estimated six common factor scores for the STD data set. — **Figure 5.9** The estimated six common factor scores for the STD data set.

So, we have our factor model for the STD data set,

(5.56)

where the element and specification of L, F_t, and ε_t are given in Table 5.8. As described in Section 5.6, once values or scores of factors are estimated, we can combine the lagged values of Z_t or other observed variables in the model and build a forecast equation for the h‐step ahead forecast for Z_t + h,, that is

(5.57)

where V_t are lagged values of Z_t and/or other observed variables, and and are the parameter estimates based on V_t given in Eq. (5.57).

In this example, we will simply choose V_t to be the first lagged value of Z_t. Specifically, our forecasting equations will be

(5.58)

where is the ith row of and is the ith element of For i = 1, we have

(5.59)

and for h = 1, it becomes

From the estimation results, we have

and the factor scores, for t = 1, 2, …, 24, and Z_{1, t} are given in Table 5.9.

Table 5.9 The estimated factor scores and standardized Z_1,t.

t	Factor 1	Factor 2	Factor 3	Factor 4	Factor 5	Factor 6	t	Z_1,t
1	0.754	0.729	0.745	0.872	−1.280	0.629	1	0.147
2	1	0.745	0.338	−0.181	1.160	−0.686	2	0.010
3	−1.165	6.198	−2.948	−2.638	3.382	0.107	3	1.067
4	−4.217	3.103	−0.599	−0.511	1.237	−2.825	4	2.033
5	−7.161	−1.663	7.076	3.145	0.621	−1.730	5	3.342
6	−9.685	−0.185	−4.299	1.901	−3.252	3.630	6	−0.734
7	−2.813	−7.658	0.829	−6.294	−0.704	0.553	7	−2.073
8	3.543	−6.566	−1.992	2.611	4.981	1.335	8	−1.996
9	4.445	−2.224	−4.613	2.797	−2.742	−2.706	9	−1.102
10	5.751	−0.298	1.360	−0.786	−3.816	−1.423	10	−0.866
11	3.709	1.050	1.099	0.630	0.403	3.209	11	−0.221
12	3.850	3.066	3.592	−0.027	−1.059	2.854	12	0.310
13	1.635	1.396	1.014	−0.166	−0.407	0.536	13	0.013
14	2.646	0.108	0.792	−0.176	0.169	0.647	14	−0.545
15	0.848	−0.276	−0.209	0.475	0.757	0.074	15	−0.153
16	0.976	0.642	0.110	−0.044	−0.578	−0.014	16	0.151
17	−0.379	0.595	0.538	0.131	−0.044	−0.359	17	0.105
18	0.043	0.731	−0.428	−1.389	−0.046	−0.573	18	0.140
19	−0.511	0.428	−0.605	−0.078	0.908	−0.134	19	0.124
20	0.371	0.566	−0.164	−1.795	−0.571	−0.642	20	−0.096
21	−0.046	−0.380	0.455	0.857	0.764	−0.215	21	0.043
22	−1.106	−0.071	−0.497	0.511	−0.117	−1.552	22	0.175
23	−1.171	0.221	0.035	0.340	0.145	0.600	23	−0.133
24	−1.317	−0.255	−1.629	−0.185	0.089	−1.316	24	0.151

Based on Z_1,24 = 0.151 and the one‐step‐ahead forecasting of the first time series is

which is shown as the forecast for Connecticut in Table 5.10.

Table 5.10 The one‐step forecast result for the standardized STD series.

State	Forecast	Actual	Error	State	Forecast	Actual	Error
CT	0.299	0.075	0.224	IN	0.08	−0.194	0.274
ME	−0.488	−0.922	0.434	MI	0.06	0.194	−0.134
MA	−0.313	−0.088	−0.225	MN	0.042	−0.875	0.917
NH	0.375	−0.465	0.84	OH	0.392	0.042	0.35
RI	0.8	0.297	0.503	WI	0.333	−0.143	0.476
NJ	0.198	−0.168	0.366	AR	0.38	0.14	0.24
NY	0.281	−0.456	0.737	LA	0.162	−0.077	0.239
DE	0.233	0.283	−0.05	NM	0.256	0.227	0.029
DC	0.08	0.213	−0.133	OK	−0.285	0.352	−0.637
MD	−0.254	−0.217	−0.037	TX	0.25	0.574	−0.324
PA	0.198	0.127	0.071	IA	0.689	−0.435	1.124
VA	0.587	−0.067	0.654	KS	−0.059	0.382	−0.441
WV	0.339	−0.046	0.385	MO	−0.382	−0.074	−0.308
AL	0.067	−0.066	0.133	NE	0.33	0.369	−0.039
FL	−0.03	−0.161	0.131	CO	−0.789	−1.05	0.261
GA	0.332	−0.047	0.379	UT	−0.313	0.279	−0.592
KY	0.046	0.219	−0.173	AZ	−0.381	−0.952	0.571
MS	0.995	0.073	0.922	CA	−0.525	−0.276	−0.249
NC	0.549	0.888	−0.339	NV	0.771	−0.053	0.824
SC	0.241	0.367	−0.126	ID	−0.095	0.225	−0.32
TN	0.11	0.07	0.04	OR	−0.058	0.173	−0.231
IL	0.23	0.46	−0.23	WA	0.052	−0.893	0.945

The mean squared forecast error (MSFE) is 0.222. It should be noted that the series we have used in model fitting and forecasting are standardized differenced series of the original data set.

5.8 Concluding remarks

In time series applications, it is natural to consider the observable factors in terms of some time series structures and use time lag operator and time series models on the factors. As a result, many different forms of factor models have been developed. The use of time series models such as vector autoregressive models on factors has also been extended to vector time series modeling in both time domain and frequency domain approaches. Some good examples are the various dynamic factor models.

For further information on factor models and applications, we recommend to readers Sharpe (1970), Geweke (1977), Engle and Watson (1981), Geweke and Sinleton (1981), Harvey (1989), Bai and Ng (2002, 2007), Garcia‐Ferrer and Poncela 2002, Bai (2003), Peña and Poncela (2004), Deistler and Zinner (2007), Hallin and Liska (2007), Doz et al. (2011), Jungbacker et al. (2011), Stock and Watson (2009, 2011, 2016), Lam and Yao (2012), Fan et al. (2016), Rockova and George (2016), Chan et al. (2017), and Gonçalves et al. (2017), among others.

Software code

R code for Examples 5.1 and 5.2

library(ggplot2)
##Import Data: daily stock returns from ten different stocks from C:/Bookdata/WW5.csv
setwd("C:/Bookdata/")
d10 <- as.data.frame(read.csv("WW5.csv")[,-1])
rownames(d10) <- seq(as.Date("2016/8/2"), by="day", length=106)

##Covariance and Correlation Matrices
d10 <- t(t(d10) - colMeans(d10))
cov(d10)
cor(d10)

##Plot Time Series Data
plot(seq(as.Date("2016/8/2"), by="day", length=106),d10[,1],type='l',ylab="Stock Returns",xlab="Day")
for(i in 2:10){
 lines(seq(as.Date("2016/8/2"), by="day", length=106),d10[,i], type='l',col=i)
}
legend("topleft",legend=colnames(d5),col=1:5,lty=1)

##Principal Component Analysis
pca <- princomp(d10,cor=F)
lds <- pca$loadings
scs <- pca$scores
screeplot(pca,type="lines",main="screeplot")

pca <- princomp(d10,cor=T)
lds <- pca$loadings
scs <- pca$scores
screeplot(pca,type="lines",main="screeplot")

##Plot: Sectors by Their First 2 Loadings
library(ggplot2)
C <- as.data.frame(cbind(lds[,1],lds[,2]))
ggplot(C,aes(C[,1],C[,2],label=rownames(C))) +
 geom_point(size=4,col=3) +
 geom_text(vjust=0,hjust=0,angle = 10,size=5) +
 xlab("Loading 1") +
 ylab("Loading 2")
##Plot: Time Points byTheir Scores on First 2 Components
C <- as.data.frame(cbind(scs[,1],scs[,2]))
palette(rainbow(400))
ggplot(C,aes(C[,1],C[,2],label=substring(rownames(C),1,7))) +
 geom_point(size=4,col=1:nrow(C)) +
 geom_text(vjust=0,hjust=0,angle = 10,size=5) +
 xlab("Scoring 1") +
 ylab("Scoring 2")
palette("default")

##Plot: Time Points by Their Scores on First 2 Components
C <- as.data.frame(cbind(scs[,1],scs[,2]))
palette(rainbow(400))
ggplot(C,aes(C[,1],C[,2],label=substring(rownames(C),1,4))) +
 geom_path() +
 geom_point(size=4,col=1:nrow(C)) +
 geom_text(vjust=0,hjust=0,angle = 10,size=5) +
 xlab("Scoring 1") +
 ylab("Scoring 2")
palette("default")
plot(seq(as.Date("2009/4/1"), by="month", length=106),C[,1],type="l",ylab="Scoring 1",xlab="Date")
plot(seq(as.Date("2009/4/1"), by="month", length=106),C[,2],type="l",ylab="Scoring 2",xlab="Date")
ag <- aggregate(C[,1:2],list(substr(rownames(C),1,4)),mean)
C <- as.data.frame(ag)
ggplot(C,aes(C[,2],C[,3],label=C[,1])) +
 geom_point(size=4,col=3) +
 geom_text(vjust=0,hjust=0,angle = 10,size=5) +
 xlab("Scoring 1") +
 ylab("Scoring 2")

##Factor Analysis
fit <- factanal(scale(d10), factors=2,rotation="none")
print(fit, digits=3, cutoff=.001, sort=TRUE)

##Plot Sectors by their First 2 Loadings
load <- fit$loadings[,1:2]
plot(load,type="n") # set up plot
text(load,labels=colnames(d5),cex=.7)
q()

R code for Example 5.4

#=========================================================
# Necessary packages and functions
#=========================================================
library(forecast)
library(tseries)
library(MTS)
library(TSA)
SWfore <- function (y, x, orig, m, h)
{
  if (!is.matrix(x))
    x = as.matrix(x)
  nT = dim(x)[1]
  k = dim(x)[2]
  if (orig > nT)
    orig = nT
  if (m > k)
    m = k
  if (m < 1)
    m = 1
  #orig=orig-2
  x1 = x[1:orig,]
  me = apply(x1, 2, mean)
  se = sqrt(apply(x1, 2, var))
  x1 = x
  for (i in 1:k) {
    x1[,i] = (x1[,i] - me[i])/se[i]
  }
  V1 = cov(x1[1:orig,])
  m1 = eigen(V1)
  sdev = m1$values

  #**************************
  ff<-c()
  for (j in 2:m){
    M = m1$vectors
    M1 = M[,1:j]
    Dindex = x1 %*% M1
    tt = Dindex%*%t(M1)
    ff[j-1] <- log(mean((tt-x)^2)) + j*((orig+k)    /(orig*k))*log((orig*k)/(orig+k))
  }
  order <- which(ff==min(ff)) + 1
  #**************************

  M = m1$vectors
  M1 = M[,1:order]
  Dindex = x1 %*% M1
  ################################################
  y1 = y[(h+1):orig]
  DF = Dindex[1:(orig-h),]
  y1l = y[h:(orig-1)]
  dat = data.frame(cbind(y1,DF,y1l))
  #fit = arimax(y1,order=c(1,0,0),xreg=DF,include.mean=FALSE)
  fit = lm(y1~.-1,data=dat)
  resi = fit$residuals
  coef <- fit$coefficients
  #se <- fit$se
  yhat = NULL
  MSE = NULL
  #yhat=coef%*%c(Dindex[(orig - h + 1),], y[orig])
  for(j in 1:h) {
   if (j==1){
     yhat[j] = coef%*%c(Dindex[(orig - h + 1),], y[orig])
    }else{
     yhat[j] = coef%*%c(Dindex[orig - h + j,], yhat[j-1])
    }
  }
  SWfore <- list(coef = coef, yhat = yhat, loadings = M1,
      DFindex = Dindex, order=order, residuals = resi,
      DF=DF,y1=y1)
}

##Import Data: Yearly sexually transmitted disease (STD) from C:/Bookdata/WW8c.csv
#=========================================================
# Read the data
#=========================================================
setwd("C:/Bookdata/")
A_MMWR <- as.matrix(read.csv("WW8c-R9.csv"))
A_STD <- as.matrix(read.csv("WW8c-R4.csv"))
SexDise <- as.matrix(read.csv("WW8c.csv"))
d1 <- SexDise[,-1]

# Take difference and mean-center the data
d2 <- d1[-1,]-d1[-nrow(d1),]
d3 <- (d2 - t(matrix(colMeans(d2),ncol(d2),nrow(d2))))/t(matrix(apply(d2,2,sd),ncol(d2),nrow(d2)))

# Final data matrix
Z <- as.matrix(d3)
# Identify # of locations (r) and # of time points (T)
Z.fit <- Z[1:24,]
r <- ncol(Z.fit)
T <- nrow(Z.fit)

Z.MMWR <- Z%*%t(A_MMWR)
Z.MMWR.fit <- Z.MMWR[1:24,]

Z.STD <- Z%*%t(A_STD)
Z.STD.fit <- Z.STD[1:24,]

#=========================================================
# Covariance and Correlation Matrices
#=========================================================
# Covariance
print(round(cov(Z),3))
# Correlation
print(round(cor(Z),3))

#=========================================================
# Principal Component Analysis
#=========================================================
y <- Z
x <- Z

#### Standardized
m1 <- prcomp(x,scale=TRUE,center=TRUE)
#Loadings
M1=m1$rotation[,1:6]
print(round(M1,3))

# Scree plot
plot(m1, type = "l",main="screeplot",20)

# PC1 vs. PC2
plot(M1[,1:2],type="n",xlab="Comp. 1", ylab="Comp. 2",main="")
text(M1,labels=colnames(x))

# Eigenvalues and cumulative variance
round(m1$sdev^2,3)
round(cumsum(m1$sdev^2)/sum(m1$sdev^2),4)

#Table 5.6
a1=m1$sdev[1]*m1$rotation[,1]
a2=m1$sdev[2]*m1$rotation[,2]
a3=m1$sdev[3]*m1$rotation[,3]
a4=m1$sdev[4]*m1$rotation[,4]
a5=m1$sdev[5]*m1$rotation[,5]
a6=m1$sdev[6]*m1$rotation[,6]
va=a1^2+a2^2+a3^2+a4^2+a5^2+a6^2
sigma =1-va
table5.6 <- data.frame(a1=a1,a2=a2,a3=a3,a4=a4,a5=a5,a6=a6,va=va,sigma=sigma)
write.csv(table5.6,"table5.6.csv")

#### Without standardized
m2 <- prcomp(d2,scale=FALSE,center=TRUE)
#Loadings
M2=m2$rotation[,1:3]
print(round(M2,3))
plot(m2, type = "l",main="screeplot",20)
plot(M2[,1:2],type="n",xlab="Comp. 1", ylab="Comp. 2",main="")
text(M2,labels=colnames(x))
# Eigenvalues and cumulative variance
round(m2$sdev^2,3)
round(cumsum(m2$sdev^2)/sum(m2$sdev^2),4)

#=========================================================
# Factor Model
#=========================================================
# Factor plots
par(mfrow=c(3,2))
ts.plot(m1$x[,1],ylab="factor 1",xlab="time")
ts.plot(m1$x[,2],ylab="factor 2",xlab="time")
ts.plot(m1$x[,3],ylab="factor 3",xlab="time")
ts.plot(m1$x[,4],ylab="factor 4",xlab="time")
ts.plot(m1$x[,5],ylab="factor 5",xlab="time")
ts.plot(m1$x[,6],ylab="factor 6",xlab="time")
par(mfrow=c(1,1))

####### Coefficients for the first time series
round(SWfore(y[,1],x,24,10,1)$coef,3)
round(SWfore(y[,1],x,24,10,1)$residual,3)

####### Forecasting
# f_factor <- matrix(0,5,r)
# for(k in 1: 44){
#   m1=SWfore(y[,k],x,24,10,5)
#   f_factor[,k] <- m1$yhat
# }
f_factor <- matrix(0,5,r)
for(k in 1: 44){
 f_factor[1,k] <- SWfore(y[,k],x,24,10,1)$yhat[1]
 f_factor[2,k] <- SWfore(y[,k],x,24,10,2)$yhat[2]
 f_factor[3,k] <- SWfore(y[,k],x,24,10,3)$yhat[3]
 f_factor[4,k] <- SWfore(y[,k],x,24,10,4)$yhat[4]
 f_factor[5,k] <- SWfore(y[,k],x,24,10,5)$yhat[5]
}
# One-step-ahead forecast for each state
one_step <- data.frame(forecast=f_factor[1,],actual=Z[25,],error=abs(f_factor[1,]-Z[25,]))
round(one_step,3)
write.csv(one_step,"one_step.csv")

# MMWR
f_9<-f_factor%*%t(A_MMWR)
ff <-as.vector(t(f_9))
region<-rep(1:9,5)
step <- rep(1:5,each=9)
actual <- as.vector(t(Z.MMWR[25:29,]))
error <- abs(ff-actual)
table_mmwr <- data.frame(step=step, region=region, forecast=ff, actual=actual, error=error)
table_mmwr
write.csv(table_mmwr,"table_mmwr.csv")

# SDT
f_4<-f_factor%*%t(A_STD)
ff <-as.vector(t(f_4))
region<-rep(1:4,5)
step <- rep(1:5,each=4)
actual <- as.vector(t(Z.STD[25:29,]))
error <- abs(ff-actual)
table_std <- data.frame(step=step, region=region, forecast=ff, actual=actual, error=error)
table_std
write.csv(table_std,"table_std.csv")

## Example
m1=SWfore(y[,1],x,24,10,1)
tdat=m1$coef%*%c(m1$DFindex[24,],y[24,1])
tdat
q()

Projects

Find a multivariate analysis book and carefully read its chapter on factor analysis.
Find a m‐dimensional social science related time series data set with m ≥ 10, construct your best factor models based on both principle component and likelihood ratio test methods, and compare your results with a written report and analysis software code. Email your data set and software code to your course instructor.
Let Z_t be the m‐dimensional vector in Project 1. Build a forecast equation to compute three‐step ahead forecasts for
Find a m‐dimensional natural science related time series data set with m ≥ 20, construct your best factor models based on both principle component and likelihood ratio test methods and compare your results with a written report and analysis software code. Email your data set and software code to your course instructor.
Let Z_t be the m‐dimensional vector in Project 4. Build a forecast equation to compute five‐step ahead forecasts for

References

Bai, J. (2003). Inferential theory for factor models of large dimensions. Econometrica 71: 135–171.
Bai, J. and Ng, S. (2002). Determining the number of factors in approximate factor models. Econometrica 70: 191–221.
Bai, J. and Ng, S. (2007). Determining the number of primitive shocks in factor models. American Statistical Association. Journal of Business & Economic Statistics 25: 52–60.
Bartlett, M.S. (1937). The statistical concept of mental factors. British Journal of Psychology 28: 97–104.
Bartlett, M.S. (1954). A note on multiplying factors for various chi‐squared approximation. Journal of the Royal Statistical Society, Series B 16: 296–298.
Carroll, J.B. (1953). An analytic solution for approximating simple structure in factor analysis. Psychometrika 18: 23–38.
Carroll, J.B. (1957). Biquartimin criterion for rotation to oblique simple structure in factor analysis. Science 126: 1114–1115.
Chan, N.H., Lu, Y., and Yau, C.Y. (2017). Factor modelling for high‐dimensional time series: inference and model selection. Journal of Time Series Analysis 38: 285–307.
Deistler, M. and Zinner, C. (2007). Modelling high‐dimensional time series by generalized linear dynamic factor models; an introductory survey. Communications in Information and Systems 7: 153–166.
Doz, C., Giannone, D., and Reichlin, L. (2011). A two‐step estimator for large approximate dynamic factor models based on Kalman filtering. Journal of Econometrics 164: 188–205.
Engle, R.F. and Watson, M.W. (1981). A one‐factor multivariate time series model of metropolitan wage rates. Journal of American Statistical Association 76: 774–781.
Fan, J., Liao, Y., and Wang, W. (2016). Projected principal component analysis in factor models. Annals of Statistics 44: 219–254.
Garcia‐Ferrer, A. and Poncela, P. (2002). Forecasting international GNP data through common factor models and other procedures. Journal of Forecasting 21: 225–244.
Geweke, J.F. (1977). The dynamic factor analysis of economic time series. In: Latent Variables in Socio‐Economic Models (ed. D.J. Aigner and A.S. Goldberger). North‐Holland, Amsterdam: Elsevier.
Geweke, J.F. and Sinleton, K.J. (1981). Maximum likelihood confirmatory factor analysis of economic time series. International Economic Review 22: 37–54.
Gonçalves, S., Perron, B., and Djogbenou, A. (2017). Bootstrap prediction intervals for factor models. Journal of Business & Economic Statistics 35: 53–69.
Hallin, M. and Liska, R. (2007). The generalized dynamic factor model: determining the number of factors. Journal of American Statistical Association 102: 603–617.
Harvey, A.C. (1989). Forecasting Structural Time Series Models and the Kalman Filter. Cambridge University Press.
Jennrich, R.I. and Sampson, P.F. (1966). Rotation for simple loadings. Psychometrica 31: 313–323.
Johnson, R.A. and Wichern, D.W. (2007). Applied Multivariate Statistical Analysis, 6e. Englewood Cliffs, NJ: Pearson Prentice Hall.
Jungbacker, B., Koopman, S.J., and van der Wel, M. (2011). Maximum likelihood estimation for dynamic factor models with missing data. Journal of Economic Dynamics & Control 35: 1358–1368.
Kaiser, H.F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika 23: 187–200.
Lam, C. and Yao, Q. (2012). Factor modeling for high‐dimensional time series: inference for the number of factors. Annals of Statistics 40: 694–726.
Peña, D. and Poncela, P. (2004). Forecasting with nonstationary dynamic factor model. Journal of Econometrics 119: 291–321.
Rockova, V. and George, E. (2016). Fast Bayesian factor analysis via automatic rotations to sparsity. Journal of American Statistical Association 111: 1608–1622.
Sharpe, W. (1970). Portfolio Theory and Capital Markets. New York: McGraw‐Hill.
Stock, J.H. and Watson, M.W. (2009). Forecasting in dynamic factor models subject to structural instability, Chapter 7. In: The Methodology and Practice of Econometrics: Festschrift in Honor of D.F. Hendry (ed. N. Shephard and J. Castle). Oxford University Press.
Stock, J.H. and Watson, M.W. (2011). Dynamic factor models. In: The Oxford Handbook on Economic Forecasting (ed. M.P. Clements and D.F. Hendry), 35–59. Oxford University Press.
Stock, J.H. and Watson, M.W. (2016). Dynamic factor models, factor‐augmented vector autoregressions, and structural vector autoregressions in macroeconomics. Handbook of Macroeconomics, Elsevier 415–525.
Thurstone, L.L. (1947). Multiple‐Factor Analysis. Chicago: University of Chicago Press.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Variable (j)
CVX	−0.437	−0.220	0.778	0.845	0.155
XOM	−0.575		0.693	0.811	0.189
AAPL	−0.389	−0.626	−0.327	0.650	0.350
FB	−0.521	−0.608	−0.177	0.672	0.328
MSFT	−0.447	−0.696		0.684	0.316
JNJ	−0.679	0.189		0.497	0.503
MRK	−0.703	0.276	−0.139	0.590	0.410
PFE	−0.739	0.387		0.696	0.304
FBIOX	−0.767	0.335	−0.191	0.737	0.263
FSPHX	−0.871	0.164	−0.162	0.812	0.188
Sum of squared loadings	3.99	1.70	1.32
Cumulative percentage of total stand‐dardized sample variance	39.9	56.9	70.1

Table of Contents for 5 Factor analysis of multivariate time series

Create new playlist

Sign In

Sign Up

5.1 Introduction

5.2 The orthogonal factor model

5.3 Estimation of the factor model

5.3.1 The principal component method

5.3.2 Empirical Example 1 – Model 1 on daily stock returns from the second set of 10 stocks

5.3.3 The maximum likelihood method

5.3.4 Empirical Example II – Model 2 on daily stock returns from the second set of 10 stocks

5.4 Factor rotation

5.4.1 Orthogonal rotation

5.4.2 Oblique rotation

5.4.3 Empirical Example III – Model 3 on daily stock returns from the second set of 10 stocks

5.5 Factor scores

5.5.1 Introduction

5.5.2 Empirical Example IV – Model 4 on daily stock returns from the second set of 10 stocks

5.6 Factor models with observable factors

5.7 Another empirical example – Yearly U.S. sexually transmitted diseases (STD)

5.7.1 Principal components analysis (PCA)

5.7.1.1 PCA for standardized Zt

5.7.1.2 PCA for unstandardized Zt

5.7.2 Factor analysis

5.8 Concluding remarks

Projects

References

Table of Contents for
5 Factor analysis of multivariate time series

5.7.1.1 PCA for standardized Z_t

5.7.1.2 PCA for unstandardized Z_t