Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

2
GARCH(p, q) Processes

Autoregressive conditionally heteroscedastic (ARCH) models were introduced by Engle (1982), and their GARCH (generalised ARCH) extension is due to Bollerslev (1986). In these models, the key concept is the conditional variance, that is, the variance conditional on the past. In the classical GARCH models, the conditional variance is expressed as a linear function of the squared past values of the series. This particular specification is able to capture the main stylised facts characterising financial series, as described in Chapter 1. At the same time, it is simple enough to allow for a complete study of the solutions. The ‘linear’ structure of these models can be displayed through several representations that will be studied in this chapter.

We first present definitions and representations of GARCH models. Then we establish the strict and second‐order stationarity conditions. Starting with the first‐order GARCH model, for which the proofs are easier and the results are more explicit, we extend the study to the general case. We also study the so‐called ARCH(∞) models, which allow for a slower decay of squared‐return autocorrelations. Then, we consider the existence of moments and the properties of the autocorrelation structure. We conclude this chapter by examining forecasting issues.

2.1 Definitions and Representations

We start with a definition of GARCH processes based on the first two conditional moments.

Equation (2.1) can be written in a more compact way as

2.2

where B is the standard backshift operator ( and for any integer i ), and α and β are polynomials of degrees q and p , respectively:

If β(z) = 0 we have

2.3

and the process is called an ARCH( q ) process. ¹ By definition, the innovation of the process is the variable . Substituting in Eq. ( 2.1) the variables by , we get the representation

2.4

where r = max(p, q), with the convention α _i = 0 ( β _j = 0) if i > q ( j > p ). This equation has the linear structure of an ARMA model, allowing for simple computation of the linear predictions. Under additional assumptions (implying the second‐order stationarity of ), we can state that if (ε_t) is GARCH(p, q), then is an ARMA(r, p) process. In particular, the square of an ARCH( q ) process admits, if it is stationary, an AR( q ) representation. The ARMA representation will be useful for the estimation and identification of GARCH processes. ²

Remark 2.1 Correlation of the squares of a GARCH

We observed in Chapter 1 that a characteristic feature of financial series is that squared returns are autocorrelated, while returns are not. The representation (2.4) shows that GARCH processes are able to capture this empirical fact. If the fourth‐order moment of (ε_t) is finite, the sequence of the h ‐order autocorrelations of is the solution of a recursive equation which is characteristic of ARMA models. For the sake of simplicity, consider the GARCH(1, 1) case. The squared process is ARMA(1, 1), and thus its autocorrelation decreases to zero proportionally to (α ₁ + β ₁)^h : for h > 1,

where K is a constant independent of h . Moreover, the ε_t 's are uncorrelated in view of (i) in Definition 2.1.

Definition 2.1 does not directly provide a solution process satisfying those conditions. The next definition is more restrictive but allows explicit solutions to be obtained. The link between the two definitions will be given in Remark 2.5. Let η denote a probability distribution with null expectation and unit variance.

GARCH processes in the sense of Definition 2.1 are sometimes called semi‐strong following the paper by Drost and Nijman (1993) on temporal aggregation. Substituting ε_t − i by σ _t − i η _t − i in ( 2.1), we get

which can be written as follows:

2.6

where a _i(z) = α _i z ² + β _i, i = 1, …, r . This representation shows that the volatility process of a strong GARCH is the solution of an autoregressive equation with random coefficients.

Properties of Simulated Paths

Contrary to standard time series models (ARMA), the GARCH structure allows the magnitude of the noise ε_t to be a function of its past values. Thus, periods with high‐volatility level (corresponding to large values of ) will be followed by periods where the fluctuations have a smaller amplitude. Figures 2.1–2.7 illustrate the volatility clustering for simulated GARCH models. Large absolute values are not uniformly distributed on the whole period, but tend to cluster. We will see that all these trajectories correspond to strictly stationary processes which, except for the ARCH(1) models of Figures 2.3–2.5, are also second‐order stationary. Even if the absolute values can be extremely large, these processes are not explosive, as can be seen from these figures. Higher values of α (theoretically, α > 3.56 for the distribution, as will be established below) lead to explosive paths. Figures 2.6 and 2.7, corresponding to GARCH(1, 1) models, have been obtained with the same simulated sequence (η _t). As we will see, permuting α and β does not modify the variance of the process but has an effect on the higher‐order moments. For instance the simulated process of Figure 2.7, with α = 0.7 and β = 0.2, does not admit a fourth‐order moment, in contrast to the process of Figure 2.6. This is reflected by the presence of larger absolute values in Figure 2.7. The two processes are also different in terms of persistence of shocks: when β approaches 1, a shock on the volatility has a persistent effect. On the other hand, when α is large, sudden volatility variations can be observed in response to shocks.

Graph displaying a fluctuating curve for volatility clustering of the simulation of size 500 of the ARCH(1) process with ω = 1, α = 0.5, and ηt ∼ N (0, 1). — Figure 2.1 Simulation of size 500 of the ARCH(1) process with ω = 1, α = 0.5 and .

images — Figure 2.1 Simulation of size 500 of the ARCH(1) process with ω = 1, α = 0.5 and .

Graph displaying a fluctuating curve for volatility clustering of the simulation of size 500 of the ARCH(1) process with ω = 1, α = 0.95, and ηt ∼ N (0, 1). — Figure 2.2 Simulation of size 500 of the ARCH(1) process with ω = 1, α = 0.95 and .

Graph displaying a fluctuating curve for volatility clustering of the simulation of size 500 of the ARCH(1) process with ω = 1, α = 1.1, and ηt ∼ N (0, 1). — Figure 2.3 Simulation of size 500 of the ARCH(1) process with ω = 1, α = 1.1 and .

Graph displaying a fluctuating curve for volatility clustering of the simulation of size 200 of the ARCH(1) process with ω = 1, α = 3, and ηt ∼ N (0, 1). — Figure 2.4 Simulation of size 200 of the ARCH(1) process with ω = 1, α = 3 and .

Graph displaying a fluctuating curve for volatility clustering of observations 100-140 of Figure 2.4. — Figure 2.5 Observations 100–140 of Figure 2.4.

Graph displaying a fluctuating curve for volatility clustering of the simulation of size 500 of the GARCH(1, 1) process with ω = 1, α = 0.2, β = 0.7, and ηt ∼ N (0, 1). — Figure 2.6 Simulation of size 500 of the GARCH(1, 1) process with ω = 1, α = 0.2, β = 0.7 and .

Graph displaying a fluctuating curve for the simulation of size 500 of the GARCH(1, 1) process with ω= 1, α= 0.7, β= 0.2, and ηt ∼ N (0, 1). — Figure 2.7 Simulation of size 500 of the GARCH(1, 1) process with ω = 1, α = 0.7, β = 0.2 and .

2.2 Stationarity Study

This section is concerned with the existence of stationary solutions (in the strict and second‐order senses) to model (2.5). We are mainly interested in non‐anticipative solutions, that is, processes (ε_t) such that ε_t is a measurable function of the variables η _t − s , s ≥ 0. For such processes, σ _t is independent of the σ ‐field generated by {η _t + h, h ≥ 0} and ε_t is independent of the σ ‐field generated by {η _t + h, h > 0}. It will be seen that such solutions are also ergodic. The concept of ergodicity is discussed in Appendix A.1. We first consider the GARCH(1, 1) model, which can be studied in a more explicit way than the general case. For x > 0, let log⁺ x = max(log x, 0).

2.2.1 The GARCH(1,1) Case

When p = q = 1, model ( 2.5) has the form

2.7

with ω > 0, α ≥ 0, β ≥ 0. Let a(z) = αz ² + β .

Remark 2.2 On the strict stationarity condition (2.8)

When ω = 0 and γ < 0, it is clear that, in view of condition (2.9), the unique strictly stationary solution is ε_t = 0. It is, therefore, natural to impose ω > 0.
It may be noted that the condition ( 2.8) depends on the distribution of η _t and that it is not symmetric in α and β .
Examination of the proof below shows that the assumptions Eη _t = 0 and , which facilitate the interpretation of the model, are not necessary. It is sufficient to have
Condition ( 2.8) implies β < 1. Now, if

then condition ( 2.8) is satisfied since, by application of the Jensen inequality,
If ( 2.8) is satisfied, it is also satisfied for any pair (α ₁, β ₁) such that α ₁ ≤ α and β ₁ ≤ β . In particular, the strict stationarity of a given GARCH(1, 1) model implies that the ARCH(1) model obtained by cancelling β is also stationary.
In the ARCH(1) case ( β = 0), the strict stationarity constraint is written as
2.10

For instance when η _t∼ (0,1), the condition becomes α < 3.56. For a distribution such that , for instance with a mass at 0, condition (2.10) is always satisfied. For such distributions, a strictly stationary ARCH(1) solution exists whatever the value of α.

Proof of Theorem 2.1

Proof of Theorem 2.1

Note that the coefficient γ = E log {a(η _t)} always exists in [− ∞ , + ∞) because E log⁺{a(η _t)} ≤ Ea(η _t) = α + β . Using iteratively the second equation in model ( 2.7), we get, for N ≥ 1,

2.11

The limit process exists in since the summands are non‐negative. Moreover, letting N go to infinity in h _t(N) = ω + a(η _t − 1)h _t − 1(N − 1), we get

We now show that h _t is a.s. finite if and only if γ < 0.

Suppose that γ < 0. We will use the Cauchy rule for series with non‐negative terms. ³ We have

2.12

as n → ∞, by application of the strong law of large numbers to the iid sequence (log{a(η _t)}). ⁴ The series defined in equality ( 2.9) thus converges a.s. in ℝ, by application of the Cauchy rule, and the limit process (h _t) takes positive real values. It follows that the process (ε_t) defined by

2.13

is strictly stationary and ergodic (see Appendix A.1, Theorem A.1). Moreover, (ε_t) is a non‐anticipative solution of model ( 2.7).

We now prove the uniqueness. Let denote another strictly stationary solution. By (2.11) we have

It follows that

The term in brackets on the right‐hand side tends to 0 a.s. as N → ∞ . Moreover, since the series defining h _t converges a.s., we have a(η _t − 1)…a(η _t − n) → 0 with probability 1 as n → ∞. In addition, the distribution of is independent of N by stationarity. Therefore, in probability as N → ∞. We have proved that in probability as N → ∞. This term being independent of N , we necessarily have for any t , a.s.

If γ > 0, from the convergence in (2.12) and the Cauchy rule, , a.s., as N → ∞. Hence, if ω > 0, h _t = + ∞ a.s. By ( 2.11), it is clear that It follows that there exists no a.s. finite solution to model ( 2.7).

For γ = 0, we give a proof by contradiction. Suppose there exists a strictly stationary solution of model ( 2.7). We have, for n > 0,

from which we deduce that a(η ₋₁)…a(η _−n)ω converges to zero, a.s., as n → ∞, or, equivalently, that

2.14

By the Chung–Fuchs theorem ⁵ , we have with probability 1, which contradicts (2.14).

The next result shows that non‐stationary GARCH processes are explosive.

Proof.

If (ε_t) is a GARCH(1, 1) process, in the sense of Definition 2.1, which is second‐order stationary and non‐anticipative, we have

that is,

Hence, we must have α + β < 1. In addition, we get Conversely, suppose α + β < 1. By Remark 2.2(4), the strict stationarity condition is satisfied. It is thus sufficient to show that the strictly stationary solution defined in ( 2.13) admits a finite variance. The variable h _t being an increasing limit of positive random variables, the infinite sum and the expectation can be permuted to give,

This proves the second‐order stationarity of the solution. Moreover, this solution is a white noise because E(ε_t) = E{E(ε_t ∣ ε_u, u < t)} = 0 and for all h > 0,

Let denote another second‐order and non‐anticipative stationary solution. We have

and then

Notice that the second equality uses the fact that the solutions are non‐anticipative. This assumption was not necessary to establish the uniqueness of the strictly stationary solution. The expectation of being bounded by , which is finite and independent of n by stationarity, and since (α + β)ⁿ tends to 0 when n → ∞, we obtain and thus for all t , a.s.

Figure 2.8 shows the zones of strict and second‐order stationarity for the strong GARCH(1, 1) model when . Note that the distribution of η _t only matters for the strict stationarity. As noted above, the frontier of the strict stationarity zone corresponds to a random walk (for the process log(h _t − ω)). A similar interpretation holds for the second‐order stationarity zone. If α + β = 1 we have

β1 vs. α1 displaying 2 descending lines with labels 1, 2, and 3. — Figure 2.8 Stationarity regions for the GARCH(1, 1) model when : 1, second‐order stationarity; 1 and 2, strict stationarity; 3, non‐stationarity.

Thus, since the last term in this equality is centred and uncorrelated with any variable belonging to the past of h _t − 1 , the process (h _t) is a random walk. The corresponding GARCH process is called integrated GARCH (or IGARCH(1, 1)) and will be studied later: it is strictly stationary, has an infinite variance, and a conditional variance which is a random walk (with a positive drift).

2.2.2 The General Case

In the general case of a strong GARCH(p, q) process, the following vector representation will be useful. We have

2.16

where

and

2.17

is a (p + q) × (p + q) matrix. In the ARCH( q ) case, reduces to and its q − 1 first past values, and A _t to the upper‐left block of the above matrix. Equation (2.16) defines a first‐order vector autoregressive model, with positive and iid matrix coefficients. The distribution of conditional on its infinite past coincides with its distribution conditional on z _t − 1 only, which means that is a Markov process. Model ( 2.16) is thus called the Markov representation of the GARCH(p, q) model. Iterating Eq. ( 2.16) yields

2.18

provided that the series exists a.s.. Finding conditions ensuring the existence of this series is the object of what follows. Notice that the existence of the right‐hand vector in (2.18) does not ensure that its components are positive. One sufficient condition for

2.19

in the sense that all the components of this vector are strictly positive (but possibly infinite), is that

2.20

This condition is very simple to use but may not be necessary, as we will see in Section 2.3.2.

Strict Stationarity

The main tool for studying strict stationarity is the concept of the top Lyapunov exponent. Let A be a (p + q) × (p + q) matrix. The spectral radius of A , denoted by ρ(A), is defined as the greatest modulus of its eigenvalues. Let ‖ ⋅ ‖ denote any norm on the space of the (p + q) × (p + q) matrices. We have the following algebra result:

2.21

(Exercise 2.3). This property has the following extension to random matrices.

Remark 2.4 On the top Lyapunov exponent

It is always true that γ ≤ E(log‖A ₁‖), with equality in dimension 1.
If A _t = A for all t ∈ ℤ, we have γ = log ρ(A) in view of Eq. (2.21).
All norms on a finite‐dimensional space being equivalent, it readily follows that γ is independent of the norm chosen.
The equivalence between the two definitions of γ , in equality (2.22), follows from Fekete's lemma (see Exercise 2.11) by noting that a _t = E log ‖A _t…A ₁‖ defines a sub‐additive sequence.
The equivalence between the definitions of γ can be shown using Kingman's sub‐additive ergodic theorem (see Kingman 1973, Theorem 6). The characterisation in condition (2.23) is particularly interesting because its allows us to evaluate this coefficient by simulation. Asymptotic confidence intervals can also be obtained (see Goldsheid 1991).

The following general lemma, which we shall state without proof (see Bougerol and Picard 1992a, Lemma 3.4), is very useful for studying products of random matrices.

As for ARMA models, we are mostly interested in the non‐anticipative solutions (ε_t) to model ( 2.5), that is, those for which ε_t belongs to the σ ‐field generated by {η _t, η _t − 1, …}.

Proof.

We shall use the norm defined by ‖A‖ = ∑ ∣ a _ij∣. For convenience, the norm will be denoted identically whatever the dimension of A . With this convention, this norm is clearly multiplicative: ‖AB‖ ≤ ‖A‖‖B‖ for all matrices A and B such that AB exists. ⁶ Observe that since the variable η _t has a finite variance, the components of the matrix A _t are integrable. Hence, we have

First suppose γ < 0. Then it follows from condition ( 2.23) that

converges a.s., when N goes to infinity, to some limit . Indeed, using the fact that the norm is multiplicative,

2.25

and

To show that we have used the result proven in Exercise 2.11, which can be applied because

It follows that, by the Cauchy rule, is well defined in (ℝ^*+)^p + q . Let denote the ( q + 1)th component of . Setting , we define a solution of model ( 2.5). This solution is non‐anticipative since, by ( 2.18), ε_t can be expressed as a measurable function of η _t, η _t − 1, …. By Theorem A.1, together with the ergodicity (η _t), this solution is also strictly stationary and ergodic.

Next we establish the necessary part. From Lemma 2.1, it suffices to prove, by Eq. (2.24), that lim_{t → ∞} a.s. ‖A ₀…A _−t‖ = 0. We shall show that, for 1 ≤ i ≤ p + q ,

2.26

where e _i is the i th element of the canonical base of ℝ^p + q . Let (ε_t) be a strictly stationary solution of condition ( 2.5) and let be defined by condition ( 2.16). We have, for t > 0,

2.27

because the coefficients of the matrices and are non‐negative. ⁷ It follows that the series converges and thus tends a.s. to 0 as k → ∞. But since it follows that can be decomposed into two positive terms, and we have

2.28

Since ω ≠ 0, (2.26) holds for i = q + 1. Now we use the equality

2.29

with e _{p + q + 1} = 0 by convention. For i = 1, this equality gives

hence Eq. ( 2.26) is true for i = q + 2, and, by induction, it is also true for i = q + j, j = 1, …, p using Eq. (2.29). Moreover, we note that , which allows us to see that, from condition (2.28), Eq. ( 2.26) holds for i = q . For the other values of i the conclusion follows from

The proof of the uniqueness parallels the arguments given in the case p = q = 1. Let (ε_t) denote a strictly stationary solution of model ( 2.5), or equivalently, let denote a positive and strictly stationary solution of ( 2.16). For all N ≥ 0,

Then

The first term on the right‐hand side tends to 0 a.s. as N → ∞. In addition, because the series defining converges a.s., we have ‖A _t…A _t − N‖ → 0 with probability 1 when n → ∞. Moreover, the distribution of is independent of N by stationarity. It follows that in probability as N → ∞. We have shown that in probability when N → ∞. This quantity being independent of N , we necessarily have for any t , a.s.. The proof of Theorem 2.4 is now complete.

Remark 2.5 On Theorem 2.4 and its proof

This theorem shows, in particular, that the stationary solution of the strong GARCH model is a semi‐strong GARCH process, in the sense of Definition 2.1. The converse is not true, however (see the example of Section 4.1.1).
Bougerol and Picard (1992b) use a more parsimonious vector representation of the GARCH(p, q) model, based on the vector (Exercise 2.7). However, a drawback of this representation is that it is only defined for p ≥ 1 and q ≥ 2.
An analogous proof uses the following Markov vector representation based on representation (2.6):
2.30

with , and

where I _r − 1 is the identity matrix of size r − 1. In Theorem 2.4 and its proof, the sequence (A _t) could be replaced by the more parsimonious sequence (B _t). However, the matrices B _t are not independent, and we will require independence to establish subsequent results, like the existence of moments (Theorem 2.9). On the other hand, the representation (2.30) may be more convenient than Eq. ( 2.16) for deriving other properties such as the mixing properties (see Chapter 3). It is also worth noting (Exercise 2.14) that

2.31
To verify the condition γ < 0, it is sufficient to check that

for some t > 0.
If a GARCH model admits a strictly stationary solution, any other GARCH model obtained by replacing the α _i and β _j by smaller coefficients will also admit a strictly stationary solution. Indeed, the coefficient γ of the latter model will be smaller than that of the initial model because, with the norm chosen, 0 ≤ A ≤ B implies ‖A‖ ≤ ‖B‖. In particular, the strict stationarity of a given GARCH process entails that the ARCH process obtained by cancelling the coefficients β _j is also strictly stationary.
We emphasise the fact that any strictly stationary solution of a GARCH model is non‐anticipative. This is an important difference with ARMA models, for which strictly stationary solutions depending on both past and future values of the noise exist.

The following result provides a simple necessary condition for strict stationarity, in three different forms.

Proof.

Since all the coefficients of the matrices A _t are non‐negative, it is clear that γ is larger than the top Lyapunov exponent of the constant sequence obtained by replacing by 0 the coefficients of the first q rows and of the first q columns of the matrices A _t . But the matrix obtained in this way has the same non‐zero eigenvalues as B , and thus has the same spectral radius as B . In view of Remark 2.4(2), it can be seen that

It follows that γ < 0 ⟹ (c). It is easy to show (by induction on p and by computing the determinant with respect to the last column) that, for λ ≠ 0,

2.32

where ℬ(z) = 1 − β ₁ z − ⋯ − β _p z ^p . The equivalence between (b) and (c) is then straightforward. Next, we prove that (a) ⇔ (b). We have ℬ(0) = 1 and . Hence, if then ℬ(1) ≤ 0 and, by a continuity argument, there exists a root of ℬ in (0, 1]. Thus (b) ⟹ (a). Conversely, if and ℬ(z ₀) = 0 for a z ₀ of modulus less than or equal to 1, then , which is impossible. It follows that (a) ⟹ (b) and the proof of the corollary is complete.

We now give two illustrations allowing us to obtain more explicit stationarity conditions than in the theorem.

Example 2.2 ARCH(2)

For an ARCH(2) model, the matrix A _t takes the form

and the stationarity region can be evaluated by simulation. Table 2.1 shows, for different values of the coefficients α ₁ and α ₂ , the empirical means and the standard deviations (in parentheses) obtained for 1000 simulations of size 1000 of . The η _t 's have been simulated from a distribution. Note that in the ARCH(1) case, simulation provides a good approximation of the condition α ₁ < 3.56, which was obtained analytically. Apart from this case, there exists no explicit strict stationarity condition in terms of the coefficients α ₁ and α ₂ .

Estimations of γ obtained from 1000 simulations of size 1000 in the ARCH(2) case.

α ₁
α ₂	0.25	0.3	1	1.2	1.7	1.8	3.4	3.5	3.6
0	—	—	—	—	—	—	−0.049 (0.071)	−0.018 (0.071)	0.010 (0.071)
0.5	—	—	—	−0.175 (0.040)	−0.021 (0.042)	0.006 (0.044)	—	—	—
1	—	—	−0.011 (0.038)	0.046 (0.038)	—	—	—	—	—
1.75	−0.015 (0.035)	0.001 (0.032)	—	—	—	—	—	—	—

Figure 2.9, constructed from these simulations, gives a more precise idea of the strict stationarity region for an ARCH(2) process. We shall establish in Corollary 2.3 a result showing that any strictly stationary GARCH process admits small‐order moments. We begin with two lemmas which are of independent interest.

α2 vs. α1 displaying 2 descending lines dividing the graph into 3 sections labeled 1, 2, and 3. — Figure 2.9 Stationarity regions for the ARCH(2) model: 1, second‐order stationarity; 1 and 2, strict stationarity; 3, non‐stationarity.

The following result, which is stated for any sequence of positive iid matrices, provides another characterisation of the strict stationarity of GARCH models.

Using Lemma 2.3 and Corollary 2.3 together, it can be seen that for s ∈ (0, 1],

2.34

The converse is generally true. For instance, we have for s ∈ (0, 1],

2.35

(Exercise 2.15).

Second‐Order Stationarity

The following theorem gives necessary and sufficient second‐order stationarity conditions.

Proof.

We first show that condition ( 2.36) is necessary. Let (ε_t) be a second‐order stationary and non‐anticipative GARCH(p, q) process. Then

is a positive real number which does not depend on t . Taking the expectation of both sides of equality ( 2.1), we thus have

that is,

2.37

Since ω is strictly positive, we must have condition ( 2.36).

Now suppose that condition ( 2.36) holds true and let us construct a stationary GARCH solution (in the sense of Definition 2.2). For t, k ∈ ℤ, we define ℝ^d ‐valued vectors as follows:

We have

By iterating these relations we get, for k > 0,

On the other hand, for the norm ‖C‖ = ∑_{i, j} ∣ c _ij∣, we have, for any random matrix C with positive coefficients, E‖C‖ = E∑_{i, j} ∣ c _ij ∣ = E∑_{i, j} c _ij = ‖E(C)‖. Hence, for k > 0,

because the components of the vector are positive. All terms of the product are independent (because the process (η _t) is iid, and every term of the product is function of a variable η _t − i , the dates t − i being distinct). Moreover, A ≔ E(A _t) and do not depend on t . Finally, for k > 0,

because all entries of the vector are positive.

Condition ( 2.36) implies that the modulus of the eigenvalues of A are strictly less than 1. Indeed, one can verify (Exercise 2.12) that

2.38

Thus if ∣λ ∣ ≥ 1, using the inequality ∣a − b ∣ ≥ ∣ a ∣ − ∣ b∣, we get

and then ρ(A) < 1. It follows that, in view of the Jordan decomposition, or using ( 2.21), the convergence A ^k → 0 holds at exponential rate when k → ∞. Hence, for any fixed t , Z _k(t) converges both in the L ¹ sense, using Cauchy's criterion, and almost surely as k → ∞. Let denote the limit of (Z _k(t))_k . At fixed k , the process (Z _k(t))_{t ∈ ℤ} is strictly stationary. The limit process is thus strictly stationary. Finally, it is clear that is a solution of Eq. ( 2.16).

The uniqueness can be shown as in the case p = q = 1, using the representation ( 2.30).

IGARCH(p, q) Processes

When

the model is called an integrated GARCH(p, q) or IGARCH(p, q) model (see Engle and Bollerslev 1986). This name is justified by the existence of a unit root in the autoregressive part of representation ( 2.4) and is introduced by analogy with the integrated ARMA models, or ARIMA. However, this analogy can be misleading: there exists no (strict or second‐order) stationary solution of an ARIMA model, whereas an IGARCH model admits a strictly stationary solution under very general conditions. In the univariate case ( p = q = 1), the latter property is easily shown.

This property extends to the general case under slightly more restrictive conditions on the law of η _t .

Note that this strictly stationary solution has an infinite variance in view of Theorem 2.5.

2.3 ARCH(∞)Representation ^*

A process (ε_t) is called an ARCH(∞) process if there exists a sequence of iid variables (η _t) such that E(η _t) = 0 and , and a sequence of constants φ _i ≥ 0, i = 1, …, and φ ₀ > 0 such that

2.40

This class obviously contains the ARCH(q) process, and we shall see that it more generally contains the GARCH(p, q) process.

2.3.1 Existence Conditions

The existence of a stationary ARCH(∞) process requires assumptions on the sequences (φ _i) and (η _t). The following result gives an existence condition.

Proof.

Consider the random variable

2.43

taking values in [0, + ∞]. Since s ∈ (0, 1], applying the inequality (a + b)^s ≤ a ^s + b ^s for a, b ≥ 0 gives

Using the independence of the η _t , it follows that

2.44

This shows that S _t is a.s. finite. All the summands being positive, we have

Therefore, the following recursive equation holds:

A strictly stationary and non‐anticipative solution of model ( 2.40) is then obtained by setting . Moreover, in view of (2.44). Now denote by (ε_t) any strictly stationary and non‐anticipative solution of model ( 2.40), such that E|ε_t|^2s < ∞. For all q ≥ 1, by q successive substitutions of the we get

Note that S _{t, q} → S _t a.s. when q → ∞, where S _t is defined in (2.43). Moreover, because the solution is non‐anticipative, ε_t is independent of for all t ^′ > t . Hence,

Thus since A _s μ _2s < 1. Finally, R _{t, q} → 0 a.s. when q → ∞, which implies a.s..

Equality ( 2.42) is called a Volterra expansion (see Priestley 1988). It shows in particular that, under the conditions of the theorem, if φ ₀ = 0, the unique strictly stationary and non‐anticipative solution of the ARCH(∞) model is the identically null sequence. An application of Theorem 2.6, obtained for s = 1, is that the condition

ensures the existence of a second‐order stationary ARCH(∞) process. If

it can be shown (see Giraitis, Kokoszka, and Leipus 2000a) that , for all h and

2.45

The fact that the squares have positive autocovariances will be verified later in the GARCH case (see Section 2.5). In contrast to GARCH processes, for which they decrease at exponential rate, the autocovariances of the squares of ARCH(∞) can decrease at the rate h ^−γ with γ > 1 arbitrarily close to 1. A strictly stationary process of the form ( 2.40) such that

is called integrated ARCH(∞), or IARCH(∞). Notice that an IARCH(∞) process has infinite variance. Indeed, if , then, by ( 2.40), σ ² = φ ₀ + σ ² , which is impossible. From Theorem 2.8, the strictly stationary solutions of IGARCH models (see Corollary 2.5) admit IARCH(∞) representations. The next result provides a condition for the existence of IARCH(∞) processes.

Other integrated processes are the long‐memory ARCH, for which the rate of decrease of the φ _i is not geometric.

2.3.2 ARCH(∞) Representation of a GARCH

It is sometimes useful to consider the ARCH(∞) representation of a GARCH(p, q) process. For instance, this representation allows the conditional variance of ε_t to be written explicitly as a function of its infinite past. It also allows the positivity condition (2.20) on the coefficients to be weakened. Let us first consider the GARCH(1, 1) model. If β < 1, we have

2.46

In this case, we have

The condition A _s μ _2s < 1 thus takes the form

For example, if α + β < 1 this condition is satisfied for s = 1. However, second‐order stationarity is not necessary for the validity of (2.46). Indeed, if (ε_t) denotes the strictly stationary and non‐anticipative solution of the GARCH(1, 1) model, then, for any q ≥ 1,

2.47

By Corollary 2.3 there exists s ∈ (0, 1[ such that . It follows that So converges a.s. to 0 and, by letting q go to infinity in (2.47), we get ( 2.46). More generally, we have the following property.

The ARCH(∞) representation can be used to weaken the condition ( 2.20) imposed to ensure the positivity of . Consider the GARCH(p, q) model with ω > 0, without any a priori positivity constraint on the coefficients α _i and β _j , and assuming that the roots of the polynomial ℬ(z) = 1 − β ₁ z − ⋯ − β _p z ^p have moduli strictly greater than 1. The coefficients φ _i introduced in ( 2.48) are then well defined, and under the assumption

2.49

we have

Indeed, φ ₀ > 0 because otherwise B(1) ≤ 0, which would imply the existence of a root inside the unit circle since ℬ(0) = 1. Moreover, the proofs of the sufficient parts of Theorem 2.4, Lemma 2.3, and Corollary 2.3 do not use the positivity of the coefficients of the matrices A _t . It follows that if the top Lyapunov exponent of the sequence (A _t) is such that γ < 0, the variable is a.s. finite‐valued. To summarise, the conditions

imply that there exists a strictly stationary and non‐anticipative solution to model ( 2.5). The condition (2.49) is not generally simple to use, however, because it implies an infinity of constraints on the coefficients α _i and β _j . In the ARCH( q ) case, it reduces to the condition ( 2.20), that is, α _i ≥ 0, i = 1, …, q . Similarly, in the GARCH(1, 1) case, it is necessary to have α ₁ ≥ 0 and β ₁ ≥ 0. However, for p ≥ 1 and q > 1, the condition ( 2.20) can be weakened (Exercise 2.16).

2.3.3 Long‐Memory ARCH

The introduction of long memory into the volatility can be motivated by the observation that the empirical autocorrelations of the squares, or of the absolute values, of financial series decay very slowly in general (see for example Table 2.1). We shall see that it is possible to reproduce this property by introducing ARCH(∞) processes, with a sufficiently slow decay of the modulus of the coefficients φ _i . ⁸ A process (X _t) is said to have long memory if it is second‐order stationary and satisfies, for h → ∞,

2.50

and K is a non‐zero constant. An alternative definition relies on distinguishing ‘intermediate‐memory’ processes for which d < 0 and thus , and ‘long‐memory’ processes for which d ∈ (0, 1/2[ and thus (see Brockwell and Davis 1991, p. 520). The autocorrelations of an ARMA process decrease at exponential rate when the lag increases. The need for processes with a slower autocovariance decay leads to the introduction of the fractionary ARIMA models. These models are defined through the fractionary difference operator

Denoting by π _j the coefficient of B ^j in this sum, it can be shown that π _j∼Kj ^{−d − 1} when j → ∞, where K is a constant depending on d . An ARIMA(p, d, q) process with d ∈ ( − 0.5, 0.5) is defined as a stationary solution of

where ε_t is a white noise, and ψ and θ are polynomials of degrees p and q , respectively. If the roots of these polynomials are all outside the unit disk, the unique stationary and purely deterministic solution is causal, invertible and its covariances satisfy condition (2.50) (see Brockwell and Davis 1991, Theorem 13.2.2.). By analogy with the ARIMA models, the class of FIGARCH(p, d, q) processes is defined by the equations

2.51

where ψ and θ are polynomials of degrees p and q , respectively, such that ψ(0) = θ(0) = 1, the roots of ψ have moduli strictly greater than 1 and φ _i ≥ 0, where the φ _i are defined by

We have φ _i∼Ki ^{−d − 1} , where K is a positive constant, when i → ∞ and . The process introduced in (2.51) is thus IARCH(∞), provided it exists. Note that existence of this process cannot be obtained by Theorem 2.7 because, for the FIGARCH model, the φ _i decrease more slowly than the geometric rate. The following result, which is a consequence of Theorem 2.6, and whose proof is the subject of Exercise 2.22, provides another sufficient condition for the existence of IARCH(∞) processes.

This result can be used to prove the existence of FIGARCH(p,d,q) processes for d ∈ (0,1) sufficiently close to 1, if the distribution of is assumed to be non‐degenerate (hence, ); see Douc, Roueff, and Soulier (2008). The FIGARCH process of Corollary 2.6 does not admit a finite second‐order moment. Its square is thus not a long‐memory process in the sense of definition ( 2.50). More generally, it can be shown that the squares of a ARCH(∞) processes do not have the long‐memory property. This motivated the introduction of an alternative class, called linear ARCH (LARCH) and defined by

2.53

Under appropriate conditions, this model is compatible with the long‐memory property for .

2.4 Properties of the Marginal Distribution

We have seen that, under quite simple conditions, a GARCH(p, q) model admits a strictly stationary solution (ε_t). However, the marginal distribution of the process (ε_t) is never known explicitly. The aim of this section is to highlight some properties of this distribution through the marginal moments.

2.4.1 Even‐Order Moments

We are interested in finding existence conditions for the moments of order 2m , where m is any positive integer. ⁹ Let ⊗ denote the tensor product, or Kronecker product, and recall that it is defined as follows: for any matrices A = (a _ij) and B , we have A ⊗ B = (a _ij B). For any matrix A , let A ^⊗m = A ⊗ ⋯ ⊗ A . We have the following result.

Example 2.3 Moments of a GARCH(1, 1) process

When p = q = 1, the matrix A _t is written as

Hence, all the eigenvalues of the matrix are null except one. The non‐zero eigenvalue is thus the trace of A ^(m) . It readily follows that the necessary and sufficient condition for the existence of is

2.54

where , i = 0, …, m. The moments can be computed recursively, by expanding . For the fourth‐order moment, a direct computation gives

and thus

provided that the denominator is positive. Figure 2.10 shows the zones of second‐order and fourth‐order stationarity for the strong GARCH(1, 1) model when .

Figure 2.10 Regions of moments existence for the GARCH(1, 1) model: 1, moment of order 4; 1 and 2, moment of order 2; 3, infinite variance.

This example shows that for a non‐trivial GARCH process, that is, when the α _i and β _j are not all equal to zero, the moments cannot exist for any order.

Proof of Theorem 2.9

Proof of Theorem 2.9

For k > 0, let

with the convention that A _{t, 0} = I _p + q and . Notice that the components of are a.s. defined in [0, + ∞) ∪ {∞}, without any restriction on the model coefficients. Let ‖ ⋅ ‖ denote the matrix norm such that ‖A‖ = ∑_{i, j} ∣ a _ij∣. Using the elementary equalities

and the associativity of the Kronecker product, we obtain, for k > 0,

since the elements of the matrix are positive. For any vector X conformable to the matrix A , we have

by the property of the Kronecker product, AB ⊗ CD = (A ⊗ C)(B ⊗ D), for any matrices such that the products AB and CD are well defined. It follows that

2.55

Let and recall that . In view of (2.55), we get

using the independence between the matrices in the product (since A _t − i is a function of η _t − i ). The matrix norm being multiplicative, it follows, using ( 2.18), that

2.56

If the spectral radius of the matrix A ^(m) is strictly less than 1, then ‖(A ^(m))^k‖ converges to zero at exponential rate when k tends to infinity. In this case, is a.s. finite. It is the strictly stationary solution of Eq. ( 2.16), and this solution belongs to L ^m . It is clear, on the other hand, that

because the norm of is greater than that of any of its components. A sufficient condition for the existence of is then ρ(A ^(m)) < 1.

Conversely, suppose that belongs to L ^m . For any vectors x and y of the same dimension, let x ≤ y mean that the components of y − x are all positive. Then, for any n ≥ 0,

because all the terms involved in theses expressions are positive. Since the components of are finite, we have

2.57

To conclude, it suffices to show that

2.58

because this is equivalent to ρ(A ^(m)) < 1. To deduce (2.58) from (2.57), we need to show that for any fixed integer k ,

2.59

The previous computations showed that

Given the form of the matrices A _t , the q th component of is the first component of , which is not a.s. equal to zero. First suppose that α _q and β _p are both not equal to zero. In this case, the first component of cannot be equal to zero a.s., for any k ≥ q + 1. Also, still in view of the form of the matrices A _t , the i th component of is the (i − 1)th component of for i = 2, …, q . Hence, none of the first q components of can be equal to zero a.s., and the same property holds for whatever k ≥ 2q . The same argument shows that none of the last q components of is equal to zero a.s. when k ≥ 2p . Taking into account the positivity of the variables , this shows that in the case α _q β _p ≠ 0, (2.59) holds true for k ≥ max {2p, 2q}. If α _q β _p = 0, one can replace by a vector of smaller size, obtained by cancelling the component if α _q = 0 and the component if β _p = 0. The matrix A _t is then replaced by a matrix of smaller size, but with the same non‐zero eigenvalues as A _t . Similarly, the matrix A ^(m) will be replaced by a matrix of smaller size but with the same non‐zero eigenvalues. If α _q − 1 β _p − 1 ≠ 0, we are led to the preceding case, otherwise we pursue the dimension reduction.

2.4.2 Kurtosis

An easy way to measure the size of distribution tails is to use the kurtosis coefficient. This coefficient is defined, for a centred (zero‐mean) distribution, as the ratio of the fourth‐order moment, which is assumed to exist, to the squared second‐order moment. This coefficient is equal to 3 for a normal distribution, this value serving as a gauge for the other distributions. In the case of GARCH processes, it is interesting to note the difference between the tails of the marginal and conditional distributions. For a strictly stationary solution, (ε_t) of the GARCH(p, q) model defined by ( 2.5), the conditional moments of order k are proportional to :

The kurtosis coefficient of this conditional distribution is thus constant and equal to the kurtosis coefficient of η _t . For a general process of the form

where σ _t is a measurable function of the past of ε_t , η _t is independent of this past and (η _t) is iid centred, the kurtosis coefficient of the stationary marginal distribution is equal, provided that it exists, to

where denotes the kurtosis coefficient of (η _t). It can thus be seen that the tails of the marginal distribution of (ε_t) are fatter when the variance of is large relative to the squared expectation. The minimum (corresponding to the absence of ARCH effects) is given by the kurtosis coefficient of (η _t),

with equality if and only if is a.s. constant. In the GARCH(1, 1) case we thus have, from the previous calculations,

2.60

The excess kurtosis coefficients of ε_t and η _t , relative to the normal distribution, are related by

The excess kurtosis of ε_t increases with that of η _t and when the GARCH coefficients approach the zone of non‐existence of the fourth‐order moment. Notice the asymmetry between the GARCH coefficients in the excess kurtosis formula. For the general GARCH(p, q), we have the following result.

It will be seen in Chapter 7 that the Gaussian quasi‐maximum likelihood estimator of the coefficients of a GARCH model is consistent and asymptotically normal even if the distribution of the variables η _t is not Gaussian. Since the autocorrelation function of the squares of the GARCH process does not depend on the law of η _t , the autocorrelations obtained by replacing the unknown coefficients by their estimates are generally very close to the empirical autocorrelations. In contrast, the kurtosis coefficients obtained from the theoretical formula, by replacing the coefficients by their estimates and the kurtosis of η _t by 3, can be very far from the coefficients obtained empirically. This is not very surprising since the preceding result shows that the difference between the kurtosis coefficients of ε_t computed with a Gaussian and a non‐Gaussian distribution for η _t ,

is not bounded as a approaches .

2.5 Autocovariances of the Squares of a GARCH

We have seen that if (ε_t) is a GARCH process which is fourth‐order stationary, then is an ARMA process. It must be noted that this ARMA is very constrained, as can be seen from representation ( 2.4): the order of the AR part is larger than that of the MA part, and the AR coefficients are greater than those of the MA part, which are positive. We shall start by examining some consequences of these constraints on the autocovariances of . Then we shall show how to compute these autocovariances explicitly.

2.5.1 Positivity of the Autocovariances

For a GARCH(1, 1) model such that , the autocorrelations of the squares take the form

2.61

where

2.62

(Exercise 2.8). It follows immediately that these autocorrelations are non‐negative. The next property generalises this result.

Note that the property of positive autocorrelations for the squares, or for the absolute values, is typically observed on real financial series (see, for instance, the second row of Table 2.1).

2.5.2 The Autocovariances Do Not Always Decrease

Formulas (2.61) and (2.62) show that for a GARCH(1,1) process, the autocorrelations of the squares decrease. An illustration is provided in Figure 2.11. A natural question is whether this property remains true for more general GARCH(p, q) processes. The following computation shows that this is not the case. Consider an ARCH(2) process admitting moments of order 4 (the existence condition and the computation of the fourth‐order moment are the subject of Exercise 2.8):

2 Graphs both displaying vertical lines in a descending manner illustrating the autocorrelation function (left) and partial autocorrelation function (right) of the squares of the GARCH (1, 1) model. — Figure 2.11 Autocorrelation function (a) and partial autocorrelation function (b) of the squares of the GARCH(1, 1) model ε_t = σ _t η _t , , (η _t) iid 풩(0, 1).

We know that is an AR(2) process, whose autocorrelation function satisfies

It readily follows that

and hence that

The latter equality is, of course, true for the ARCH(1) process ( α ₂ = 0) but is not true for any (α ₁, α ₂). Figure 2.12 gives an illustration of this non‐decreasing feature of the first autocorrelations (and partial autocorrelations). The sequence of autocorrelations is, however, decreasing after a certain lag (Exercise 2.18).

2 Graphs both displaying vertical lines in an ascending-descending manner illustrating the autocorrelation function (left) and partial autocorrelation function (right) of the squares of the ARCH (2) process. — Figure 2.2 Autocorrelation function (a) and partial autocorrelation function (b) of the squares of the ARCH(2) process ε_t = σ _t η _t , , (η _t) iid 풩(0, 1).

2.5.3 Explicit Computation of the Autocovariances of the Squares

The autocorrelation function of will play an important role in identifying the orders of the model. This function is easily obtained from the ARMA(max(p, q), p) representation

The autocovariance function is more difficult to obtain (Exercise 2.9) because one has to compute

One can use the method of Section 2.4.1. Consider the vector representation

defined in ( 2.16) and ( 2.17). Using the independence between and , together with elementary properties of the Kronecker product ⊗, we get

Thus

2.63

where

To compute A ^(m) , we can use the decomposition , where B and C are deterministic matrices. We then have, letting ,

We obtain and similarly. All the components of are equal to ω/(1 − ∑ α _i − ∑ β _i). Note that for h > 0, we have

2.64

Let . The following algorithm can be used:

Define the vectors , , , and the matrices , , A ⁽¹⁾ , A ⁽²⁾ as a function of α _i , β _i , and ω , μ ₄ .
Compute from Eq. (2.63).
For h = 1, 2, …, compute from Eq. (2.64).
For h = 0, 1, …, compute .

This algorithm is not very efficient in terms of computation time and memory space, but it is easy to implement.

2.6 Theoretical Predictions

The definition of GARCH processes in terms of conditional expectations allows us to compute the optimal predictions of the process and its square given its infinite past. Let (ε_t) be a stationary GARCH(p, q) process, in the sense of Definition 2.1. The optimal prediction (in the L ² sense) of ε_t given its infinite past is 0 by Definition 2.1(i). More generally, for h + 1 > 0,

which shows that the optimal prediction of any future variable given the infinite past is zero. The main attraction of GARCH models obviously lies not in the prediction of the GARCH process itself but in the prediction of its square. The optimal prediction of given the infinite past of ε_t is . More generally, the predictions at horizon h ≥ 0 are obtained recursively by

with, for i ≤ h ,

for i > h ,

and for i ≥ h ,

These predictions coincide with the optimal linear predictions of the future values of given its infinite past. Note that there exists a more general class of GARCH models (weak GARCH) for which the two types of predictions, optimal and linear optimal, do not necessarily coincide. Weak GARCH representations appear, in particular, when GARCH are temporally aggregated (see Drost and Werker, 1996; Drost, Nijman and Werker 1998), when they are observed with a measurement error as in Gouriéroux, Monfort and Renault (1993) andKing, Sentana and Wadhwani (1994), or for the beta‐ARCHof Diebolt and Guégan (1991). See Francq and Zakoïan (2000) for other examples.

It is important to note that is the conditional variance of the prediction error of ε_t + h . Hence, the accuracy of the predictions depends on the past: it is particularly low after a turbulent period, that is, when the past values are large in absolute value (assuming that the coefficients α _i and β _j are non‐negative). This property constitutes a crucial difference with standard ARMA models, for which the magnitude of the prediction intervals is constant, for a given horizon.

Figures 2.13–2.16, based on simulations, allow us to visualise this difference. In Figure 2.13, obtained from a Gaussian white noise, the predictions at horizon 1 have a constant variance: the confidence interval [−1.96, 1.96] contains roughly 95% of the realisations. Using a constant interval for the next three series, displayed in Figures 2.14–2.16, would imply very bad results. In contrast, the intervals constructed here (for conditionally Gaussian distributions, with zero mean and variance ) do contain about 95% of the observations: in the quiet periods a small interval is enough, whereas in turbulent periods, the variability increases and larger intervals are needed.

Graph displaying a fluctuating curve for prediction intervals at horizon 1, at 95%, for the strong N (0, 1) white noise. — Figure 2.3 Prediction intervals at horizon 1, at 95%, for the strong 풩(0, 1) white noise.

Graph displaying top and bottom outline on the fluctuating curve for prediction intervals at horizon 1, at 95%, for the GARCH(1, 1) process simulated with ω = 1, α = 0.1, β = 0.8 and N (0, 1) distribution for (ηt). — Figure 2.4 Prediction intervals at horizon 1, at 95%, for the GARCH(1, 1) process simulated with ω = 1, α = 0.1, β = 0.8 and 풩(0, 1) distribution for (η _t).

Image described by caption. — Figure 2.15 Prediction intervals at horizon 1, at 95%, for the GARCH(1, 1) process simulated with ω = 1, α = 0.6, β = 0.2 and 풩(0, 1) distribution for (η _t).

For a strong GARCH process it is possible to go further, by computing optimal predictions of the powers of , provided that the corresponding moments exist for the process (η _t). For instance, computing the predictions of allows us to evaluate the variance of the prediction errors of . However, these computations are tedious, the linearity property being lost for such powers.

When the GARCH process is not directly observed but is the innovation of an ARMA process, the accuracy of the prediction at some date t directly depends of the magnitude of the conditional heteroscedasticity at this date. Consider, for instance, a stationary AR(1) process, whose innovation is a GARCH(1, 1) process:

2.65

where ω > 0, α ≥ 0, β ≥ 0, α + β ≤ 1 and ∣φ ∣ < 1. We have, for h ≥ 0,

Hence,

since the past of X _t coincides with that of its innovation ε_t . Moreover,

Since and, for i ≥ 1,

we have

Consequently,

if φ ² ≠ α + β and

if φ ² = α + β . The coefficient of always being positive, it can be seen that the variance of the prediction at horizon h increases linearly with the difference between the conditional variance at time t and the unconditional variance of ε_t . A large negative difference (corresponding to a low‐volatility period) thus results in highly accurate predictions. Conversely, the accuracy deteriorates when is large. When the horizon h increases, the importance of this factor decreases. If h tends to infinity, we retrieve the unconditional variance of X _t :

Now we consider two non‐stationary situations. If ∣φ ∣ = 1, and initialising, for instance at 0, all the variables at negative dates (because here the infinite pasts of X _t and ε_t do not coincide), the previous formula becomes

Thus, the impact of the observations before time t does not vanish as h increases. It becomes negligible, however, compared to the deterministic part which is proportional to h . If α + β = 1 (IGARCH(1, 1) errors), we have

and it can be seen that the impact of the past variables on the variance of the predictions remains constant as the horizon increases. This phenomenon is called persistence of shocks on the volatility. Note, however, that, as in the preceding case, the non‐random part of the decomposition of Var(ε_t + i ∣ ε_u, u < t) becomes dominant when the horizon tends to infinity. The asymptotic precision of the predictions of ε_t is null, and this is also the case for X _t since

2.7 Bibliographical Notes

The strict stationarity of the GARCH(1, 1) model was first studied by Nelson (1990a) under the assumption . His results were extended by Klüppelberg, Lindner, and Maller (2004) to the case of . For GARCH(p, q) models, the strict stationarity conditions were established by Bougerol and Picard (1992b). For model ( 2.16), where is a strictly stationary and ergodic sequence with a logarithmic moment, Brandt (1986) showed that γ < 0 ensures the existence of a unique strictly stationary solution. In the case where is iid, Bougerol and Picard (1992a) established the converse property showing that, under an irreducibility condition, a necessary condition for the existence of a strictly stationary and non‐anticipative solution is that γ < 0. Liu (2006) used representation ( 2.30) to obtain stationarity conditions for more general GARCH models. The second‐order stationarity condition for the GARCH(p, q) model was obtained by Bollerslev (1986), as well as the existence of an ARMA representation for the square of a GARCH (see also Bollerslev 1988). Nelson and Cao (1992) obtained necessary and sufficient positivity conditions for the GARCH(p, q) model. These results were extended by Tsai and Chan (2008). The ‘zero‐drift’ GARCH(1,1) model in which ω = 0 was studied by Hafner and Preminger (2015) and Li et al. (2017). In the latter article it was shown that, though non‐stationary, the zero‐drift GARCH(1, 1) model is stable when γ = 0, with its sample paths oscillating randomly between zero and infinity over time.

GARCH are not the only models that generate uncorrelated times series with correlated squares. Another class of interest which shares these characteristics is that of the all‐pass time series models (ARMA models in which the roots of the AR polynomials are reciprocals of roots of the MA polynomial and vice versa) studied by Breidt, Davis, and Trindade (2001).

ARCH(∞) models were introduced by Robinson (1991); see Giraitis, Leipus, and Surgailis (2009) for the study of these models. The condition for the existence of a strictly stationary ARCH(∞) process was established by Robinson and Zaffaroni (2006) and Douc, Roueff, and Soulier (2008). The condition for the existence of a second‐order stationary solution, as well as the positivity of the autocovariances of the squares, were obtained by Giraitis, Kokoszka, and Leipus (2000a). Theorems 2.8 and 2.7 were proven by Kazakevičius and Leipus (2002, 2003). The uniqueness of an ARCH(∞) solution is discussed in Kazakevičius and Leipus (2007). The asymptotic properties of quasi‐maximum likelihood estimators (see Chapter 7) were established by Robinson and Zaffaroni (2006). See Doukhan, Teyssière, and Winant (2006) for the study of multivariate extensions of ARCH(∞) models. The introduction of FIGARCH models is due to Baillie, Bollerslev, and Mikkelsen (1996), but the existence of solutions was recently established by Douc, Roueff, and Soulier (2008), where Corollary 2.6 is proven. LARCH(∞) models were introduced by Robinson (1991) and their probability properties studied by Giraitis, Robinson, and Surgailis (2000b), Giraitis and Surgailis (2002), Berkes and Horváth (2003a), and Giraitis et al. (2004). The estimation of such models has been studied by Beran and Schützner (2009), Truquet (2008), and Francq and Zakoïan (2009c).

The fourth‐order moment structure and the autocovariances of the squares of GARCH processes were analysed by Milhøj (1984), Karanasos (1999), and He and Teräsvirta (1999). The necessary and sufficient condition for the existence of even‐order moments was established by Ling and McAleer (2002a), the sufficient part having been obtained by Chen and An 1998. Ling and McAleer (2002b) derived an existence condition for the moment of order s , with s > 0, for a family of GARCH processes including the standard model and the extensions presented in Chapter 4. The computation of the kurtosis coefficient for a general GARCH(p, q) model is due to Bai, Russell, and Tiao (2004).

Several authors have studied the tail properties of the stationary distribution. See Mikosch and Stărică (2000), Borkovec and Klüppelberg (2001), Basrak, Davis, and Mikosch (2002), and Davis and Mikosch (2009a,b). See Embrechts, Klüppelberg and Mikosch (1997) for a classical reference on extremal events.

Andersen and Bollerslev (1998) discussed the predictive qualities of GARCH, making a clear distinction between the prediction of volatility and that of the squared returns (Exercise 2.23).

2.8 Exercises

2.1 (Non‐correlation of ε_t with any function of its past?)

For a GARCH process does Cov(ε_t, f(ε_t − h)) = 0 hold for any function f and any h > 0?

2.2 (Strict stationarity of GARCH(1, 1) for two laws of ηt) For the GARCH(1, 1) give an explicit strict stationarity condition in the two following cases: (i) the only possible values of η _t are −1 and 1; (ii) η _t follows a uniform distribution.
2.3 (Lyapunov coefficient of a constant sequence of matrices) Prove equality ( 2.21) for a diagonalisable matrix. Use the Jordan representation to extend this result to any square matrix.
2.4 (Lyapunov coefficient of a sequence of matrices) Consider the sequence (A _t) defined by A _t = z _t A , where (z _t) is an ergodic sequence of real random variables such that E log⁺ ∣ z _t ∣ < ∞, and A is a square non‐random matrix. Find the Lyapunov coefficient γ of the sequence (A _t) and give an explicit expression for the condition γ < 0.
2.5 (Alternative definition of the top Lyapunov exponent)
1. Show that, everywhere in Theorem 2.3, the matrix product A _t A _t−1…A ₁ can be replaced by A ₋₁ A ₋₂…A _−t.
2. Justify the first convergence after (2.25).
2.6 (Multiplicative norms) Show the results of footnote 6 on page 28.
2.7 (Another vector representation of the GARCH(p, q) model) Verify that the vector allows us to define, for p ≥ 1 and q ≥ 2, a vector representation which is equivalent to those used in this chapter, of the form .
2.8 (Fourth‐order moment of an ARCH(2) process) Show that for an ARCH(2) model, the condition for the existence of the moment of order 4, with , is written as

Compute this moment.
2.9 (Direct computation of the autocorrelations and autocovariances of the square of a GARCH(1, 1) process) Find the autocorrelation and autocovariance functions of when (ε_t) is solution of the GARCH(1, 1) model

where and 1 − 3α ² − β ² − 2αβ > 0.
2.10 (Computation of the autocovariance of the square of a GARCH(1,1) process by the general method) Use the method of Section 2.5.3 to find the autocovariance function of when (ε_t) is solution of a GARCH(1,1) model. Compare with the method used in Exercise 2.9.
2.11 (Fekete's lemma for sub‐additive sequences) A sequence (a _n)_n≥1 is called sub‐additive if a _n+m ≤ a _n + a _m for all n and m. Show that we then have lim_n‐>∞ a _n/n = inf_n≥1 a _n/n.
2.12 (Characteristic polynomial of EAt) Let A = EA _t , where {A _t, t ∈ ℤ} is the sequence defined in Eq. ( 2.17).
1. Prove equality ( 2.38).
2. If , show that ρ(A) = 1.
2.13 (A condition for a sequence Xn to be o(n)) Let (X _n) be a sequence of identically distributed random variables, admitting a finite expectation. Show that

with probability 1. Prove that the convergence may fail if the expectation of X _n does not exist (an iid sequence with density f(x) = x ⁻² _x ≥ 1 may be considered).
2.14 (Expectation of a product of dependent random variables equal to the product of their expectations) Prove equality (2.31).
2.15 (Necessary condition for the existence of the moment of order 2s) Suppose that (ε_t) is the strictly stationary solution of model ( 2.5) with for s ∈ (0, 1]. Let
2.66
1. Show that when K → ∞,
2. Use this result to prove that as k → ∞ .
3. Let (X _n) a sequence of ℓ × m matrices and Y = (Y ₁, …, Y _m)^′ a vector which is independent of (X _n) and such that for all i , 0 < E|Y _i|^s < ∞. Show that, when n → ∞,
4. Let A = EA _t , and suppose there exists an integer N such that (in the sense that all elements of this vector are strictly positive). Show that there exists k ₀ ≥ 1 such that
5. Deduce condition (2.35) from the preceding question.
6. Is the condition α ₁ + β ₁ > 0 necessary?
2.16 (Positivity conditions) In the GARCH(1, q) case give a more explicit form for the conditions in ( 2.49). Show, by taking q = 2, that these conditions are less restrictive than ( 2.20).
2.17 (A minoration for the first autocorrelations of the square of an ARCH) Let (ε_t) be an ARCH( q ) process admitting moments of order 4. Show that, for i = 1, …, q ,
2.18 (Asymptotic decrease of the autocorrelations of the square of an ARCH(2) process) Figure 2.12 shows that the first autocorrelations of the square of an ARCH(2) process, admitting moments of order 4, can be non‐decreasing. Show that this sequence decreases after a certain lag.
2.19 (Convergence in probability to −∞) If (X _n) and (Y _n) are two independent sequences of random variables such that X _n + Y _n → − ∞ and in probability, then Y _n → − ∞ in probability.
2.20 (GARCH model with a random coefficient) Proceeding as in the proof of Theorem 2.1, study the stationarity of the GARCH(1, 1) model with random coefficient ω = ω(η _t − 1),
2.67

under the usual assumptions and ω(η _t − 1) > 0 a.s. Use the result of Exercise 2.19 to deal with the case γ ≔ E log a(η _t) = 0.
2.21 (RiskMetrics model) The RiskMetrics model used to compute the value at risk (see Chapter 11) relies on the following equations:

where 0 < λ < 1. Show that this model has no stationary and non trivial solution.
2.22 (IARCH(∞) models: proof of Corollary 2.6)
1. In model ( 2.40), under the assumption A ₁ = 1, show that
2. Suppose that condition ( 2.41) holds. Show that the function f : [p,1] ↦ ℝ defined by f(q) = log(A _q μ _2q) is convex. Compute its derivative at 1 and deduce that condition (2.52) holds.
3. Establish the reciprocal and show that E|ε_t|^q < ∞ for any q ∈ [0, 2).
2.23 (On the predictive power of GARCH) In order to evaluate the quality of the prediction of obtained by using the volatility of a GARCH(1, 1) model, econometricians have considered the linear regression

where is replaced by the volatility estimated from the model. They generally obtained a very small determination coefficient, meaning that the quality of the regression was bad. It this surprising? In order to answer that question compute, under the assumption that exists, the theoretical R ² defined by

Show, in particular, that R ² < 1/κ _η .

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 2 GARCH(p, q) Processes

Create new playlist

Sign In

Sign Up

2.1 Definitions and Representations

Properties of Simulated Paths

2.2 Stationarity Study

2.2.1 The GARCH(1,1) Case

2.2.2 The General Case

Strict Stationarity

Second‐Order Stationarity

IGARCH(p, q) Processes

2.3 ARCH(∞)Representation *

2.3.1 Existence Conditions

2.3.2 ARCH(∞) Representation of a GARCH

2.3.3 Long‐Memory ARCH

2.4 Properties of the Marginal Distribution

2.4.1 Even‐Order Moments

2.4.2 Kurtosis

2.5 Autocovariances of the Squares of a GARCH

2.5.1 Positivity of the Autocovariances

2.5.2 The Autocovariances Do Not Always Decrease

2.5.3 Explicit Computation of the Autocovariances of the Squares

2.6 Theoretical Predictions

2.7 Bibliographical Notes

2.8 Exercises

2.1 (Non‐correlation of ε t with any function of its past?)

Notes

Table of Contents for
2 GARCH(p, q) Processes

2.3 ARCH(∞)Representation ^*

2.1 (Non‐correlation of ε_t with any function of its past?)