8: Tests Based on the Likelihood

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Tests Based on the Likelihood

In the previous chapter, we saw that the asymptotic normality of the QMLE of a GARCH model holds true under general conditions, in particular without any moment assumption on the observed process. An important application of this result concerns testing problems. In particular, we are able to test the IGARCH assumption, or more generally a given GARCH model with infinite variance. This problem is the subject of Section 8.1.

The main aim of this chapter is to derive tests for the nullity of coefficients. These tests are complex in the GARCH case, because of the constraints that are Imposed on the estimates of the coefficients to guarantee that the estimated conditional variance is positive. Without these constraints, it is impossible to compute the Gaussian log-likelihood of the GARCH model. Moreover, asymptotic normality of the QMLE has been established assuming that the parameter belongs to the interior of the parameter space (assumption A5 in Chapter 7). When some coefficients α_i or β_j are null, Theorem 7.2 does not apply. It is easy to see that, in such a situation, the asymptotic distribution of (_n − θ₀) cannot be Gaussian. Indeed, the components _in of _n are constrained to be positive or null. If, for instance, θ_0i = 0 then (_in − θ_0i) = _in ≥ 0 for all n and the asymptotic distribution of this variable cannot be Gaussian.

Before considering significance tests, we shall therefore establish In Section 8.2 the asymptotic distribution of the QMLE without assumption A5, at the cost of a moment assumption on the observed process. In Section 8.3, we present the main tests (Wald, score and likelihood ratio) used for testing the nullity of some coefficients. The asymptotic distribution obtained for the QMLE will lead to modification of the standard critical regions. Two cases of particular interest will be examined in detail: the test of nullity of only one coefficient and the test of conditional homoscedasticity, which corresponds to the nullity of all the coefficients α_i and β_j. Section 8.4 is devoted to testing the adequacy of a particular GARCH(p, q) model, using portmanteau tests. The chapter also contains a numerical application in which the preeminence of the GARCH(1, 1) model is questioned.

8.1 Test of the Second-Order Stationarity Assumption

For the GARCH(p, q) model defined by (7.1), testing for second-order stationarity involves testing

Introducing the vector c = (0, 1, …, 1)′ .^p+q+1, the testing problem is

(8.1)

In view of Theorem 7.2, the QMLE _n = (_n,_1n, …,_qn, _1n, … _pn) of θ₀ satisfies

under assumptions which are compatible with H₀ and H₁. In particular, if c′θ₀ = 1 we have

It is thus natural to consider the Wald statistic

where _η and are consistent estimators in probability of κ_η and J. The following result follows immediately from the convergence of T_n to (0, 1) when c′θ₀ = 1.

Proposition 8.1 (Critical region of stationarity test) Under the assumptions of Theorem 7.2, a test of (8.1) at the asymptotic level a is defined by the rejection region

where Φ is the (0, 1) cumulative distribution function.

Table 8.1 Test of the infinite variance assumption for 11 stock market returns. Estimated standard deviations are in parentheses.

Index	+	p -value
CAC	0.983 (0.007)	0.0089
DAX	0.981 (0.011)	0.0385
DJA	0.982 (0.007)	0.0039
DJI	0.986 (0.006)	0.0061
DJT	0.983 (0.009)	0.0023
DJU	0.983 (0.007)	0.0060
FTSE	0.990 (0.006)	0.0525
Nasdaq	0.993 (0.003)	0.0296
Nikkei	0.980 (0.007)	0.0017
SMI	0.962 (0.015)	0.0050
S&P 500	0.989 (0.005)	0.0157

Note that for most real series (see, for instance, Table 7.4), the sum of the estimated coefficients and is strictly less than 1: second-order stationarity thus cannot be rejected, for any reasonable asymptotic level (when T_n < 0, the p-value of the test is greater than 1/2). Of course, the non-rejection of H₀ does not mean that the stationarity is proved. It is interesting to test the reverse assumption that the data generating process is an IGARCH, or more generally that it does not have moments of order 2. We thus consider the problem

(8.2)

Proposition 8.2 (Critical region of nonstationarity test) Under the assumptions of Theorem 7.2, a test of (8.2) at the asymptotic level a is defined by the rejection region

As an application, we take up the data sets of Table 7.4 again, and we give the p-values of the previous test for the 11 series of daily returns. For the FTSE (DAX, Nasdaq, S&P 500), the assumption of infinite variance cannot be rejected at the 5% (3%, 2%, 1%) level (see Table 8.1). The other series can be considered as second-order stationary (if one believes in the GARCH(1, 1) model, of course).

8.2 Asymptotic Distribution of the QML When θ₀ is at the Boundary

In view of (7.3) and (7.9), the QMLE is constrained to have a strictly positive first component, while the other components are constrained to be positive or null. A general technique for determining the distribution of a constrained estimator involves expressing it as a function of the unconstrained estimator (see Gouriéroux and Monfort, 1995). For the QMLE of a GARCH, this technique does not work because the objective function

cannot be computed outside Θ (for an ARCH(l), it may happen that := ω + α₁ is negative when α₁ < 0). It is thus impossible to define .

The technique that we will utilize here (see, in particular, Andrews, 1999), involves writing with the aid of the normalized score vector, evaluated at θ₀:

(8.3)

with

where the components of ∂I_n(θ₀)/∂θ and of J_n are right derivatives (see (a) in the proof of Theorem 8.1 on page 207).

In the proof of Theorem 7.2, we showed that

(8.4)

For any value of θ₀ Θ (even when θ₀ ), it will be shown that the vector Z_n is well defined and satisfies

(8.5)

provided J exists. By contrast, when

equation (8.4) is no longer valid. However, we will show that the asymptotic distribution of n^1/2(− θ₀) is well approximated by that of the vector n^1/2{θ − θ₀) which is located at the minimal distance of Z_n, under the constraint θ Θ. Consider thus a random vector θ_Jn (J_n) (which is not an estimator, of course) solving the minimization problem

(8.6)

It will be shown that J_n converges to the positive definite matrix J. For n large enough, we thus have

where dist J_n (x, y) := {(x − y)′ J_n(x − y)}^1/2 is a distance between two points x and y of ^p+q+1, and where the distance between a point x and a subset S of ^p+q+1 is defined by dist J_n(x, S) = inf _ssdist J_n(x, S).

We allow θ₀ to have null components, but we do not consider the (less interesting) case where θ₀ reaches another boundary of Θ. More precisely, we assume that

Bl: ,

where 0 < < and 0 < min{₂, …, _p+q+1}. In this case and (_n − θ₀) belong to the local parameter space’

(8.7)

where Λ₁ = and, for i = 2,…, p + q + 1, Λ_i- = if θ_0i ≠ 0 and Λ_i = [0, ∞) if θ_0i = 0. With the notation

we thus have, with probability 1,

(8.8)

The vector is the projection of Z_n on Λ, with respect to the norm (see Figure 8.1). Since Λ is closed and convex, such a projection is unique. We will show that

(8.9)

Since (Z_n, J_n) tends in law to (Z, J) and is a function of (Z_n, J_n) which is continuous everywhere except at the points where J_n is singular (that is, almost everywhere with respect to the distribution of (Z, J) because J is invertible), we have , where λ^Λ is the solution of limiting problem

(8.10)

Figure 8.1 ARCH(l) model with θ₀ = (ω₀,0) and Θ = [,] × [0,]: (Θ − θ₀) = [− (ω₀ − ), ( − ω₀)] × [0, ( − α₀)] is the gray area; (_n − θ₀) and have the same asymptotic distribution.

In addition to Bl, we retain most of the assumptions of Theorem 7.2:

B2: θ₀ Θ and Θ is a compact set.

B3: γ(A₀) < 0 and for all θ Θ,

B4: has a nondegenerate distribution with E = 1

B5: If p > 0, (z) and (z) do not have common roots, (1)≠ 0, and .

B6: κ_η = E < ∞.

We also need the following moment assumption:

B7: < ∞

When θ₀ , we can show the existence of the information matrix

without moment assumptions similar to B7. The following example shows that, in the ARCH case, this is no longer possible when we allow θ₀ ∂Θ.

Example 8.1 (The existence of J may require a moment of order 4) Consider the ARCH(2) model

(8.11)

where the true values of the parameters are such that ω₀ > 0, α₀₁ ≥ 0, α₀₂ = 0, and the distribution of the iid sequence (η) is defined, for a > 1, by

The process (_t) is always stationary, for any value of α₀₁ (since exp {− E(log )} = +∞, the strict stationarity constraint (2.10) holds true). By contrast, _t does not possess moments of order 2 when α₀₁ ≥ 1 (see Proposition 2.2).

We have

so that

because on the one hand η_{t − 1} =0 entails _{t − 1} = 0, and on the other hand η_{t − 1} and _{t − 2} are independent. Consequently, if E = ∞ then the matrix J does not exist.

We then have the following result.

Theorem 8.1 (QML asymptotic distribution at the boundary) Under assumptions B1–B7, the asymptotic distribution of (_n − θ₀) is that of λ^Λ satisfying (8.10), where Λ is given by (8.7).

Remark 8.1 (We retrieve the standard results in ) For θ₀ , the result is shown in Theorem 7.2. Indeed, in this case Λ = _p+q+1 and

Theorem 8.1 is thus only of interest when θ₀ is at the boundary ∂Θ of the parameter space.

Remark 8,2 (The moment condition B7 can sometimes be relaxed) Apart from the ARCH (q) case, it is sometimes possible to get rid of the moment assumption B7. Note that under the condition γ(A₀) < 0, we have with b₀₀ > 0, b_0j≥0. The derivatives ∂/∂θ_k have the form of similar series. It can be shown that the ratio {∂/∂θ}/ admits moments of all orders whenever any term _−j which appears in the numerator is also present in the denominator. This allows us to show (see the references at the end of the chapter) that, in the theorem, assumption B7 can be replaced by

B7:′ b_0j > 0 for all j ≥ 1, where .

Note that a sufficient condition for B7′ is α₀₁ > 0 and β₀₁ > 0 (because). A necessary condition is obviously that α₀₁ > 0 (because b₀₁ = α₀₁). Finally, a necessary and sufficient condition for B7′ is

Obviously, according to Example 8.1, assumption B7′ is not satisfied in the ARCH case.

8.2.1 Computation of the Asymptotic Distribution

In this section, we will show how to compute the solutions of (8.10). Switching the components of θ, if necessary, it can be assumed without loss of generality that the vector of the first d₁ components of θ₀ has strictly positive elements and that the vector of the last d₂ = p + q + 1 − d₁ components of θ₀ is null. This can be written as

(8.12)

More generally, it will be useful to consider all the subsets of these constraints. Let

be the set of the matrices obtained by deleting no, one, or several (but not all) rows of K. Note that the solution of the constrained minimization problem (8.10) is the unconstrained solution λ = Z when the latter satisfies the constraint, that is, when

When Z Λ, the solution λ^Λ coincides with that of an equality constrained problem of the form

An important difference, compared to the initial minimization program (8.10), is that the minimization is done here on a vectorial space. The solution is given by a projection (nonorthogonal when J is not the identity matrix). We thus obtain (see Exercise 8.1)

(8.13)

is the projection matrix (orthogonal for the metric defined by J) on the orthogonal subspace of the space generated by the rows of K_i Note that λ_Ki does not necessarily belong to Λ because K_iλ = 0 does not imply that Kλ ≥ 0. Let = {λ_Ki : K_i and λK_i ≥ 0} be the class of the admissible solutions. It follows that the solution that we are looking for is

This formula can be used in practice to obtain realizations of λ^Λ from realizations of Z. The Q(λ_Ki) can be obtained by writing

(8.14)

Another expression (of theoretical interest) for λ^Λ is

where ₀ = Λ and the _i form a partition of ^p+q+1. Indeed, according to the zone to which Z belongs, a solution λ^Λ = λ_Ki is obtained. We will make explicit these formulas in a few examples. Let d = p + q + 1, z⁺ = z _(0,+∞)(z) and z⁻ = z _(−∞,0) (z).

Example 8.2 (Law when only one component is at the boundary) When d₂ = 1, that is, when only the last component of θ₀ is zero, we have

and

We finally obtain

where c is the last column of J⁻¹ divided by the (d, d)th element of this matrix. Note that the last component of λ^Λ is . Noting that J⁻¹ is, up to a multiplicative factor, the variance of Z, it can also be seen that

(8.15) c08e015_fmt

Thus if and only if Cov(Z_i, Z_d) = 0.

Example 8.3 (ARCH(2) model when the data generating process is a white noise) Consider an ARCH(2) model with θ₀ = (ω₀, 0, 0). We thus have d₂ = 2, d₁ = 1 and

with K₁ = K, K₂ = (0, 1, 0) and K₃ = (0, 0, 1). Exercise 8.6 shows that

Using K Σ K′ = I₂ and K_iΣK′_i for i = 2, 3 in particular, we thus obtain

Let P₀ = I₃ and K₀ = 0. Using (8.14), we have

This shows that

(8.16) c08e016_fmt

In order to obtain a slightly simpler expression for the projections defined in (8.13), note that the constraint (8.12) can again be written as

(8.17)

We define a dual of , by

the set of the matrices obtained by deleting from 0 to d₂ of the last d₂ columns of the matrix I_d. Note that the elements of can always be numbered in such a way that H₀ = I_d corresponds to the absence of constraint on θ₀ and that, for , the constraint K_iθ₀ = 0 corresponds to the constraint θ₀ = H_iH′_iθ₀. Exercise 8.2 then shows that

(8.18)

for (with P₀ = I_d). Note that (8.18) requires the inversion of only one matrix of size (d − k) × (d − k) (d − k being the number of columns of H_i), whereas (8.13) requires the inversion of one matrix of size d × d and another matrix of size k × k. To illustrate this new formula, we return to our previous examples.

Example 8,4 (Example 8.2 continued) We have

and

using the notation

where the matrix J₁₁ is of size d₁ × d₁, the vectors J₁₂ = J′₁ and ⁽¹⁾ are of size d₁ × 1, and J₂₂ and Z⁽²⁾ = Z_d are scalars. We finally obtain

which can be shown using (8.15) and

Example 8.5 (Example 83 continued) We have

with H₁ = H

In view of Exercise 8.6, we have

which allows us to obtain

and to retrieve (8.16). Note that, in this example, the calculation are simpler with (8.13) than with (8.18), because J⁻¹ has a simpler expression than J.

8.3 Significance of the GARCH Coefficients

We make the assumptions of Theorem 8.1 and use the notation of Section 8.2.1. Assume and consider the testing problem

(8.19)

Recall that under H₀, we have

where the distribution of λ^Λ is defined by

(8.20)

with

8.3.1 Tests and Rejection Regions

For parametric assumptions of the form (8.19), the most popular tests are the Wald, score and likelihood ratio tests.

Wald Statistic

The Wald test looks at whether is close to 0. The usual Wald statistic is defined by

where is a consistent estimator of Σ = (κ_η − 1)J⁻¹.

Score (or Lagrange Multiplier or Rao) Statistic

Let

denote the QMLE of θ constrained by θ⁽²⁾ = 0. The score test aims to determine whether ∂I_n (_n|2)/∂θ is not too far from 0, using a statistic of the form

where _η|2 and _n|2 denote consistent estimators of κ_η and J.

Likelihood Ratio Statistic

The likelihood ratio test is based on the fact that under H₀ : θ⁽²⁾ = 0, the constrained (quasi) log-likelihood log L_n (_n|2) = − (n/2)_n(_n|2) should not be much smaller than the unconstrained log-likelihood − (n/2)_n(_n). The test employs the statistic

Usual Rejection Regions

From the practical viewpoint, the score statistic presents the advantage of only requiring constrained estimation, which is sometimes much simpler than the unconstrained estimation required by the two other tests. The likelihood ratio statistic does not require estimation of the information matrix J, nor the kurtosis coefficient κ_η. For each test, it is clear that the null hypothesis must be rejected for large values of the statistic. For standard statistical problems, the three statistics asymptotically follow the same distribution under the null. At the asymptotic level α, the standard rejection regions are thus

where , (l − α) is the (1 − α)-quantile of the χ² distribution with d₂ degrees of freedom. In the case d₂ = 1, for testing the significance of only one coefficient, the most widely used test is Student’s t test, defined by the rejection region

(8.21)

where . This test is equivalent to the standard Wald test because (t_n being here always positive or null, because of the positivity constraints of the QML estimates) and

Our testing problem is not standard because, by Theorem 8.1, the asymptotic distribution of _n is not normal. We will see that, among the previous rejection regions, only that of the score test asymptotically has the level α.

8.3.2 Modification of the Standard Tests

The following proposition shows that for the Wald and likelihood ratio tests, the asymptotic distribution is not the usual under the null hypothesis. The proposition also shows that the asymptotic distribution of the score test remains the distribution. The asymptotic distribution of R_n is not affected by the fact that, under the null hypothesis, the parameter is at the boundary of the parameter space. These results are not very surprising. Take the example of an ARCH(l) with the hypothesis H₀:α₀ = 0 of absence of ARCH effect. As illustrated by Figure 8.2, there is a nonzero probability that be at the boundary, that is, that = 0. Consequently W_n = n² admits a mass at 0 and does not follow, even asymptotically, the law. The same conclusion can be drawn for the likelihood ratio test. On the contrary, the score n^l/2∂l_n(θ₀)/∂θ can take as well positive or negative values, and does not seem to have a specific behavior when θ₀ is at the boundary.

Proposition 83 (Asymptotic distribution of the three statistics under H₀) Under H₀ and the assumptions of Theorem 8.1,

(8.22)

(8.23)

(8.24) c08e024_fmt

where Ω = K′ {(κ_η − 1) KJ⁻¹ K′}⁻¹ K and λ^Λ satisfies (8.20).

Figure 8.2 Concentrated log4ikelihood (solid line) α log L_n(, α) for an ARCH(l) model. Assume there is no ARCH effect: the true value of the ARCH parameter is α₀ = 0- In the configuration on the right, the likelihood maximum does not lie at the boundary and the three statistics W_n, R_n and L_n take strictly positive values. In the configuration on the left, we have W_n = n² = 0 and L_n = 2 {log L_n(, ) − log L_n(a), 0)} = 0, whereas R_n = {∂ log L_n(, 0)/∂ α}² continues to take a strictly positive value.

Remark 8.3 (Equivalence of the statistics W_n and L_n) Let _η be an estimator which converges in probability to κ_η. We can show that

under the null hypothesis. The Wald and likelihood ratio tests will thus have the same asymptotic critical values, and will have the same local asymptotic powers (see Exercises 8.8 and 8.9, and Section 8.3.4). They may, however, have different asymptotic behaviors under nonlocal alternatives.

Remark 8.4 (Assumptions on the tests) In order for the Wald statistic to be well defined, Ω must exist, that is, J = J (θ₀) must exist and must be invertible. This is not the case, in particular, for a GARCH(p, q) at θ₀ = (ω₀, 0, …, 0), when p ≠ 0. It is thus impossible to carry out a Wald test on the simultaneous nullity of all the α_i and β_i coefficients in a GARCH(p, q), p ≠ 0. The assumptions of Theorem 8.1 are actually required, in particular the identifiability assumptions. Iti is thus impossible to test, for instance, an ARCH(l) against a GARCH(2, 1), but we can test, for instance, an ARCH(l) against an ARCH(3).

A priori, the asymptotic distributions W and L depend on J, and thus on nuisance parameters. We will consider two particular cases: the case where we test the nullity of only one GARCH coefficient and the case where we test the nullity of all the coefficients of an ARCH. In the two cases the asymptotic laws of the test statistics are simpler and do not depend on nuisance parameters. In the second case, both the test statistics and their asymptotic laws are simplified.

8.3.3 Test for the Nullity of One Coefficient

Consider the case d₁ = 1, which is perhaps the most interesting case and corresponds to testing the nullity of only one coefficient. In view of (8.15), the last component of λ^Λ is equal to . We thus have

where Z* ~ (0, 1). Using the symmetry of the Gaussian distribution, and the independence between Z*² and _{z*>0} when z* follows the real normal law, we obtain

Testing

can thus be achieved by using the critical region at the asymptotic level α ≤ 1/2. In view of Remark 8.3, we can define a modified likelihood ratio test of critical region . Note that the standard Wald test has the asymptotic level α/2, and that the asymptotic level of the standard likelihood ratio test is much larger than α when the kurtosis coefficient κ_Λ is large. A modified version of the Student t test is defined by the rejection region

(8.25)

We observe that commercial software - such as GAUSS, R, RATS, SAS and SPSS - do not use the modified version (8.25), but the standard version (8.21). This standard test is not of asymptotic level α but only α/2. To obtain a t test of asymptotic level α it then suffices to use a test of nominal level 2α.

Example 8.6 (Empirical behavior of the tests under the null) We simulated 5000 independent samples of length n = 100 and n = 1000 of a strong (0, 1) white noise. On each realization we fitted an ARCH(l) model , by QML, and carried out tests of H₀ : α = 0 against H₁ : α > 0.

We began with the modified Wald test with rejection region

This test is of asymptotic level 5%. For the sample size n = 100, we observed a relative rejection frequency of 6.22%. For n = 1000, we observe a relative rejection frequency of 5.38%, which is not significantly different from the theoretical 5%. Indeed, an elementary calculation shows that, on 5000 independent replications of a same experiment with success probability 5%, the success percentage should vary between 4.4% and 5.6% with a probability of approximately 95%. Figure 8.3 shows that the empirical distribution of W_n is quite close to the asymptotic distribution , even for the small sample size n = 100.

We then carried out the score test defined by the rejection region

where R² is the determination coefficient of the regression of on 1 and This test is also of asymptotic level 5%. For the sample size n = 100, we observed a relative rejection frequency of 3.40%. For n = 1000, we observed a relative frequency of 4.32%.

We also used the modified likelihood ratio test. For the sample size n = 100, we observed a relative rejection frequency of 3.20%, and for n = 1000 we observed 4.14%.

On these simulation experiments, the Type I error is thus slightly better controlled by the modified Wald test than by the score and modified likelihood ratio tests.

Example 8.7 (Comparison of the tests under the alternative hypothesis) We implemented the W_n, R_n and L_n tests of the null hypothesis H₀ : α₀₁ = 0 in the ARCH(l) model

Figure 8.3 Comparison between a kernel density estimator of the Wald statistic (dotted line) and the density on [0.5, ∞) (solid line) on 5000 simulations of an ARCH(l) process with α₀₁ = 0: (left) for sample size n = 100; (right) for n = 1000.

Figure 8.4 Comparison of the observed powers of the Wald test (thick line), of the score test (dotted line) and of the likelihood ratio test (thin line), as function of the nominal level α, on 5000 simulations of an ARCH(l) process: (left) for n = 100 and α₀₁ = 0.2; (right) for n = 1000 and α₀₁ = 0.05.

compares the observed powers of the three tests, that is, the relative frequency of rejection of the hypothesis H₀ that there is no ARCH effect, on 5000 independent realizations of length n = 100 and n = 1000 of an ARCH(l) model with α₀₁ = 0.2 when n = 100, and α₀₁ = 0.05 when n = 1000. On these simulated series, the modified Wald test turns out to be the most powerful.

8.3.4 Conditional Homoscedastlclty Tests with ARCH Models

Another interesting case is that obtained with d₁ = 1, θ⁽¹⁾ = ω, p = 0 and d₂ = q. This case corresponds to the test of the conditional homoscedasticity null hypothesis

(8.26)

in an ARCH(q) model

(8.27)

We will see that for testing (8.26) there exist very simple forms of the Wald and score statistics.

Simplified Form of the Wald Statistic

Using Exercise 8.6, we have

Since KΣK′ = I_q, we obtain a very simple form for the Wald statistic:

(8.28)

Asymptotic Distribution W and L

A trivial extension of Example 83 yields

(8.29) c08e029_fmt

The asymptotic distribution of is thus that of

where the Z_i are independent (0, 1). Thus, in the case where an ARCH(q) is fitted to a white noise we havei

(8.30)

This asymptotic distribution is tabulated and the critical values are given in Table 8.2. In view of Remark 8.3, Table 8.2 also yields the asymptotic critical values of the modified likelihood ratio statistic 2L_n/( − 1). Table 8.3 shows that the use of the standard -based critical values of the Wald test would lead to large discrepancies between the asymptotic levels and the nominal level α.

Table 8.2 Asymptotic critical value C_qα, at level α, of the Wald test of rejection region for the conditional homoscedasticity hypothesis H₀ : α₁ = … = α_q = 0 in an ARCH (q) model.

Table 8.3 Exact asymptotic level (%) of erroneous Wald tests, of rejection region , under the conditional homoscedasticity assumption H₀ : α₁ = … = α_q = 0 in an ARCH(q) model.

Score Test

For the hypothesis (8.26) that all the α coefficients of an ARCH(q) model are equal to zero, the score statistic R_n can be simplified. To work within the linear regression framework, write

where Y is the vector of length n of the ‘dependent’ variable 1 − /, where X is the n × (q × 1) matrix of the constant (in the first column) and of the ‘explanatory’ variables (in column i + 1, with the convention _t = 0 for t ≤ 0), and . Estimating J(θ₀) by n⁻¹ X′X and κ_η − 1 by n⁻¹Y′Y, we obtain

and one recognizes n times the coefficient of determination in the linear regression of Y on the columns of X. Since this coefficient is not changed by linear transformation of the variables (see Exercise 5.11), we simply have R_n = nR², where R² is the coefficient of determination in the regression of on a constant and q lagged values . Under the null hypothesis of conditional homoscedasticity, R_n asymptotically follows the law.

The previous simple forms of the Wald and score tests are obtained with estimators of J which exploit the particular form of the matrix under the null. Note that there exist other versions of these tests, obtained with other consistent estimators of J. The different versions are equivalent under the null, but can have different asymptotic behaviors under the alternative.

8.3.5 Asymptotic Comparison of the Tests

The Wald and score tests that we have just defined are in general consistent, that is, their powers converge to 1 when they are applied to a wide class of conditionally heteroscedastic processes. An asymptotic study will be conducted via two different approaches: Bahadur’s approach compares the rates of convergence to zero of the p-values under fixed alternatives, whereas Pitman’s approach compares the asymptotic powers under a sequence of local alternatives, that is, a sequence of alternatives tending to the null as the sample size increases.

Bahadur’s Approach

Let S_w(t) = (W > t) and S_R(t = (R > t) be the asymptotic survival functions of the two test statistics, under the null hypothesis H₀ defined by (8.26). Consider, for instance, the Wald test. Under the alternative of an ARCH(q) which does not satisfy H₀, the p-value of the Wald test S_w(W_n) converges almost surely to zero as n → ∞ because

The p-value of a test is typically equivalent to exp{−nc/2}, where c is a positive constant called the Bahadur slope. Using the fact that

(8.31)

and that , the (approximate¹) Bahadur slope of the Wald test is thus

To compute the Bahadur slope of the score test, note that we have the linear regression model = ω + (B) + v_t, where v_t = ( − 1)(θ₀) is the linear innovation of . We then have

The previous limit is thus equal to the Bahadur slope of the score test. The comparison of the two slopes favors the score test over the Wald test.

Proposition 8.4 Let (_t) be a strictly stationary and nonanticipative solution of the ARCH(q) model (8.27), with E() < ∞ and . The score test is considered as more efficient than the Wald test in Bahadur’s sense because its slope is always greater or equal to that of the Wald test, with equality when q = 1.

Example 8.8 (Slopes In the ARCH(l) and ARCH(2) cases) The slopes are the same in the ARCH(l) case because

In the ARCH(2) case with fourth-order moment, we have

We see that the second limit is always larger than the first. Consequently, in Bahadur’s sense, the Wald and Rao tests have the same asymptotic efficiency in the ARCH(l) case. In the ARCH(2) case, the score test is, still in Bahadur’s sense, asymptotically more efficient than the Wald test for testing the conditional homoscedasticity (that is, α₁ = α₂ = 0).

Bahadur’s approach is sometimes criticized for not taking account of the critical value of test, and thus for not really comparing the powers. This approach only takes into account the (asymptotic) distribution of the statistic under the null and the rate of divergence of the statistic under the alternative. It is unable to distinguish a two-sided test from its one-sided counterpart (see Exercise 8.8). In this sense the result of Proposition 8.4 must be put into perspective.

Pitman’s Approach

In the ARCH(l) case, consider a sequence of local alternatives H_n(τ) : α₁= τ/. We can show that under this sequence of alternatives,

Consequently, the local asymptotic power of the Wald test is

(8.32)

The score test has the local asymptotic power

(8.33)

Note that (8.32) Is the power of the test of the assumption H₀ : θ = 0 against the assumption H₁ : θ = τ > 0, based on the rejection region of {X > c₁] with only one observation X ~ (θ, 1). The power (8.33) is that of the two-sided test {|X| > c₂}. The tests {X > c₁} and {|X| > c₂} have the same level, but the first test Is uniformly more powerful than the second (by the Neyman-Pearson lemma, {|X| > c₁} is even uniformly more powerful than any test of level less than or equal to α, for any one-sided alternative of the form H₁). The local asymptotic power of the Wald test Is thus uniformly strictly greater than that of Rao’s test for testing for conditional homoscedasticity in an ARCH(l) model.

Consider the ARCH(2) case, and a sequence of local alternatives H_n(τ) : α₁ = τ₁/, α₂ = τ₂/ . Under this sequence of alternatives

with (U₁, U₂)′ ~ (0, I₂). Let c₁ be the critical value of the Wald test of level α. The local asymptotic power of the Wald test is

Let c₂ be the critical value of the Rao test of level α. The local asymptotic power of the Rao test is

where (U₁ + ω₁)² + (U₂ + ω₁)² follows a noncentral χ² distribution, with two degrees of freedom and noncentrailty parameter . Figure 8.5 compares the powers of the two tests when ω₁ = ω₂.

Figure 8.5 Local asymptotic power of the Wald test (solid line) and of the Rao test (dotted line) for testing for conditional homoscedasticity in an ARCH(2) model.

Thus, the comparison of the local asymptotic powers clearly favors the Wald test over the score test, counterbalancing the result of Proposition 8.4.

8.4 Diagnostic Checking with Portmanteau Tests

To check the adequacy of a given time series model, for instance an ARMA(p, q) model, it is common practice to test the significance of the residual autocorrelations. In the GARCH framework this approach is not relevant because the process _t = _t/_t is always a white noise (possibly a martingale difference) even when the volatility is misspecified, that is, when with . To check the adequacy of a volatility model, for instance a GARCH(p, q) of the form (7.1), it is much more fruitful to look at the squared residual autocovariances

where |h| < n, _t = _t(_n) is defined by (7.4) and _n is the QMLE given by (7.9).

For any fixed integer m, 1 ≤ m < n, consider the statistic _m = ((1),…,(m))′. Let _η and be weakly consistent estimators of κ_η and J. For instance, one can take

Define also the m × (p + q + 1) matrix _m whose (h, k)th element, for 1 ≤ h ≤ m and 1 ≤ k < p + q + 1, is given by

Theorem 8.2 (Asymptotic distribution of a portmanteau test statistic) Under the assumptions of Theorem 7.2 ensuring the consistency and asymptotic normality of the QMLE,

with

The adequacy of the GARCH(p, q) model is rejected at the asymptotic level α when

8.5 Application: Is the GARCH(1, 1) Model Overrepresented?

The GARCH(1, 1) model is by far the most widely used by practitioners who wish to estimate the volatility of daily returns. In general, this model is chosen a priori, without implementing any statistical identification procedure. This practice is motivated by the common belief that the GARCH(1, 1) (or its simplest asymmetric extensions) is sufficient to capture the properties of financial series and that higher-order models may be unnecessarily complicated.

We will show that, for a large number of series, this practice is not always statistically justified. We consider daily and weekly series of 11 returns (CAC, DAX, DJA, DJI, DJT, DJU, FTSE, Nasdaq, Nikkei, SMI and S&P 500) and five exchange rates. The observations cover the period from January 2, 1990 to January 22, 2009 for the daily returns and exchange rates, and from January 2, 1990 to January 20, 2009 for the weekly returns (except for the indices for which the first observations are after 1990). We begin with the portmanteau tests defined in Section 8.4. Table 8.4 shows that the ARCH models (even with large order q) are generally rejected, whereas the GARCH(1, 1) is only occasionally rejected. This table only concerns the daily returns, but similar conclusions hold for the weekly returns and exchange rates. The portmanteau tests are known to be omnibus tests, powerful for a broad spectrum of alternatives. As we will now see, for the specific alternatives for which they are built, the tests defined in Section 8.3 (Wald, score and likelihood ratio) may be much more powerful.

The GARCH(1, 1) model is chosen as the benchmark model, and is successively tested against the GARCH(1, 2), GARCH(1, 3), GARCH(1, 4) and GARCH(2, 1) models. In each case, the three tests (Wald, score and likelihood ratio) are applied. The empirical p-values are displayed in Table 8.5. This table shows that: (i) the results of the tests strongly depend on the alternative;

Table 8.4 Portmanteau test p-values for adequacy of the ARCH(5) and GARCH(1, 1) models for daily returns of stock market indices, based on m squared residual autocovariances. p-values less than 5% are in bold, those less than 1% are underlined.

Table 8.5 p-values for tests of the null of a GARCH(1,1) model against the GARCH(1,2), GARCH(1,3), GARCH(1,4) and GARCH(2,1) alternatives, for returns of stock market indices and exchange rates, p-values less than 5% are in bold, those less than 1% are underlined.

(ii) the p-values of the three tests can be quite different; (iii) for most of the series, the GARCH(1,1) model is clearly rejected. Point (ii) is not surprising because the asymptotic equivalence between the three tests is only shown under the null hypothesis or under local alternatives. Moreover, because of the positivity constraints, it is possible (see, for instance, the DJU) that the estimated GARCH(1,2) model satisfies ₂ = 0 with . In this case, when the estimators lie at the boundary of the parameter space and the score is strongly positive, the Wald and LR tests do not reject the GARCH(1,1) model, whereas the score does reject it. In other situations, the Wald or LR test rejects the GARCH(1,1) whereas the score does not (see, for instance, the DAX for the GARCH(1,4) alternative). This study shows that it is often relevant to employ several tests and several alternatives. The conservative approach of Bonferroni (rejecting if the minimal/?-value multiplied by the number of tests is less than a given level α), leads to rejection of the GARCH(1, 1) model for 16 out of the 24 series in Table 8.5. Other procedures, less conservative than that of Bonferroni, could also be applied (see Wright, 1992) without changing the general conclusion.

In conclusion, this study shows that the GARCH(1, 1) model is certainly overrepresented in empirical studies. The tests presented in this chapter are easily implemented and lead to selection of GARCH models that are more elaborate than the GARCH(1,1).

8.6 Proofs of the Main Results*

Proof of Theorem 8.1

We will split the proof into seven parts.

(a) Asymptotic normality of score vector. When θ₀ ∂Θ, the function (θ) can take negative values in a neighborhood of θ₀, and _t(θ) = /(θ) + log (θ) is then undefined in this neighborhood. Thus the derivative of _t(·) does not exist at θ₀- By contrast the right derivatives exist, and the vector ∂_t(θ₀)/∂θ of the right partial derivatives is written as an ordinary derivative. The same convention is used for the higher-order derivatives, as well as for the right derivatives of l_n, _t and _n at θ₀ With these conventions, the formulas for the derivative of criterion remain valid:

(8.34) c08e034_fmt

It is then easy to see that J = E∂²_t(θ₀)/∂θ∂θ′ exists under the moment assumption B7. The ergodic theorem immediately yields

(8.35)

where J is invertible, by assumptions B4 and B5 (cf. Proof of Theorem 7.2). The convergence (8.5) then directly follows from Slutsky’s lemma and the central limit theorem given in Corollary A.l.

(b) Uniform integrability and continuity of the second-order derivatives. It will be shown that, for all ε > 0, there exists a neighborhood (θ₀) of θ₀ such that, almost surely,

(8.36)

and

(8.37)

Using elementary derivative calculations and the compactness of Θ, it can be seen that

with

where K > 0 and 0 < ρ < 1. Since sup_θΘ, assumption B7 then entails that

In view of (8.34), the Holder and Minkowski inequalities then show (8.36) for all neighborhood of θ₀ The ergodic theorem entails that

This expectation decreases to 0 when the neighborhood (θ₀) decreases to the singleton {θ₀}, which shows (8.37).

(c) Convergence in probability of to θ₀ at rate . In view of (8.35), for n large enough, defines a norm. The definition (8.6) of entails that . The triangular inequality then implies that

where the last equality comes from the convergence in law of (Z_n, J_n) to (Z, J). This entails that .

(d) Quadratic approximation of the objective function. A Taylor expansion yields

where

and is between θ and θ₀. Note that (8.37) implies that, for any sequence (θ_n) such that θ_n − θ₀ = O_P(1), we have R_n(θ_n) = o_P (θ_n − θ₀²)- In particular, in view of (c), we have R_n {θ_Jn(Z_n)} = o_P(n⁻¹). Introducing the vector Z_n defined by (8.3), we can write

and

(8.38) c08e038_fmt

where

The initial conditions are asymptotically negligible, even when the parameter stays at the boundary. Result (d) of page 160 remaining valid, we have for any sequence (θ_n) such that θ_n → θ₀ in probability.

(e) Convergence In probability of _n to θ₀ at rate n^1/2. We know that

when θ₀ . We will show that this result remains valid when θ₀ ∂Θ. Theorem 7.1 applies. In view of (d), the almost sure convergence of _n to θ₀ and of J_i to the nonsingular matrix J, we have

and

Since _n(·) is minimized at _n, we have

It follows that

where the last inequality follows from . The triangular inequality then yields

Thus

(f) Approximation of We have

where the first line comes from the definition of , the second line comes from (8.38), and the inequality in third line follows from the fact that _n(·) is minimized at _n, the final equality having been shown in (d). In view of (8.8), we conclude that

(8.39)

(g) Approximation of (_n − θ₀) by . The vector , which is the projection of Z_n on Λ with respect to the scalar product , is characterized (see Lemma 1.1 in Zarantonello, 1971) by

(see Figure 8.1). Since _n Θ and Λ = lim ↑ (Θ − θ₀), we have almost surely (_n − θ₀) Λ for n large enough. The characterization then entails

Using (8.39), this yields

which entails (8.9), and completes the proof.

Proof of Proposition 8.3

The first result is an immediate consequence of Slutsky’s lemma and of the fact that under H₀,

To show (8.23) in the standard case where θ₀ , the asymptotic distribution is established by showing that R_n – W_n = o_P(1). This equation does not hold true in our testing problem H₀ : θ = θ₀, where θ₀ is on the boundary of Θ. Moreover, the asymptotic distribution of W_n is not - A more direct proof is thus necessary.

Since is a consistent estimator of , > 0, we have, for n large enough, > 0 and

Let

where . We again obtain

(8.40)

Since

(8.41)

a Taylor expansion shows that

where means a = b + c. The last d₂ components of this vectorial relation yield

(8.42)

and the first d₁ components yield

using

(8.43)

We thus have

(8.44)

Using successively (8.40), (8.42) and (8.43), we obtain

Let

where W₁ and W₂ are vectors of respective sizes d₁ and d₂, and J₁₁ is of size d₁ × d₁. Thus

where the last equality comes from Exercise 6.7. Using (8.44), the asymptotic distribution of R_n is thus that of

which follows a because it is easy to check that

We have thus shown (8.23).

Now we show (8.24). Using (8.43) and (8.44), several Taylor expansions yield

and

By subtraction,

It can be checked that

Thus, the asymptotic distribution of L_n is that of

Moreover, it can easily be verified that

It follows that

which gives the first equality of (8.24). The second equality follows using Exercise 8.2.

Proof of Proposition 8.4

Since Cov(v_t, ), we have

and the result follows from (see Proposition 2.2).

Proof of Theorem 8.2

We first study the asymptotic impact of the unknown Initial values on the statistic _m. Introduce the vector r_m = (r(l), …, r(m))′, where

Let s_t(θ) (_t(θ)) be the random variable obtained by replacing η_t by η_t(θ) = _t/σ_t(θ) (_t(θ) = _t/_t(θ)) in s_t.The vectors r_m(θ) and _m(θ) are defined similarly, so that r_m = r_m(θ₀) and _m = _m(_n). Write when a = b + c. Using (7.30) and the arguments used to show (d) on page 160, it can be shown that, as n → ∞,

(8.45)

We now show that the asymptotic distribution of _m is a function of the joint asymptotic distribution of r_m and of the QMLE. By the arguments used to show (c) on page 160, it can be shown that there exists a neighborhood (θ₀) of θ₀ such that

(8.46)

Using (8.45) and the fact that (_n − θ₀), a Taylor expansion of r_m(·) around _n and θ₀ shows that

for some θ* between _n and θ₀. Using (8.46), the ergodic theorem, the strong consistency of the QMLE, and a second Taylor expansion, we obtain

where

For the next to last equality, we use the fact that . It follows that

(8.47)

We now derive the asymptotic distribution of (r_m, _n − θ₀). In the proof of Theorem 7.2, it is shown that

(8.48)

as n → ∞, where

Note that where s_{t − 1: t−m} = (s_t−1, …, s_t−m)′- In view of (8.48), the central limit theorem applied to the martingale difference shows that

(8.49) c08e049_fmt

where

Using (8.47) and (8.49) together, we obtain

We now show that D is invertible. Because the law of is nondegenerate, we have κ_η>1. We thus have to show the invertibility of

If the previous matrix is singular then there exists λ = (λ₁, …, λ_m)′ such that λ ≠ 0 and

(8.50)

with μ = λ′C_mJ⁻¹. Note that μ = (μ₁, …, μ_{p+ q + 1)}′ ≠ 0. Otherwise λ′s_−1:m = 0 a.s., which implies that there exists j {1, …, m} such that s_−j is measurable with respect to σ{s_t, t ≠ − j}. This is impossible because the s_t are independent and nondegenerate by assumption A3 on page 144 (see Exercise 11.3). Denoting by R_t any random variable measurable with respect to σ{η_u, u ≤ t}, we have

and

Thus (8.50) entails that

Solving this quadratic equation in shows that either which is impossible by arguments already given, or λ₁α₁ = 0. Let λ′_2:m = (λ₂, …, λ_m)′. If λ₁ = 0 then (8.50) implies that

Taking the expectation with respect to σ{η_t, t ≤ −2}, it can be seen that in the previous equality. Thus we have

which entails α₁ = μ₂ = 0, because P {(λ₂,…, λ_m)′s_−2:−m = 0} < 1 (see Exercise 8.12). For GARCH (p, 1) models, it is impossible to have α₁ = 0 by assumption A4. The invertibility of D is thus shown in this case. In the general case, we show by induction that (8.50) entails α₁ = … α_p.

It is easy to show that → D in probability (and even almost surely) as n → ∞. The conclusion follows.

8.7 Bibliographical Notes

It is well known that when the parameter is at the boundary of the parameter space, the maximum likelihood estimator does not necessarily satisfy the first-order conditions and, in general, does not admit a limiting normal distribution. The technique, employed in particular by Chernoff (1954) and Andrews (1997) in a general framework, involves approximating the quasi-likelihood by a quadratic function, and defining the asymptotic distribution of the QML as that of the projection of a Gaussian vector on a convex cone. Particular GARCH models are considered by Andrews (1997, 1999) and Jordan (2003). The general GARCH(p, q) case is considered by Francq and Zakoïan (2007). A proof of Theorem 8.1, when the moment assumption B7 is replaced by assumption B7′ of Remark 8.2, can be found in the latter reference. When the nullity of GARCH coefficients is tested, the parameter is at the boundary of the parameter space under the null, and the alternative is one-sided. Numerous works deal with testing problems where, under the null hypothesis, the parameter is at the boundary of the parameter space. Such problems have been considered by Chernoff (1954), Bartholomew (1959), Perlman (1969) and Gouriéroux, Holly and Monfort (1982), among many others. General one-sided tests have been studied by, for instance, Rogers (1986), Wolak (1989), Silvapulle and Silvapulle (1995) and King and Wu (1997). Papers dealing more specifically with ARCH and GARCH models are Lee and King (1993), Hong (1997), Demos and Sentana (1998), Andrews (2001), Hong and Lee (2001), Dufour et al. (2004) and Francq and Zakoïan (2009b).

The portmanteau tests based on the squared residual autocovariances were proposed by McLeod and Li (1983), Li and Mak (1994) and Ling and Li (1997). The results presented here closely follow Berkes, Horváth and Kokoszka (2003a). Problems of interest that are not studied in this book are the tests on the distribution of the iid process (see Horváth, Kokoszka and Teyssiére, 2004; Horváth and Zitikis, 2006).

Concerning the overrepresentation of the GARCH(1, 1) model in financial studies, we mention Stărică (2006). This paper highlights, on a very long S&P 500 series, the poor performance of the GARCH(1, 1) in terms of prediction and modeling, and suggests a nonstationary dynamics of the returns.

8.8 Exercises

8.1 (Minimization of a distance under a linear constraint)

Let J be an n × n invertible matrix, let x₀ be a vector of ⁿ, and let K be a full-rank p × n matrix, p ≤ n. Solve the problem of the minimization of Q(x) = (x − x₀)′J(x − x₀) under the constraint K_x = 0.

8.2 (Minimization of a distance when some components are equal to zero)

Let J be an n × n invertible matrix, x₀ a vector of ⁿ and p < n. Minimize Q(x) = (x −0 x₀) J(x − x₀) under the constraints (x_i denoting the ith component of x, and assuming that 1 ≤ i₁ < … < i_p ≤ n).

8.3 (Lagrangian or method of substitution for optimization with constraints)

Compare the solutions (C.13) and (C.14) of the optimization problem of Exercise 8.2, with

and the constraints

(a) x₃ = 0,

(b) x₂ = x₃ = 0.

8.4 (Minimization of a distance under inequality constraints)

Find the minimum of the function

under the constraints λ₂ ≥ 0 and λ₂ ≥ 0, when

(i) Z = (−2,l,2)′,

(ii) Z = (−2,−1,2)′,

(iii) Z = (−2, 1, −2)′,

(iv) Z = (−2,−1,-2)′.

8.5 (Influence of the positivity constraints on the moments of the QMLE)

Compute the mean and variance of the vector λ^ι defined by (8.16). Compare these moments with the corresponding moments of Z = (Z₁, Z₂, Z₃)′.

8.6 (Asymptotic distribution of the QMLE of an ARCH in the conditionally homoscedastic case)

For an ARCH(q) model, compute the matrix Σ involved in the asymptotic distribution of the QMLE in the case where all the α_0i are equal to zero.

8.7 (Asymptotic distribution of the QMLE when an ARCH (1) is fitted to a strong white noise)

Let be the QMLE in the ARCH(l) model when the true parameter is equal to (ω₀,α₀) = (ω₀, 0) and when . Give an expression for the asymptotic distribution of ( − θ₀) with the aid of

Compute the mean vector and the variance matrix of this asymptotic distribution. Determine the density of the asymptotic distribution of ( − ω₀). Give an expression for the kurtosis coefficient of this distribution as function of κ_η.

8.8 (One-sided and two-sided tests have the same Bahadur slopes)

Let X₁, …, X_n be a sample from the (θ, 1) distribution. Consider the null hypothesis H₀ : θ = 0. Denote by Φ the F10D;(0, 1) cumulative distribution function. By the Neyman–Pearson lemma, we know that, for alternatives of the form H₁ : θ > 0, the one-sided test of rejection region

is uniformly more powerful than the two-sided test of rejection region

(moreover, C is uniformly more powerful than any other test of level α or less). Although we just have seen that the test C is superior to the test C* in finite samples, we will conduct an asymptotic comparison of the two tests, using the Bahadur and Pitman approaches.

• The asymptotic Bahadur slope c(θ) is defined as the almost sure limit of − 2/n times the logarithm of the p-value under P_θ, when the limit exists. Compare the Bahadur slopes of the two tests.

• In the Pitman approach, we define a local power around θ = 0 as being the power at τ/. Compare the local powers of C and C*. Compare also the local asymptotic powers of the two tests for non-Gaussian samples.

8.9 (The local asymptotic approach cannot distinguish the Wald, score and likelihood ratio tests)

Let X₁, …, X_n be a sample of the (θ, τ²) distribution, where θ and τ² are unknown. Consider the null hypothesis H₀ : θ = 0 against the alternative H₁ : θ > 0. Consider the following three tests:

• , where

is the Wald statistic;

• . where

is the Rao score statistic;

• , where

is the likelihood ratio statistic.

Give a justification for these three tests. Compare their local asymptotic powers and their Bahadur slopes.

8.10 (The Wald and likelihood ratio statistics have the same asymptotic distribution)

Consider the case d₂ = 1, that is, the framework of Section 8.3.3 where only one coefficient is equal to zero. Without using Remark 8.3, show that the asymptotic laws W and L defined by (8.22) and (8.24) are such that

8.11 (For testing conditional homoscedasticity, the Wald and likelihood ratio statistics have the same asymptotic distribution)

Repeat Exercise 8.10 for the conditional homoscedasticity test (8.26) in the ARCH(q) case.

8.12 (The product of two independent random variables is null if and only if one of the two variables is null)

Let X and Y be two independent random variables such that XY = 0 almost surely. Show that either X = 0 almost surely or Y = 0 almost surely.

¹ The term ‘approximate’ is used by Bahadur (1960) to emphasize that the exact survival function (t) is approximated by the asymptotic survival function S_W (t). See also Bahadur (1967) for a discussion on the exact and approximate slopes.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 8: Tests Based on the Likelihood

Create new playlist

Sign In

Sign Up

Table of Contents for
8: Tests Based on the Likelihood