Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 3
Further Thinning-based Models for Count Time Series

After having introduced important tasks and approaches for analyzing count time series, we shall now return to the question of how to model the underlying process. The main characteristic of the INAR(1) model in Section 2.2 is the use of the binomial thinning operator as a substitute for the multiplication, to be able to transfer the AR(1) recursion to the count data case. In Section 3.1, we shall see that this approach can also be used to define higher-order ARMA-like models. Furthermore, different types of thinning operation have been developed for such models; see Section 3.2. Finally, various thinning-based models to deal with count time series with a finite range (Section 3.3) and multivariate count time series (Section 3.4) are also available in the literature.

3.1 Higher-order INARMA Models

The INAR(1) model, as introduced in Section 2.2, was developed as an integer-valued counterpart to the conventional AR(1) model, mainly by replacing the multiplication in the AR(1) recursion with the binomial thinning operator. This idea is not limited to the first-order autoregressive case, but can also be used to mimic higher-order ARMA models; see Appendix B. The resulting models are then referred to as INARMA models.

Example 3.1.1 (INMA(1) model)

An integer-valued counterpart to the MA(1) model $c03-math-001$ was proposed by Al-Osh & Alzaid (1988), McKenzie (1988). Like the INAR(1) model discussed in Section 2.2, their INMA(1) model replaces the multiplication in the MA recursion by the binomial thinning operation (executed independently of any available random variables at time $c03-math-002$ ), leading to the model recursion

3.1

where, as in Definition 2.1.1.1, the innovations $c03-math-004$ are an i.i.d. process with range $c03-math-005$ ; again we denote $c03-math-006$ and $c03-math-007$ . We may interpret (3.1) by analogy to (2.4): the population at time $c03-math-008$ consists of the immigrants at time $c03-math-009$ plus the survivors of the immigrants from the previous point in time $c03-math-010$ ; there are no survivors from former generations.

The stationary mean and variance of $c03-math-011$ follow immediately from the mean and variance of $c03-math-012$ (Section 2.2.1) as

The autocovariance function equals $c03-math-013$ for lag 1, and 0 otherwise. These and further properties were derived by Al-Osh & Alzaid (1988).

The results simplify in the particular case of Poisson innovations $c03-math-014$ ; see Al-Osh & Alzaid (1988), McKenzie (1988). Because of the Poisson's additivity and invariance with respect to binomial thinning (Section 2.2.2), it follows that the observations of such a Poisson INMA(1) process are also Poisson distributed, and because of the equidispersion, the autocorrelation function becomes $c03-math-015$ and 0 otherwise. The Poisson INMA(1) process is also time-reversible with a linear 1-step-ahead conditional mean,

see Al-Osh & Alzaid (1988); these properties do not hold in the non-Poisson case.

While both the INAR(1) and the INMA(1) recursion involve only one thinning operation, the higher-order models need more than one thinning operation at a time. As an example, the counterpart to the full MA(q) model, the INMA(q) model, is defined by a recursion of the form

3.2

where the $c03-math-017$ thinnings at time $c03-math-018$ are performed independently of each other (in Example 3.1.1, we set $c03-math-019$ ). Here, a time index $c03-math-020$ has been added below the operator “ $c03-math-021$ ” to emphasize the fact that these are the thinnings being executed at time $c03-math-022$ . Now focussing on one particular innovation, say $c03-math-023$ , it becomes clear that this innovation is altogether involved in $c03-math-024$ thinnings, namely $c03-math-025$ . Since the thinning operations are probabilistic (in contrast to the multiplication used for an MA(q) model), the joint distribution of these thinnings has to be considered – that is, the conditional distribution of $c03-math-026$ given $c03-math-027$ – thereby leading to different types of models of the same model order q.

Until now, a total of four different INMA(q) models have been proposed in the literature, each having slightly different interpretations and probabilistic properties; see Al-Osh & Alzaid (1988), McKenzie (1988), Brännäs & Hall (2001) and Weiß (2008b). To be more precise, the marginal properties are already fixed by definition (3.2), namely:

3.3

where $c03-math-029$ . The joint distributions, however, differ between the different types of INMA(q) processes, as we shall see below. But first, let us look at an example: as for the INAR(1) model (Section 2.2.2), the Poisson distribution plays an important role.

To be able to define different types of INMA(q) models, we follow the approach in Weiß (2008b) and look at the individual counting series. Let $c03-math-033$ be the counting series of the thinning applied to $c03-math-034$ at time $c03-math-035$ for a $c03-math-036$ , with $c03-math-037$ . These $c03-math-038$ might be interpreted as indicators for the $c03-math-039$ individuals being introduced to the considered system at time $c03-math-040$ (“generation $c03-math-041$ ”). If $c03-math-042$ , then the $c03-math-043$ th individual of generation $c03-math-044$ is active at time $c03-math-045$ , where each individual has “lifetime” q.

In view of the general definition (3.2), the $c03-math-046$ -dimensional vectors $c03-math-047$ (each corresponding to one specific individual) have to be i.i.d., but their components might be dependent, thus restricting the activation of an individual during its lifetime. The independence model by McKenzie (1988), for instance, assumes the components of $c03-math-048$ to be mutually independent (so there are no further restrictions concerning the activation of an individual), while the sale model by Brännäs & Hall (2001) requires $c03-math-049$ and defines the vector $c03-math-050$ to be multinomially distributed according to $c03-math-051$ see Example A.3.3. So the sale model assumes that each individual becomes active at most once during its lifetime; an example would be if the “individuals” are perishable goods being produced at day $c03-math-052$ , and they become “active” when they are sold during their shelf-life. Weiß (2008b) analyzed the serial dependence structure of all these INMA(q) models; among other things, he showed that

3.4

So the autocovariance function for different INMA(q) models differs according to the last term. For example, for the independence model by McKenzie (1988), we have $c03-math-054$ , while $c03-math-055$ for the sale model because it is impossible that an individual becomes active twice.

Two types of INAR(p) models have been proposed by Alzaid & Al-Osh (1990) and Du & Li (1991), both being based on the recursion

3.5

where $c03-math-060$ is assumed. Obviously, the conditional distribution of $c03-math-061$ given $c03-math-062$ now has to be specified. Du & Li (1991) assume conditional independence (by analogy to the INMA(q) independence model), while Alzaid & Al-Osh (1990) assume a conditional multinomial distribution (by analogy to the INMA(q) sale model):

see Example A.3.3. Further specialized INAR(p) models can be defined by refining Equation 3.5 by analogy to the INMA(q) case; that is, by considering the counting series $c03-math-063$ of the thinning applied to $c03-math-064$ (“generation $c03-math-065$ ”) at time $c03-math-066$ and by specifying the distribution of $c03-math-067$ .

Remark 3.1.4 (Interpretation of INAR(p) model)

For model order $c03-math-068$ , a reasonable interpretation of the INAR(p) recursion in (3.5) is generally obtained by analogy to that of BPIs, see Remark 2.1.1.2; that is, by interpreting $c03-math-069$ as the total offspring at time $c03-math-070$ (caused by either previous generations or by immigration). This interpretation especially applies to the INAR(p) model by Du & Li (1991) (DL-INAR). For the INAR(p) model by Alzaid & Al-Osh (1990) (AA-INAR), the conditional multinomial distribution appears to be rather restrictive in view of reproduction. Here, one can think of some kind of renewal of the individuals from generation $c03-math-071$ : if such an individual is renewed at time $c03-math-072$ , then it becomes a member of generation $c03-math-073$ , but if it is not renewed within p time periods, then it is shut down.

Let us now look at properties of INAR(p) models. The stationary marginal mean is always given by $c03-math-074$ . For the DL-INAR(p) model, the variance satisfies

3.6

see Silva & Oliveira (2005) for higher-order joint moments. The ACF is obtained from the conventional AR(p) Yule–Walker equations (see (B.13)); that is,

3.7

as shown by Du & Li (1991). For the AA-INAR(p) model, in contrast, Alzaid & Al-Osh (1990) derived an ARMA $c03-math-077$ -like autocorrelation structure, which seems to be the main reason why the DL-INAR(p) model is usually preferred in practice. But there are also reasons why the AA-INAR(p) model is attractive: if having Poisson innovations, its observations are also Poisson distributed and the whole process is time-reversible (Alzaid & Al-Osh, 1990; Schweer, 2015), which is analogous to the Gaussian AR(p) case. The DL-INAR(p) process, in contrast, is time-reversible only in trivial cases (Schweer, 2015), and equidispersed innovations generally do not imply equidispersed observations (Weiß, 2013a). Note that for $c03-math-078$ , (3.6) simplifies to

The conditional mean and variance of the DL-INAR(p) process are given by

3.8

The transition probabilities of the DL-INAR(p) process, which is Markovian of order $c03-math-080$ , can be computed by utilizing the fact that the conditional distribution is a convolution between p binomial distributions and the innovations' distribution (Drost et al., 2009); see (3.12) below for illustration. Mixing and weak dependence properties for the DL-INAR(p) process are discussed by Doukhan et al. (2012 2013) and a frequency-domain analysis is considered by Silva & Oliveira (2005).

Remark 3.1.5 (CINAR(p) model)

An alternative AR(p)-like approach, the so-called combined INAR(p) (CINAR(p)) model, is due to Zhu & Joe (2006) and Weiß (2008c). A probabilistic mixing of lagged INAR(1) recursions is used:

3.9

where the $c03-math-082$ are independent and multinomially distributed according to $c03-math-083$ with $c03-math-084$ . By construction, the marginal distribution of such a process is that of the underlying INAR(1) process; that is, one with the same $c03-math-085$ and the same innovations' distribution. However, as for the INAR(p) model, different types of CINAR models are obtained for the same model order p if varying the joint distribution of the involved thinnings. In particular, if thinnings are independent, then the process is Markovian of order $c03-math-086$ , having the typical AR(p)-like autocorrelation structure with autoregressive parameters $c03-math-087$ . The conditional mean and transition probabilities are given by

3.10

Higher-order (factorial) moments are easily computed by utilizing the binomial theorem (also see Example A.2.1), for example:

3.11

While the marginal and autocorrelation properties of the CINAR(p) independence model are quite attractive, the data generating mechanism itself is somewhat artificial and difficult to interpret.

In a nutshell, while the INAR(1) model, with its intuitive interpretation and its simple stochastic properties, is very attractive for applications, higher-order extensions are not straightforward. Therefore, in Chapter 4, we discuss an alternative concept for ARMA-like count process models.

Example 3.1.6 (Gold particles)

For illustration, we analyze a time series of counts of gold particles, which was originally published by Westgren (1916). These counts were measured in a fixed volume element of a colloidal solution over time, and the count values vary because of the Brownian motion of the particles. As in Jung & Tremayne (2006) and Weiß (2013a), we consider observations $c03-math-090$ of series C (Westgren, 1916, pp. 12–13) and denote them as $c03-math-091$ for simplicity. The first $c03-math-092$ observations $c03-math-093$ , which are plotted in Figure 3.1, are used for model fitting, while $c03-math-094$ serve as an exercise for forecasting.

Figure 3.1 Plot of the particles counts; see Example 3.1.6.

The SACF shown in Figure 3.2a indicates an autoregressive autocorrelation structure. This is also confirmed by the SPACF in Figure 3.2b, where we have significant values at lags 1 and 2 but an abrupt drop towards zero with lag 3, so an autoregressive model of order $c03-math-095$ appears plausible (see the discussion in Appendix B.3). Since we also have a significant SPACF value at lag 9, we shall later also try out some higher-order models. The empirical marginal pmf is shown in black in Figure 3.2c, together with the pmf of the $c03-math-096$ distribution in gray, where $c03-math-097$ . The plot implies that a Poisson marginal distribution might be appropriate for the data, a suggestion which is confirmed by the index of dispersion $c03-math-098$ (Poisson INAR(1) approximation according to (2.14): mean $c03-math-099$ , std.err. $c03-math-100$ ).

Figure 3.2 Particles counts; see Example 3.1.6: (a) SACF, (b) SPACF, (c) marginal frequencies (black) together with a Poisson fit (gray) (d) SACF of Pearson residuals based on fitted Poisson INAR(1) model.

As a result of the previous analyses, we shall fit the Poisson INAR(p) and CINAR(p) models of order $c03-math-101$ to the data. To make the models more comparable, we shall use a slightly modified parametrization for the CINAR(p) model, with $c03-math-102$ for $c03-math-103$ and, hence, $c03-math-104$ as well as $c03-math-105$ . After having computed moment estimates for initialization, the final estimates are obtained by maximizing the respective conditional log-likelihood function $c03-math-106$ from (B.6). The required transition probabilities are given by (2.5) for INAR(1), by (3.10) for CINAR(2), and for the INAR(2) model by

3.12

For all models, we compute the CML estimates together with their approximate standard errors as well as the AIC and BIC, as described in Remark B.2.1.2. The results are summarized in Table 3.1. The first-order model not only performs worst in terms of AIC and BIC, but the corresponding Pearson residuals indicate that the model is not adequate for the data (Section 2.5): there are several significant SACF values in Figure 3.2d, which implies that a higher-order model should be used.

c03-math-109 — **Table 3.1** Particles counts: CML estimates and AIC and BIC values for different models

Figures in parentheses are standard errors. AIC and BIC values rounded.

However, it is difficult to decide between the fitted Poisson INAR(2) and CINAR(2) model. While the latter has a Poisson marginal distribution (with mean $c03-math-111$ and dispersion index $c03-math-112$ ), the former is slightly overdispersed (marginal mean $c03-math-113$ , dispersion index $c03-math-114$ ); for the data, we observed $c03-math-115$ and $c03-math-116$ . The Pearson residuals do not show significant SACF values for any of these models, while all the PIT histograms (see Figure 3.3) visibly deviate from uniformity (with a slight preference for the CINAR(2) model). Note that the conditional variance of the Poisson CINAR(2) model, as required for the Pearson residuals, follows from (3.11) with $c03-math-117$ by using $c03-math-118$ (Example A.1.1). The ACF of the fitted INAR(2) model ( $c03-math-119$ ) is closer to the SACF ( $c03-math-120$ ) than the ACF of the CINAR(2) model ( $c03-math-121$ ); note that the estimate for $c03-math-122$ is visibly smaller for the CINAR(2) model. So, in summary, none of the considered second-order models perfectly fits the data, but both seem to constitute reasonable approximations. If, for a practitioner, it is most important to imitate the observed autocorrelation structure, then the INAR(2) model should be preferred, while the CINAR(2) model, with its Poisson marginal distribution, might be more attractive in view of its equidispersion property.

Figure 3.3 PIT histograms based on fitted (a) Poisson INAR(1), (b) INAR(2), (c) CINAR(2) models; see Example 3.1.6.

At this point, let us also try out some higher-order models; we have seen in Figure 3.2b that there was also a significant SPACF value at lag 9. For simplicity, we concentrate on CINAR(p) models for this exercise. Starting with the CINAR(3) model, we estimate $c03-math-123$ as 0.562 (standard error 0.069) and the autoregressive parameters $c03-math-124$ as 0.392 (0.054), 0.193 (0.055), 0.037 (0.044), respectively. It becomes clear that the estimate for $c03-math-125$ is not significantly different from 0 at a 5% level. Also the $c03-math-126$ is larger than that of the second-order model, while the AIC further decreases to about 1024. Considering the fact that the AIC sometimes tends to overestimate the model order (see Katz (1981) and Remark B.2.1.2), the increased model order does not appear to be beneficial for the given data. This is even clearer for a fitted CINAR(4) model, where the estimate for $c03-math-127$ is (numerically) identical to zero, and where now also the AIC is increased. All these analyses confirm that the second-order model seems to be superior for the data within the full CINAR(p) models. Only a special CINAR(9) model, where the autoregressive parameters $c03-math-128$ are set to zero (motivated by the significant SPACF value at lag 9), could be a reasonable alternative. This model has four parameters, $c03-math-129$ , and leads to a performance comparable to the CINAR(2) model, although certainly at the price of increased model complexity.

Let us now return to the fitted second-order models, and conclude this example with a forecasting exercise for the remaining observations $c03-math-130$ . Based on the 1-step-ahead conditional distributions computed according to (3.12) and (3.10) for the fitted Poisson INAR(2) and CINAR(2) model, respectively, the 5%- and the 95%-quantiles were computed; see Table 3.2. It can be seen that the CINAR(2) model produces slightly narrower quantile bands, which is plausible in view of also it having less marginal dispersion (see above). Neither observation violates any of the quantile bands.

Table 3.2 Particles counts: 5% and 95% quantiles of 1-step-ahead forecasting distributions for $c03-math-131$ based on fitted Poisson INAR(2) and CINAR(2) models

	$c03-math-132$	371	372	373	374	375	376	377	378	379	380
95%	INAR(2)	5	5	4	4	4	3	4	4	3	4
	CINAR(2)	4	4	4	4	4	3	3	4	3	3
	$c03-math-133$	3	2	3	2	1	2	2	1	2	1
5%	INAR(2)	1	1	0	0	0	0	0	0	0	0
	CINAR(2)	1	1	0	1	0	0	0	0	0	0

Remark 3.1.7 (Further extensions)

The use of thinning operations is not limited to development of stationary and ARMA-like count models. As an example, by defining the thinning parameter or further model parameters to depend on time, say through a time-dependent mean (McKenzie, 1985; Moriña et al., 2011), through periodically varying parameters (Monteiro et al., 2010) or through incorporating covariate information (Azzalini, 1994; Freeland & McCabe, 2004a; Sutradhar, 2008), one obtains a non-stationary extension of the INAR(1) model. A related approach is to use state-dependent parameters ; see Weiß (2015a) and the references therein. In particular, the idea of a self-exciting threshold (SET) autoregression (Turkman et al., 2014) can be adapted to the integer-valued case, too; see Monteiro et al. (2012) for such a model.

As another example, thinning operations can also be used to transfer bilinear models (Turkman et al., 2014) to the integer-valued case, leading to the INBL models; see, for example Doukhan et al. (2006), who discuss the INBL $c03-math-134$ model in great detail. It is defined by the recursion¹

3.13

where thinnings are performed independently of each other. A (strictly) stationary solution exists if $c03-math-136$ , having the mean $c03-math-137$ . The variance exists (and hence weak stationarity holds) if $c03-math-138$ and $c03-math-139$ holds. For further results, including the ACF in the case of Poisson-distributed innovations, see Doukhan et al. (2006).

To illustrate the potential of some of the extensions mentioned in Remark 3.1.7, let us look at simulated sample paths. The first two parts of Figure 3.4 show paths of non-stationary Poisson INAR(1) processes, where the thinning parameter $c03-math-140$ is kept constant in time, but the innovations' mean varies according to $c03-math-141$ , with $c03-math-142$ representing the covariate information at time $c03-math-143$ (Freeland & McCabe, 2004a). Such trend and seasonality are often observed in practice; corresponding real-data examples are presented in Examples 5.1.6 and 5.1.7. Figure 3.4c shows a sample path with a piecewise pattern, which was generated by a simple SET extension of the Poisson INAR(1) model with one threshold and delay 1; see Monteiro et al. (2012) for further background.

Illustartion of Simulated Poisson. — **Figure 3.4** Simulated Poisson INAR(1) process ( $c03-math-144$ ) with innovations $c03-math-145$ (a) having trend $c03-math-146$ with $c03-math-147$ , $c03-math-148$ ; (b) having seasonality $c03-math-149$ with $c03-math-150$ , $c03-math-151$ ; (c) having SET mechanism $c03-math-152$ if $c03-math-153$ and $c03-math-154$ otherwise.

**Figure 3.4** Simulated Poisson INAR(1) process ( $c03-math-144$ ) with innovations $c03-math-145$ (a) having trend $c03-math-146$ with $c03-math-147$ , $c03-math-148$ ; (b) having seasonality $c03-math-149$ with $c03-math-150$ , $c03-math-151$ ; (c) having SET mechanism $c03-math-152$ if $c03-math-153$ and $c03-math-154$ otherwise.

3.2 Alternative Thinning Concepts

The idea behind a thinning operation (and the related time series models) can be modified in diverse ways; see the surveys by Weiß (2008a) and Scotto et al. (2015). Such modified thinning concepts allow for different stochastic properties and alternative interpretation schemes. We shall pick out two of these alternative thinning concepts here, but many further approaches are described in Weiß (2008a) and Scotto et al. (2015).

In view of the BPIs according to Remark 2.1.1.2, the generalized thinning operation, as proposed by Latour (1998), appears reasonable:

3.14

where the random variables $c03-math-156$ (counting series) are allowed to have the full range $c03-math-157$ instead of only $c03-math-158$ . Here, the $c03-math-159$ are required to have mean $c03-math-160$ and variance $c03-math-161$ . Since now the $c03-math-162$ may become larger than 1, the interpretation (2.4) as “survival indicators” is no longer appropriate, but they can be understood as describing a reproduction mechanism: as in Remark 2.1.1.2, $c03-math-163$ might be the number of children being generated by the $c03-math-164$ th individual of the population behind $c03-math-165$ .

Using their generalized thinning operation (which includes binomial thinning as a special case), Latour (1998) extended the INAR(p) model (3.5) by Du & Li (1991) to the GINAR(p) model, which is defined as

3.15

where again $c03-math-167$ is assumed. In (3.15), a time index for the thinning operations is omitted for the sake of readability, but it is understood that all thinnings are performed independently, as in the model by Du & Li (1991).

Although (3.15) has been generalized with respect to (3.5), $c03-math-168$ as well as the Yule–Walker equations (3.7) still hold, while formula (3.6) for the variance has to be modified (Latour, 1998):

3.16

Setting $c03-math-170$ , we would obtain (3.6) again.

A particular instance of generalized thinning, which has received considerable interest in the literature, is the negative binomial thinning operator “ $c03-math-171$ ” of Ristić et al. (2009), for which the $c03-math-172$ are geometrically distributed with parameter $c03-math-173$ (hence $c03-math-174$ ; see Example A.1.5). Therefore, $c03-math-175$ is conditionally $c03-math-176$ -distributed due to the additivity of the NB distribution (Example A.1.4). Ristić et al. (2009) use this operation to construct a first-order process having a geometric marginal distribution, and refer to it as the new geometric INAR(1) process, abbreviated as NGINAR(1). Also the Poisson INARCH models, discussed later in Section 4.1, might be understood as particular GINAR models using Poisson thinning (Rydberg & Shephard, 2000; Weiß, 2015a).

Another family of thinning operations assumes the counting series to be Bernoulli distributed, but now the thinning probability $c03-math-177$ is allowed to be random itself. The resulting thinning operation is then called random coefficient (RC) thinning, and it has been applied in the context of count time series modeling by a number of authors, including Joe (1996) and Zheng et al. (2007). As a specific instance (Joe, 1996), we consider the case where $c03-math-178$ follows the $c03-math-179$ distribution with $c03-math-180$ ; that is, where $c03-math-181$ and $c03-math-182$ . Then the conditional distribution of $c03-math-183$ given $c03-math-184$ is a beta-binomial distribution (see Example A.2.2), and the thinning operation is referred to as beta-binomial thinning accordingly.

A counterpart to the INAR(1) model from Definition 2.1.1.1 using a general random coefficient thinning operation “ $c03-math-185$ ” (that is, where the distribution of $c03-math-186$ on $c03-math-187$ is only required to have mean $c03-math-188$ and a certain variance $c03-math-189$ , but is not further specified) was investigated by Zheng et al. (2007). Their RCINAR(1) model is defined by

3.17

While the (conditional) mean and ACF remain as in the INAR(1) case (Section 2.2.1), the effect of the additional uncertainty manifests itself in the (conditional) variance (Zheng et al., 2007):

3.18

Note that $c03-math-192$ is a quadratic function of $c03-math-193$ , and the observations are overdispersed even if the innovations are equidispersed.

Joe (1996) and Sutradhar (2008) considered that instance of the RCINAR(1) model where beta-binomial thinning “ $c03-math-194$ ” is used. If $c03-math-195$ and if $c03-math-196$ , then the stationary marginal distribution of

3.19

omitting the time index at the thinning operation, is $c03-math-198$ . So innovations and observations are from the same family of distributions, $c03-math-199$ , and have a unique index of dispersion, $c03-math-200$ , both being completely analogous to the case of a Poisson INAR(1) model (Section 2.2.2).

Example 3.2.1 (Download counts)

Let us consider again the time series of download counts as discussed in Section 2.6. When analyzing the data, Weiß (2008a) recommended to use the NB-RCINAR(1) model (3.19) for these data. Therefore, we now also fit this model to the data by a full likelihood approach, where the transition probabilities are computed by analogy to (2.5) by replacing the pmf of the $c03-math-201$ distribution by that of the $c03-math-202$ distribution (see Example A.2.2). We obtain the estimates (std.err. in parentheses)

In particular, the NB-RCINAR(1) model indeed leads to the lowest values of AIC ( $c03-math-203$ ) and BIC ( $c03-math-204$ ). The marginal distribution of the fitted model is an NB distribution with mean $c03-math-205$ , dispersion index $c03-math-206$ , and zero probability $c03-math-207$ . The Pearson residuals do not contradict the fitted model, and also the PIT histogram is reasonable close to uniformity (although it looks a bit worse than the one of the NB-INAR(1) model in Figure 2.9b).

Zheng et al. (2006) extended the RCINAR(1) model to the pth-order RCINAR(p) model, analogous to the INAR(p) model of Du & Li (1991); that is, with thinnings being performed independently (Section 3.1).

Remark 3.2.2 (Convolution-closed models)

Another way of generalizing the binomial-thinning-based Poisson INAR(1) model to different marginal distributions and to higher-order dependence structures was proposed by Joe (1996). The idea behind this approach can be explained by looking back to the construction of the Poisson INAR(1) model, as described at the beginning of Section 2.2.2. There, we made use of the additivity of the Poisson distribution; that is, the property that the sum of two independent Poisson random variables is again Poisson distributed. Generally, the pmf of the sum of two independent random variables is obtained by convoluting their individual pmfs. Using this terminology, the additivity of the Poisson distribution is equivalently stated by saying that the Poisson distribution is convolution-closed. However, there are further convolution-closed distributions in Appendix A, such as the negative binomial (Example A.1.4) or the generalized Poisson (Example A.1.6). In fact, the whole compound Poisson family (Example A.1.2) shares some kind of additivity; see Lemma 2.1.3.2. Therefore, the approach by Joe (1996) mainly concentrates on convolution-closed and infinitely divisible marginal distributions.

So let us look back to Section 2.2.2. The stationary Poisson INAR(1) model generates the count $c03-math-208$ at time $c03-math-209$ as the sum of the independent random variables $c03-math-210$ and $c03-math-211$ . As a result – see (2.13) – we know that the successive observations $c03-math-212$ are bivariately Poisson distributed according to $c03-math-213$ (Example A.3.1). So we have the equality in distribution

3.20

where $c03-math-215$ are independent with $c03-math-216$ and $c03-math-217$ . In this construction, one may interpret $c03-math-218$ as a common latent component of $c03-math-219$ and $c03-math-220$ inducing the serial dependence, while $c03-math-221$ reflect the innovation part (Joe, 1997; Jung & Tremayne, 2011a). To obtain first-order models with non-Poisson marginals, Joe (1996) used the same construction as in (3.20), but with $c03-math-222$ stemming from another type of convolution-closed and infinitely divisible distribution:

the negative binomial, leading to the NB-RCINAR(1) model (3.19), or
the generalized Poisson, leading to the model by Alzaid & Al-Osh (1993), which can be expressed by using the quasi-binomial thinning operation.

For the latter model, also see Jung & Tremayne (2011a), who refer to it as the GPJ(1) model. Generally, as shown by Joe (1996), all these first-order models can be understood as some kind of thinning-based model, by using an appropriate type of generalized thinning, and they are all Markov chains with CLAR(1) structure (Grunwald et al., 2000).

The construction in (3.20) can also be extended to higher-order autoregressions, but then “the notation is a bit cumbersome” (Joe, 1996, p. 671). So the reader is referred to the works by Joe (1996) and Jung & Tremayne (2011a) for more details; it is just noted here that such higher-order autoregressions do not possess a CLAR structure anymore. Finally, it is pointed out that Markov count models can also be defined using copulas (Joe, 1997 2016), background information on which is provided by Joe (1997) and Genest & Nes̆lehová (2007). A Markov chain with the $c03-math-223$ having cdf $c03-math-224$ , for instance, is constructed by specifying the bivariate distribution of $c03-math-225$ in the form $c03-math-226$ , where $c03-math-227$ is an appropriate bivariate copula. A potential benefit of such copula-based models is the possibility for negative autocorrelations.

Remark 3.2.3 ( $c03-math-228$ -valued time series)

Although they have received less attention in the literature than count models, it should be emphasized that models for time series having the full set of integers $c03-math-229$ as their range (hereafter referred to as $c03-math-230$ -valued time series) have also been discussed by some authors. Probably the first contribution concerning $c03-math-231$ -valued time series was the one by Kim & Park (2008), who introduced the signed binomial thinning operator, defined as $c03-math-232$ , where $c03-math-233$ , and where $c03-math-234$ for $c03-math-235$ and $c03-math-236$ otherwise. Using this operator, Kim & Park (2008) defined signed counterparts to both the AA- and the DL-INAR(p) models (Section 3.1). These are referred to as INARS(p) models. The DL-INARS(p) model, where the thinnings in

are assumed to be all independent, has a stationary solution if the roots of $c03-math-237$ are outside the unit circle (see also Appendix B.3); then it exhibits the typical AR(p)-autocorrelation structure. Andersson & Karlis (2014) investigated a special case of the INARS(1) model, where the innovations are assumed to stem from a Skellam distribution (see Example A.1.10). References to further models for $c03-math-238$ -valued time series can be found in Section 3.4 of Scotto et al. (2015).

3.3 The Binomial AR Model

In many applications, it is known that the observed count data cannot become arbitrarily large; their range has a natural upper bound $c03-math-239$ that can never be exceeded. The models discussed up to now – the INAR(1) model from Definition 2.1.1.1 and all its extensions – can only be applied to count processes having an infinite range $c03-math-240$ . As an example, if we want to guarantee that $c03-math-241$ does not become larger than $c03-math-242$ , then the range of $c03-math-243$ at time $c03-math-244$ would have to be restricted to $c03-math-245$ , which would contradict the innovations' i.i.d. assumption. So different solutions are required for “finite-valued” counts.

One such solution for time series of counts supported on $c03-math-246$ , with a fixed upper limit $c03-math-247$ , is the binomial AR(1) model, which was proposed by McKenzie (1985) together with the INAR(1) model. It replaces the INAR(1)'s innovation term by an additional thinning, $c03-math-248$ , such that this term cannot become larger than $c03-math-249$ .

The condition on $c03-math-258$ guarantees that the derived parameters $c03-math-259$ satisfy $c03-math-260$ ; that is, these parameters can indeed serve as thinning probabilities.

The binomial AR(1) model of Definition 3.3.1 is easy to interpret (Weiß, 2009a). Suppose that we have a system of $c03-math-261$ mutually independent units, each being either in state “1” or state “0”. Let $c03-math-262$ be the number of units being in state “1” at time $c03-math-263$ . Then $c03-math-264$ is the number of units still being in state “1” at time $c03-math-265$ , with individual transition probability $c03-math-266$ (“survival probability”). $c03-math-267$ is the number of units, which moved from state “0” to state “1” at time $c03-math-268$ , with individual transition probability $c03-math-269$ (“revival probability”).

It is known that $c03-math-270$ is a stationary, ergodic and $c03-math-271$ -mixing finite Markov chain (also see Appendix B.2.2). Its marginal distribution is $c03-math-272$ , and the (truly positive) 1-step-ahead transition probabilities are given by

3.21

see McKenzie (1985) and Weiß (2009a). Conditional mean and variance are both linear in $c03-math-274$ , and given by

3.22

Closed-form expressions for the corresponding $c03-math-276$ -step-ahead regression properties are obtained by replacing $c03-math-277$ by $c03-math-278$ (Weiß & Pollett, 2012). Also note that the eigenvalues of the transition matrix $c03-math-279$ according to (3.21) are just given by $c03-math-280$ (Weiß, 2009a). So the speed of convergence of $c03-math-281$ is determined by the value of $c03-math-282$ according to the Perron–Frobenius theorem (Remark B.2.2.1). The ACF of the binomial AR(1) model is of AR(1)-type, given by $c03-math-283$ for $c03-math-284$ (note that $c03-math-285$ might become negative). Closed-form expressions for higher-order joint moments in such a process are provided by Weiß & Kim (2013); the binomial AR(1) process is also time-reversible (McKenzie, 1985).

Remark 3.3.2 (Renewal-based models)

If $c03-math-286$ , the binomial AR(1) model according to Definition 3.3.1 reduces to the binary Markov chain, which will be discussed in Example 7.1.3. But for $c03-math-287$ , there is also a close relationship between both models due to the additive structure of the involved thinning operations. Each of the $c03-math-288$ independent “units” might be associated its own binary Markov chain, indicating if this unit is in state “0” or “1”. The whole binomial AR(1) process might then be thought of as a sum of these $c03-math-289$ independent binary Markov chains. This idea was picked up by Cui & Lund (2009 2010), who defined count processes by superpositioning stationary renewal processes. Such models can be designed to give, for example, binomially or Poisson-distributed marginals, and they include the binomial AR(1) model as a special case.

A pth-order autoregressive version, which uses a probabilistic mixing approach analogous to the CINAR(p) model from Remark 3.1.5 and thus preserves the binomial marginal distribution, was proposed by Weiß (2009b). For the resulting binomial AR(p) model, defined by

3.23

completely analogous to (3.9), the special case of independent thinnings is again relevant. It leads to a pth-order Markov process, the ACF of which satisfies the Yule–Walker equations

3.24

The conditional mean and transition probabilities are given by

3.25

Higher-order (factorial) moments follow in an analogous way to (3.11),

3.26

The binomial AR(1) model and its pth-order extension have a binomial marginal distribution; in particular, their binomial index of dispersion according to (2.3) satisfies $c03-math-294$ . To allow for time-dependent and finite-range counts with $c03-math-295$ (extra-binomial variation), Weiß & Kim (2014) proposed replacing the binomial thinning operations in Definition 3.3.1 (or in (3.23), respectively) by beta-binomial ones (see Section 3.2). The resulting beta-binomial AR(1) model is characterized by the recursion

3.27

where both thinnings use a unique dispersion parameter $c03-math-297$ . Many stochastic properties are analogous to those of the binomial AR(1) model, but the beta-binomial AR(1) model exhibits extra-binomial variation (Weiß & Kim, 2014):

3.28

The transition probabilities are obtained from (3.21) by replacing the involved binomial distributions by beta-binomial ones (Example A.2.2). They can be used to compute the stationary marginal distribution as well as any $c03-math-299$ -step-ahead forecasting distribution exactly as described in Appendix B.2.1, since the beta-binomial AR(1) process is a finite Markov chain. The conditional mean is the same as in (3.22) (thus we also have the same ACF), while the conditional variance is now a quadratic function of $c03-math-300$ (Weiß & Kim, 2014):

3.29

Extensions of the binomial AR(1) model using state-dependent parameters (also see Remark 3.1.7) have been proposed by Weiß & Pollett (2014), for example to describe binomial underdispersion, and by Möller (2016) for SET-type models.

Example 3.3.4 (Price Stability)

Weiß & Kim (2014) analyzed a time series for the EA17, a group of countries in the Euro area, consisting of $c03-math-304$ member states. Starting in January 2000, it was determined for each month, how many of these countries showed stable prices (that is, an inflation rate below 2%), leading to monthly counts with range $c03-math-305$ . As in Weiß & Kim (2014), we first restrict to the time series $c03-math-306$ corresponding to the period from January 2000 to December 2006 (length $c03-math-307$ ). The data collected for January 2007 to August 2012 (shown in gray in Figure 3.5a) is later used for forecasting.

Figure 3.5 EA17 prices: (a) price stability counts (2000–2006 in black), (b) their sample autocorrelation, (c) marginal frequencies (black) and a binomial fit (gray). See Example 3.3.4.

The SACF in Figure 3.5b shows significant values for lags 1 and 2, and the SPACF indicates an AR(1)-like autocorrelation structure. The estimated pmf is shown in black in Figure 3.5c. The marginal mean equals $c03-math-308$ , so the overall proportion of countries having stable prizes is (only) $c03-math-309$ . The corresponding binomial distribution $c03-math-310$ is shown in gray in Figure 3.5c, and it looks less dispersed than the sample pmf. This is also confirmed by the binomial index of dispersion, given by $c03-math-311$ . Compared to a binomial AR(1) model with parameter values $c03-math-312$ and $c03-math-313$ , where approximate mean and standard deviation for $c03-math-314$ would be given by about 0.946 and 0.238, respectively, according to (3.30), we realize a significant degree of extra-binomial variation (P value $c03-math-315$ ). So the beta-binomial AR(1) model might be more appropriate for the data.

Figures in parentheses are standard errors.

Both models are fitted to the data by the (full) ML approach; see Table 3.3 for a summary of the results. In terms of the AIC, the beta-binomial AR(1) model is superior, while the BIC prefers the binomial AR(1) model. Furthermore, the estimate for $c03-math-318$ is not significantly different from 0, but this might just be due to the small sample size. On the other hand, the binomial index of dispersion within the fitted beta-binomial AR(1) model ( $c03-math-319$ ) better meets the observed value $c03-math-320$ (the same holds for $c03-math-321$ ). Also the Pearson residuals are slightly better for the beta-binomial model, with the variance $c03-math-322$ (vs. $c03-math-323$ for the binomial model) being pretty close to 1. The PIT histograms only roughly agree with uniformity in both cases.

c03-math-317 — **Table 3.3** Price stability counts: ML estimates and AIC and BIC values for different models

In view of the significant degree of extra-binomial variation for $c03-math-324$ , let us discuss the fitted beta-binomial model in more detail, while being aware that this model is far from being perfect for the data. The estimate $c03-math-325$ implies that the overall probability for an EA17 country satisfying the stability criterion is only about 26%. If such a country has stable prices in a month $c03-math-326$ , it will also have stable prices in month $c03-math-327$ with about 72% probability, since the survival probability is estimated as $c03-math-328$ . In contrast, if there are no stable prices in month $c03-math-329$ , prices will become stable one month later with a probability of only about 10% (revival probability $c03-math-330$ ).

Next, we use the fitted beta-binomial model for forecasting “future” values; that is, the values after $c03-math-331$ . Caused by the rather strong degree of dependence, the $c03-math-332$ -step-ahead forecasting distributions (given $c03-math-333$ ) converge relatively slowly to the marginal distribution. For instance, the 5% quantile equals 3 for $c03-math-334$ , then 2 for $c03-math-335$ , and 1 for $c03-math-336$ . Similarly, the median changes from 6 ( $c03-math-337$ ) to 5 ( $c03-math-338$ ) to 4 ( $c03-math-339$ ), and the 95% quantile from 9 ( $c03-math-340$ ) to 8 ( $c03-math-341$ ). The speed of convergence is characterized by the Perron–Frobenius theorem (Remark B.2.2.1); here, the second largest eigenvalue of the fitted model's transition matrix takes the value $c03-math-342$ .

The result of continued 1-step-ahead forecasting is shown in Figure 3.6. After a few observations, the data appear to be misplaced with respect to the quantile bands. As argued by Weiß & Kim (2014), there is indeed external evidence for a structural change after $c03-math-343$ (that is, for January 2007 and later), for example increased inflation due to strongly surging oil prices in 2007, and the beginning of the financial crisis in 2008 and the bankruptcy of Lehman Brothers. Tools for detecting such changes in a count process are presented in Chapter 8.

Figure 3.6 Plot of the price stability counts for $c03-math-344$ , together with 1-step-ahead median (dashed) plus 5%- and 95%-quantiles (solid); see Example 3.3.4.

3.4 Multivariate INARMA Models

In many applications, we do not observe a single feature over time, but instead a number of related features simultaneously, thus leading to a multivariate time series. If these multivariate observations are truly real-valued, then, for example, vector autoregressive moving-average (VARMA) models might be appropriate for describing the data; see Appendix B.4.2 for a brief summary. Here, we consider the count data case again; that is, a $c03-math-345$ -variate time series $c03-math-346$ with the range being $c03-math-347$ (or a subset of it). For such multivariate count time series, a number of thinning-based models (designed as counterparts to the VARMA model) have been discussed in the literature; see Section 4 in Scotto et al. (2015) for a survey. These approaches mainly differ in the way of defining a multivariate type of thinning operation. The most widely discussed approach, which is defined in a close analogy to conventional matrix multiplication, is matrix-binomial thinning, which was introduced by Franke & Subba Rao (1993).

Let $c03-math-348$ be a $c03-math-349$ -dimensional count random vector; that is, having the range $c03-math-350$ . Let $c03-math-351$ be a matrix of thinning probabilities. Then Franke & Subba Rao (1993) define the $c03-math-352$ th component of the $c03-math-353$ -dimensional count random vector $c03-math-354$ by

3.31

where the $c03-math-356$ univariate binomial thinning operations (Section 2.2.1) are performed independently of each other. A generalized version of this operator, using univariate generalized thinnings (3.14) instead of binomial ones, was proposed by Latour (1997). We denote this operation by “ $c03-math-357$ ”, with the $c03-math-358$ th thinning “ $c03-math-359$ ” satisfying $c03-math-360$ and $c03-math-361$ (for binomial thinning, we have $c03-math-362$ ). It satisfies $c03-math-363$ and

In an analogy to Definition B.4.2.1, a multivariate extension of the INAR(1) model (MINAR(1) model) according to Definition 2.1.1.1 is given by the recursion

3.32

where $c03-math-365$ is an i.i.d. $c03-math-366$ -dimensional count process with finite mean $c03-math-367$ and covariance matrix $c03-math-368$ ; see Franke & Subba Rao (1993), Latour (1997) and Pedeli & Karlis (2013a). Latour (1997) showed that (Condition (B.25) in Appendix B.4.2):

where $c03-math-369$ denotes the identity matrix, still guarantees the existence of a unique stationary solution. The marginal mean is given by $c03-math-370$ , and the autocovariance function is obtained from a slightly modified version of (B.30),

3.33

If the thinning matrix $c03-math-372$ of the MINAR(1) model (3.32) is not a diagonal matrix, then the component processes $c03-math-373$ are generally not univariate INAR(1) processes. Therefore, Pedeli & Karlis (2011 2013b) concentrate on the diagonal case $c03-math-374$ of matrix-binomial thinning (thus also reducing the number of model parameters) such that the cross-correlation between $c03-math-375$ and $c03-math-376$ is solely caused by the innovations. Note that (3.33) simplifies in this case, because $c03-math-377$ becomes $c03-math-378$ :

3.34

where $c03-math-380$ denotes the Kronecker delta. As particular instances of the MINAR(1) model with diagonal-matrix-binomial thinning, they consider the cases with $c03-math-381$ stemming from a multivariate Poisson or negative binomial distribution (Examples A.3.1 and A.3.2). The corresponding component processes $c03-math-382$ then follow the univariate Poisson INAR(1) (Section 2.2.2) or NB-INAR(1) model (Example 2.1.3.3), respectively. Note that Pedeli & Karlis (2011 2013b) use a slightly different parametrization for the MNB distribution from Example A.3.2, with $c03-math-383$ obtained from the relations $c03-math-384$ and $c03-math-385$ for $c03-math-386$ .

Example 3.4.2 (BINAR(1) model)

The particular instance of the bivariate INAR(1) (BINAR(1)) model with diagonal-matrix-binomial thinning was discussed by Pedeli & Karlis (2011) in great detail. For the innovations $c03-math-391$ , they considered either the bivariate Poisson or negative binomial distribution; see Examples A.3.1 and A.3.2 (a copula-based model was developed by Karlis & Pedeli (2013)). Combining the moment properties for $c03-math-392$ and $c03-math-393$ as given in Examples A.3.1 and A.3.2, respectively, with formula (3.34), the moment properties of the resulting BINAR(1) models are immediately obtained. The transition probabilities $c03-math-394$ follow by extending (2.5) to (Pedeli & Karlis, 2011):

In the Poisson case, the components $c03-math-395$ follow the Poisson INAR(1) model with $c03-math-396$ ; in the NB case, they follow the NB-INAR(1) model with $c03-math-397$ .

For illustration, we briefly discuss the data example presented by Pedeli & Karlis (2011). Each bivariate count $c03-math-398$ expresses the number of daytime ( $c03-math-399$ ) and nighttime ( $c03-math-400$ ) road accidents in the Schiphol area (Netherlands) for the days $c03-math-401$ in 2001; see the plot in Figure 3.7. Both components $c03-math-402$ show significant values for the SACF at lag 1 ( $c03-math-403$ and $c03-math-404$ ), and the sample cross-correlation is computed as $c03-math-405$ .

Figure 3.7 Plot of the daily number of daytime and nighttime road accidents; see Example 3.4.2.

The respective marginal frequencies are shown in Figure 3.8, with the zero frequencies being 0.014 and 0.258, respectively. The estimated means are $c03-math-406$ and $c03-math-407$ . We not only have more daytime accidents in the mean, but also a much stronger daytime dispersion, with $c03-math-408$ and $c03-math-409$ (approximate upper-sided critical values at the 5% level according to (2.14) would be given by 1.120 for both components). Therefore, we shall consider both the Poisson and the negative binomial BINAR(1) models as candidate models. The results of a CML estimation are summarized in Table 3.4. Since both models have the same number of parameters, only the maximal log-likelihood is shown, which prefers the NB model.

Figure 3.8 Plot of the marginal frequencies for the daily number of (a) daytime and (b) nighttime road accidents; see Example 3.4.2.

Table 3.4 Accidents counts: CML estimates and maximal log-likelihoods for different models

Model	Parameter					$c03-math-410$
	1	2	3	4	5
Poisson BINAR(1)	6.460	1.099	0.268	0.077	0.087	$c03-math-411$
$c03-math-412$	(0.241)	(0.125)	(0.105)	(0.025)	(0.043)
Negative binomial BINAR(1)	4.449	0.547	0.104	0.043	0.114	$c03-math-413$
$c03-math-414$	(0.588)	(0.025)	(0.008)	(0.038)	(0.044)

Figures in parentheses are standard errors.

If computing the components' Pearson residuals based on the fitted Poisson-BINAR(1) model from Table 3.4, the variances of 2.857 and 1.215 indicate that the Poisson model is not able to capture the dispersion within the data, especially with respect to the first component (daytime accidents). Here, the fitted NB-BINAR(1) model (with residual variances of 1.139 and 0.963) performs much better than the Poisson one. Generally, the observations' marginal properties are matched quite well by the fitted NB-BINAR(1) model, with means of 7.289 and 1.496, dispersion indices of 2.504 and 1.267, and zero probabilities of 0.011 and 0.265, respectively.

Despite these nice marginal features, the NB-BINAR(1) model is also not perfectly adequate for the data, which becomes clear by inspecting the SACF of the Pearson residuals in Figure 3.9b. There, we find significant SACF values for lags 7, 14, …, which could also have been observed by inspecting the SACF for the original daytime counts in Figure 3.9a; see also Liu (2012). So there seems to be a “day-of-week effect” causing this seasonal pattern. For the nighttime counts, neither the original counts nor their residuals show such seasonality. Computing the means of $c03-math-415$ , of $c03-math-416$ , and so on, we obtain the values

showing an increased mean number of daytime accidents on weekdays. So a refined model able to deal with the apparent seasonality is required for the data. For instance, using a separate set of parameters for the NB-BINAR(1) model's innovations on weekdays and on the weekend leads to a visible improvement concerning the seasonality ( $c03-math-417$ ).

Figure 3.9 Plot of SACF for daytime accidents (a) and for corresponding Pearson residuals (b) from NB-BINAR(1) model, see Example 3.4.2.

Finally, let us turn to the case of multivariate counts having a finite range; that is, a range of the form $c03-math-418$ . At first glance, it appears to be possible to combine Definition 3.3.1 for a binomial AR(1) model with the concept of matrix-binomial thinning (3.31), and to define a model according to the recursive scheme $c03-math-419$ . However, if the matrices $c03-math-420$ are non-diagonal, it may happen that the upper limit $c03-math-421$ is violated, since, for example, $c03-math-422$ might become larger than $c03-math-423$ . So we would have to restrict ourselves to diagonal thinning matrices to ensure the upper limit $c03-math-424$ . But then, due to the absence of an innovations term, the component series would be independent of each other. So, in summary, a non-trivial multivariate binomial AR(1) model using matrix-binomial thinning is not possible.

For this reason, restricting to the bivariate case, Scotto et al. (2014) introduced another type of thinning operation, based on the bivariate binomial distribution of Type II (Example A.3.5). Let $c03-math-425$ be a bivariate count random variable with range $c03-math-426$ . The bivariate binomial thinning operation “ $c03-math-427$ ” is defined by requiring the conditional distribution

3.35

where the thinning parameter $c03-math-429$ has to satisfy the restrictions given in Example A.3.5. Note that in contrast to matrix-binomial thinning, bivariate binomial thinning (3.35) generates cross-correlation (if $c03-math-430$ ) and preserves the upper bound $c03-math-431$ at the same time. Furthermore, the marginals of $c03-math-432$ just correspond to univariate binomial thinnings: $c03-math-433$ .

Using this operation, Scotto et al. (2014) defined the bivariate binomial AR(1) model with upper limit $c03-math-434$ by extending Definition 3.3.1 as follows: let $c03-math-435$ for $c03-math-436$ be separate sets of parameters according to Definition 3.3.1, derive $c03-math-437$ accordingly. Choose $c03-math-438$ such that $c03-math-439$ and $c03-math-440$ become valid BVB-parameters. Then the $c03-math-441$ -AR(1) process $c03-math-442$ is defined by the recursion

3.36

where the thinnings are again performed independently of each other. By construction, the components of this process are just univariate binomial AR(1) processes with parameters $c03-math-444$ and $c03-math-445$ , respectively, and these components are cross-correlated to each other (Scotto et al., 2014):

3.37

To numerically compute these and further properties of the model, it should be noted that $c03-math-447$ constitutes a finite Markov chain (so (B.4) holds) with transition probabilities

3.38

where $c03-math-449$ abbreviates the pmf of the $c03-math-450$ distribution as given in Example A.3.5. Note that $c03-math-451$ is even a primitive Markov chain and hence $c03-math-452$ -mixing with a uniquely determined stationary marginal distribution. Further properties and a data example are provided by Scotto et al. (2014).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Model	Parameter			AIC	BIC
	1	2	3
Poisson INAR(1)	0.734	0.531		1040	1047
$c03-math-108$	(0.063)	(0.036)
Poisson INAR(2)	0.545	0.472	0.180	1027	1038
$c03-math-109$	(0.072)	(0.047)	(0.053)
Poisson CINAR(2)	0.601	0.412	0.194	1027	1038
$c03-math-110$	(0.064)	(0.051)	(0.054)

Model	Parameter			AIC	BIC
	1	2	3
Binomial AR(1)	0.256	0.578		327.3	332.2
$c03-math-316$	(0.022)	(0.058)
Beta-binomial AR(1)	0.260	0.622	0.037	326.7	334.0
$c03-math-317$	(0.028)	(0.064)	(0.028)

Table of Contents for Chapter 3: Further Thinning-based Models for Count Time Series

Create new playlist

Sign In

Sign Up

3.1 Higher-order INARMA Models

3.2 Alternative Thinning Concepts

3.3 The Binomial AR Model

3.4 Multivariate INARMA Models

Table of Contents for
Chapter 3: Further Thinning-based Models for Count Time Series