Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Appendix A

Examples of Count Distributions

This appendix provides a brief survey of common models for count data and their stochastic properties. In particular, we discuss dispersion properties and types of generating functions, the latter being useful tools when analyzing count models. The presented models are divided into univariate and multivariate models, and into models for an infinite range like $c0A-math-001$ or a finite range like $c0A-math-002$ .

A.1 Count Models for an Infinite Range

The most famous model for a count random variable having the range $c0A-math-003$ (full set of non-negative integers) is the Poisson distribution.

The equidispersion property of the Poisson distribution can be stated equivalently in terms of the index of dispersion $c0A-math-012$ according to (2.1), by saying that the Poisson distribution always satisfies $c0A-math-013$ (a related approach focussing on the probability for observing a zero, based on the zero index (2.2), is considered below). Values for $c0A-math-014$ deviating from 1, in turn, express a violation of the Poisson model.

Example A.1.2 (Compound Poisson distribution)

We adapt the notation and definitions from Chapter XII in Feller (1968). Let $c0A-math-015$ be independent and identically distributed (i.i.d.) random variables with the range being contained in $c0A-math-016$ ; let $c0A-math-017$ denote the upper limit of the range (we allow the case $c0A-math-018$ ). Denote the pgf of the $c0A-math-019$ (compounding distribution) by $c0A-math-020$ ; in the case of a finite range with upper limit $c0A-math-021$ and $c0A-math-022$ , $c0A-math-023$ becomes the polynomial $c0A-math-024$ .

Let $c0A-math-025$ be Poisson distributed, independently of $c0A-math-026$ Then $c0A-math-027$ is said to be compound Poisson distributed; that is, $c0A-math-028$ . The generating functions of $c0A-math-029$ are given by

while the pmf can be computed recursively as (Kemp, 1967, “Panjer recursion”)

If the $c0A-math-030$ th moment $c0A-math-031$ of the $c0A-math-032$ exists, then the $c0A-math-033$ th cumulant of $c0A-math-034$ is given by

This implies that the variance-mean ratio $c0A-math-035$ of $c0A-math-036$ equals $c0A-math-037$ ; that is, the $c0A-math-038$ -distribution is equidispersed iff $c0A-math-039$ (note that the $c0A-math-040$ -distribution coincides with the $c0A-math-041$ -distribution). Otherwise, we have $c0A-math-042$ ; that is, any $c0A-math-043$ -distribution with $c0A-math-044$ is overdispersed.

The compound Poisson distributions are uniquely characterized by the property of being infinitely divisible (Feller, 1968); that is, their pgf satisfies that $c0A-math-045$ is itself a pgf for all $c0A-math-046$ . The CP distribution is also additive; see Lemma 2.1.3.2 for more details.

Note that in the literature, there are several other names for the compound Poisson distribution: for example, it is sometimes referred to as the Poisson-stopped sum distribution because the summation $c0A-math-047$ is stopped by the Poisson random variable $c0A-math-048$ .

The family of $c0A-math-049$ -distributions includes a number of important special cases. Besides $c0A-math-050$ , the distributions of Examples A.1.3–A.1.6 also belong to the CP-family.

The negative binomial distribution belongs to the $c0A-math-061$ -family.

Example A.1.5 (Geometric distribution)

As mentioned in Example A.1.4, the special case $c0A-math-078$ of a negative binomial distribution with $c0A-math-079$ is commonly referred to as a geometric distribution, abbreviated as $c0A-math-080$ . It can be interpreted as the number of failures until observing the first success in a sequence of i.i.d. replications of a Bernoulli experiment (see also Example A.2.1), that is, an experiment with only two outcomes (“failure” or “success”) and with success probability $c0A-math-081$ . Properties of the geometric distribution follow immediately from Example A.1.4.

In some applications, not the number of failures but the number of trials (waiting time) until the first success is relevant, which corresponds to shifting $c0A-math-082$ to $c0A-math-083$ . We refer to $c0A-math-084$ as following a shifted geometric distribution. Among others, we have

All distributions in Examples A.1.2–A.1.6 are overdispersed. The opposite phenomenon, $c0A-math-094$ – that is, a variance smaller than the mean – is referred to as underdispersion. The $c0A-math-095$ -distribution of Example A.1.6 might be extended to also cover underdispersion, by allowing $c0A-math-096$ to be negative. But then it is necessary to truncate the range of the GP-distribution, where the degree of truncation depends on the actual values of the model parameters. The truncation of the range also causes the problem that the probabilities according to the above pmf no longer sum to 1, and otherwise exact properties become only approximate (Johnson et al., 2005, p. 336). An analogous criticism – that is, that essential properties of the distribution are known only approximately – also applies to the families of Efron's double Poisson (DP) distributions (Johnson et al., 2005, Section 11.1.8) and of Conway–Maxwell (COM) Poisson distributions (Shmueli et al., 2005).

Therefore, in Weiß (2013a), two other distribution families are recommended for underdispersed counts: the Good distribution and the power-law weighted Poisson (PL) distribution.

Another characteristic property of the Poisson distribution (Example A.1.1) is the probability of observing a zero: the zero index $c0A-math-114$ according to (2.2) equals 0 for the Poisson distribution, but may differ otherwise. For a compound Poisson distribution $c0A-math-115$ , for instance, we obtain (Example A.1.2)

A.1

which equals 0 iff $c0A-math-117$ , and which satisfies $c0A-math-118$ otherwise (zero inflation; note that $c0A-math-119$ is the mean of the compounding distribution). A more flexible approach towards zero modification is summarized in the following example.

If the parent distribution in Example A.1.9 is the Poisson distribution $c0A-math-127$ , then the distribution obtained for $c0A-math-128$ is said to be the zero-inflated Poisson distribution, $c0A-math-129$ .

We conclude this section with a brief look at a distribution for a slightly different type of infinite and integer-valued range: the full range of integers $c0A-math-130$ .

A.2 Count Models for a Finite Range

The distributions surveyed up until now all have unlimited ranges. In some applications, however, it is known in advance that an upper bound $c0A-math-145$ exists; this can never be exceeded. Then the distributions discussed in the sequel become relevant. Thse have a finite range of the form $c0A-math-146$ .

Example A.2.1 (Binomial distribution)

A random variable $c0A-math-147$ with range $c0A-math-148$ for an $c0A-math-149$ is binomially distributed according to $c0A-math-150$ with $c0A-math-151$ if its distribution is given by

The case $c0A-math-152$ is referred to as the Bernoulli distribution (see also Example A.1.5). The mean and variance of the $c0A-math-153$ -distribution are given by $c0A-math-154$ and $c0A-math-155$ ; that is, we have $c0A-math-156$ . The pgf equals

so factorial moments are computed as

The binomial distribution $c0A-math-157$ can be regarded as the distribution of the sum of $c0A-math-158$ independent Bernoulli trials with success probability $c0A-math-159$ ; that is, of the number of successes among $c0A-math-160$ replications of the Bernoulli experiment. This implies the additivity of the binomial distribution: if $c0A-math-161$ , $c0A-math-162$ and both are independent, then $c0A-math-163$ . Note that a similar property holds for the Poisson distribution (Example A.1.1). In fact, both distributions are related to each other by the Poisson limit theorem: for $c0A-math-164$ but keeping $c0A-math-165$ fixed, the binomial distribution $c0A-math-166$ approaches $c0A-math-167$ . These and further properties are provided in Johnson et al. (2005, Chapter 3).

We have seen that in a Poisson sense, the binomial distribution is underdispersed. However, since we are concerned with a different type of random phenomenon anyway – one with a finite range – it is more appropriate to evaluate the dispersion behavior in terms of the binomial index of dispersion $c0A-math-168$ according to (2.3), with the binomial distribution satisfying $c0A-math-169$ .

A distribution with $c0A-math-170$ shows “overdispersion with respect to a binomial model”, a phenomenon that we refer to as extra-binomial variation.

Another distribution with extra-binomial variation is the Markov binomial distribution, which is briefly discussed in the context of Equations 9.1 and 9.2 .

The probability of observing a zero equals $c0A-math-185$ in the binomial case. Increasing this probability according to the approach in Example A.1.9, with $c0A-math-186$ , leads to the zero-inflated binomial distribution, $c0A-math-187$ . However, the $c0A-math-188$ distribution according to Example A.2.2 is also easily seen to be zero-inflated with respect to a binomial distribution, since the zero probability is obtained as

A.3 Multivariate Count Models

Although this book is primarily concerned with univariate count data, a few models for multivariate count data are sketched here. Much more information can be found in the book by Johnson et al. (1997). For the particular case of bivariate distributions, the book by Kocherlakota & Kocherlakota (1992) is also recommended.

Example A.3.1 (Multivariate Poisson distribution)

The simplest way of defining a multivariate version of the Poisson distribution of Example A.1.1 is by assuming $c0A-math-189$ independent random variables $c0A-math-190$ with $c0A-math-191$ for $c0A-math-192$ . Then

A.2

is said to be multivariately Poisson distributed according to $c0A-math-194$ . Due to the additivity of the Poisson distribution, each component is itself univariately Poisson distributed, namely $c0A-math-195$ , but the common summand $c0A-math-196$ causes the components to be cross-correlated:

Because of the mutual independence of $c0A-math-197$ , we have $c0A-math-198$ , so the joint pgf is given by

with $c0A-math-199$ . Sometimes, one more generally defines a random variable $c0A-math-200$ with range $c0A-math-201$ to be multivariately Poisson distributed if its pgf is of the form

where $c0A-math-202$ . In the special case $c0A-math-203$ – that is, in the case of the bivariate Poisson distribution – the two approaches are equivalent. The corresponding pmf is given by

For the general $c0A-math-204$ -variate case as well as for recursive schemes for pmf computation, see Tsiamyrtzis & Karlis (2004). These and further properties are provided in Johnson et al. (1997, Chapter 37)

The approach of Example A.3.1 for constructing a $c0A-math-205$ -variate distribution could be applied to other univariate additive count models as well.

Extending the binomial distribution (Example A.2.1), with its finite range, is more demanding. The most famous multivariate binomial distribution is the multinomial distribution.

Example A.3.3 (Multinomial distribution)

Instead of summing $c0A-math-214$ independent Bernoulli trials, the multinomial distribution $c0A-math-215$ , with $c0A-math-216$ and probabilities $c0A-math-217$ such that $c0A-math-218$ , is defined by summing $c0A-math-219$ independent copies of a binary random vector $c0A-math-220$ , where exactly one of the components takes the value 1 and all others are equal to 0. So the possible range of $c0A-math-221$ consists of the unit vectors $c0A-math-222$ , where $c0A-math-223$ is defined by $c0A-math-224$ (Kronecker delta; $c0A-math-225$ has a one in its $c0A-math-226$ th component) for $c0A-math-227$ . The probabilities $c0A-math-228$ are assumed.

As a result, $c0A-math-229$ has the range $c0A-math-230$ , and its pmf is given by

The covariance matrix equals

Each component $c0A-math-231$ of $c0A-math-232$ is binomially distributed according to $c0A-math-233$ . These and further properties are provided in Johnson et al. (1997, Chapter 35).

Note that it sometimes simplifies the notation if one omits one of the $c0A-math-234$ components; if we write $c0A-math-235$ , where $c0A-math-236$ and $c0A-math-237$ , then we assume the extended vector $c0A-math-238$ to be distributed according to $c0A-math-239$ .

The use of the multinomial distribution as a multivariate extension of the binomial distribution is limited by the fact that each binomial component $c0A-math-240$ has the same population size $c0A-math-241$ , and that the sum of the components has to be equal to this value $c0A-math-242$ . The importance of the multinomial distribution is in the fact that the binary random vectors $c0A-math-243$ can be understood as a binarization of a categorical random variable $c0A-math-244$ with range $c0A-math-245$ by defining $c0A-math-246$ if $c0A-math-247$ . Then $c0A-math-248$ represents the realized absolute frequencies of $c0A-math-249$ independent replications of $c0A-math-250$ , and $c0A-math-251$ gives the corresponding relative frequencies (proportions).

Remark A.3.4 (Compositional data)

If the number $c0A-math-252$ of replications becomes infinitely large, $c0A-math-253$ , then the vector of random proportions becomes a continuous random vector with the $c0A-math-254$ -part unit simplex as its range,

The corresponding data, which express the “proportions of some whole” (Aitchison, 1986, p. 1), are referred to as compositional data (CoDa); books on this topic are the ones by Aitchison (1986) and Pawlowsky-Glahn & Buccianti (2011). We shall not further discuss CoDa in this book since we are concentrating on the discrete-valued case, but the relation between the multinomial setup and CoDa could be useful, for example for approximating the multinomial distribution for large $c0A-math-255$ by an appropriate CoDa model, or for generalizing the multinomial distribution by analogy to the beta-binomial approach from Example A.2.2: the vector $c0A-math-256$ of multinomial probabilities also belongs to the simplex $c0A-math-257$ and might be assumed to be random itself. If, for instance, the most basic CoDa distribution, the Dirichlet distribution, is used for this purpose (see Chapter 49 in Kotz et al. (2000) and Chapter 2 in Ng et al. (2011) for a comprehensive survey), then one obtains the Dirichlet-multinomial distribution as a counterpart to the beta-binomial distribution; see Section 35.13 in Johnson et al. (1997) for further details.

Another type of generalized multinomial distribution is the Markov multinomial distribution, which is briefly discussed in Example 9.1.2.3 .

Example A.3.5 (Bivariate binomial distributions)

A more flexible generalization, but restricted to the bivariate case, is the bivariate binomial distribution of type II (Kocherlakota & Kocherlakota, 1992). It starts by defining the bivariate Bernoulli distribution with parameters $c0A-math-258$ and $c0A-math-259$ through the bivariate random vectors $c0A-math-260$ with range $c0A-math-261$ , the joint probabilities $c0A-math-262$ of which have to satisfy

Instead of $c0A-math-263$ , one may consider

as a third parameter besides $c0A-math-264$ , which has to satisfy the restriction

If $c0A-math-265$ are $c0A-math-266$ i.i.d. bivariately Bernoulli-distributed random variables, then the sample sum $c0A-math-267$ is said to follow a bivariate binomial distribution of Type I, abbreviated as $c0A-math-268$ . As a result, the components $c0A-math-269$ are univariately binomially distributed according to $c0A-math-270$ , but still have the unique sample size $c0A-math-271$ for each component. To overcome this limitation and to allow for varying sample sizes $c0A-math-272$ with $c0A-math-273$ , one defines two further random variables $c0A-math-274$ such that $c0A-math-275$ are independent, and where $c0A-math-276$ and $c0A-math-277$ . Then $c0A-math-278$ is said to follow a bivariate binomial distribution of Type II, abbreviated to $c0A-math-279$ . Its pmf is given by

Because of the additivity of the binomial distribution, the $c0A-math-280$ th component satisfies $c0A-math-281$ for $c0A-math-282$ , and the cross-correlation between $c0A-math-283$ and $c0A-math-284$ is given by $c0A-math-285$ , which might take positive or negative values.

More details on these and other multivariate discrete distributions are provided by Johnson et al. (1997) and Kocherlakota & Kocherlakota (1992).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Appendix A: Examples of Count Distributions

Create new playlist

Sign In

Sign Up

Examples of Count Distributions

A.1 Count Models for an Infinite Range

A.2 Count Models for a Finite Range

A.3 Multivariate Count Models

Table of Contents for
Appendix A: Examples of Count Distributions