Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 3

Discrete Lifetime Models

Abstract

This chapter is concerned with the current and prospective tools that enable the representation of lifetime data through some model. A major component of such a formulation is to identify the probability distribution of the underlying lifetime. We discuss some possible lifetime distributions and their properties in this chapter. Often, a family of distributions is initially chosen and then a member that adequately describes the features of the data is assumed to be the model. We begin with the Ord family comprising of the binomial, Poisson, negative binomial, hypergeometric, negative hypergeometric and beta-Pascal distributions. The power series family and its subfamily, the Lerch distributions, are considered next. Among the distributions discussed in this class are the Hurwitz zeta, Zipf, Zipf-Manelbrot, Good, geometric, uniform, discrete Pareto, Estoup, Lotka, logarithmic and the zeta models. This is followed by considering the Abel series family consisting of generalized Poisson, quasi-binomial I, quasi-negative binomial, quasi-binomial II, and quasi-logarithmic series distributions. Further, the Lagrangian family, with special reference to the Geeta and generalized geometric models, are also discussed. The reliability characteristics of each of the above families such as hazard rate, mean residual life, characterizations based on relationships between reliability functions, recurrence relations, etc. are discussed. Of special interest in reliability modelling in discrete time is the development of discretized versions of acclaimed continuous life distributions. In this category, we review the works on discrete Weibull I, half-logistic, geometric, inverse-Weibull, generalized exponential, gamma and Lindley models and their reliability aspects. We conclude the discussions by presenting other models that do not belong to the above classifications like the discrete Weibull II and III and the S-distributions.

Keywords

Ord family; Power series distributions; Abel series; Discrete versions of continuous models; Discrete Weibull reliability properties

3.1 Introduction

The main emphasis of this chapter is to present a discussion of the current and prospective tools that enable the representation of lifetime data through some model. A major component in such a formulation is the distribution of the underlying lifetime. Probability distributions facilitate characterization of the uncertainty prevailing in the data set by identification of the patterns of variation. By summarizing the observations into a mathematical form that contains a few unknown parameters, distributions provide a means to the best possible understanding to the basic data generating mechanism. Lifetime distributions, being a probabilistic description of the behaviour of the length of life, is to a marked extent depends on the mode of failure of the device under study. Choice of an appropriate distribution for the data set depends largely on our knowledge about the physical characteristics of the process that give rise to the observations. In some cases, the relationship between the failure mechanism and various concepts encountered in Chapter 2 can be forged. However, many situations, unlike the case of the normal distribution where the central limit theorem affords a bridge between theory and practice, the discrete distributions we confront with are difficult to justify on practical grounds. The real position is that within the range of observations we have in hand, more than one distribution may pass well established goodness-of-fit tests in many cases. Characterization properties, being unique to a specified distribution, are the only exact tools that can identify the correct model. When the sample observations provide evidence of characteristic properties exhibited by reliability functions such as hazard rate, mean residual life, etc., pertaining to a particular model with high accuracy, there is reasonable justification to choose that model. If such a model passes the goodness-of-fit test also, then we have a match between observations and the chosen distribution both in terms of comparable physical properties and statistical validity. In Chapter 2, several examples of this nature were illustrated for modelling real data with some specified distributions. The subsequent sections of this chapter will to present various discrete distributions along with their properties that can be used for modelling and analysis of lifetime data. For a detailed discussion of various aspects of reliability modelling, we refer to Blischke and Murthy (2000).

3.2 Families of Distributions

In the presence of lack of understanding of the data generating mechanism due to insufficient details about the physical characteristics, the investigator has to be content with finding the best approximating distribution from a collection of candidate models. Based on a preliminary assessment of the features of the observations at disposal, mathematical formulation of a suitable model is initially made. A family of distributions that contains enough members with different shapes and varying characteristics can be useful aids in the process of empirical modelling described above.

3.2.1 Ord Family

The family of distributions has its origin in 1895 when Pearson introduced a system bearing his name in the continuous case. A discrete version of the Pearson system, defined by the difference equation

$f (x) - f (x - 1) = \frac{(a - x) f (x - 1)}{b_{0} + b_{1} x + b_{2} x (x - 1)},$

(3.1)

was studied by Ord (1967a, 1967b), which is referred to as the Ord family in the sequel, where x takes values in a subset T of integers.

The members of the system with non-negative integers as the values of X are:

(i) binomial

$f (x) = (\begin{matrix} n \\ x \end{matrix}) p^{x} {(1 - p)}^{n - x}, x = 0, 1, 2, \dots, n; 0 < p < 1, q = 1 - p;$

(3.2)

(ii) negative binomial

$f (x) = (\begin{matrix} k + x - 1 \\ x \end{matrix}) p^{k} {(1 - p)}^{x}, x = 0, 1, 2, \dots;$

(3.3)

(iii) Poisson

$f (x) = e^{- λ} \frac{λ^{x}}{x!}, x = 0, 1, 2, \dots, λ > 0;$

(3.4)

(iv) hypergeometric

$f (x) = (\begin{matrix} M \\ x \end{matrix}) (\begin{matrix} N - M \\ n - x \end{matrix}) / (\begin{matrix} N \\ n \end{matrix}), x = 0, 1, 2, \dots, n;$

(3.5)

(v) negative hypergeometric (beta-binomial)

$f (x) = (\begin{matrix} k + x - 1 \\ x \end{matrix}) (\begin{matrix} N - k - x \\ M - x \end{matrix}) / (\begin{matrix} N \\ M \end{matrix}), x = 0, 1, \dots, M;$

(3.6)

(vi) beta-Pascal

$f (x) = \frac{A}{k + A} (\begin{matrix} k + x - 1 \\ x \end{matrix}) (\begin{matrix} A + B - 1 \\ A \end{matrix}) / (\begin{matrix} k + A + B + x - 1 \\ k + A \end{matrix}),$

(3.7)

$x = 0, 1, 2, \dots$ ; $A = M - 1$ ; $B = N - M$ .

The first four moments of the family are given by

$\begin{matrix} μ_{1}^{'} = (b_{1} + a - 1) / (1 - 2 b_{2}), \\ μ_{2} = (b_{1} + a - 1) {a b_{2} + (1 - b_{2}) (b_{1} - 2 b_{2})} / {(1 - 2 b_{2})}^{2} (1 - 3 b_{2}) \\ μ_{3} = μ_{2} {4 b_{2} (a + 1 - b_{1} - b_{2}) - 4 b_{1} - 3} / (1 - 2 b_{2}) (1 - 4 b_{2}), \end{matrix}$

and

$\begin{matrix} μ_{4} = 3 μ_{2}^{2} + \frac{μ_{2}}{(1 - 4 b_{2}) (1 - 5 b_{2})} [{(1 - b_{2}) (1 - 2 b_{2})} \\ + 6 {(b_{1}^{2} - b_{2}^{2}) + (b_{2} - b_{1}) - 2 b_{2} (a + b_{1} - 1)} \\ + b_{2} μ_{2} {1 - 2 b_{2} {(1 - 3 b_{2})}^{- 1}}] . \end{matrix}$

These will be required for analysing the descriptive characteristics of the distribution, estimating parameters by the method of moments, and in computing standard errors of statistics. All distributions of the system are either J shaped or unimodal. A crucial issue of interest when families are considered for model selection, is the criteria to distinguish its members. Ord (1972) considered the quantities $I = \frac{μ_{2}}{μ}$ , which is the variance mean ratio called index of dispersion, and $S = \frac{μ_{3}}{μ_{2}}$ , and proposed a diagram of $(S, I)$ . In this diagram, the line $S = 2 I - 1$ defines the binomial or Poisson or negative binomial distributions as $I <, =, > 1$ . The other distributions are obtained by analyzing $T = S - 2 I + 1$ . The region $S < 2 I - 1$ gives the beta-binomial, $S > 2 I - 1$ corresponds to the hypergeometric or beta-Pascal according to whether $S < 1$ or >1 in addition. Since the parameters of the family are completely determined by the first three moments, the $(S, I)$ diagram can be used to locate a suitable distribution for a given data, by replacing the population moments with the corresponding sample moments. Ord (1972) also proposed the plot of $u_{x} = \frac{x p_{x}}{p_{x - 1}}$ against x, where $p_{x}$ is the sample frequency. Since successive $u_{x}$ values are dependent, a smoothing $v_{x} = \frac{1}{2} (u_{x} + u_{x + 1})$ is has also been suggested. For example, the binomial distribution satisfies

$u_{x} = \frac{(n + 1) p}{q} - \frac{p}{q} x$

in the population, so that the plot of $(x, \frac{x p_{x}}{p_{x - 1}})$ must show an approximate line. An interesting property of the Ord family is that the truncated version of X at either end also belongs to this family.

Various reliability functions, like the hazard rate, for the Ord family can not be expressed in simple forms. However, the members of the family admit a simple relationship between the hazard rate and the mean residual life. Ahmed (1991) has shown that X has binomial distribution in (3.2) if and only if

$m^{⁎} (x) = n p + q x h (x) .$

(3.8)

Since the mean residual life $m (x) = m^{⁎} (x) - x$ , the relationship between $m (x)$ and $h (x)$ characterizing the binomial distribution is obvious. On similar lines, Osaki and Li (1988) have shown that X follows negative binomial distribution

$f (x) = (\begin{matrix} n - 1 \\ x - 1 \end{matrix}) p^{x} q^{n - x}, n ⩾ x, x = 1, 2, \dots,$

if and only if

$E (X | X > m) = μ + (m + 1 - x) h (m + 1) / p$

for all integers $m ⩾ x - 1$ . They have also shown that X has Poisson distribution in (3.4) if and only if

$m^{⁎} (x) = μ + \frac{(1 + x)}{λ} h (x + 1) .$

A more general result for the Ord family in (3.1) rewritten as

$f (x + 1) - f (x) = - \frac{(x + d) f (x)}{b_{0} + b_{1} x + b_{2} x^{2}},$

was characterized by Nair and Sankaran (1991) through the identity

$m^{⁎} (x) = μ + (a_{0} + a_{1} x + a_{2} x^{2}) h (x + 1),$

(3.9)

where

$b_{i} = \frac{a_{i}}{2 a_{2} + 1} and d = \frac{a_{1} - a_{2} - μ}{2 a_{2} + 1}, i = 0, 1, 2 .$

The values of $a_{0}, a_{1}$ and $a_{2}$ for the distributions in (3.2)–(3.7) are listed in Table 3.1.

Table 3.1

Values of a₀,a₁ and a₂ in (3.9) for the Ord family

Distribution	$a_{0}$	$a_{1}$	$a_{2}$
binomial	q	q	0
Poisson	1	1	0
negative binomial	$\frac{1}{p}$	$\frac{1}{p}$	0
hypergeometric	$\frac{1}{2 + 2 n + 2 M - N}$	$\frac{N + 2 - M}{2 + 2 n + 2 M - 1}$	$\frac{N - M - n}{2 + 2 n + 2 M - 1}$
negative hypergeometric	$\frac{N - k}{N + 1 - M}$	$\frac{N - k - 1}{N + 1 - M}$	$\frac{1}{M - N - 1}$
beta-Pascal	$\frac{- B}{A + 2 K + 1}$	$\frac{- 1 + B}{A + 2 K + 1}$	$\frac{- 1}{A + 2 K + 1}$

Figs 3.1A, 3.1B and 3.1C show the hazard rates of negative binomial, hypergeometric and negative hypergeometric distributions for various values of the parameters.

Figure 3.1 Hazard rate functions for (A) negative binomial distribution, (B) hypergeometric distribution, and (C) negative hypergeometric distribution.

Various distributional properties of the individual models along with results on inference and applications have been discussed in Johnson et al. (1992). Further, characterization problems of the family in terms of truncated moments have been addressed by Glanzel et al. (1984) and Glanzel (1987, 1991). Of particular interest is the result

$E (X^{2} | X ⩾ x) = P (x) E (X | x ⩾ x) + Q (x)$

(3.10)

in Glanzel (1991), where $P (x)$ and $Q (x)$ are polynomials of degree at most one, with real coefficients. Obviously, (3.10) leads to a property of the variance residual life in relation to the mean residual life.

There are two related families in connection with (3.1) that deserves special mention. One is the Katz (1965) system defined by

$\frac{f (x + 1)}{f (x)} = \frac{α + β x}{1 + x}, x = 0, 1, 2 \dots, α > 0, β < 1 .$

(3.11)

It is easy to see that (3.11) is a special case of (3.1) when $b_{0} = b_{1}$ , $b_{2} = 0$ . This restricted model contains the binomial ( $β < 0$ ), Poisson ( $β = 0$ ) and negative binomial ( $0 < β < 1$ ) distribution as particular members. These members can be discriminated by performing a test of hypothesis $H_{0} : μ_{2} = μ$ against $H_{1} : μ_{2} > μ$ ( $μ_{2} < μ$ ), with $H_{0}$ if accepted gives Poisson, otherwise $H_{1} : μ_{2} > μ$ ( $μ_{2} < μ$ ) gives negative binomial (binomial). The test is based on the statistic

$Z = \frac{S^{2} - \bar{x}}{\bar{x}}$

which is closely approximated by a normal distribution $N (\frac{β}{1 - β}, \frac{2}{n})$ , where n is the sample size, and $S^{2}$ and $\bar{x}$ are the variance and mean of the sample. Following (3.9), for the Katz family, we see that

$m^{⁎} (x) = μ {(1 - β)}^{- 1} (1 + x) h (x + 1)$

is satisfied for all x, and conversely. Sudheesh and Nair (2010) established the following characteristic properties for the Ord and Katz families, respectively:

$σ^{2} (x) = E (c_{0} + c_{1} X + c_{2} X^{2} | X > x) + (μ - m (x)) (m (x) - x - 1),$

(3.12)

where

$c_{0} = μ + \frac{b_{0} - b_{1} + b_{2}}{1 - 2 b_{2}}, c_{1} = \frac{b_{1} - 1}{1 - 2 b_{2}}, c_{2} = \frac{b_{2}}{1 - 2 b_{2}},$

and

$σ^{2} (x) = \frac{α + β m (x) + (μ - m (x)) (m (x) - x - 1)}{(1 - β)} .$

(3.13)

For example, in the Poisson case, we have

$σ^{2} (x) = λ + (λ - m (x)) (m (x) - x - 1),$

or alternatively in terms of the variance and the hazard rate as

$σ^{2} (x) = λ + (x + 1) (x + 1 - λ) h (x + 1) - {(x + 1)}^{2} h^{2} (x + 1) .$

Let τ be the class of real-valued functions $C (X)$ of X. Then, for the Ord family, we have

$\inf_{C (x) \in τ} \frac{V (C (X))}{E^{2} [(d_{0} + d_{1} X + d_{2} X^{2}) Δ C (X)]} = 1,$

where $d_{i} = c_{i} σ^{- 1}$ and Δ is the forward difference operator, $i = 0, 1, 2$ . In particular, for the Katz family, we have

$\inf_{C (X) \in τ} \frac{σ^{2} (1 - β) V [C (X)]}{E^{2} ((α + β X) Δ C (X))} = 1$

provides the lower bound to the variance of a random function $C (X)$ . Under some regularity conditions, Nair and Sudheesh (2008) showed that these bounds reduce to the Cramer-Rao and Chapman-Robbins inequalities. The moments of the Katz family are

$\begin{matrix} μ = \frac{α}{1 - β}, μ_{2} = \frac{α}{{(1 - β)}^{2}}, \\ μ_{3} = μ_{2} (1 + β) and μ_{4} = 3 μ_{2}^{2} + μ_{2} \frac{{(β + 2)}^{2}}{{(1 - β)}^{2}} . \end{matrix}$

It has a probability generating function

$P (t) = {(\frac{1 - β t}{1 - β})}^{- \frac{α}{β}} .$

An extension of the Ord family has been provided by Sindhu (2002) wherein the difference equation in (3.1) has the modified form

$\frac{f (x + 1) - f (x)}{f (x)} = \frac{d_{0} + d_{1} x + d_{2} x^{2}}{k_{0} + k_{1} x + k_{2} x^{2}}$

(3.14)

with $d_{0}, d_{1}, d_{2}, k_{0}, k_{1}$ and $k_{2}$ as real constants. Besides containing all members of the Ord family, the system in (3.14) contains several other distributions such as the confluent hypergeometric distribution of Bhattacharya (1966), Borel-Tanner model (Tanner, 1953), and the Haight distribution (Haight, 1961).

Theorem 3.1

A necessary and sufficient condition for the distribution of X to belong to the family in (3.14) is that

$\begin{matrix} d_{2} E (X^{2} | X ⩾ x) + (d_{1} + 2 k_{2}) E (X | X ⩾ x) + d_{0} + k_{1} - k_{2} \\ + (k_{0} + k_{1} x + k_{2} x^{2}) h (x + 1) = 0 . \end{matrix}$

Remark 3.1

An extension of Glanzel's result in (3.10) is also possible from the above theorem. The relevant result is that X follows (3.14) if an only if

$d_{2} E (X^{3} | X ⩾ x) = A (x) E (X^{2} | X ⩾ x) + B (x) E (X | X ⩾ x) + C (x),$

where $A (x) = d_{2} x + q$ with q as a real constant and $B (x)$ and $C (x)$ are polynomials of degree at most one with real coefficients.

The approach for relationships between reliability functions in reversed time is the same and very similar results emerge. Corresponding to (3.9), the identity between $r (x)$ and $λ (x)$ for the Ord family is

$r (x + 1) = μ - (a_{0} + a_{1} x + a_{2} x^{2}) λ (x)$

with $a_{0} = μ + \frac{b_{0} - b_{1} + b_{2}}{1 - 2 a_{2}}$ , $a_{1} = \frac{b_{1} - 1}{1 - 2 b_{2}}$ and $a_{2} = \frac{b_{2}}{1 - 2 b_{2}}$ .

Gupta et al. (1997) have given a formula for computing the hazard rate as

$\frac{1}{h (x)} = \sum_{i = 0}^{\infty} \prod_{u = x}^{x + i} \frac{f (u + 1)}{f (u)}$

(3.15)

when the ratio of two successive probabilities are known. By utilizing of this, for the Katz family, we have

$h (x) = {[1 + \frac{x! (k - 1)!}{(k + x - 1)!} {(\begin{matrix} k + x \\ 1 + x \end{matrix}) β + \dots + (\begin{matrix} k + x + j - 1 \\ x + j \end{matrix}) β^{j} + \dots}]}^{- 1},$

where $k = \frac{α}{β}$ . An alternative expression is

$h (x) = {[{}_{2}F_{1} (1, \frac{α}{β} + x; 1 + x; β)]}^{- 1},$

(3.16)

where ${}_{2}F_{1}$ is the hypergeometric function defined as

${}_{2}F_{1} (a, b, c; x) = 1 + \frac{a b}{c} x + \frac{a (a + 1) b (b + 1)}{c (c + 1) 2!} x^{2} + \dots,$

(3.17)

with $c \neq 0, - 1, - 2, \dots$ .

For an elaborate study on the orthogonal polynomials in the cumulative Ord family and their application to variance bounds, one may refer to the recent work of Afendras et al. (2017).

3.2.2 Power Series Family

The power series family of distributions were initially studied by Kosambi (1949) and Noack (1950). Any distribution that can be represented as

$f (x) = \frac{a (x) θ^{x}}{A (θ)}, x = 0, 1, \dots, θ > 0,$

(3.18)

where $a (x) > 0$ and $A (θ) = \sum a (x) θ^{x}$ , is said to be of power series form. In (3.18), $A (θ)$ is called the series function. Sometimes, (3.18) is also referred to as the discrete linear exponential family in view of the specification of (3.18) as

$f (x) = a (x) b (α) e^{- α x} .$

Initially, Noack (1950) considered $f (x)$ with support as the whole set of non-negative integers. Patil (1962) extended the support to any arbitrary non-empty subset T of non-negative integers such that

$A (θ) = \sum_{x \in T} a (x) θ^{x},$

(3.19)

with $a (x) > 0$ , $θ ⩾ 0$ , so that $A (θ)$ is positive, finite and differentiable. Then, (3.18) for $x \in T$ is called a generalized power series distribution (GPSD). The system in (3.19) includes the binomial, negative binomial, Poisson and logarithmic series distributions. A truncated GPSD is also a GPSD.

A further extension of the GPSD has been introduced by Gupta (1974) who defined the modified power series distributions (MPSD), specified by the probability mass function of the form

$f (x) = \frac{a (x) {(g (θ))}^{x}}{B (θ)}, x \in T,$

(3.20)

where $a (x) > 0$ , and $g (θ)$ and $B (θ)$ are positive finite and differentiable. When $g (θ)$ is invertible, MPSD reduces to the GPSD. Apart from all distributions belonging to the GPSD family, the modified version contains more distributions, such as the generalized binomial distribution of Jain and Consul (1971). Being the more general form, the properties of (3.20) will be mentioned so that the results for GPSD can be deduced from them. The moments of (3.20) are

$\begin{matrix} μ & = E (X) = \frac{B^{'} (θ) g (θ)}{B (θ) g^{'} (θ)} = \frac{g (θ)}{g^{'} (θ)} \frac{d}{d θ} [\log B (θ)], \\ μ_{2} & = \frac{g (θ)}{g^{'} (θ)} \frac{d μ}{d θ}, \end{matrix}$

and

$μ_{r + 1} = \frac{g (θ)}{g^{'} (θ)} \frac{d μ_{r}}{d θ} + r μ_{2} μ_{r - 1}, r = 1, 2, \dots$

The factorial moments can be calculated from the recurrence relation

$μ_{(r + 1)} = \frac{g (θ)}{g^{'} (θ)} \frac{d μ_{(r)}}{d θ} + μ_{(r)} μ - r μ_{(r)} .$

Consul (1995) has developed characterizations of the exponential family of distributions

$f (x) = \exp [x p (α) + q (α) + s (x)]$

(3.21)

which contains numerous discrete probability mass functions and probability density functions of continuous random variables for various choices of $p (α)$ , $q (α)$ and $S (x)$ . In the discrete case, the translations $a (x) = \exp [s (x)]$ , $B (θ) = \exp [- q (α)]$ and $g (θ) = \exp [p (α)]$ reduce (3.21) to the form (3.20). His main result can be stated in our notation as

$m^{⁎} (x) = μ + \frac{g (θ)}{g^{'} (θ)} \frac{\partial}{\partial θ} (\log S (x + 1)) .$

(3.22)

As examples, the Lagrangian Poisson distribution

$\begin{matrix} f (x) = \frac{{(1 + α x)}^{x - 1}}{x!} θ^{x} \exp [- θ (1 + α x)], x = 0, 1, \dots, \\ θ > 0, 0 ⩽ α ⩽ θ^{- 1}, \end{matrix}$

(3.23)

satisfies the property

$m^{⁎} (x) = \frac{θ}{1 - α θ} + {(- α + \frac{1}{θ})}^{- 1} \frac{\partial}{\partial θ} \log S (x + 1);$

the Lagrangian negative binomial with probability function

$\begin{matrix} f (x) = \frac{n}{n + m x} (\begin{matrix} n + m x \\ x \end{matrix}) θ^{x} {(1 - θ)}^{n + m x - x}, x = 0, 1, 2, \dots, \\ 0 < θ < 1, n > 0, m = 0, 1 ⩽ m ⩽ θ^{- 1}, \end{matrix}$

(3.24)

is characterized by

$m^{⁎} (x) = \frac{n θ}{1 - m θ} + \frac{θ (1 - θ)}{1 - m θ} \frac{\partial}{\partial θ} \log S (x + 1), 0 < θ < 1 .$

(3.25)

Notice that (3.24) contains, as special cases, the binomial distribution ( $m = 0$ ), negative binomial ( $m = 1$ ), the Geeta distribution

$f (x) = \frac{1}{α x - 1} (\begin{matrix} α x - 1 \\ x \end{matrix}) θ^{x - 1} {(1 - θ)}^{α x - x}, x = 0, 1, 2 \dots,$

when $n = α - 1$ and x is replaced by $x - 1$ so that the identity in (3.25) is true for $x = 1, 2, \dots, n$ (binomial), for all x (negative binomial), and $x = 2, 3, \dots$ (Geeta). The identity in (3.22) can be used to obtain the lower bound for the variance as

$V (C (X)) ⩾ E^{2} [g (X) Δ C (X)],$

when $C (x)$ is any real-valued function of x (Nair and Sudheesh, 2008) and

$g (x) = \frac{g (θ)}{g^{'} (θ)} \frac{1}{σ f (x)} \frac{\partial S (x + 1)}{\partial θ} .$

3.2.3 Lerch Family

An important sub-family of the MPSD is the Hurwitz-Lerch zeta distributions (HLZD). Various properties of this class, along with their applications to reliability, have been studied by Gupta et al. (2008). The HLZD arises from the general Hurwitz-Lerch zeta function defined by

$ϕ (z, s, a) = \sum_{k = 0}^{\infty} \frac{z^{k}}{{(k + a)}^{s}}, s \in C when | z | < 1, a = C (0, - 1, - 2, \dots)$

(3.26)

and $R (s) > 1$ when $| z | = 1$ and C is the set of complex numbers. Some known special cases of the function (3.26) are the Riemann zeta function

$ζ (s) = \sum_{k = 1}^{\infty} \frac{1}{k^{s}} = \sum_{k = 0}^{\infty} \frac{1}{{(1 + k)}^{s}} = ϕ (1, s, 1),$

(3.27)

Hurwitz zeta function

$ζ (s, a) = \sum_{k = 0}^{\infty} \frac{1}{{(k + a)}^{s}} = ϕ (1, s, a),$

(3.28)

and the polylogarithmic function

$A (z, s) = \sum_{k = 1}^{\infty} \frac{z^{k}}{k^{s}} = z \sum_{k = 0}^{\infty} \frac{z^{k}}{{(1 + k)}^{s}} = z ϕ (z, s, 1) .$

(3.29)

For detailed properties of these functions, we refer to Erdelyi et al. (1953). Zornig and Altmann (1995) introduced the three parameter HLZD as a generalization of the Zipf, Zipf-Mandelbrot and Good distributions mentioned below. Subsequently, the family has found some applications as a model in ecology, linguistics, information sciences, statistical physics and survival analysis.

Aksenov and Savageau (2005) defined the Lerch family of distributions through the probability mass function

$f (x) = \frac{c z^{x}}{{(a + x)}^{s}}, x = 0, 1, 2, \dots,$

(3.30)

where $c^{- 1} = ϕ (z, s, a)$ , the Lerch transcendent (Lerch, 1887), also called the Hurwitz-Lerch zeta function defined in (3.26) for $z > 0$ , $a > 0$ . By virtue of the relationship

$ϕ (z, s, a) = z^{m} ϕ (z, s, m + a) + \sum_{k = 0}^{m - 1} \frac{z^{k}}{{(k + a)}^{s}},$

(3.31)

we can arrive at the distribution function

$F (x) = 1 - z^{x + 1} ϕ (z, s, a + x + 1) / ϕ (z, s, a) .$

The probability generating function and the moment generating function are, respectively,

$P (t) = \frac{ϕ (t z, s, a)}{ϕ (z, s, a)}, | t | ⩽ 1,$

and

$M (t) = \frac{ϕ (z e^{t}, s, a)}{ϕ (z, s, a)} .$

There is unimodality for the family if $s < 0$ and $a ⩾ 1$ with mode at

$m_{0} = 1 + [\frac{1}{z^{1 / s} - 1} - a],$

where $[W]$ is the integer part of W. Furthermore,

$μ = \frac{ϕ (z, s - 1, a)}{ϕ (z, s, a)} - a$

and

$μ_{2} = {(a + μ)}^{2} + \frac{ϕ (z, s - 2, a) - 2 (a + μ) ϕ (z, s - 1, a)}{ϕ (z, s, a)} .$

Regarding the reliability functions, we obtain directly from the definitions that

$h (x) = \frac{1}{{(a + x)}^{s} ϕ (z, s, a + x)}$

(3.32)

and

$λ (x) = \frac{c z^{x}}{ϕ (z, s, a) - z^{x + 1} ϕ (z, s, a + x + 1)} .$

(3.33)

We observe that the expressions in (3.32) leads to some new interesting recurrence relationships and formulae. Since

$ϕ (z, s, a + x) = {[h (x) {(a + x)}^{s}]}^{- 1}$

and

$ϕ (z, s, a + x + 1) = \frac{1}{z} [ϕ (z, s, a + x) - {(a + x)}^{- s}] = {[h (x + 1) {(a + x + 1)}^{s}]}^{- 1}$

from (3.31), we have on eliminating ϕ between the last two equations,

$h (x + 1) = z {(\frac{a + x}{a + x + 1})}^{s} \frac{h (x)}{1 - h (x)},$

(3.34)

a recurrence relation for evaluating $h (x)$ . One may use the initial value $h (0) = c a^{- s}$ in (3.34) to initiate the recurrence.

The probability mass function and the hazard rate function of the family are presented in Figs 3.2 and 3.3 respectively.

Figure 3.2 Probability mass functions of Hurwitz-Lerch distribution.

Figure 3.3 Hazard functions of Hurwitz-Lerch distribution.

To simplify (3.33) further, we apply (Magnus et al., 1966)

$ϕ (z, s, a) - z^{x + 1} ϕ (z, s, a x + 1) = \sum_{k = 0}^{x} \frac{z^{k}}{{(a + k)}^{s}}$

to write a closed-form expression

$\begin{matrix} λ (x) & = \frac{z^{x}}{(a + x^{s} \sum_{k = 0}^{x} \frac{z^{k}}{{(a + k)}^{s}})} \\ = {[\sum_{k = 0}^{x} {(\frac{a + x}{a + x - k})}^{s} \frac{1}{z^{k}}]}^{- 1} . \end{matrix}$

(3.35)

The mean residual life is given by

$m (x) = \frac{1}{z^{x} ϕ (z, s, a + x + 1)} \sum_{t = x + 1}^{\infty} z^{t} ϕ (z, s, a + t) .$

It now follows that

$m (x + 1) z ϕ (z, s, a + x + 2) - m (x) ϕ (z, s, a + x + 1) = \frac{z}{{(a + x + 1)}^{s}},$

$m (x + 1) - m (x) = \frac{z + m (x + 1)}{{(a + x + 1)}^{s} ϕ (z, s, a + x + 1)} .$

(3.36)

Lerch family can be written in the form (3.20) with

$B (θ) = ϕ (z, s, a), a (x) = {(a + x)}^{- s} and g (θ) = z$

as a modified power series family with parameter z in place of $g (θ) = θ$ . Hence, the moments satisfy the recurrence formula

$μ_{r + 1} = z \frac{d μ_{r}}{d z} + r μ_{z} μ_{r - 1}, r = 1, 2, \dots,$

for the central moments, and

$μ_{(r + 1)} = z \frac{d μ_{(r)}}{d z} + μ_{(r)} μ - r μ_{(r)}$

connecting the factorial moments. With the use of the expressions for μ and $μ_{2}$ given above, the central and factorial moments can be computed. In the context of reliability modelling, the following characterization in terms of reliability concepts are useful.

Theorem 3.2

A discrete random variable X with support $(0, 1, 2, \dots)$ belongs to the Lerch family in (3.30) if and only if

$m^{⁎} (x) = μ + z \frac{\partial \log S (x + 1)}{\partial z} .$

(3.37)

Proof

We have

$S (x + 1) = \sum_{t = x + 1}^{\infty} \frac{z^{t}}{{(a + t)}^{s} ϕ (z, s, a)} .$

Differentiating with respect to z, we get

$\frac{\partial S}{\partial z} = \sum_{t = x + 1}^{\infty} \frac{t z^{t - 1}}{{(a + t)}^{s} ϕ (z, s, a)} - \sum_{t = x + 1}^{\infty} \frac{z^{t}}{{(a + t)}^{s} ϕ (z, s, a)} \frac{ϕ^{'} (z, s, a)}{ϕ (z, s, a)}$

and so

$\frac{1}{S (x + 1)} \frac{\partial S (x + 1)}{\partial z} = \frac{1}{z} E (X | X > x) - \frac{ϕ^{'} (z, s, a)}{ϕ (z, s, a)} .$

(3.38)

By virtue of the identity

$ϕ (x, s - 1, a) = (a + z \frac{\partial}{\partial z}) ϕ (z, s, a)$

and the fact that

$μ = \frac{ϕ (z, s - 1, a)}{ϕ (z, s, a)} - a,$

(3.38) simplifies to (3.37). To prove the sufficiency part, condition (3.7) means that

$\sum_{t = x + 1}^{\infty} t f (t) = μ \sum_{t = x + 1}^{\infty} f (t) + z \sum_{t = x + 1}^{\infty} \frac{\partial f (t)}{\partial z} .$

(3.39)

Changing $x + 1$ to x and from the resulting equation, subtracting (3.39), yields

$x f (x) = μ f (x) + z \frac{\partial f (x)}{\partial z},$

$z \frac{\partial \log f (x)}{\partial z} = x - μ = x - \frac{z}{ϕ (z, s, a)} \frac{\partial ϕ (z, s, a)}{\partial z} .$

Upon integrating, we get

$f (x) = \frac{z^{x}}{ϕ (z, s, a)} A (x) .$

Since $\sum_{x = 0}^{\infty} f (x) = 1$ and $A (x) = {(a + x)}^{- s}$ we have the proof. □

Theorem 3.3

A necessary and sufficient condition for X to have Lerch distribution is that

$E [{(a + X)}^{s} | X > x] = \frac{{(a + x + 1)}^{s} h (x + 1)}{1 - z}, | z | < 1, a > 0,$

(3.40)

for all x.

Proof

We have

$f (x) {(a + x)}^{s} = \frac{z^{x}}{ϕ (z, s, a)},$

$\sum_{t = x + 1}^{\infty} f (t) {(a + t)}^{s} = \frac{z^{x + 1}}{(1 - z) ϕ (z, s, a)} .$

Thus,

$\begin{matrix} E [{(a + X)}^{s} | X > x] & = \frac{z^{x + 1}}{(1 - z) S (x + 1) ϕ (z, s, a)} \\ = \frac{{(a + x + 1)}^{s}}{z} h (x + 1), \end{matrix}$

so that the condition is necessary. The sufficiency part follows by retracing the above steps. □

Theorem 3.4

The random variable X has Lerch distribution if and only if

$E (X | X ⩽ x) = μ + z \frac{\partial \log F (x)}{\partial x} .$

(3.41)

Proof

From Theorem 3.2 and the relationship

$F (x) E (X | X ⩽ x) + S (x + 1) E (X | X > x) = μ,$

we obtain the characterization in (3.41). □

Theorem 3.5

The random variable X has Lerch distribution if and only if the variance residual life is given by

$σ^{2} (x) = z \frac{\partial \log S (x + 1)}{\partial z} + z^{2} \frac{\partial^{2} \log S (x + 1)}{\partial z^{2}} .$

(3.42)

Proof

Sudheesh and Nair (2010) have characterized the modified power series family by

$σ^{2} (x) = \frac{g (θ)}{g^{'} (θ)} \frac{\partial m^{⁎} (x, θ)}{\partial θ} .$

(3.43)

Since the Lerch family is a special case when $g (θ) = θ = z$ , we have

$σ^{2} (x) = z \frac{\partial m^{⁎} (x, z)}{\partial z} .$

(3.44)

Substituting (3.37) into (3.44), we obtain (3.42). □

The estimation of parameters of the Lerch family has been addressed by Aksenov and Savageau (2005), who proposed the method of moments and the method of maximum likelihood. Denoting the sample moments by

$p_{r} = \frac{1}{n} \sum_{λ = 1}^{n} x_{i}^{r}$

from a random sample $(x_{1}, \dots, x_{n})$ from the distribution, the equations to be solved in the method of moments are as follows:

$\begin{matrix} \frac{ϕ (\bar{z}, \bar{s} - 1, \bar{a})}{ϕ (\bar{z}, \bar{s}, \bar{a})} - \bar{a} = p_{1}, \\ {\bar{a}}^{2} - 2 \bar{a} \frac{ϕ (\bar{z}, \bar{s} - 1, \bar{a})}{ϕ (\bar{z}, \bar{s}, \bar{a})} + \frac{ϕ (\bar{z}, \bar{s} - 2, \bar{a})}{ϕ (\bar{z}, \bar{s}, \bar{a})} = p_{2}, \\ - {\bar{a}}^{3} + 3 {\bar{a}}^{2} \frac{ϕ (\bar{z}, \bar{s} - 1, \bar{a})}{ϕ (\bar{z}, \bar{s}, \bar{a})} - 3 \bar{a} \frac{ϕ (\bar{z}, \bar{s} - 3, \bar{a})}{ϕ (\bar{z}, \bar{s}, \bar{a})} = p_{3}, \end{matrix}$

to get the estimates $(\bar{z}, \bar{s}, \bar{a})$ of $(z, s, a)$ . The above equations have to be solved numerically. Aksenov and Savageau (2005) also presented formulas for the asymptotic variances and covariances of the estimates obtained by the method of moments and the method of maximum likelihood.

Remark 3.2

(1) The population moments given above are deduced from the general expression for the rth moment about the origin given by

$μ_{r}^{'} = \frac{1}{ϕ (z, s, a)} \sum_{j = 0}^{r} (\begin{matrix} r \\ j \end{matrix}) {(- a)}^{r - j} ϕ (z, s - j, a);$

(2) For calculation of the coefficients of skewness and kurtosis, the formula for the central moments given by

$μ_{r} = \frac{1}{ϕ (z, s, a)} \sum_{j = 0}^{r} {(- 1)}^{r - j} (\begin{matrix} r \\ j \end{matrix}) ϕ (z, s - j, a) {(\frac{ϕ {(z, s - 1, a)}^{r - j}}{ϕ (z, s, a)})}^{r - j}$

will be useful.

Gupta et al. (2008) have studied the Lerch family with positive integers as support. They considered the probability mass function of the form

$f (x) = \frac{z^{x}}{T (z, s, a) {(a + x)}^{s + 1}}, \begin{matrix} x = 1, 2, 3, \dots, s ⩾ 0, \\ 0 ⩽ a ⩽ 1, 0 < z ⩽ 1, \end{matrix}$

where

$T (z, s, a) = \sum_{x = 1}^{\infty} \frac{z^{x}}{{(a + x)}^{s + 1}} = z ϕ (z, s + 1, a + 1) .$

The basic characteristics of this distribution are evaluated from

$\begin{matrix} μ = \frac{T (z, s - 1, a)}{T (z, s, a)} - a, \\ σ^{2} = \frac{T (z, s - 2, a)}{T (x, s, a)} - {(\frac{T (z, s - 1, a)}{T (z, s, a)})}^{2}, \end{matrix}$

and

$μ_{r + 1} = z \frac{d μ_{r}}{d z} + r σ^{2} μ_{r - 1}, r = 1, 2, \dots .$

Other functions of interest are the probability generating function given by

$P (t) = \frac{T (z t, s, a)}{T (z, s, a)},$

the distribution function given by

$F (x) = 1 - z^{x} \frac{T (z, s, a + x)}{T (z, s, a)},$

the survival function given by

$S (x) = z^{k - 1} \frac{T (z, s, a + x + 1)}{T (z, s, a)},$

the hazard rate given by

$h (x) = \frac{z}{{(a + x)}^{s + 1} T (z, s, a + x - 1)},$

and the reversed hazard rate given by

$λ (x) = \frac{z^{k}}{[T (z, s, a) - z^{k} T (z, s, a + x)] {(a + x)}^{s + 1}} .$

The maximum likelihood method is proposed to estimate the parameters with the estimators solved from

$\begin{matrix} E (X) & = \bar{x}, \\ \sum_{i = 1}^{n} \frac{\log (a + x_{i})}{n} & = E (\log (a + X)) \end{matrix}$

and

$\frac{1}{n} \sum_{i = 1}^{n} \frac{1}{a + x_{i}} = E (\frac{1}{a + X}) .$

They further observed that the likelihood equations are the same as the method of moments equations.

Zornig and Altmann (1995) and Kemp (2010) provided a list of members of the Lerch family. We now present some of these distributions and their properties. It may be noted that there is a difference in the expressions for the probability generating functions when the supports are different. For instance,

$\begin{matrix} P (t) & = ϕ (z t, s, a) / ϕ (z, s, a), x = 0, 1, 2, \dots, \\ = t ϕ (z t, s, a) / ϕ (z, s, a), x = 0, 1, 2, \dots, \\ = \frac{t ϕ (z t, s, a) - z^{b} ϕ (t z, s, a + b)}{ϕ (z, s, a) - z^{b} ϕ (z, s, a + b)}, x = 1, 2, \dots, b . \end{matrix}$

Geometric Distribution

The geometric distribution is a member of all the families discussed so far, and hence enjoys the properties of all families. In addition to some of the characteristic properties already discussed in the preceding chapter, we present a few more results here that are relevant to reliability studies. The foremost among them is the no-ageing (lack of memory) property of the geometric lifetimes. It states that X is geometric with probability mass function

$f (x) = q^{x} p, x = 0, 1, 2, \dots,$

(3.45)

if and only if

$P (X ⩾ s + t | X ⩾ s) = P (X ⩾ t)$

(3.46)

for all $t, s = 0, 1, 2, \dots$ , or equivalently,

$P (X = s + t | X ⩾ s) = P (X = t) .$

Writing (3.46) as

$\frac{P (X ⩾ s + t)}{P (X ⩾ s)} = P (X ⩾ t),$

we can recognize the left hand side as the distribution of the residual life $(X - s | X ⩾ s)$ . Thus, the residual life of a geometric lifetime at any age is the same as the original lifetime, thereby, justifying the name ‘no-ageing’ property.

Being a characteristic property, for devices that does not age with time, the geometric law provides a suitable model. This is consistent with our earlier observation that the hazard rate, mean residual life and variance residual life are constant independent of the age of the device. Further, all these properties are equivalent to one another. The notion of no-ageing, also defined as

$S (x + y) = S (x) S (y)$

(3.47)

for all $x, y$ , forms the basis of many ageing concepts discussed in the next chapter. Also, (3.47) suitably extended to higher dimensions is employed to generate various forms of multivariate geometric distributions; see Chapter 7 for details. A simple extension of the geometric law is possible by assuming X to take values $a, a + d, a + 2 d, \dots$ , with

$P (X = a + n d) = q^{x} p, n = 0, 1, 2, \dots,$

(3.48)

in which case also the no-ageing property applies when X is replaced $a + X d$ in (3.46). More interesting results of (3.48), in connection with ageing behaviour of X, can be seen in Chapter 4. Some additional properties of the geometric distribution are as follows:

(i) If $X_{1}, \dots, X_{n}$ are independent geometric random variables with parameter p, then $X_{1} + X_{2} + \dots + X_{n}$ is distributed as negative binomial with parameters $(n, p)$ ;
(ii) If $X_{1}, \dots, X_{n}$ are independent geometric random variables with parameters $p_{1}, p_{2}, \dots, p_{n}$ respectively, then $X_{(1)} = \min (X_{1}, \dots, X_{n})$ is a geometric random variable with parameter $p = 1 - \prod_{i = 1}^{n} (1 - p_{i})$ ;
(iii) If $X_{1}$ and $X_{2}$ are independent random variables with same parameter p, then $X_{(2)} = \max (X_{1}, X_{2})$ has distribution

$\begin{matrix} P (X_{(2)} = x) & = P (X_{1} = x, X_{2} < x) \\ + P (X_{2} = x, X_{1} < x) \\ + P (X_{1} = X_{2} = x) \\ = 2 q^{x} p (1 - q^{x}) + q^{2 x} p^{2} \\ = 2 q^{x} p - q^{2 x} p - q^{2 x + 1} p . \end{matrix}$

The method of proof can be extended readily to the case of n variables.
(iv) If X is negative binomial with

$f (x | K = k) = (\begin{matrix} k + x - 1 \\ k - 1 \end{matrix}) q^{x} p^{k}$

and K has geometric distribution

$P (K = y) = p_{1} q_{1}^{y - 1}, y = 1, 2, \dots,$

then the mixture (unconditional distribution of X) is geometric $(\frac{p p_{1}}{1 - p q_{1}})$ . Thus, geometric distribution arises as a mixture.
(v) Let Y be a continuous random variable with exponential distribution

$f_{1} (y) = λ e^{- λ y}, y > 0, λ > 0 .$

If we denote by X and U, the integer part (largest integer not exceeding Y) and the fractional part of Y, Steutel and Thiemann (1989) have shown that X and U are independent and further that X has geometric distribution in (3.45) with $p = 1 - e^{- λ}$ ; see also Deng and Chhikkara (1990). Arising from the above result, it is also shown that the order statistics $X_{k : n}$ , $k = 1, 2, \dots, n$ , from the geometric distribution satisfies

$X_{k : n} \overset{d}{=} \sum_{j = 0}^{k - 1} X_{1 : n - j} + [\sum_{j = 0}^{k - 1} \frac{U_{n - j}}{n - j}],$

where $\overset{d}{=}$ denotes equality in distribution, all variables on the right hand side are independent, and $[Y] = X$ as stated above. Also,

$\sum_{j = 0}^{k - 1} X_{1 : n - j} ⩽ X_{k : n} ⩽ X_{1 : n - j} + n - 1$

in which the inequalities denote the usual stochastic orderings discussed in Chapter 4.

It is more informative to pursue the above results in a more general framework. Let Y be a continuous random variable with probability density function $f_{1} (y)$ satisfying $P (Y \in D) = 1$ , where D is an open subset of $R$ , the one-dimensional Euclidean space. If D can be divided into mutually exclusive regions $R_{1}, \dots, R_{k}$ with corresponding regions $S_{1}, \dots, S_{k}$ such that $T_{i} (Y) : R_{i} \to S_{i}$ , $i = 1, 2, \dots, k$ , are injective and continuously differentiable. Then $U = T (Y)$ has probability density function

$g (u) = {\begin{matrix} \sum_{i = 1}^{k} \frac{f_{1} (T_{i}^{- 1} (u))}{J (T_{i}^{- 1} (u))}, & u \in T (R_{i},) \\ 0, & u \notin T (R_{i}) . \end{matrix}$

Here, J stands for the absolute Jacobian of transformation and $T^{- 1} : T (D) \to D$ is the inverse of T (see Hoffman-Jorgensen, 1994). Applying the above transformation theorem with $D = R$ , $T (Y) = U$ , $T (D) = [0, 1)$ , then $J = 1$ for every $x = 1, 2, \dots$ . When Y is a continuous random variable with X as integer part and U as fractional part, we have the following result.

Theorem 3.6

The distribution of U is continuous with probability density function

$g (u) = \sum_{x = - \infty}^{\infty} f_{1} (x + u), 0 < u < 1 .$

Example

Starting with the exponential case

$g (u) = \sum_{x = 0}^{\infty} λ e^{- λ (x + u)} = \frac{λ e^{- λ u}}{1 - e^{- λ}}, 0 < u < 1,$

which is again the exponential distribution truncated at unity.

The application of the above result in reliability analysis is quite clear. On the one hand, we can convert the data on exponential to geometric and vice versa. Also by simulating exponential observations, their integer parts will be geometric. Some of the reliability functions from one distribution can be expressed in terms of the other. For example, in the exponential case, the mean residual life function is

$\begin{matrix} m (y) & = E (Y - y | Y > y) = E (Y) \\ = E (X + U) = E (X) + E (U) \\ = E (X - x | X > x) + E (U) - 1 \\ = \frac{1}{p} + (\frac{1}{λ} - \frac{e^{- λ}}{1 - e^{- λ}}) - 1 . \end{matrix}$

Thus, the difference between the mean residual lives of the exponential and geometric lifetimes is $\frac{1}{λ} - \frac{e^{- λ}}{1 - {\bar{e}}^{λ}}$ . Even if failure times are observed at integer values, a very good approximation to continuous measurement can be made by virtue of the above result, from the estimated value of p, employing the equation $p = 1 - e^{- λ}$ .

(vi) The other distributional properties are:

$\begin{matrix} probability generating function P (t) = p {(1 - q t)}^{- 1}, \\ mean μ = \frac{q}{p}, variance μ_{2} = \frac{q}{p^{2}}, \end{matrix}$

and the rth factorial moment is

$μ_{(r)} = r! {(\frac{q}{p})}^{r} .$

Theorem 3.7

For any discrete random variable X, we have

$E (X h (X)) ⩾ \frac{2 μ^{2}}{E (X^{2}) + μ}$

with equality holding if and only if X is geometric.

Proof

We have

$E (X h (X)) = \sum_{x = 0}^{\infty} \frac{x f^{2} (x)}{S (x)} .$

From Cauchy-Schwarz inequality, we find

$(\sum_{x = 0}^{\infty} \frac{x f^{2} (x)}{S (x)}) (\sum_{x = 0}^{\infty} x S (x)) ⩾ {(\sum_{x = 0}^{\infty} x f (x))}^{2} = μ^{2} .$

(3.49)

By some straight forward calculations, we then get

$\sum x S (x) = \frac{E (X^{2}) + μ}{2} .$

Hence, from (3.49), we have the inequality stated in the theorem. The equality sign in (3.49) holds if and only if

${[x S (x)]}^{\frac{1}{2}} = K {(\frac{x}{S (x)})}^{\frac{1}{2}} f (x),$

which gives $h (x) =$ a constant, characterizing the geometric law. □

(viii) Geometric distribution is a member of the Lerch family with $s = 0$ and $a = 1$ . The reliability properties follow from the general formulas for this family and also from our discussions in Chapter 2.

Discrete Uniform Distribution

The discrete uniform distribution arises from (3.30) when $z = 1$ , $s = 0$ and $a = 1$ , with probability mass function

$f (x) = \frac{1}{b}, x = 1, \dots, b .$

(3.50)

It has distribution function $F (x) = \frac{x}{b}$ and survival function $S (x) = \frac{b - x + 1}{b}$ . Various distributional characteristics are as follows:

$\begin{matrix} μ = \frac{(b + 1) (2 b + 1)}{6}, μ_{2} = \frac{b^{2} - 1}{12}, \\ P (t) = \frac{t (1 - t^{b})}{b (1 - t)}, t \neq 1 . \end{matrix}$

If $X_{1}, \dots, X_{n}$ are independent random variables with distribution in (3.50), then $X_{(1)} = \min X_{i}$ and $X_{(n)} = \max X_{i}$ , $i = 1, 2, \dots, n$ have respective probability mass functions

$P (X_{(1)} = x) = \frac{{(b - x + 1)}^{n} - {(b - x)}^{n}}{b^{n}}$

and

$P (X_{(n)} = x) = \frac{x^{n} - {(x - 1)}^{n}}{b^{n}} .$

Among the reliability functions, the hazard rate is $h (x) = \frac{1}{b - x + 1}$ and the mean residual life function is $m (x) = \frac{(b - x + 1)}{2}$ , giving

$h (x) m (x) = \frac{1}{2},$

a constant, characterizing the uniform distribution as a special case of the negative hypergeometric distribution (see Chapter 1). The variance residual life is found to be

$σ^{2} (x) = \frac{(b - x + 1) (b - x - 1)}{12} = \frac{m (x) (m (x) - 1)}{3} .$

Also, the functions in reversed time turns out to be

$\begin{matrix} λ (x) = \frac{1}{x}, a reciprocal linear function, \\ r (x) = \frac{x}{2}, a linear function of x, \end{matrix}$

with their product

$λ (x) r (x) = \frac{1}{2},$

a constant. Further, the reversed variance residual life becomes

$v (x) = \frac{x (x - 2)}{12} = \frac{r (x) (r (x) - 1)}{3},$

which is also a characterization of the discrete uniform distribution.

Discrete Pareto

The discrete Pareto distribution, also known as the Zipf distribution and as Riemann zeta distribution, is specified by the probability mass function (Fig. 3.4)

$f (x) = C x^{- (s + 1)}, x = 1, 2, \dots, s > 0,$

(3.51)

where

$C = {[\sum_{x = 1}^{\infty} x^{- (s + 1)}]}^{- 1} = \frac{1}{ζ (s + 1)} = ϕ (1, s + 1, 1),$

and $ζ (\cdot)$ is the Riemann zeta function defined earlier in (3.27). As a model of random phenomenon, the distribution in (3.51) have been used in literature in different contexts. It is used to model the size or ranks of objects chosen randomly from certain type of populations, for example, the frequency of words in long sequences of text approximately obeys the discrete Pareto law. Zipf (1949) law states that out of a population of N elements, the frequency of elements of rank $(s + 1)$ is (3.51) and hence the name Zipf distribution, when used in linguistics. Seal (1947) has used it to model the number of insurance policies per individual, and as the distribution of surnames by Fox and Lasker (1983); see also Good (1953) and Haight (1966). Various distributional characteristics are obtained as special cases of the corresponding results of the Lerch family. In particular, moments of (3.51) are given by

$μ_{r}^{'} = \frac{ζ (s - r + 1)}{ζ (s + 1)}, r < s,$

and the moments are infinite for $r ⩾ s$ . The maximum likelihood estimator $\hat{s}$ is obtained by solving

$\frac{1}{n} \sum_{i = 1}^{n} \log x_{i} = - \frac{ζ^{'} (\hat{s} + 1)}{ζ (\hat{s} + 1)} .$

Some special cases of the discrete Pareto are the Estoup model, given by

$f (x) = \frac{c}{x}, x = 1, 2, \dots, b,$

(3.52)

(who noted that the rank r and frequency f in a French text were related by a hyperbolic law which translates in to the form (3.52)) and the Lotka law given by

$f (x) = \frac{c}{x^{2}},$

where $c = \frac{6}{π^{2}}$ . Lotka law results from the finding that the number of authors making n contributions is about $n^{- a}$ of those making one contribution, in which empirically a is nearly equal to two. When the value of x in (3.51) is truncated at b, we obtain

$f (x) = \frac{1}{H (b, s) x^{s}}, x = 1, 2, \dots, b,$

(3.53)

which is a proper distribution with $H (b, s) = \sum_{x = 1}^{b} {(\frac{1}{x})}^{s}$ . In this case,

$F (x) = \frac{H (x, s)}{H (b, s)} and S (x) = \frac{H (b, s) - H (x - 1, s)}{H (b, s)}$

and accordingly

$λ (x) = \frac{H (x, s)}{x^{s}} and h (x) = \frac{1}{x^{s} [H (b, s) - H (x - 1, s)]} .$

The mean and variance are given by

$μ = \frac{H (b, s - 1)}{H (b, s)} and μ_{2} = \frac{H (b, s - 2) H (b, s) - H^{2} (b, s - 1)}{H^{2} (b, s)},$

and the moment generating function is given by

$M (t) = \frac{1}{H (b, s)} \sum_{j = 1}^{b} \frac{e^{j t}}{j^{s}} .$

A distribution that is unrelated to the above, but bears the name ‘zeta distribution’ was introduced by Haight (1969) with probability mass function

$f (x) = {(2 x - 1)}^{- σ} - {(2 x + 1)}^{- σ}, x = 1, 2, \dots, σ > 0 .$

The distribution and survival functions are, respectively,

$F (x) = 1 - {(2 x + 1)}^{- σ}$

and

$S (x) = {(2 x - 1)}^{- σ} .$

Thus, the hazard and reversed hazard functions have simple closed form expressions as

$h (x) = 1 - {(\frac{2 x - 1}{2 x + 1})}^{σ}$

and

$λ (x) = [(\frac{2 x + 1}{2 x - 1}) - 1] / [{(2 x + 1)}^{σ} - 1] .$

Further, the mean residual life and reversed mean residual life are found to be

$m (x) = μ + \frac{g (x) h (x)}{1 - h (x)} - x$

and

$r (x) = x - μ + g (x) λ (x),$

where

$g (x) = \frac{μ (1 - {(2 x + 1)}^{- σ}) - ζ_{x} (σ)}{{(2 x - 1)}^{- σ} - {(2 x + 1)}^{- σ},}$

$ζ_{x} (σ) = \sum_{t = 1}^{x} \frac{1}{t^{σ}}$ and $μ = (1 - 2^{- σ}) ζ (σ)$ is the mean.

Figure 3.4 Hazard functions of discrete Pareto distribution.

The variance of the distribution is

$μ_{2} = (1 - 2^{1 - σ}) ζ (σ - 1) - μ^{2} .$

Notice that the mean is infinite for $σ ⩽ 1$ and the variance is infinite for $σ ⩽ 2$ . Some further literature on discrete Pareto versions can be seen in Lin and Hu (2001) and Shan (2005).

Figs 3.4–3.7 show hazard rate functions of discrete Pareto, Estoup, Lotka and truncated discrete Pareto distributions.

Figure 3.5 Hazard functions of Estoup distribution.

Figure 3.6 Hazard functions of Lotka distribution.

Figure 3.7 Hazard functions of truncated discrete Pareto.

Hurwitz-Zeta Distribution

A second major distribution belonging to the Lerch family is the Hurwitz-zeta distribution defined by

$f (x) = \frac{c}{{(a - 1 + x)}^{s}}, x = 1, 2, \dots, s > 1,$

(3.54)

with $c^{- 1} = ζ (s, a) = ϕ (1, s, a)$ being the Hurwitz-zeta function defined in (3.28). This distribution is mainly used in linguistics in connection with ranking problems. We note that

$ζ (s, a) - ζ (s, a + 1) = a^{- s},$

so that

$F (x) = 1 - \frac{ζ (s, a + x)}{ζ (s, a)} and S (x) = \frac{ζ (s, a + x - 1)}{ζ (s, a)} .$

Accordingly, the reliability functions have the expressions

$\begin{matrix} h (x) = {[{(a - 1 + x)}^{s} ζ (s, a + x - 1)]}^{- 1}, \\ λ (x) = {[{(a - 1 + x)}^{- s} (ζ (s, a) - ζ (s, a + x))]}^{- 1} . \end{matrix}$

Also,

$\begin{matrix} ζ (s, a + x - 1) = {[{(a + x - 1)}^{s} h (x)]}^{- 1} \\ ζ (s, a + x) = {[{(a + x)}^{s} h (x + 1)]}^{- 1}, \end{matrix}$

along with

$ζ (s, a + x - 1) - ζ (s, a + x) = {(a + x - 1)}^{- s},$

leads to

$h (x + 1) = {(\frac{a + x - 1}{a + x})}^{s} \frac{h (x)}{1 - h (x)}, x = 1, 2, \dots$

(3.55)

Formula (3.55) enables the evaluation of successive hazard rates starting from $h (1) = \frac{1}{a^{s} ζ (s, a)}$ . The expression for $λ (x)$ given above admits further simplification on applying the relationship

$ζ (s, a) - ζ (s, a + k) = \sum_{n = 0}^{k - 1} {(a + n)}^{- s},$

which leads to a compact expression of the form

$λ (x) = \frac{{(a + x - 1)}^{s}}{\sum_{n = 0}^{x - 1} {(a + n)}^{- s}} .$

Consequently, we have the recurrence relation

$λ (x + 1) = λ (x) {[λ (x) + {(\frac{a + x}{a + x + 1})}^{s}]}^{- 1},$

which can be determined iteratively by setting the initial value as $λ (1) = 1$ . We can write the mean residual life as

$\begin{matrix} m (x) & = \frac{1}{ζ (s, a + x)} \sum_{t = x + 1}^{\infty} ζ (s, a + t - 1) \\ = \frac{1}{ζ (s, a + x)} \sum_{t = x}^{\infty} ζ (s, a + t) . \end{matrix}$

Hence,

$m (x) ζ (s, a + x) - m (x + 1) ζ (s, a + x + 1) = ζ (s, a + x) .$

By virtue of

$ζ (s, a + x + 1) = ζ (s, a + x) - {(a + x)}^{- s},$

(3.56)

we have a recurrence relation

$m (x + 1) = \frac{ζ (s, a + x) (m (x) - 1)}{ζ (s, a + x) - {(a + x)}^{- s}} .$

The reversed mean residual life is given by

$r (x) = \frac{1}{ζ (s, a) - ζ (s, a + x - 1)} \sum_{t = 1}^{x - 1} ζ (s, a) - ζ (s, a + x) .$

Further simplification yields

$r (x + 1) = 1 + (\frac{\sum_{t = 1}^{x - 1} {(a + t - 1)}^{- s}}{\sum_{t = 1}^{x} {(a + t - 1)}^{- s}}) r (x) .$

The probability mass function and hazard rate function of Hurwitz-zeta distribution are presented in Figs 3.8 and 3.9, respectively.

Figure 3.8 Probability mass functions of Hurwitz-zeta distribution.

Figure 3.9 Hazard functions of Hurwitz-zeta distribution.

A special case of the Hurtwitz-zeta model that is in use is the Zipf-Mandelbrot law (Mandelbrot, 1959, 1966) specified by

$f (x) = \frac{c}{(a - 1 + x)}, x = 1, 2, \dots, n,$

obtained when $s = 1$ in (3.54). See Zornig and Altmann (1995) for a discussion on the distribution.

A third category of distribution in the Lerch family makes use of the general term in the polylogarithmic function $A (z, s)$ defined in (3.29).

Accordingly, the general form of the probability mass function in such cases is the Good distribution (Good, 1953) given by

$f (x) = \frac{C z^{x}}{x^{s}}, x = 1, 2, \dots, 0 < x < 1,$

(3.57)

with $C = A (z, s) = z ϕ (z, s, 1)$ . An important member in the above form, repeatedly used in the sequel, is the logarithmic series distribution (Fig. 3.8).

Setting $s = 1$ in (3.57), we have the probability mass function of the distribution as

$f (x) = \frac{C z^{x}}{x}, x = 1, 2, \dots, 0 < x < 1,$

(3.58)

where $C = - \log (1 - z)$ . It is also member of the GPSD discussed above. It is easily seen that

$f (x) = \frac{z (x - 1)}{x} f (x - 1), x = 2, 3, \dots, 0 < x < 1 .$

(3.59)

Historical remarks and the origin of the logarithmic series distribution and its properties can be found in Johnson et al. (1992). Regarding distributional properties, it may be noted that the probability generating function has the form

$P (t) = \frac{\log (1 - z t)}{\log (1 - z)} .$

The factorial moment of order r is

$μ_{(r)} = - \log {(1 - z)}^{- 1} z^{r} (r - 1)! {(1 - z)}^{- r}$

from which the mean and variance are obtained as

$\begin{matrix} μ & = - \log {(1 - z)}^{- 1} z {(1 - z)}^{- 1}, \\ μ_{2} & = - \log (1 - z) {(1 - z)}^{- 2} (1 + z \log (1 - z)) . \end{matrix}$

Various central moments can be found from the above value of $μ_{2}$ and the recurrence relation

$μ_{r + 1} = (1 - z) \frac{d μ_{r}}{d z} + r μ_{2} μ_{r - 1} .$

From the recurrence relation for $f (x)$ mentioned in (3.58), the ratio $\frac{f (x + 1)}{f (x)}$ is always less than unity, so that $f (x)$ is always a decreasing function. The distribution has a long tail and it resembles the geometric distribution for large values of x. We can identify the logarithmic series distribution by plotting $\frac{x f_{x}}{(x - 1) f_{x - 1}}$ against x, where $f_{x}$ is the observed frequency of x (see Eq. (3.58)). The plot is expected to give a straight line parallel to the x-axis with intercept z. In problems of estimation of z, the maximum likelihood estimator $\hat{z}$ is obtained by solving the equation

$\bar{x} = \frac{\hat{z}}{- (1 - \hat{z}) \log (1 - \hat{z})},$

with $\bar{x}$ being the mean of a random sample drawn from the distribution. The reversed hazard rate function is

$λ (x) = \frac{z^{x}}{x \sum_{t = 1}^{x} \frac{z^{t}}{t}}$

giving

$λ (x + 1) = \frac{z x λ (x)}{z x λ (x) + x + 1} .$

(3.60)

On the other hand, the hazard rate function is the reciprocal of an infinite series, viz.,

$h (x) = {[\sum_{t = 0}^{\infty} \frac{x z^{t}}{x + t}]}^{- 1} .$

Hence,

$h (x + 1) = \frac{z x h (x)}{(x + 1) (1 - h (x))} .$

Fig. 3.10 gives hazard rate functions of Good distribution.

Figure 3.10 Hazard functions of Good distribution.

The rest of the reliability functions characterizing the logarithmic distribution are given in the following theorem.

Theorem 3.8

A discrete random variable X with the support of $(1, 2, \dots)$ follows the logarithmic series distribution if and only if, for all x,

$m^{⁎} (x) = \frac{1 + x}{1 - z} h (x + 1) .$

(3.61)

Proof

Assume that the relationship (3.61) is true for the random variable X. Then,

$\frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} t f (t) = \frac{1 + x}{1 - x} \frac{f (x + 1)}{S (x + 1)},$

$\sum_{t = x + 1}^{\infty} t f (t) = \frac{1 + x}{1 - z} f (x + 1) .$

This gives

$x f (x) = \frac{x}{1 - z} f (x) - \frac{1 + x}{1 - z} f (x + 1)$

and

$\frac{f (x + 1)}{f (x)} = \frac{z x}{1 + x}, x = 1, 2, \dots$

(3.62)

Consequently, from (3.58), X has logarithmic series distribution. The converse is proved by retracing the steps from (3.62). □

From (3.61), the mean residual life function of X is determined as

$m (x) = \frac{1 + x}{1 - x} h (x + 1) - x$

with the expression for $h (x)$ given above.

Some general aspects of various distributions and their applications, other than those connected with reliability modelling, are given in the works of Vilaplana (1987), Gut (2005), Doray and Luong (1995, 1997), Kulasekera and Tonkyn (1992) and Panaretos (1989). Figs 3.1A, B, C and Fig. 3.3 provide the shapes of the hazard rate functions of the modified power series and Lerch families.

3.2.4 Abel Series Distributions

The family of Abel series distributions is named after the Norwegian mathematician Niels Henrik Abel (1802–1829) who found the series whose general term specifies the probability mass function of the family. Abel polynomials (Comptet, 1994) are defined in terms of sequence $〈 p_{x} (θ) 〉$ , where

$p_{x} (θ) = α_{x} (θ, λ) = \frac{θ {(θ + x λ)}^{x - 1}}{x!}, x = 0, 1, 2, \dots,$

(3.63)

with the first term $α_{0} = 1$ and the second term $α_{1} = θ$ . Successive differentiation of (3.63) with respect to θ yields

$\frac{d^{r}}{d θ^{r}} α_{x} (θ, λ) = α_{x - r} (θ + r λ, λ), r = 0, 1, 2, \dots .$

(3.64)

For a fixed λ, not related to θ, $α_{x} (θ, λ)$ forms the basis of a set of polynomials in θ. Hence, for a finite, positive and successively differentiable function $P (θ) = P (θ, λ)$ in θ, we have

$P_{1} (θ) = \sum_{x = 0}^{\infty} α_{x} (θ, λ) g (- x λ),$

(3.65)

where

$g (- x λ) = {[\frac{d^{x}}{d θ^{x}} P_{1} (θ)]}_{θ = x λ} .$

Accordingly, restricting the parameter space to $(θ, λ) \in Θ \times Λ$ for which the terms in the expansion (3.65) are non-negative, we conclude that

$f (x) = \frac{α_{x} (θ, λ) g (- x λ)}{P_{1} (θ)}, x = 0, 1, 2, \dots,$

(3.66)

qualifies to be a probability mass function of a discrete random variable X. The distribution in (3.66) will be designated as the Abel series distribution (ASD), with series function $P_{1} (θ)$ . It may be noted that the truncated version of the ASD is also an ASD.

Some special notation and functions are required to describe the properties of ASD, as given in Charalambides (1990). They are the shift operator E and the usual differential operator D combined to give

$D E^{- λ} a (u) = \frac{d a (u - λ)}{d u} .$

The Bell partition polynomials

$B_{n} = B_{n} (a_{1}, a_{2}, \dots, a_{n}; b_{1}, b_{2}, \dots, b_{n})$

are defined by

$B_{n} = \sum \frac{n! b_{k}}{k_{1}! \dots k_{n}!} {(\frac{a_{1}}{1!})}^{k_{1}} {(\frac{a_{2}}{2!})}^{k_{2}} \dots {(\frac{a_{n}}{n!})}^{k_{n}}, n = 1, 2, \dots, B_{0} = 1,$

where the summation is extended over all partitions of n over $k_{1}, \dots, k_{n} ⩾ 0$ satisfying $k_{1} + 2 k_{2} + \dots + n k_{n} = n$ and $k = k_{1} + k_{2} + \dots + k_{n}$ is the number of parts of the partition. The special cases $f_{k} = 1$ , $k = 1, 2, \dots$ , and $f_{k} = {(s)}_{k} = s (s - 1) \dots (s - k + 1)$ , $k = 1, 2, \dots$ , for real s of the Bell polynomials are denoted by $B_{n}^{'} (a_{1}, a_{2}, \dots a_{n})$ and $B_{n}^{(s)} (a_{1}, a_{2}, \dots a_{n})$ . Also, let

$a (t) = \sum_{k = 0}^{\infty} a_{k} t^{k} / k!$

be the generating function of the sequence $(a_{k})$ . With the above notations Charalambides (1990) has then shown that

(i) the probability generating function of ASD is

$P (t) = {[P_{1} (θ)]}^{- 1} {\exp [θ q^{- 1} (t D E^{- λ}) P_{1} (u)]}_{u = 0},$

where $q (w) = w e^{- λ w}$ , $q^{- 1}$ is the inverse of q and $D E^{- λ} a (u) = \frac{d a (μ - λ)}{d u}$ ;
(ii) the factorial moments are given by

$μ_{(r)} (θ) = {[P_{1} (θ)]}^{- 1} {[B_{r}^{'} (Q_{1} \dots Q_{r}) E^{θ} P (u)]}_{u = 0}, r = 0, 1, 2, \dots,$

with

$\begin{matrix} Q_{m} & = {(- 1)}^{m - 1} θ λ^{m - 1} D^{m} {(1 - λ D)}^{- m} B_{m - 1}^{(- m)} (C_{1}, C_{2}, \dots, C_{m - 1}), \\ m = 1, 2, \dots, \end{matrix}$

where

$C_{k} = (k + 1 - λ D) {(1 - λ D)}^{- 1} / (k + 1), k = 1, 2, \dots .$

Various basic members of the ASD are obtained by the Abel series expansions of the exponential, binomial and logarithmic functions. We will discuss these distributions in some detail.

Generalized Poisson Distribution

Take

$P_{1} (θ) = e^{θ}, \frac{d^{r} P_{1} (θ)}{d θ^{r}} = e^{θ}$

and consequently $g (- x λ) = e^{- x λ}$ . Thus, we have the probability mass function as

$\begin{matrix} f (x; θ, λ) = \frac{θ {(θ + x λ)}^{x - 1}}{x!} e^{- (θ + x λ)}, \\ x = 0, 1, 2, \dots, θ > 0, 0 < λ < 1, \end{matrix}$

(3.67)

representing the generalized Poisson distribution of Consul and Jain (1973a, 1973b) who obtained it from the Lagrangian expansion. A comprehensive study of this distribution is available in Consul (1989). By way of properties of the distribution, we have:

$\begin{matrix} (i) f (x) = \frac{{(θ + x λ)}^{x - 1} e^{- λ}}{{(θ + x λ - λ)}^{x - 2}} f (x - 1); \\ (ii) μ = \frac{θ}{1 - λ}, μ_{2} = \frac{θ}{{(1 - λ)}^{3}}; \\ (iii) (1 - λ) μ_{r + 1}^{'} = θ μ_{r}^{'} + θ \frac{\partial μ_{r}^{'}}{\partial θ} + λ \frac{\partial μ_{r}^{'}}{\partial λ}, r = 0, 1, 2, \dots, \end{matrix}$

(3.68)

and

$μ_{r + 1} = \frac{r θ}{{(1 - λ)}^{3}} μ_{r - 1} + \frac{1}{1 - λ} {[\frac{\partial μ_{r} (t)}{\partial t}]}_{t = 1}, r = 1, 2, 3, \dots,$

where $μ_{r} (t)$ is $μ_{r}$ with λ and θ replaced by λt and θt, respectively.

The distribution in (3.67) is a modified power series distribution. We can estimate the parameters by the method of moments as

$\hat{θ} = \sqrt{\frac{m_{1}^{3}}{m_{2}}}$

and

$\hat{α} = \sqrt{\frac{m_{2}}{m_{1}^{3}}} - \frac{1}{m_{1}},$

where $α = \frac{λ}{θ}$ and $m_{i}$ is the ith sample moment, $i = 1, 2$ .

Regarding reliability characteristics, which has not been studied so far, we first observe that the hazard rate function is

$h (x) = \frac{{(θ + x λ)}^{x - 1 e^{- λ x}}}{x! \sum_{t = x}^{\infty} \frac{{(θ + t λ)}^{t - 1}}{t!} e^{- t λ}} .$

(3.69)

Also, (3.69) satisfies the relationship

$h (x + 1) = \frac{{(θ + λ + x λ)}^{x}}{{(θ + x λ)}^{x - 1} (x + 1)} \frac{h (x)}{(1 - h (x))}$

(3.70)

from which successive values of $h (x)$ can be calculated starting with $h (0) = e^{- θ}$ .

Secondly, the mean residual life function is obtained from the theorem due to Consul (1995) given below, with parametrization as $[θ, α)$ .

Theorem 3.9

A discrete random variable X has generalized Poisson distribution if and only if

$m (x) = μ [1 + \frac{\partial S (x + 1)}{\partial θ}] - x,$

for $x = 0, 1, 2, \dots$ .

Reliability functions in reversed time obey similar properties. We have

$λ (x) = \frac{{(θ + λ x)}^{x - 1} e^{- λ x}}{x! \sum_{t = 0}^{x} \frac{{(θ + λ t)}^{t - 1}}{t!} e^{- λ t}}$

and therefrom

$\frac{1}{λ (x + 1)} = 1 + \frac{1}{λ (x)} \frac{(x + 1) e^{λ} {(θ + λ x)}^{x - 1}}{θ (θ + λ + λ x)} .$

Differentiating the distribution function

$F (x; θ, α) = \sum_{t = 0}^{x} \frac{{(1 + α t)}^{t - 1}}{t!} θ^{t} e^{- θ (1 + α t)}, 0 ⩽ α < \frac{1}{θ}, α = \frac{λ}{θ},$

and simplifying the resulting expression, we obtain

$E (X | X < x + 1) = μ [1 + \frac{\partial \log F}{\partial θ}] .$

Thus, the reversed mean residual life has the expression

$r (x + 1) = x + 1 - μ (1 + \frac{\partial \log F}{\partial θ}) .$

When $λ = 0$ $(α = 0)$ , all the above results reduce to those of the Poisson distribution. In the Poisson process, the probability of occurrence of a single event remains constant, whereas in the generalized Poisson case the probability depends on the previous occurrence. On the basis of experimental evidence, the generalized case appears to model situations in which failures occur rarely in short periods of time where the frequency of such events are of interest over longer duration of time.

Quasi-Binomial Distribution I

Set $P_{1} (p) = {(p + q)}^{n}$ ,

$g (- x λ) = n (n - 1) \dots (n - x + 1) {(1 - p - x λ)}^{n - x}$

and

${(p + q)}^{n} = \sum_{x = 0}^{n} (\begin{matrix} n \\ x \end{matrix}) p {(p + x λ)}^{x - 1} {(1 - p - x λ)}^{x - 1} .$

When $q = 1 - p$ , we have the probability mass function

$\begin{matrix} f (x) = (\begin{matrix} m \\ x \end{matrix}) p {(p + x λ)}^{x - 1} {(1 - p - x λ)}^{n - x}, x = 0, 1, 2, \dots, n, \\ 0 ⩽ p ⩽ 1, - \frac{p}{n} < λ < \frac{1 - p}{n} \end{matrix}$

(3.71)

A random variable X having its as probability mass function as in (3.71) is said to have a quasi-binomial distribution I (QBD-I). The probabilities of X are computed from the recurrence relation

$\begin{matrix} f (x + 1) = \frac{(n - x) (p + λ x)}{(x + 1) (1 - p - x λ)} {(1 + \frac{λ}{p + x λ})}^{x} \\ {(1 - \frac{λ}{1 - p - x λ})}^{n - x - 1} f (x) . \end{matrix}$

QBD-I is unimodal and the values of the parameter λ substantially affects various characteristics of the distribution. Expressions for the mean and variance are

$μ = E (X) = p \sum_{k = 0}^{n - 1} \frac{n!}{(n - k)!} λ^{k}$

and

$\begin{matrix} μ_{2} = \sum_{k = 0}^{n - 2} (k + 1) p^{2} λ^{k} \frac{n!}{(n - k - 2)!} \\ + \frac{1}{2} p λ \sum_{k = 0}^{n - 2} (k + 1) (k + 4) λ^{4} \frac{n!}{(n - k - 2)!} + μ - μ^{2}, \end{matrix}$

respectively. When $λ > (<) 0$ , the mean of the QBD 1 is larger (smaller) than the mean of the binomial. Further distributional aspects and estimation of the parameters have been discussed by Consul and Famoye (2006).

The hazard rate function of QBD 1 is given by

$h (x) = \frac{(\begin{matrix} n \\ x \end{matrix}) p {(p + x λ)}^{x - 1} {(1 - p - x λ)}^{n - x}}{\sum_{t = x}^{n} (\begin{matrix} n \\ t \end{matrix}) p {(p + t λ)}^{t - 1} {(1 - p - t λ)}^{n - t}},$

and there exists a recurrence formula of the form

$\frac{1}{h (x)} = 1 + \frac{1}{h (x + 1)} [\frac{n - x}{x + 1} (\frac{p + x λ + λ}{1 - p - x λ}) {(1 + \frac{λ}{p + x λ})}^{x - 1} {(1 - \frac{λ}{1 - p - x λ})}^{n - x - 1}]$

(3.72)

with $h (0) = f (0) = {(1 - p)}^{n}$ . Likewise, the reversed hazard rate satisfies the recurrence formula

$\frac{1}{λ (x + 1)} = 1 + \frac{1}{λ (x)} [\frac{x + 1}{n - x} (\frac{1 - p - x λ}{p + x λ + λ}) {(1 + \frac{λ}{p + x λ})}^{1 - x} {(1 - \frac{λ}{1 + p - x λ})}^{x + 1 - n}]$

with $λ (0) = 1$ .

The probability mass functions and hazard rate functions of quasi-binomial distribution are presented in Figs 3.11 and 3.12, respectively.

Figure 3.11 Probability mass functions of quasi-binomial distribution-I.

Figure 3.12 Hazard rate functions of quasi-binomial distribution-I.

Quasi-Negative Binomial Distribution

Taking $P_{1} (a) = {(b - a)}^{- n}$ , we have

$P_{1} (a) = \sum_{x = 0}^{\infty} (\begin{matrix} n + x - 1 \\ x \end{matrix}) a {(a + x λ)}^{x - 1} {(b + x λ)}^{- n - x} .$

(3.73)

Thus,

$f (x; a, λ) = (\begin{matrix} n + x - 1 \\ x \end{matrix}) a {(a + x λ)}^{x - 1} {(b + x λ)}^{- n - x},$

$\begin{matrix} f (x) & = (\begin{matrix} n + x - 1 \\ x \end{matrix}) p {(p + x ϕ)}^{x - 1} {(q + x ϕ)}^{- n - x}, \\ x = 0, 1, 2, \dots, p + x ϕ < 1, q - p = 1, \end{matrix}$

(3.74)

defines a probability distribution, which is called the quasi-negative binomial distribution (QNBD). In (3.74), $p = \frac{a}{b - a}$ , $q = \frac{b}{b - a}$ and $ϕ = \frac{λ}{b - a}$ in the original expansion. Some special cases of (3.73) are as follows:

(i) the negative binomial when $ϕ = 0$ ;
(ii) the quasi-geometric distribution (QGD)

$f (x) = p {(p + x ϕ)}^{x - 1} {(q + x ϕ)}^{- 1 - x}, x = 0, 1, 2, \dots,$

when $n = 1$ and
(iii) the geometric distribution when $n = 1$ and $ϕ = 0$ .

The probability mass function, for successive values of x, satisfies the recursive formula

$\begin{matrix} f (x + 1) = \frac{n + x}{x + 1} (\frac{p + x ϕ + ϕ}{q + x ϕ}) {(1 + \frac{ϕ}{p + x ϕ})}^{x - 1} \\ {(1 + \frac{ϕ}{q + x ϕ})}^{- n - x} f (x), \end{matrix}$

and the moments are evaluated from the generating function

$P (t) = {(P_{1} (a))}^{- 1} {[\exp a q^{- 1} (t \frac{d P_{1} (μ - λ)}{d u})]}_{u = 0} .$

Eq. (3.74) yields the survival function as

$S (x) = \sum_{t = x}^{\infty} (\begin{matrix} n + t - 1 \\ t \end{matrix}) p {(p + t ϕ)}^{t - 1} {(q + t ϕ)}^{- n - t} .$

(3.75)

From (3.74) and (3.75), we find that the hazard rate function can be computed from the recursive formula

$\frac{1}{h (x)} = 1 + \frac{n + x}{x} \frac{{(1 + \frac{ϕ}{p + x ϕ})}^{x - 1}}{{(1 + \frac{ϕ}{1 + p + x ϕ})}^{n + x}} {(1 + \frac{1}{p + x ϕ + ϕ})}^{- 1} \frac{1}{h (x + 1)} .$

Similarly, the reversed hazard rate function satisfies the recursive formula

$\frac{1}{λ (x + 1)} = 1 + \frac{1}{λ (x)} \frac{(x + 1)}{(n + x)} (\frac{q + x ϕ + ϕ}{p + x ϕ + ϕ}) \frac{{(1 + \frac{ϕ}{q + x ϕ})}^{x + n}}{{(1 + \frac{ϕ}{p + x ϕ})}^{x - 1}} .$

When $n = 1$ , we have the results for the quasi-geometric law. Models that lead to QNBD as well as various properties of QNBD have been discussed by Bilal and Hassan (2006) and Hassan and Bilal (2008). They have found the mean and variance to be

$μ = n p_{2} F_{0} [1, n + 1, -; ϕ]$

and

$\begin{matrix} μ_{2} = & n p_{2} F_{0} [1, n + 1, -; ϕ] + p (p + 2 ϕ) n (n + 1)_{2} F_{0} [2, n + 2, -; ϕ] \\ + p ϕ^{2} n (n + 1) (n + 2)_{2} F_{0} [3, n + 3, -; ϕ] - μ^{2}, \end{matrix}$

respectively, where

${}_{2}F_{0} [1, n + 1, -; ϕ] = \sum_{j = 0}^{\infty} i^{(j)} {(n + 1)}^{(j)} \frac{ϕ^{j}}{j!} .$

Quasi-Logarithmic Series Distribution

In this case, an extension of the usual logarithmic series distribution is obtained by taking $P_{1} (a) = - \log (1 - a)$ . Then, the Abel series expansion yields

$- \log (1 - a) = \sum_{x = 1}^{\infty} \frac{{(1 - λ x)}^{x}}{x} a {(a + x λ)}^{x - 1}$

so that

$f (x) = \frac{θ a {(a + x λ)}^{x - 1}}{x} {(1 - x λ)}^{- x}, 0 < a < 1, θ^{- 1} = - \log (1 - a),$

$x = 1, 2, 3, \dots$ , is a probability mass function, representing the quasi-logarithmic distribution. The hazard rate function becomes

$h (x) = \frac{{(a + x λ)}^{x - 1}}{x {(1 - x λ)}^{x}} / \sum_{t = x}^{\infty} \frac{{(a + t λ)}^{t - 1}}{t {(1 - t λ)}^{t}} .$

Recurrence relations for $h (x)$ and $λ (x)$ can be obtained in the usual manner. For instance, a recursive relation for the hazard function is given by

$\begin{matrix} \frac{1}{h (x)} = 1 + \frac{x}{λ + 1} (\frac{a + λ + x λ}{1 - λ - x λ}) {(1 + \frac{λ}{a + x λ})}^{x - 1} \\ {(1 + \frac{λ}{a + x λ})}^{- x} \frac{1}{h (x + 1)} . \end{matrix}$

Quasi-Binomial Distribution II

The quasi-binomial distribution II (QBD II) can be derived in different ways. We obtain it here as a property of Abel polynomials and the corresponding series expansion. A sequence $〈 s_{n} (a) 〉$ is of the binomial type if

$s_{n} (a + b) = \sum_{x = 0}^{n} (\begin{matrix} n \\ x \end{matrix}) s_{x} (a) s_{n - x} (b)$

for all b. With the choice of

$s_{n} (θ) = θ {(θ + n λ)}^{n - 1} = n! α_{n} (θ, λ),$

(3.76)

we have

$\frac{d^{x}}{d θ^{x}} α_{n} (θ + b λ) = α_{n - x} (θ + b + x λ, λ)$

from (3.64).

Hence,

$g (- x λ) = α_{n - x} (b, λ) .$

Using the expansion in (3.66), we have

$α_{n} (θ + b, λ) = \sum_{x = 0}^{n} α_{x} (θ, λ) α_{n - x} (b, λ)$

so that

$\frac{1}{s_{n} (θ + b, λ)} \sum_{x = 0}^{n} (\begin{matrix} n \\ x \end{matrix}) s_{a} (θ, λ) s_{n - x} (b, λ) = 1 .$

Substituting for $s_{n}$ from (3.76), we see that

$f (x) = (\begin{matrix} n \\ x \end{matrix}) \frac{θ b}{θ + b} \frac{{(θ + x λ)}^{x - 1} {[b + (n - x) λ]}^{n - x - 1}}{{(θ + b + n λ)}^{n - 1}},$

(3.77)

$x = 0, 1, 2, \dots, n$ , $θ > 0$ , $b > 0$ and $λ > - \frac{θ}{n}$ . The distribution defined by (3.77) is called QBD II. When $λ = 0$ , it reduces to the binomial model. A reparametrization of $p = θ (θ + b + n λ)$ and $α = λ {(θ + b + n λ)}^{- 1}$ provide another QBD II version

$\begin{matrix} f (x) = (\begin{matrix} n \\ x \end{matrix}) \frac{(1 - p - n α) p}{(1 - n α)} {(p + α x)}^{x - 1} {(1 - p - α x)}^{n - x - 1}, \\ 0 < p < 1, - \frac{p}{n} < α < \frac{(1 - p)}{n}, x = 0, 1, 2, \dots, n . \end{matrix}$

(3.78)

The mean and variance of the form in (3.77) are

$μ = \frac{n θ}{θ + b}$

and

$μ_{2} = \frac{n^{2} θ b}{{(θ + b)}^{2}} - \frac{n (n - 1) θ b}{θ + b} \sum_{i = 0}^{n - 2} \frac{{(n - 2)}_{(i)} λ^{i}}{{(θ + b + n α)}^{i + 1}},$

respectively. From (3.78), we can write expressions for the hazard rate and reversed hazard rate functions, respectively, as

$h (x) = \frac{(\begin{matrix} n \\ x \end{matrix}) {(p + α x)}^{x - 1} {(1 - p - α x)}^{n - x - 1}}{\sum_{t = x}^{n} (\begin{matrix} n \\ t \end{matrix}) {(p + α t)}^{t - 1} {(1 - p - α t)}^{n - t - 1}}$

and

$λ (x) = \frac{(\begin{matrix} n \\ x \end{matrix}) {(p + α x)}^{x - 1} {(1 - p - α x)}^{n - x - 1}}{\sum_{t = 0}^{n} (\begin{matrix} n \\ t \end{matrix}) {(p + α t)}^{t - 1} {(1 - p - α t)}^{m - t - 1}} .$

They satisfy the recursive relationships

$\frac{1}{h (x)} = 1 + \frac{1}{h (x + 1)} \frac{n - x}{x + 1} (\frac{p + α + α x}{1 - p - α x}) {(1 + \frac{α}{p + α x})}^{x - 1} {(1 - \frac{α}{1 - p - α x})}^{n - x - 2},$

and

$\frac{1}{λ (x + 1)} = 1 + \frac{1}{λ (x)} \frac{x + 1}{n - x} (\frac{1 - p - α - α x}{p + α x}) {(1 + \frac{α}{p + α x})}^{- x} {(1 - \frac{α}{1 - p - α x})}^{x + 1 - n} .$

Figs 3.13 and 3.14 provide probability mass functions and hazard rate functions of quasi-binomial distribution II.

Figure 3.13 Probability mass functions of quasi-binomial distribution-II.

Figure 3.14 Hazard rate functions of quasi-binomial distribution-II.

The Abel series distributions presented above have been obtained by Nandi and Das (1994). They have proposed estimates of the parameters of the distributions by equating the proportion of zeros and ones in the sample with $f (0)$ and $f (1)$ in the case of QBD I, QNBD and the generalized Poisson, and equating the sample mean and proportion of zeros with the population mean and $f (0)$ for QBD II. With real data, it has also been shown that various distributions become useful in practice as satisfactory models. Chapter 4 of Consul and Famoye (2006) and the references therein provide more information on distributional properties, characterizations and application of the Abel series distributions in the general framework of Lagrangian expansions.

3.2.5 Lagrangian Family

Consider a function $g (z)$ which is successively differentiable satisfying $g (0) \neq 0$ and $g (1) = 1$ . Then, the numerically smallest root $z = l (t)$ of the transformation $z = t g (z)$ defines a probability generating function with Lagrangian expansion

$z = P (t) = \sum_{x = 1}^{\infty} \frac{t^{x}}{x!} {[\frac{d^{x - 1}}{d t^{x - 1}} g {(z)}^{x}]}_{z = 0}$

(3.79)

provided that the derivative inside the square brackets is non-negative for all $x ⩾ 0$ . The corresponding probability mass function is

$f (x) = \frac{1}{x!} {[\frac{d^{x - 1}}{d z^{x - 1}} {(g (z))}^{x}]}_{z = 0}$

(3.80)

which constitutes the basic Lagrangian distributions. Detailed study of the properties, applications and estimation problems connected with this family is available in works of Mohanty (1966), Consul and Shenton (1972, 1973, 1975), and the books by Consul (1989) and Consul and Famoye (2006). We, therefore, represent here only these results that are most pertinent to reliability modelling and analysis. A large number of distributions belong to the Lagrangian family, of which we discuss only those for which the support is appropriate and where some useful and simple results on various reliability functions can be presented.

Chronologically, the earliest distribution is that of Borel specified by

$f (x) = \frac{{(x λ)}^{x - 1}}{x!} e^{- x λ}, x = 1, 2, \dots .$

(3.81)

By transferring the random variable X in (3.81) to $Y = X - 1$ , we have the generalized Poisson distribution presented earlier in (3.67). Since details of the Borel distribution can be evaluated in terms of these of the generalized Poisson, we abstain from further discussions here.

Haight Distribution

The Haight (1961) distribution has probability mass function

$f (x) = \frac{1}{2 s - 1} (\begin{matrix} 2 x - 1 \\ x \end{matrix}) q^{x} p^{x - 1}, x = 1, 2, \dots, 0 < q = 1 - p < 1 .$

(3.82)

From the probability generating function

$P (t) = q {(1 - p t)}^{- 1},$

the basic characteristics such as the mean, variance and higher moments can all be easily derived. The hazard rate function is

$h (x) = \frac{(\begin{matrix} 2 x - 1 \\ x \end{matrix}) q^{x} p^{x - 1}}{(2 x - 1) \sum_{t = x}^{\infty} \frac{1}{2 t - 1} (\begin{matrix} 2 t - 1 \\ t \end{matrix}) q^{t} p^{t - 1}}, x = 1, 2, \dots .$

As usual, $h (x)$ can be evaluated recursively as

$\frac{1}{h (a)} = 1 + \frac{(4 x - 2) p q}{(x + 1) h (x + 1)}, x = 1, 2, \dots,$

(3.83)

with $h (1) = q$ . Differentiating the survival function

$S (x + 1) = \sum_{t = 1}^{x + 1} \frac{1}{2 t - 1} (\begin{matrix} 2 t - 1 \\ t \end{matrix}) q^{t} p^{t - 1}$

with respect to p and rearranging the terms, we get

$E (X | X > x) = \frac{p q}{q - p} [\frac{1}{p} + \frac{\partial \log S (x + 1)}{\partial p}] .$

(3.84)

Accordingly, the mean residual life function is

$m (x) = \frac{p q}{q - p} [\frac{1}{p} + \frac{\partial \log S (x + 1)}{\partial}] - x, x = 1, 2, \dots .$

(3.85)

Differentiating $S (x + 1)$ twice with respect to p, we get

$\frac{\partial^{2} S (x + 1)}{\partial p^{2}} = \sum_{t = x + 1}^{\infty} {\frac{1}{2 t - 1} (\begin{matrix} 2 t - 1 \\ t \end{matrix}) [\frac{(t - 1) (t - 2) q^{t} p^{t - 1}}{p^{2}}] - t (t - 1) [\frac{2}{p q} - \frac{1}{q^{2}}] p^{t - 1} q^{t}}$

which leads to

$\frac{1}{S (x + 1)} \frac{\partial^{2} S (x + 1)}{\partial p^{2}} = \frac{2}{p^{2}} - \frac{2 q^{2} + {(q - p)}^{2}}{p^{2} q^{2}} E (X | X > x) + \frac{{(q - p)}^{2}}{p^{2} q^{2}} E (X^{2} | X > x)$

and to

$\begin{matrix} E (X^{2} | X > x) \\ = \frac{p^{2} q^{2}}{{(q - p)}^{2}} [\frac{1}{S (x + 1)} \frac{\partial^{2} S (x + 1)}{\partial p^{2}} + \frac{2 q^{2} + {(q - p)}^{2}}{p^{2} q^{2}} E (X | X > x)] . \end{matrix}$

(3.86)

Now, the variance residual life can be computed from (3.84) and (3.86), as

$σ^{2} (x) = E (X^{2} | X > x) - E^{2} (X | X > x) .$

With the aid of the distribution function

$F (x) = \sum_{t = 1}^{x} \frac{1}{2 t - 1} (\begin{matrix} 2 t - 1 \\ t \end{matrix}) q^{t} p^{t - 1},$

some similar manipulations yield reliability functions in reversed time. The expressions are

$\begin{matrix} \frac{1}{λ (x + 1)} = 1 + \frac{x + 1}{2 (2 x - 1) x q p} \frac{1}{λ (x)}, \\ r (x) = x - \frac{p q}{q - p} [\frac{1}{p} + \frac{\partial \log F (x)}{\partial p}], x = 1, 2, \dots \end{matrix}$

and

$v (x) = E (X^{2} | X ⩽ x) - E^{2} (X | X ⩽ x),$

where

$E (X | X ⩽ x) = \frac{p q}{q - p} [\frac{1}{p} + \frac{\partial \log F (x)}{\partial}]$

and

$E (X^{2} | X ⩽ x) = \frac{p^{2} q^{2}}{{(q - p)}^{2}} [\frac{\partial^{2} F (x)}{\partial p^{2}} - \frac{2}{p^{2}} + \frac{2 q^{2} + {(q - p)}^{2}}{p^{2} q^{2}} E (X | X ⩽ x)] .$

Geeta Distribution

A discrete random variable X is said to have Geeta distribution with parameters θ and α if

$\begin{matrix} f (x) = \frac{1}{α x - 1} (\begin{matrix} α x - 1 \\ x \end{matrix}) θ^{x - 1} {(1 - θ)}^{α x - x}, \\ x = 1, 2, \dots, 0 < θ < 1, 1 < α < θ^{- 1} . \end{matrix}$

The distribution is L-shaped and unimodal with

$μ = (1 - \hat{θ}) {(1 - α θ)}^{- 1}$

and

$μ_{2} = (α - 1) θ (1 - θ) {(1 - α θ)}^{- 3} .$

A recurrence relation is satisfied by the central moments of the form

$μ_{r + 1} = θ μ \frac{d μ_{r}}{d θ} + r μ_{2} μ_{r - 1}, r = 1, 2, \dots .$

Estimation of the parameters can be done by the method of moments or by the maximum likelihood method. Moment estimates are

$\hat{μ} = \bar{x} and \hat{α} = \frac{S^{2} - \bar{x} (\bar{x} - 1)}{S^{2} - {\bar{x}}^{2} (\bar{x} - 1)}$

with $\bar{x}$ and $s^{2}$ being the sample mean and variance. The maximum likelihood estimates are

${\hat{μ}}_{M} = \bar{x}$

and $\hat{α}$ is iteratively obtained by solving the equation

$\frac{(α - 1) \bar{x}}{α \bar{x} - 1} = \exp [- \frac{1}{n \bar{x}} \sum_{x = 2}^{k} \sum_{i = 2}^{x} \frac{x f_{x}}{x - i}]$

with $n = \sum_{x = 1}^{k} f_{x}$ as the sample size and $f_{x}$ is the frequency of $x = 1, 2, \dots, k$ . For details, see Consul (1990). As the Geeta model is a member of the MPSD, the reliability properties can be readily obtained from those of the MPSD discussed in Section 3.2.2.

Generalized Geometric Distribution II

Famoye (1997) obtained the model

$\begin{matrix} f (x) = \frac{1}{x} (\begin{matrix} m x \\ x - 1 \end{matrix}) θ^{x - 1} {(1 - θ)}^{m x - x + 1}, \\ x = 1, 2, \dots, 0 < θ < 1, 1 ⩽ m ⩽ θ^{- 1}, \end{matrix}$

(3.87)

which is a generalization of the geometric distribution, as (3.87) becomes the geometric law when $m = 1$ . It is obtained from the Lagrangian expansion of the generating function of the geometric distribution. The mean and variance are

$μ = {(1 - θ m)}^{- 1} and μ_{2} = m θ (1 - θ) {(1 - θ m)}^{- 3} .$

Other distributional properties are derived from the central moments that satisfy the recurrence formula

$μ_{r + 1} = θ (1 - θ) {(1 - θ m)}^{- 1} \frac{d μ_{r}}{d θ} + r μ_{r - 1} μ_{2}, r = 1, 2, \dots,$

using the value of $μ_{2}$ given above. For any given set of values of x, the hazard rate function satisfies from the recursive relationship

$\frac{1}{h (x)} = 1 + \frac{θ {(1 - θ)}^{m - 1}}{(x + 1) h (x + 1)} \frac{Γ (m x + m + 1) Γ (m x - x + 2)}{Γ (n x + 1) Γ (m x + m - x + 1)}, x = 1, 2, \dots,$

with $h (1) = {(1 - θ)}^{m}$ . Working as in the earlier cases, the mean residual life satisfies the relationship

$m (x) = μ [\frac{\partial \log S (x + 1)}{\partial θ} + \frac{1}{θ (1 - θ)}] - x .$

Also,

$\begin{matrix} \frac{1}{S (x + 1)} \frac{\partial^{2} S (x + 1)}{\partial θ^{2}} = \frac{E (X - 1) (X - 2)}{θ^{2}} \\ - 2 \frac{E (X - 1) (m X - X + 1)}{θ (1 - θ)} + \frac{E (m X - x) (m X - x + 1)}{{(1 - θ)}^{2}} . \end{matrix}$

Simplifying, we obtain

$\begin{matrix} E (X^{2} | X > x) = \frac{1}{S (x + 1)} {(1 - θ)}^{- 2} \\ \times [\frac{\partial^{2} S (x + 1)}{\partial θ^{2}} - (3 θ^{2} + 2 θ m - 8 θ^{2} + 2 θ - 3) E (X | X > x) - 2 (1 - θ)] . \end{matrix}$

This, along with

$E (X | X > x) = μ [\frac{\partial \log S}{\partial θ} + \frac{1}{θ (1 - θ)}],$

gives an expression for the variance residual life. The reversed hazard rate is derived from the recurrence relation

$\frac{1}{λ (x + 1)} = 1 + λ (x) \frac{(x + 1) Γ (m x + m - x + 1) Γ (m x + 1)}{Γ (m x + m + 1) Γ (m x + 2 - x) θ {(1 - θ)}^{m - 1}} .$

By adopting calculations similar to those of the mean residual life, we have the reversed mean residual life as

$r (x) = x - μ [\frac{\partial \log F (x)}{\partial x} + \frac{1}{θ (1 - θ)}] .$

Finally, the estimates of the parameters can be obtained by the moment method, or the maximum likelihood method, or by equating the sample mean and frequency at $x = 1$ with the mean and $n f (1)$ , where n is the sample size.

There are several other families of discrete distributions discussed in the literature like the generalized hypergeometric family (Kemp, 1968a, 1968b), Gould series distributions (Charalambides, 1986), factorial series distributions (Berg, 1974) and their further extensions in various directions. Many of the members of these families overlap with those discussed here already. Furthermore, the reliability functions in other cases are of complicated forms, but can be obtained by some of the methods already detailed here.

3.3 Discrete Analogues of Continuous Distributions

Several discrete distributions that are members of various families have been presented in the preceding section. They can serve as black-box models in the sense that their derivations were not based on the physical properties of the failure mechanism resulting in lifetimes. In a slightly different scenario, in this section, we consider various forms of discrete models that are obtained from popular continuous life distributions. This can be accomplished in different ways. Let Y be a continuous random variable representing lifetime with survival function $\bar{F} (x) = P (Y ⩾ x)$ . If times are grouped into unit intervals, the discrete variable $X = [Y]$ , the largest integer less than or equal to Y, has its probability mass function as

$\begin{matrix} f (x) & = P (X = x) = P (x ⩽ Y < x + 1) \\ = \bar{F} (x) - \bar{F} (x + 1), x = 0, 1, 2, \dots \end{matrix}$

(3.88)

The survival function $S (x)$ of X and $\bar{F} (x)$ of Y will be the same at all integer points. As a simple example, when Y is exponential with $\bar{F} (x) = \exp [- λ x]$ , we have

$\begin{matrix} f (x) & = \exp (- λ x) - \exp (- λ (x + 1)) \\ = q^{x} - q^{x + 1} = p q^{x}, p = 1 - q, q = e^{- λ}, \end{matrix}$

which represents the geometric distribution.

Discrete Weibull Distribution Taking $\bar{F} (x) = \exp {[- (\frac{x}{α})]}^{β}$ and applying (3.88), we have

$S (x) = q^{x^{β}}, q = \exp [- \frac{1}{α^{β}}], x = 0, 1, 2, \dots, 0 < q < 1 .$

(3.89)

This will be referred to as the discrete Weibull distribution I, first proposed by Nakagawa and Osaki (1975). For the model in (3.89), we readily find

$f (x) = q^{x^{β}} - q^{{(x + 1)}^{β}}$

and

$h (x) = 1 - q^{{(x + 1)}^{β} - x^{β}}, x = 0, 1, 2, \dots .$

The reversed hazard rate is

$λ (x) = \frac{q^{x^{β}} - q^{{(x + 1)}^{β}}}{1 - q^{{(x + 1)}^{β}}} = \frac{1 - q^{x^{β}}}{1 - q^{{(x + 1)}^{β}}} - 1 .$

Two special cases of interest are the geometric when $β = 1$ and the discrete Rayleigh when $β = 2$ . There are no closed-form expressions for the mean, variance, and higher-order moments. Alikhan et al. (1989) noted that

$P (X = 1) = 1 - q and P (X = 2) = q - q^{2 β},$

and then proposed estimating the parameters q and β by equating the theoretical and estimated probabilities from the data of the observations $X = 1$ and $X = 2$ . Thus, these estimates are given by

$\hat{q} = S_{n} (1) and \hat{β} = \frac{1}{\log 2} \frac{\log S_{n} (2)}{\log S_{n} (1)},$

where $S_{n} (x)$ is the empirical survival function. One can use two other values of X instead of 1 and 2, depending on the number of observations in the sample that are equal to the chosen integer values. The larger the frequencies of the numbers chosen, the better will be the estimates. Bracquemond and Gaudoin (2003) proposed the method of maximum likelihood for the estimation of q and β and observed that the estimates are biased, but of good quality for large values of q. One can use the distribution as an approximation to the continuous counterpart. Further, a shock model interpretation is also available, with X denoting the number of shocks survived by a device and q is the probability of surviving more than one shock. It can also be noted that (3.89) satisfies the extension of the lack of memory property that

$P (X ⩾ {(x^{β} + y^{β})}^{\frac{1}{β}} | X ⩾ x) = P (X ⩾ y) .$

Discrete Half-Logistic Distribution The half-logistic model in the continuous case has survival function (Balakrishnan, 1985)

$\bar{F} (x) = \frac{2}{1 + \exp (\frac{x}{σ})}, x > 0, σ > 0 .$

Accordingly, the discrete half-logistic distribution is specified by the survival function

$S (x) = \frac{2}{1 + \exp (\frac{x}{σ})}, x = 0, 1, 2, \dots,$

(3.90)

or by the probability mass function

$f (x) = \frac{2 (e^{\frac{1}{σ}} - 1) e^{\frac{x}{σ}}}{(1 + e^{\frac{x}{σ}}) (e^{\frac{x + 1}{σ}} - 1)}, x = 0, 1, 2, \dots .$

The mean and variance of the distribution in (3.90) are

$μ = 2 \sum_{n = 1}^{\infty} {(1 + e^{\frac{n}{σ}})}^{- 1} and μ_{2} = \frac{2 (e^{\frac{1}{σ}} - 1)}{{(1 + e^{\frac{1}{σ}})}^{2}} + 2 \sum_{n = 1}^{\infty} \frac{2 n + 1}{1 + e^{\frac{n + 1}{σ}}} - μ^{2} .$

The hazard rate and reversed hazard rate functions of the distribution in (3.90) can be derived as follows:

$\begin{matrix} h (x) = \frac{(e^{\frac{1}{σ}} - 1) e^{\frac{x}{σ}}}{1 + e^{\frac{x + 1}{σ}}}, \\ λ (x) = \frac{2 (e^{\frac{1}{σ}} - 1) e^{\frac{x}{σ}}}{(1 + e^{\frac{x}{σ}}) (e^{\frac{x + 1}{σ}} - 1)}, x = 0, 1, \dots . \end{matrix}$

There is only one parameter (σ) involved in (3.90) which can be estimated by the method of maximum likelihood.

Geometric Weibull Distribution This is a three-parameter model based on the continuous exponential Weibull distribution

$\bar{F} (x) = \exp [- λ x - {λ {(x - a)}^{+}}^{β}]$

introduced by Zacks (1984), where ${(x - a)}^{+} = \max (0, x - a)$ . Of the three parameters, $λ > 0$ represents scale, $β > 0$ the shape and $a > 0$ is the change-point. The model was proposed in a practical situation when the hazard rate is constant until time a and thereafter it is increasing like a Weibull hazard rate. In the discrete analogue also, the same physical meaning prevails with

$S (x) = \exp [- λ x - {λ {(x - a)}^{+}}^{β}]$

(3.91)

and

$f (x) = \exp [- λ x - {λ {(x - a)}^{+}}^{β}] - \exp [- λ (x + 1) - {λ {(x + 1 - a)}^{+}}^{β}] .$

We then have

$h (x) = 1 - \exp [- λ + λ^{β} ({{(x - a)}^{+}}^{β} - {{(x - a + 1)}^{+}}^{β})]$

as the hazard rate.

Telescopic Distributions A more general family of distributions result if we consider the discrete analogue of Y with survival function

$\bar{F} (x) = \exp [- α k_{θ} (x)], x > 0,$

when $k_{θ} (x)$ is a strictly increasing function with $k_{θ} (0) = 0$ and $k_{θ} (x) \to \infty$ and $x \to \infty$ . It includes the exponential, Rayleigh, Weibull, linear-exponential, and Gompertz distributions as special cases. Corresponding to $\bar{F} (x)$ above, we have the distribution of X as

$f (x) = q^{k_{θ} (x)} - q^{k_{θ} (x + 1)}, x = 0, 1, 2, \dots, 0 < q < 1, q = e^{- α} .$

(3.92)

For the model in (3.92), it follows (Roknabadi et al., 2009) that

$\begin{matrix} S (x) & = q^{k_{θ} (x)}, \\ h (x) & = 1 - q^{k_{θ} (x + 1) - k_{θ} (x)}, \\ λ (x) & = q^{k_{θ} (x) - k_{θ} (x + 1)} / 1 - q^{k_{θ} (x + 1)}, \\ m (x) & = \sum_{i = x}^{\infty} q^{k_{θ} (i + 1) - k_{θ} (x)}, \\ r (x) & = \sum_{i = 1}^{x} \frac{1 - q^{k_{θ} (i)}}{1 - q^{k_{θ} (x + 1)}}, \end{matrix}$

and the alternative hazard rate is

$h_{1} (x) = (k_{θ} (x - 1) - k_{θ} (x)) \log q .$

Discrete Inverse Weibull Distribution Jazi et al. (2010) considered the inverse Weibull law

$\bar{F} (x) = 1 - \exp [- a x^{- b}], x > 0,$

to derive the discrete inverse Weibull distribution with survival function

$S (x) = 1 - q^{x^{- b}}, x = 1, 2, \dots,$

and probability mass function

$f (x) = {\begin{matrix} q, & x = 1, \\ q^{x^{- b}} - q^{{(x - 1)}^{- b}}, & x = 2, 3, \dots, 0 < q < 1, b > 0 . \end{matrix}$

(3.93)

The distribution in (3.93) has decreasing $f (x)$ for $q ⩾ \frac{\log 2}{2^{- b} - 1}$ and unimodal with mode at 2, otherwise. When β becomes small, the tail becomes longer. The mean and variance do not have closed-form expressions. The hazard rate function and the alternative hazard rate function are, respectively,

$h (x) = \frac{q^{x^{- b}} - q^{{(x - 1)}^{- b}}}{1 - q^{{(x - 1)}^{- b}}}$

and

$h_{1} (x) = \log (\frac{1 - q^{{(x - 1)}^{- b}}}{1 - q^{x^{- b}}}), x = 1, 2, \dots$

In order to estimate the parameters Jazi et al. (2010) considered the method of proportions, method of moments, a heuristic algorithm and an inverse Weibull probability plots. The discrete version of the probability plot is given by

$y = - β x + \log (- \log q),$

which is a straight line and accordingly β and q can be estimated by a simple linear regression model fitted to the data.

Discrete Generalized Exponential Distribution Recalling that the generalized exponential distribution has its survival function as

$\bar{F} (x) = 1 - {(1 - e^{- λ x})}^{α}, x > 0,$

Nekoukhou et al. (2013) derived its discretized version as

$\begin{matrix} f (x) & = {(1 - p^{x + 1})}^{α} - {(1 - p^{x})}^{α}, α > 0, 0 < p < 1 \\ = \sum_{j = 1}^{\infty} {(- 1)}^{j + 1} (\begin{matrix} α \\ j \end{matrix}) p^{j x} (1 - p^{j}) \end{matrix}$

(3.94)

with support $(0, 1, 2, \dots)$ . For integer values of α, the sum in (3.94) terminates at α. The distribution is unimodal for all α and p. There is a closed form expression for the moments given by

$E (X^{r}) = \sum_{j = 1}^{\infty} {(- 1)}^{j + 1} (\begin{matrix} α \\ j \end{matrix}) \sum_{t = 1}^{r} S (r, t) t! {(\frac{p^{j}}{1 - p^{j}})}^{t}, r = 1, 2, \dots,$

where

$S (r, t) = \frac{1}{t!} \sum_{j = 0}^{t - 1} {(- 1)}^{j} (\begin{matrix} t \\ j \end{matrix}) {(t - j)}^{r}$

is the Stirling number of the second kind. In particular, we have

$μ = \sum_{j = 1}^{\infty} {(- 1)}^{j + 1} (\begin{matrix} α \\ j \end{matrix}) \frac{p^{j}}{1 - p^{j}}$

and

$μ_{2} = \sum_{j = 1}^{\infty} {(- 1)}^{j + 1} (\begin{matrix} α \\ j \end{matrix}) \frac{p^{j} (1 + p^{j})}{{(1 - p^{j})}^{2}} - μ^{2} .$

The parameters are estimated by the method of maximum likelihood by solving the equations

$\sum_{j = 1}^{n} \frac{{(1 - p^{x_{j} + 1})}^{α} \log (1 - p^{x_{j} + 1}) - {(1 - p^{x_{j}})}^{α} \log (1 - p^{x_{j}})}{{(1 - p^{x_{j} + 1})}^{α} - {(1 - p^{x_{j}})}^{α}} = 0$

and

$α \sum_{j = 1}^{n} {(1 - p^{x_{j}})}^{α - 1} \frac{p^{x_{j - 1}} x_{j} - {(1 - p^{x_{j} + 1})}^{α} p^{x_{j}}}{{(1 - p^{x_{j} + 1})}^{α} - {(1 - p^{x_{j}})}^{α}} (x_{j} + 1) = 0 .$

Using the survival function

$S (x) = 1 - {(1 - p^{x})}^{α},$

we can write the hazard rate and reversed hazard rate functions as

$h (x) = \frac{{(1 - p^{x + 1})}^{α} - {(1 - p^{x})}^{α}}{1 - {(1 - p^{x + 1})}^{α}}$

and

$λ (x) = \frac{{(1 - p^{x + 1})}^{α} - {(1 - p^{x})}^{α}}{{(1 - p^{x + 1})}^{α}} = 1 - {(\frac{1 - p^{x}}{1 - p^{x + 1}})}^{α}, x = 0, 1, \dots .$

The alternative hazard rate becomes

$h_{1} (x) = \log (\frac{1 - {(1 - p^{x})}^{α}}{1 - {(1 - p^{x + 1})}^{α}}), x = 0, 1, \dots .$

Discrete Gamma Distribution Chakraborty and Chakravarty (2012) considered the well-known two-parameter gamma distribution with survival function

$\bar{F} (x) = \frac{1}{θ^{k} Γ (k)} \int_{x}^{\infty} u^{k - 1} e^{- \frac{u}{θ}} d u = \frac{Γ (k, \frac{x}{θ})}{Γ (k)},$

where

$Γ (k, \frac{x}{θ}) = \int_{\frac{x}{θ}}^{\infty} \frac{1}{θ^{k}} u^{k - 1} e^{- u} d u$

is the upper incomplete gamma function, to derive the discrete gamma model with probability mass function

$f (x) = \frac{Γ (k, \frac{x}{θ}) - Γ (k, \frac{x + 1}{θ})}{Γ (k)}, k, θ > 0, x = 0, 1, \dots .$

(3.95)

As particular cases, we have the one-parameter discrete gamma when $θ = 1$ and the geometric when $θ = 1$ , $k = 1$ . The distributional characteristics can be ascertained from the moments satisfying the recursive formula

$μ_{r}^{'} = \sum_{j = 1}^{r - 1} {(- 1)}^{j + 1} (\begin{matrix} r - 1 \\ j \end{matrix}) g^{r - j} (k, \frac{x}{θ}) + g^{r - 1} (k, \frac{x}{θ}) - μ_{r - 1}^{'}$

where

$g^{r} (k, \frac{x}{θ}) = \sum_{x = 1}^{\infty} x^{r} Γ (k, \frac{x}{θ}) / Γ (k) .$

In particular, we have

$μ = g^{0} (k, \frac{x}{θ})$

and

$μ_{2} = 2 g^{1} (k, \frac{x}{θ}) - g^{0} (k, \frac{x}{θ}) - {[g^{0} (k, \frac{x}{θ})]}^{2} .$

From a random sample of failure times $(x_{1}, \dots, x_{n})$ , estimates of θ and k can be obtained by solving the likelihood equations

$\sum_{j = 1}^{n} \frac{\int_{x_{j} / θ}^{x_{j + 1 / θ}} u^{k - 1} e^{- u} \log u d u}{Γ (k, \frac{x_{j}}{θ}) - Γ (k, \frac{x_{j + 1}}{θ})} = n ψ (k)$

and

$\sum_{j = 1}^{n} \frac{Γ (k + 1, \frac{x_{j}}{θ}) - Γ (k + 1, \frac{x_{j + 1}}{θ})}{Γ (k, \frac{x_{j}}{θ}) - Γ (k, \frac{x_{j} + 1}{θ})} = n k$

with $ψ (k) = \frac{d \log Γ (k)}{d k}$ , being the digamma function.

The reliability functions of interest are the survival function

$S (x) = \frac{Γ (k, \frac{x}{θ})}{Γ (k)},$

the hazard rate function

$h (x) = 1 - \frac{Γ (k, \frac{x + 1}{θ})}{Γ (k, \frac{x}{θ})},$

the alternative hazard rate function

$h_{1} (x) = \log \frac{Γ (k, \frac{x}{θ})}{Γ (k, \frac{x + 1}{θ})},$

and the reversed hazard rate function

$λ (x) = \frac{Γ (k, \frac{x}{θ}) - Γ (k, \frac{x + 1}{θ})}{Γ (k) - Γ (k, \frac{x}{θ})} .$

The hazard rate functions of discrete gamma distribution are presented in Fig. 3.15.

Figure 3.15 Hazard rate functions of discrete gamma distribution.

Discrete Lindley Distribution The Lindley distribution, in the continuous case, has its survival function as

$\bar{F} (x) = \frac{e^{- θ x} (1 + θ + θ x)}{1 + θ}, x > 0, θ > 0 .$

Accordingly, the discrete version takes on the form

$f (x) = \frac{λ^{x}}{1 - \log λ} [λ \log λ + (1 - λ) (1 - \log λ^{x + 1})], x = 0, 1, 2, \dots$

(3.96)

and $λ = e^{- θ}$ . From the probability generating function

$P (t) = \frac{(λ - 1) (t λ - 1) + (1 - 2 λ + t λ^{2}) \log λ}{{(1 - t λ)}^{2} (1 - \log λ)},$

it is easy to obtain

$μ = \frac{λ [(λ - 2) \log λ + 1 - λ]}{{(1 - λ)}^{2} (1 - \log λ)}$

and

$μ_{2} = \frac{λ [{(1 - λ)}^{2} - (3 - 4 λ + λ^{2}) \log λ + (2 - 3 λ) \log^{2} λ]}{{(1 - λ)}^{4} {(1 - \log λ)}^{2}} .$

We also have

$S (x) = \frac{λ^{x} (1 - (1 + x) \log λ)}{1 - \log λ},$

(3.97)

$F (x) = \frac{1 - λ^{x + 1} + ((2 + x) λ^{x + 1} - 1) \log λ}{1 - \log λ},$

(3.98)

so that

$\begin{matrix} h (x) & = \frac{λ \log λ + (λ - 1) (\log λ^{x + 1} - 1)}{1 - (1 + x) \log λ}, \\ λ (x) & = \frac{λ^{x} [(λ \log λ + (λ - 1) (\log λ^{x + 1} - 1))]}{1 - λ^{x + 1} [(2 + x) λ^{x + 1} - 1] \log λ}, \end{matrix}$

(3.99)

and

$h_{1} (x) = \log \frac{(2 + x) \log λ - 1}{λ (3 + x) \log λ - 1}, x = 0, 1, \dots .$

(3.100)

Fig. 3.16 presents hazard rate functions of discrete Lindley distribution.

Figure 3.16 Hazard rate functions of discrete Lindley distribution.

The maximum likelihood estimate of λ can be obtained as the solution of the equation

$n \bar{x} + \frac{n}{1 - \log λ} + \sum_{j = 1}^{n} \frac{(λ + x_{j}) λ \log λ - (x_{j} + 1) (1 - λ)}{λ \log λ + (1 - λ) (1 - (x_{j} + 1) \log λ)} = 0,$

where $\bar{x}$ is the mean of a random sample of size n from the discrete Lindley distribution.

Besides the above results on the discrete Lindley distribution described in Gomez-Deniz and Calderin-Ojeda (2011), there are some other results on the reliability characteristics. We note that

$\sum_{t = x}^{\infty} S (t) = \frac{λ^{x}}{{(1 - λ)}^{2}} [(1 - λ) (1 - x \log λ) - \log λ] .$

Hence, the mean residual life function can be expressed in a simple form as

$m (x) = \frac{1 - \log λ}{{(1 - λ)}^{2}} [\frac{(1 - λ) (1 - \log λ - x \log λ) - \log λ}{1 - \log λ - λ \log λ - x λ \log λ}]$

(3.101)

which is a homographic function of x. Moreover, as a corollary to Eq. (3.9) characterizing the Ord family, we have the following theorem.

Theorem 3.10

A random variable X, taking values in $(0, 1, \dots)$ , will be distributed as discrete Lindley distribution if and only if

$m^{⁎} (x) = μ (λ \log λ + (1 - λ) (1 - \log λ) - x (1 - λ) \log λ) h (x + 1)$

for all x.

Unlike in the case of most discrete models, the variance residual life has also a closed-form expression given by

$\begin{matrix} σ^{2} (x) = \frac{2 (1 - \log λ)}{{(1 - λ)}^{4}} \frac{(1 - λ) (1 - λ - 2 \log λ) - \log λ - x (1 - λ) \log λ}{1 - (x + 2) \log λ} \\ - m (x) (m (x) - 1) \end{matrix}$

(3.102)

with $m (x)$ as given in (3.101).

The reversed mean residual life has the expression

$r (x) = x - \frac{λ (1 - λ^{x})}{1 - λ} - \frac{\log λ}{{(1 - λ)}^{2}} [(1 - λ^{x}) λ (2 - λ) + x (1 - λ) λ^{x}] .$

A characterization of the distribution by relationship between $r (x)$ and $λ (x)$ can be derived along the lines of Theorem 3.3.

Theorem 3.11

The random variable X, taking values in $(0, 1, \dots)$ , will have discrete Lindley distribution if and only if

$r (x) = μ - [λ \log λ + (1 - λ) (1 - \log λ) - x (1 - λ) \log λ] λ (x)$

for all $x = 0, 1, 2, \dots$ .

Further, the odds functions are

$w (x) = \frac{λ^{x + 1} (1 - (2 + x) \log λ)}{1 - λ^{x + 1} + [(2 + x) λ^{x + 1} - 1]} \log λ$

and $\bar{ω} (x) = {[ω (x)]}^{- 1}$ .

A second approach to construct discrete analogues of continuous distributions is to use the formula

$P (X = x) = \frac{g (x)}{\sum_{x = 0}^{\infty} g (t)},$

(3.103)

where $g (x)$ is the probability density function of a continuous random variable. When $g (x) = C e^{- \frac{x}{θ}} x^{α}$ , for example, (3.103) results in

$f (x) = \frac{q^{x} x^{α}}{\sum_{t = 1}^{\infty} q_{t}^{t^{α}}}, x = 1, 2, \dots, q = e^{- \frac{1}{θ}},$

the Good distribution in (3.57). For a detail study of this distribution, we refer to Kulasekera and Tonkyn (1992). Obviously, the geometric distribution can also be derived from the exponential density function in this manner. See Sato et al. (1999) for an application of such a model, and also the convolution of their discrete exponential in the form of a negative binomial law. Lai and Wang (1995) obtained an analogue of the power distribution with density function

$g (x) = \frac{α x^{α - 1}}{β^{α}}, 0 ⩽ x ⩽ β, β > 0,$

in the form

$f (x) = \frac{x^{α}}{\sum_{x = 0}^{b} x^{α}}, x = 0, 1, \dots, b,$

with hazard rate

$h (x) = \frac{x^{α}}{\sum_{t = x}^{b} t^{α}}$

and reversed hazard rte

$λ (x) = \frac{x^{α}}{\sum_{t = 0}^{x} t^{α}} .$

Lai (2013) has also suggested construction of new models by discretizing the continuous hazard rate function or the alternative hazard rate function. Further discussion on the above methods and models derived therefrom can be seen in Kemp (2008) and Krishna and Pundir (2009).

3.4 Some Other Models

In this section, we present some other models arising from a variety of considerations.

Discrete Weibull Distribution II Stein and Dattero (1984) introduced a second form of Weibull distribution by specifying its hazard rate function as

$h (x) = {\begin{matrix} {(\frac{x}{m})}^{β - 1}, & x = 1, 2, \dots, m, \\ 0, & x = 0 or x > m . \end{matrix}$

The probability mass function and survival function are derived from $h (x)$ using the formulas in Chapter 2 to be

$f (x) = {(\frac{x}{m})}^{β - 1} \prod_{i = 1}^{x - 1} [1 - {(\frac{i}{m})}^{β - 1}], x = 1, 2, \dots, m,$

and

$S (x) = \prod_{i = 1}^{\min (x, m)} [1 - {(\frac{x}{m})}^{α - 1}] .$

(3.104)

By comparison, the discrete Weibull I has survival function of the same form as the continuous counterpart, while discrete Weibull II has the same form for the hazard rate function. Stein and Dattero (1984) have pointed out that a series system with two components that are independent and identically distributed have a distribution of the form in (3.104).

Discrete Weibull Distribution III A third type of Weibull distribution proposed by Padgett and Spurrier (1985) is specified by

$\begin{matrix} f (x) = (1 - \exp [- c {(x + 1)}^{α}]) \exp [- c \sum_{j = 0}^{x} j^{α}], \\ x = 0, 1, 2, \dots, c > 0, - \infty < α < \infty, \end{matrix}$

(3.105)

with

$S (x) = \exp [- c \sum_{j = 0}^{x} j^{α}] .$

Accordingly, the hazard function is

$h (x) = 1 - \exp [- c x^{α}] .$

For $α = 0$ , (3.105) reduces to the geometric case. An important advantage of this model is that the hazard rate is flexible, in the sense that it can assume different shapes. In view of the complex nature of the probability mass function, the maximum likelihood estimates becomes computationally tedious and intensive. Bracquemond and Gaudoin (2003) have pointed out that the quality of the maximum likelihood estimate of c, as regards bias, increases with c, while the bias for α is small except for very small samples.

“S” Distribution Bracquemond and Gaudoin (2003) derived the “S” distribution based on some physical characteristics of the failure pattern through a shock-model interpretation. They have assumed a system in which on each demand a shock can occur with probability p and not occur with probability $(1 - p)$ . The number of shocks $N_{x}$ at the xth demand is such that the hazard rate is an increasing function of $N_{x}$ satisfying

$h_{N} (x) = 1 - π^{N_{x}}, 0 < π ⩽ 1 .$

Then, the survival function, given $N_{x}$ , is

$\begin{matrix} S (x) & = P (X > x) = E ((P (X > x) | N_{x}, x ⩾ 1) \\ = E [π^{\sum_{i = 1}^{\infty} N_{i}}] . \end{matrix}$

Further, if $U_{x} = N_{x} - N_{x - 1}$ , the $U_{x}$ 's are independent Bernoulli (p) random variables, so that

$S (x) = \prod_{i = 1}^{x} E (π^{(x - i + 1) U_{i}}) .$

This leads to the “S” distribution specified by the probability mass function

$f (x) = p (1 - π^{x}) \prod_{i = 1}^{x - 1} (1 - p + p π^{i})$

and the survival function

$S (x) = \prod_{i = 1}^{x - 1} (1 - p + p π^{i}) .$

The interpretation given to the parameters is that p is the probability of a shock and π is the probability of surviving such a shock. We then have

$h (x) = p (1 - π^{x}) .$

Various properties of the distribution as well as the estimation issues have not been studied yet.

There are two other distributions proposed by Salvia and Bollinger (1982) and their generalizations by Padgett and Spurrier (1985), which are essentially particular cases of the models already discussed. Alzaatreh et al. (2012) provided a general method for deriving new distributions from continuous or discrete models. Let $G (x)$ be the distribution function of a random variable Y which may be continuous or discrete and $a (x)$ be the probability density function of a continuous random variable T taking values in $[0, \infty)$ . Then, a new distribution can be defined by the distribution function

$F (x) = \int_{0}^{- \log (1 - G (x))} a (t) d t = A (- \log (1 - G (x))),$

where $A (\cdot)$ is the distribution function of T. When Y is discrete, the new distribution has probability mass function

$f (x) = A (- \log (1 - G (x))) - A (- \log (1 - G (x - 1))) .$

For instance, when X has a geometric (p) distribution, the corresponding distribution arising from a continuous distribution with $G (\cdot)$ as the distribution function is

$f (x) = G (c (x + 1)) - G (c x),$

where $c = - \log p$ . Another category of models arise when they are required to satisfy certain specific properties for their reliability characteristics, such as bathtub shaped hazard rate functions. Such distributions will be taken up later on in Chapter 5. Expressions for hazard rate function for some distributions are presented in Table 3.2 for ease reference.

Table 3.2

Hazard rate functions
Distribution	$f (x)$	$h (x)$
binomial	$(\begin{matrix} n \\ x \end{matrix}) p^{x} q^{n - x}$	${1 + \frac{1}{(\begin{matrix} n \\ x \end{matrix}) θ^{x}} [{(1 + θ)}^{n} - \sum_{j = 0}^{x} (\begin{matrix} n \\ x \end{matrix}) θ^{j}]}^{- 1}$ , $θ = \frac{p}{q}$ , $q = 1 - p$
Poisson	$e^{- λ} \frac{λ^{x}}{x!}$	${1 + \frac{x!}{λ^{x}} (e^{λ} - 1 - \sum_{j = 1}^{x} \frac{λ^{j}}{j!})}^{- 1}$
negative binomial	$(\begin{matrix} k + x - 1 \\ x \end{matrix}) p^{k} q^{x}$	${1 + {(\begin{matrix} k + x - 1 \\ x \end{matrix})}^{- 1} q^{x} p^{k} [1 - \sum_{j = 0}^{x} (\begin{matrix} k + j - 1 \\ j \end{matrix}) p^{k} q^{j}]}^{- 1}$
Haight zeta	(2x−1)^−σ − (2x+1)^−σ	$\frac{{(2 x - 1)}^{- σ} - {(2 x + 1)}^{- σ}}{1 - {(2 x - 1)}^{σ}}$ , x = 2,3,…
geometric	q ^x p	p
Waring	$\frac{(a - b) {(b)}_{x}}{{(a)}_{x + 1}}$	$\frac{a - b}{a + x}$
negative hypergeometric	$\frac{(\begin{matrix} - 1 \\ x \end{matrix}) (\begin{matrix} - k \\ n - x \end{matrix})}{(\begin{matrix} - 1 - k \\ n \end{matrix})}$	$\frac{k}{k + n - x}$
uniform	$f (x) = \frac{1}{b}$ , x = 1,…,b	$\frac{1}{b - x + 1}$
arithmetic	$\frac{x}{b (b + 1)}$ , x = 1,2,…,b	$\frac{(b + x) (b - x + 1)}{b (b + 1)}$
S	$p (1 - π^{2}) \prod_{i = 1}^{x - 1} (q + p π^{i})$	p(1 − π^x)
Weibull I	$q^{x^{β}} - q^{{(x + 1)}^{β}}$	$1 - q^{{(x + 1)}^{β} - x^{β}}$
Weibull II	${(\frac{x}{m})}^{β - 1} \prod_{i = 1}^{x - 1} (1 - {(\frac{i}{n})}^{β - 1})$	${(\frac{x}{m})}^{β - 1}$
Weibull III	$(1 - \exp (- c {(x + 1)}^{α}))$ $\exp [- \sum_{j = 0}^{x} j^{α}]$	$\exp [- c x^{α}]$
power	$\frac{x^{α}}{\sum_{x = 0}^{b} x^{α}}$ , x = 0,1,…,b	$\frac{x^{α}}{\sum_{t = x}^{b} t^{α}}$
Lindley	$\frac{λ^{x}}{1 - \log λ} [λ \log λ + (1 - λ) (1 - (x + 1) \log λ)]$	$\frac{λ \log λ + (λ - 1) ((x + 1) \log λ - 1)}{1 - (x + 1) \log λ}$
gamma	$\frac{Γ (k, \frac{x}{θ}) - Γ (k, \frac{x + 1}{θ})}{Γ (k)}$	$1 - \frac{Γ (k, \frac{x + 1}{θ})}{Γ (k, \frac{x}{θ})}$
quasi binomial II	$(\begin{matrix} n \\ x \end{matrix}) \frac{(1 - p - n α) p}{1 - n α} {(p + α x)}^{x - 1} {(q - α x)}^{n - x - 1}$	$\frac{(\begin{matrix} n \\ x \end{matrix}) {(p + α x)}^{x - 1} {(q - α x)}^{n - x - 1}}{\sum_{k = x}^{n} (\begin{matrix} n \\ t \end{matrix}) {(p + α t)}^{t - 1} {(q - α t)}^{n - t - 1}}$
quasi logarithmic	$\frac{θ a {(a + x λ)}^{x - 1} {(1 - x λ)}^{- x}}{x}$	$\frac{{(a + x λ)}^{x - 1}}{x {(1 - x λ)}^{x} \sum_{t = x}^{\infty} \frac{{(a + t λ)}^{t - 1}}{t {(1 - t λ)}^{t}}}$
generalized Poisson	$\frac{θ {(θ + x λ)}^{x - 1}}{x!} e^{- (θ + x λ)}$	$\frac{{(θ + x λ)}^{x - 1} e^{- λ x}}{x \sum_{t = x}^{\infty} \frac{{(θ + t λ)}^{t - 1}}{t!} e^{- λ t}}$
Hurtwitz-zeta	$\frac{C}{{(a - 1 + x)}^{s}}$	${[{(a - 1 + x)}^{s} ζ (s, a + x - 1)]}^{- 1}$
Lerch	$\frac{C z^{x}}{{(a + x)}^{s}}$	${[{(a + x)}^{s} ϕ (z, s, a + x)]}^{- 1}$
linear hazard rate	${\begin{matrix} (1 - a) \dots (1 - a - b (x - 1)) (a + b x) \\ x = 1, 2, \dots, n \\ a x = 0, α + β n = 1 \end{matrix}$	a + bx