Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 2

Basic Reliability Concepts

Abstract

The basic reliability functions that can be used to model lifetime data and explain the failure patterns are the topics of discussion in this chapter. We begin with the conventional hazard rate defined as the ratio of the probability mass function to the survival function. This is followed up by an alternative hazard function introduced to overcome certain limitations of the conventional rate. Properties of both these hazard rates and their interrelationships are discussed. Then, the concept of residual life distribution and its characteristics like the mean, variance and moments are discussed. Various identities connecting the hazard rates, mean residual life function and various residual functions are derived, and some special relationships are employed for characterizing discrete life distributions. We then work out two problems to demonstrate how the characteristic properties enable the identification of the life distribution. Also, the role of partial moments in the context of reliability modelling is examined. Various concepts in reversed time has been of interest in reliability and related areas. Accordingly, reversed hazard rate, reversed residual mean life and reversed variance life are all defined and their interrelationships and characterizations based on them are reviewed. In the case of finite range distributions, it is shown that all the concepts in reversed time can assume constant values and these are related to the reversed lack of memory property characteristic of the reversed geometric law. Along with the traditional reliability functions, the notion of odds functions can also play a role in reliability modelling and analysis. We explain the relevant results in this connection. The log-odds functions and rates and their applications are also studied. Mixture distributions and weighted distributions also appear as models in certain situations, and the hazard rates and reversed hazard rates for these two cases are derived and are subsequently used to characterize certain lifetime distributions.

Keywords

Hazard rate; Mean residual life; Partial moments; Concepts in reversed time; Odds function; Log odds rate characterizations

2.1 Introduction

As mentioned in the last chapter, the reliability of a device is the probability that the device performs its intended function for a given period of time under conditions specified for its operation. When the device does not perform its function satisfactorily, we say that it has failed. When the random variable X represents the lifetime of a device, the observation on X is realized as the time of failure. The primary concern in reliability theory is then to understand the pattern in which failures occur for different devices and under varying operating environments. This is often done by analyzing the observed failure times or ages at failure with the help of a model that satisfactorily represents the predominant features of the data. One direct method is to find a probability distribution that provides a reasonable fit to the observations. Sometimes, there may exist more than one distribution that can pass an appropriate goodness-of-fit test. In any case, it is more desirable to find a probability model that manifests certain physical properties of the failure mechanism. In reliability theory, some basic concepts that help in the study of failure patterns have been developed. The objective of this chapter is to define such concepts and discuss their properties and inter-relationships. Two important aspects that necessitate the study of these concepts are (a) various functions considered in this context determine the life distribution uniquely, so that the knowledge of their functional form is equivalent to that of the distribution itself, and (b) it should be easier to deal with these functions than the distribution function or probability density function of the corresponding distributions.

2.2 Hazard Rate Function

Let X be a discrete random variable assuming values in $N = (0, 1, \dots)$ with probability mass function $f (x)$ and survival function $S (x) = P (X ⩾ x)$ . We will think of X as the random lifetime of a device that can fail only at times (ages) in N. The hazard rate function of X is defined as

$h (x) = \frac{f (x)}{S (x)}$

(2.1)

at points x for which $S (x) > 0$ . Treated as a function of x, the hazard rate is also called failure rate, instantaneous death rate, force of mortality and intensity function in other disciplines such as survival analysis, actuarial science, demography, extreme value theory and bio-sciences. Although in the continuous case, the concept of hazard rate dates back to historical studies in human mortality, its discrete version came up much later in the works of Barlow and Proschan (1965), Cox (1972) and Kalbfleisch and Prentice (2002), to mention a few.

When X has a finite support $(0, 1, \dots, n)$ , $n < \infty$ , then $h (n) = 1$ . As a convention we take $h (x) = 1$ for $x > n$ . The hazard function $h (x)$ is interpreted as the conditional probability of the failure of the device at age x, given that it did not fail before age x. Thus, $0 ⩽ h (x) ⩽ 1$ . The interpretation and boundedness of the discrete hazard rate is thus different from that of the continuous case.

We see from (2.1) that $h (x)$ is determined from $f (x)$ or $S (x)$ . The converse that the hazard rate function determines the distribution of X uniquely is also true. To see this, we note that

$h (x) = \frac{S (x) - S (x + 1)}{S (x)},$

$\frac{S (x + 1)}{S (x)} = 1 - h (x) .$

So,

$S (x) = {\begin{matrix} \prod_{t = 0}^{x - 1} (1 - h (t)), & x ⩾ 1, \\ 1, & x = 0 . \end{matrix}$

(2.2)

Eq. (2.2) reveals also that $h (x)$ can be used as a tool to model the life distribution. When a functional form for $h (x)$ is assumed as model for a given data set, one has to ensure that the assumed form conforms to the hazard rate function of a distribution. The following theorem is useful in this regard.

Theorem 2.1

A necessary and sufficient condition that $h : N \to [0, 1]$ is the hazard rate function of a distribution with support N is that $h (x) \in [0, 1]$ for $x \in N$ and $\sum_{x = 0}^{\infty} h (t) = \infty$ .

If the probability mass function is required from (2.1) and (2.2), we see that

$f (x) = h (x) \prod_{t = 0}^{x - 1} (1 - h (t)) .$

(2.3)

The above results have appeared repeatedly in several papers; see, for example, Gupta (1979), Shaked et al. (1995) and Kemp (2004).

Example 2.1

Let X follow the geometric distribution with parameter p ( $G (p)$ ) specified by

$f (x) = q^{x} p, x \in N, 0 < p < 1, q = 1 - p .$

(2.4)

Then,

$S (x) = q^{x}$

so that $h (x) = p$ , a constant for all x.

Example 2.2

Consider the Waring distribution with parameters a and b ( $W (a, b)$ ) having probability mass function

$f (x) = \frac{(a - b) {(b)}_{x}}{{(a)}_{x + 1}}, x \in N, a > b,$

(2.5)

developed by Irwin (1968, 1975a, 1975b, 1975c), where

${(b)}_{x} = b (b - 1) \dots (b - x + 1) = \frac{Γ (b + 1)}{Γ (b - x + 1)}$

is the Pochammer's symbol. In studying the properties of this distribution, we need the Waring expansion

$\frac{1}{x - a} = \frac{1}{x} + \frac{a}{x (x + 1)} + \frac{a (a + 1)}{x (x + 1) (x + 2)} + \dots$

(2.6)

which converges for $x > a$ . Then,

$\begin{matrix} S (x) & = (a - b) \sum_{t = x}^{\infty} \frac{{(b)}_{t}}{{(a)}_{t + 1}} \\ = (a - b) \frac{{(b)}_{x}}{{(a)}_{x}} [\frac{1}{a + x} + \frac{b + x}{(a + x) (a + x + 1)} + \dots] \\ = \frac{{(b)}_{x}}{{(a)}_{x}}, by (2.6). \end{matrix}$

(2.7)

Hence, we get in this case

$h (x) = \frac{a - b}{a + x} .$

(2.8)

Example 2.3

A special case of the negative hyper geometric law with parameters n and k is defined by the probability mass function

$f (x) = \frac{(\begin{matrix} - 1 \\ x \end{matrix}) (\begin{matrix} - k \\ n - x \end{matrix})}{(\begin{matrix} - 1 - k \\ n \end{matrix})}, x = 0, 1, \dots, n, k > 0 .$

(2.9)

Then,

$S (x) = \sum_{t = x}^{n} f (t) = \sum_{t = x}^{n} (\begin{matrix} k + n - t - 1 \\ n - t \end{matrix}) / (\begin{matrix} k + n \\ n \end{matrix})$

(2.10)

on using the identity

$(\begin{matrix} - p \\ k \end{matrix}) = {(- 1)}^{k} (\begin{matrix} p + k - 1 \\ k \end{matrix}) .$

So,

$S (x) = \sum_{t = 0}^{n - x} (\begin{matrix} k + t - 1 \\ t \end{matrix}) / (\begin{matrix} k + n \\ n \end{matrix}) .$

Now to find the sum on the right hand side, the combinatorial expression (Riordan, 1968)

$\sum_{x = 0}^{n} (\begin{matrix} a + n - x - 1 \\ n - x \end{matrix}) = (\begin{matrix} a + n \\ n \end{matrix})$

is employed in order to obtain

$S (x) = (\begin{matrix} k + n - x \\ n - x \end{matrix}) / (\begin{matrix} k + n \\ n \end{matrix}) .$

(2.11)

We then find

$h (x) = \frac{(\begin{matrix} k + n - x - 1 \\ n - x \end{matrix})}{(\begin{matrix} k + n - x \\ n - x \end{matrix})} = \frac{k}{k + n - x} .$

The distribution in (2.11) will be denoted by NH $(n, k)$ . For more functional forms of $h (x)$ that characterize various distributions, see Table 3.2.

Remark 2.1

The geometric, Waring and negative hyper-geometric models form a set of models possessing some attractive properties for their reliability characteristics, in as much the same way as the exponential, Pareto II and rescaled beta distributions in the continuous case. The results in the above examples show that the models (2.4), (2.5) and (2.8) have hazard rates of the form

$h (x) = {(l x + m)}^{- 1}$

(2.12)

with $l = 0$ for geometric, $l = \frac{a}{a - b} > 0$ for Waring, and $l = - \frac{1}{k} < 0$ for negative hyper-geometric distributions.

Remark 2.2

Using (2.3), it can be seen that a reciprocal linear hazard rate function in (2.12) characterizes the above three distributions. A direct proof of this fact is available in Xekalaki (1983).

Remark 2.3

When Remark 2.1 is employed in a practical problem, it should be borne in mind that the support of X is N. Thus, the geometric distribution has a constant hazard rate means that a device with such a lifetime distribution does not age. However, when the support of X is $n < \infty$ , $h (x) = p$ leads to

$f (x) = {\begin{matrix} q^{x} p, & x = 0, 1, 2, \dots, n - 1, \\ q^{n}, & x = n . \end{matrix}$

If X has bounded support ( $0, 1, 2, \dots, n$ ), (2.12) is satisfied with $f (0) > 0$ , then $l > 1$ and $m < 0$ and $n = \frac{1 - m}{l}$ . This follows from the facts that at $x = 0$ , $\frac{1}{l} < 1$ and at $x = n$ , $l n + m = 1$ .

The sum of the hazard rates from 0 through $x - 1$ is of interest in reliability theory and is called the cumulative hazard rate, defined by

$H (x) = \sum_{t = 0}^{x - 1} h (t) .$

(2.13)

Also define $H (0) = 0$ . Graphically, the cumulative hazard rate represents the area under the step function representing $h (x)$ . Definition (2.13) does not satisfy properties analogous to the continuous case in which the cumulative hazard rate satisfies the identity

$H (x) = - \log \bar{F} (x), \bar{F} (x) = P (X > x) .$

Therefore, Cox and Oakes (1984) proposed an alternative definition of cumulative hazard rate in the form

$H_{1} (x) = - \log S (x) .$

(2.14)

This means that

$H_{1} (x) = - \log \sum_{t ⩾ x} f (t) = - \log \prod_{t = 0}^{x - 1} (1 - h (t)) .$

(2.15)

$H_{1} (x) = \sum_{t = 0}^{x - 1} h_{1} (t), x = 1, 2, \dots$

(2.16)

then $H_{1} (x)$ is a cumulative hazard rate corresponding to an alternative hazard rate function defined by

$h_{1} (x) = \log \frac{S (x)}{S (x + 1)}, x = 0, 1, 2, \dots .$

(2.17)

Xie et al. (2002a) advocated the use of (2.17) as the hazard rate function instead of (2.1) by citing the following arguments. In the continuous case, the hazard rate is not a probability, but (2.1) is a conditional probability which is bounded. Consequently, (2.1) cannot increase too fast either linearly or exponentially to provide models of lifetimes of components in the wear-out phase. When cumulative hazard rate is defined as the negative logarithm of the survival function, $- \log S (x) \neq \sum_{t = 0}^{x - 1} h (t)$ . This causes problems in defining discrete ageing concepts that are analogues of their continuous counterparts, such as increasing hazard rate average (see Chapter 4). Thus, discrete ageing concepts based on $h (x)$ may not convey the same meaning as those in the continuous case. Similar problems persist with the construction of proportional hazards models and with series systems. Further details about these are provided in Sections 2.10 and 2.11. It may also be noted that unlike $h (x)$ , the definition of $h_{1} (x)$ does not have any interpretation. Xie et al. (2002a) and Kemp (2004) have obtained the following interrelationships among the two hazard rate functions and the other reliability functions discussed so far:

$\begin{matrix} H_{1} (x) & = - \sum_{t = 0}^{x - 1} \log (1 - H (t) + H (t - 1)), \\ H (x) & = \sum_{t = 0}^{x} (1 - \exp [H_{1} (x) - H_{1} (x + 1)]), \\ S (x) & = \exp [- H_{1} (x)] = \prod_{t = 0}^{x - 1} (1 - H (t) + H (t - 1)), \\ f (x) & = \exp [- H_{1} (x)] - \exp [- H_{1} (x + 1)] \\ = (H (x) - H (x - 1)) \times \prod_{t = 0}^{x - 1} (1 - H (t) + H (t - 1)), \\ h (x) & = 1 - \exp [H_{1} (x) - H_{1} (x + 1)] = H (x) - H (x - 1), \end{matrix}$

and

$h (x) = 1 - e^{- h_{1} (x)} .$

(2.18)

Thus, the function $h (x) (H (x))$ determines $h_{1} (x) (H_{1} (x))$ uniquely and hence is useful in characterizing life distributions. We give some examples that compare the expressions of $h (x)$ and $h_{1} (x)$ .

Example 2.4

The discrete Pareto distribution

$S (x) = {(\frac{α}{x + α - 1})}^{β}, x = 1, 2, \dots,$

provides

$h_{1} (x) = β \log [\frac{x + α}{x + α - 1}]$

whereas

$h (x) = 1 - {(\frac{x + α - 1}{x + α})}^{β} .$

Example 2.5

The Waring distribution with

$S (x) = \frac{{(b)}_{x}}{{(a)}_{x}}$

in Example 2.2 gives

$h_{1} (x) = - \log (\frac{b + x}{a + x})$

compared to $h (x) = \frac{a - b}{a + x}$ from (2.8).

2.3 Mean Residual Life

The analysis of the lifetime of a device after it has attained age x is of special relevance in reliability and survival analysis. Thus, if X is the original lifetime with survival function $S (x) = P (X ⩾ x)$ , the corresponding residual lifetime after age x is the random variable $X_{x} = (X - x | X > x)$ . From the definition of conditional probability, one can arrive at the distribution of $X_{x}$ as

$S_{x} (t) = \frac{S (x + t + 1)}{S (x + 1)}, t = 0, 1, \dots .$

(2.19)

The mean, variance, partial moments, coefficient of variation and percentiles of the distribution in (2.19) have been discussed extensively in the literature in the continuous case.

The mean residual life of X is defined as

$\begin{matrix} m (x) & = E (X - x | X > x) \\ = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} (t - x) f (t) \\ = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} S (t), x = - 1, 0, 1, \dots \end{matrix}$

(2.20)

It is easy to see that $m (x)$ is the mean of the distribution in (2.19). When $x = - 1$ ,

$m (- 1) = E (X + 1) = \sum_{t = 0}^{\infty} S (t) = 1 + μ .$

Characterizations of the distribution of X in terms of $m (x)$ and the hazard rate have been studied by Nair and Hitha (1989). From (2.20),

$S (x + 1) m (x) = \sum_{t = x + 1}^{\infty} S (t)$

which leads to

$S (x) m (x - 1) - S (x + 1) m (x) = S (x)$

and to the identity

$h (x) = \frac{1 + m (x) - m (x - 1)}{m (x)}, x = 0, 1, 2, \dots .$

(2.21)

Now, from (2.2), we have

$S (x) = {\begin{matrix} \prod_{t = 0}^{x - 1} \frac{m (t - 1) - 1}{m (t)}, & x ⩾ 1, \\ 1, & x = 0 . \end{matrix}$

(2.22)

Thus, the three functions $h (x)$ , $m (x)$ and $S (x)$ determine each other uniquely. Though $h (x)$ can be determined from $m (x)$ and vice-versa, both have unique features that ensure their necessity in reliability theory. The mean residual life function may exist when the hazard function does not exist and vice-versa, as will be seen in some of the examples that are considered later. While $h (x)$ is a local measure of failure patterns for any x, the mean residual life function depends upon the life history of devices at all ages beyond x and hence the latter is more informative. At the same time, $m (x)$ as a summary measure is highly sensitive to a single long-term survivor in a data set, which is not desirable. There are other interesting properties that distinguish the two concepts as will be seen in the subsequent chapters.

Since $0 ⩽ h (x) ⩽ 1$ , we can note from (2.21) that

$1 + m (x) ⩾ m (x - 1)$

(2.23)

and $m (x) ⩾ 1$ . For many discrete distributions, the expression for $m (x)$ is not of a simple form. In the following examples, we have simple forms for $m (x)$ that have many desirable properties. For more examples of the $m (x)$ function, see Table 2.4.

Example 2.6

For the geometric distribution $G (p)$ in Example 2.1,

$m (x) = \frac{1}{q^{x + 1}} \sum_{t = x + 1}^{\infty} q^{t + 1} = \frac{1}{p}$

is the reciprocal of the hazard rate function so that $h (x) m (x) = 1$ .

Example 2.7

The Waring distribution $W (a, b)$ in Example 2.2 yields

$\begin{matrix} m (x) & = \frac{{(a)}_{x + 1}}{{(b)}_{x + 1}} \sum_{t = x + 1}^{\infty} \frac{{(a)}_{t}}{{(b)}_{t}} \\ = \frac{{(a)}_{x + 1}}{{(a)}_{x}} [\frac{1}{a + x} + \frac{b + x + 1}{(a + x) (a + x + 1)} + \dots] \\ = \frac{a + x}{a - b - 1}, by Waring expansion in (2.6). \end{matrix}$

Notice that $m (x)$ is a linearly increasing function and satisfies $h (x) m (x) = \frac{a - b}{a - b - 1} > 1$ .

Example 2.8

In the case of the negative hypergeometric distribution $NH (n, k)$ in Example 2.3, we have

$\begin{matrix} m (x) & = \frac{(\begin{matrix} k + n \\ n \end{matrix})}{(\begin{matrix} k + n - x - 1 \\ n - x - 1 \end{matrix})} \sum_{t = x + 1}^{\infty} \frac{(\begin{matrix} k + n - t \\ n - t \end{matrix})}{(\begin{matrix} k + n \\ n \end{matrix})} \\ = \frac{(\begin{matrix} k + n - x \\ n - x - 1 \end{matrix})}{(\begin{matrix} k + n - x - 1 \\ n - x - 1 \end{matrix})} = \frac{k + n - x}{k + 1} . \end{matrix}$

We can see that $m (x)$ is linear and decreasing in x and $h (x) m (x) = \frac{k}{k + 1} < 1$ .

Nair and Hitha (1989) and Hitha and Nair (1989) have established the following characterizations of the above three models.

Theorem 2.2

A necessary and sufficient that a non-negative discrete random variable has a mean residual life function of the form

$m (x) = l x + m, m > 0,$

is that X is geometric ( $l = 0$ ), or Waring ( $l > 0$ ), or negative hyper-geometric ( $l < 0$ ).

Theorem 2.3

The relationship

$h (x) m (x) = C,$

where C is a constant, is satisfied by a non-negative discrete random variable X if and only if X is distributed as geometric for $C = 1$ , Waring for $C > 1$ , and negative hyper-geometric for $C < 1$ .

Identification of lifetime distributions that are uniquely determined by simple relationships between various reliability concepts has attracted several studies in the past and continues to be a fertile area of research. Many results pertaining to individual distributions and families of distributions are available in this context. Theorem 2.3 belongs to this category concerning three specific distributions and its applications will be discussed later in Chapter 4. A more general result in this connection is the following.

Let $k (x)$ and $c (x)$ be real-valued functions defined on N such that $E (k (X)) = μ_{k}$ and $V (k (X)) = σ_{k}^{2}$ . We define

$m_{k}^{⁎} (x) = E [k (X) | X > x] .$

Further, assume that $E c^{2} (X) < \infty$ and $E [Δ c (X) g (X)] < \infty$ , where Δ is the usual difference operator, $Δ a (x) = a (x + 1) - a (x)$ , and $g (x)$ is some real-valued function.

Theorem 2.4

Nair and Sudheesh, 2008

Let X be discrete random variable taking values in N and $k (x)$ , $c (x)$ and $g (x)$ be real-valued functions satisfying the above conditions. Then, for every $c (x)$ and some $g (x)$ and $k (x)$ , the following statements are equivalent:

$(i) \frac{f (x + 1)}{f (x)} = \frac{σ_{k} g_{k} (x)}{σ_{k} g_{k} (x + 1) - μ_{k} + k (x + 1)}$

(2.24)

with $g_{k} (0) = \frac{μ_{k} - k (0)}{σ_{k}}$ and $f (0)$ is evaluated from $\sum_{x = 0}^{\infty} f (x) = 1$ ;

$(ii) m_{k}^{⁎} (x) = μ_{k} + σ_{k} \frac{h (x) g (x)}{1 - h (x)};$

(2.25)

$(iii) V (c (X)) ⩾ E^{2} (g (X) Δ c (X)),$

(2.26)

where $h (x)$ in (2.25) is the hazard rate function.

When $k (x) = x$ , we write

$m^{⁎} (x) = E (X | X > x)$

(2.27)

and call it the vitality function, which is in fact a conditional mean life function. It satisfies

$m^{⁎} (- 1) = μ and m^{⁎} (x) = m (x) + x .$

Eq. (2.25) becomes a relationship between the mean residual life function and hazard rate function of the form

$m (x) = x + μ + σ \frac{h (x) g (x)}{1 - h (x)}$

that characterizes the class of discrete distributions satisfying

$\frac{f (x + 1)}{f (x)} = \frac{σ g (x)}{σ g (x + 1) - μ + x + 1},$

(2.28)

where $σ^{2} = V (x)$ . It is further observed in Nair and Sudheesh (2008) that

(1) the equality in (2.26) holds if and only if $c (x)$ is linear in $k (x)$ ;
(2) $E g (X) = σ_{k}^{- 1} Cov (X, k (X))$ ;
(3) the expression for $g (x)$ is unique for a particular choice of $k (x)$ , but one can have different choices for $g (x)$ for the same distribution when $k (x)$ is different;
(4) for a given $k (x)$ , the corresponding $g (x)$ characterizes the distribution of X.

Specialization of the identity in (2.25) for various distributions and their implications will be considered later in Chapter 3 wherein we discuss discrete lifetime models. The variance inequality is also useful in the unbiased estimation of functions of X.

Remark 2.4

Since $h (x) = 1 - e^{- h_{1} (x)}$ , all the characteristic properties mentioned above can be translated in terms of $m (x)$ and $h_{1} (x)$ .

Though closely related to the mean residual life function, the vitality function introduced in (2.28) has some importance in its own right in lifetime studies. By definition,

$m^{⁎} (x) = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} t f (t), x = - 1, 0, 1, \dots .$

Some properties of $m^{⁎} (x)$ that are of interest in the sequel are given below. From the definition

$m^{⁎} (x - 1) S (x) - m^{⁎} (x) S (x + 1) = x f (x) .$

Dividing by $S (x)$ and simplifying, we get

$m^{⁎} (x) - m^{⁎} (x - 1) = h (x) [m^{⁎} (x) - x] = h (x) m (x) .$

(2.29)

Further,

$m^{⁎} (x) > x for all x in N .$

Analogous to the presentation on vitality function given in Kupka and Loo (1989), we note that in the integer interval $[0, x]$ , $m ([0, x]) = m^{⁎} (- 1) - m^{⁎} (x - 1)$ represents the increment in the conditional mean life which is achieved by surviving from age 0 to x. Thus, low (high) values of $m^{⁎} ([0, x])$ means that the device is ageing rapidly (slowly) during $[0, x]$ and this justifies the name vitality function of the lifetime X. However, it may be noticed that vitality function is always non-decreasing, unlike the mean residual life function which can be either decreasing or increasing.

Returning to the reliability functions based on real-valued functions $k (X)$ of X, more characterizations and useful relationships can be seen. Let $F (x)$ be the distribution function of X such that $F (A) = 1$ for some $A > 0$ .

Theorem 2.5

Gupta, 1975

If $E [k (X)] < \infty$ , then

$S (x + 1) = \prod_{y = 0}^{x - 1} \frac{m_{k} (y)}{m_{k} (y + 1) + k (y + 2) - k (y + 1)}, 0 ⩽ x < A,$

if and only if

$m_{k} (y) = E (k (X) | X > y) - k (y + 1), 0 ⩽ y < A .$

In particular, X has geometric distribution $G (p)$ if and only if

$E (X | X > y) = E (X) + y + 1 .$

Ruiz and Navarro (1995) considered discrete random variables taking values in $C = (x_{a}, x_{a + 1}, \dots, x_{b})$ , $x_{i} < x_{i + 1}$ , and defined for a monotonic $k (x)$ , the doubly truncated mean function

$\begin{matrix} M (x, y) & = E [k (X) | x ⩽ X ⩽ y] \\ = \frac{1}{F (y) - F (x -)} \sum_{x ⩽ x_{i} ⩽ y} f (x_{i}) k (x_{i}) \end{matrix}$

for $(x, y) \in D = {(s, t) | F (t -) < F (s)}$ , $F (x) = P (X < x)$ and $f (x_{i}) = F (x_{i}) - F (x_{i} - 1)$ . Then,

$F (x_{i}) = \frac{P_{1} (x_{i}, x_{i})}{P_{1} (x_{i}, x_{i}) + P_{2} (x_{i}, x_{i}) - 1},$

where

$P_{1} (x, y) = \prod_{x_{i} < x} \frac{M (x_{i + 1}, y) - k (x_{i})}{M (x_{i}, y) - k (x_{i})}$

and

$P_{2} (x, y) = \prod_{y < x_{i}} \frac{k (x_{i}) - M (x, x_{i - 1})}{k (x_{i}) - M (x, x_{i})} .$

They also obtained a necessary and sufficient condition for a given function in $(x, y)$ to be a doubly truncated mean function. This result is of relevance in reliability modelling since $M (x, y)$ tends to the mean residual life as $x \to \infty$ and to the reversed mean residual life (Section 2.7) as $x \to 0$ when $k (X) = X$ and the range of X is N.

Example 2.9

For $k (X) = X$ , the relationship

$M (x, y) = \frac{q}{p} + \frac{x q^{x - 1} - (y + 1) q^{y}}{q^{x - 1} - q^{y}}$

characterizes the geometric distribution $G (p)$ with $S (x) = q^{x}$ , $x = 0, 1, 2, \dots$ .

The conditional moments

$m_{r}^{⁎} (x) = E (X^{r} | X > x) = \frac{1}{S (x + 1)} \sum_{x + 1}^{\infty} t^{r} f (t)$

satisfy

$h (x) = \frac{m_{r}^{⁎} (x - 1) - m_{r}^{⁎} (x)}{x^{r} - m_{r}^{⁎} (x)}$

(2.30)

so that the distribution of X is determined from $m_{r}^{⁎} (x)$ by virtue of (2.2). Further, since the left hand side of (2.30) is independent of r,

$\frac{m_{r}^{⁎} (x - 1) - m_{r}^{⁎} (x)}{x^{r} - m_{r}^{⁎} (x)} = \frac{m_{r - 1}^{⁎} (x - 1) - m_{r - 1}^{⁎} (x)}{x^{r - 1} - m_{r - 1}^{⁎} (x)}$

is a recurrence relation connecting two consecutive moments.

Glanzel et al. (1984) considered $k (x)$ to be a monotonic function such that $k (x) \neq$ a constant in $(n, n + 1, \dots) \cap N$ for any finite n. Then,

$f (x) = {\begin{matrix} (\prod_{t = 0}^{x - 1} \frac{m_{k} (t) - k (t)}{m_{k} (t + 1) - k (t)}) \frac{m_{k} (x + 1) - m_{k} (x)}{m_{k} (x + 1) - k (x)}, & for x \in N or x < n \\ \prod_{t = 0}^{n - 1} \frac{m_{k} (t) - k (t)}{m_{k} (t + 1) - k (t)}, & for x = n \end{matrix}$

Further, if $k (x)$ and $g (x)$ are two functions defined on N such that $l (x) = \frac{m_{0} (x)}{m_{k} (x)}$ is strictly monotonic, then the distribution of X has probability mass function $f (x)$ satisfying

$A (x) = \frac{f (x)}{f (0)} \frac{l (x + 1) - l (x)}{l (1) - l (0)} \prod_{t = 1}^{x} \frac{l (t - 1) k (t - 1) - g (t - 1)}{l (t + 1) k (t) - g (t)}$

and $f (0) = {(1 + \sum_{x = 1}^{\infty} A (x))}^{- 1}$ . For a further refinement of the results on conditional expectations, see Su and Huang (2000). For characterizations of the geometric, Waring and negative hyper-geometric laws by doubly truncated mean residual life, one may see to Khorashadizadeh et al. (2012).

The alternative hazard rate $h_{1} (x)$ can also be related to $m (x)$ , as shown in Xie et al. (2002a) when $x = 1, 2, 3, \dots$ . From the definition of $m (x)$ in (2.19) and Eq. (2.2), we have

$\begin{matrix} m (x) & = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} \prod_{u = 0}^{t - 1} (1 - h (u)) \\ = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} \exp [- \sum_{a = 0}^{t - 1} h_{1} (u)], using (2.18) \\ = \sum_{t = x + 1}^{\infty} \exp [H_{1} (x + 1) - H_{1} (t - 1)], since S (x) = e^{- H_{1} (x)} . \end{matrix}$

Example 2.10

Let X be distributed as geometric $G (p)$ . Then,

$h_{1} (x) = \log \frac{S (x)}{S (x + 1)} = - \log q$

and

$H_{1} (x) = - x \log q .$

Thus,

$m (x) = \sum_{t = x + 1}^{\infty} [q^{- (x + 1)} / q^{- (t - 1)}] = \frac{1}{p},$

as has been observed earlier.

Some relationships involving $h (x)$ and $m (x)$ needed in the subsequent discussions are

$E (\frac{1}{h (X)}) = 1 + μ$

(2.31)

and

$E (\frac{1}{h (X)} | X > x) = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} S (t) = m (x) .$

(2.32)

The expression $h (X)$ is referred to as the hazard rate at random time. We now prove a characteristic property of the mean of $h (X)$ .

Theorem 2.6

$E (h (X)) ⩾ \frac{1}{1 + μ}$

(2.33)

and the equality holds if and only if X is geometric.

Proof

Consider

$E (h (X)) E (\frac{1}{h (X)})) = [\sum_{x = 0}^{\infty} h (x) f (x)] [\sum_{x = 0}^{\infty} \frac{1}{h (x)} f (x)] .$

Applying the Cauchy-Schwarz inequality for real sequences $〈 a (x) 〉$ and $〈 b (x) 〉$ , viz.,

$\sum {[a (x)]}^{2} \sum {[b (x)]}^{2} ⩾ {[\sum a (x) b (x)]}^{2}$

(2.34)

with $a (x) = {[h (x) f (x)]}^{\frac{1}{2}}$ and $b (x) = {[\frac{1}{h (x)} f (x)]}^{\frac{1}{2}}$ , we readily obtain

$E [h (X)] E [\frac{1}{h (X)}] ⩾ 1 .$

From (2.31), the inequality in (2.33) follows. The inequality in (2.34) becomes equality if and only if $a (x) = k b (x)$ , in which case $h (x) = k$ , $0 < k < 1$ , for all x. A constant hazard rate characterizes the geometric law. □

Theorem 2.7

$E (\frac{1}{m (X)}) ⩾ \frac{1}{E (m (X))}$

(2.35)

and the equality holds if and only if X is geometric.

Proof

$E (\frac{1}{m (X)}) E (m (X)) = [\sum_{x = 0}^{\infty} m (x) f (x)] [\sum_{x = 0}^{\infty} \frac{1}{m (x)} f (x)] .$

Taking $a (x) = {[m (x) f (x)]}^{\frac{1}{2}}$ and $b (x) = {[\frac{f (x)}{m (x)}]}^{\frac{1}{2}}$ and applying the Cauchy-Schwarz inequality, (2.35) follows. For the equality sign to hold, $a (x) = k b (x)$ which implies $m (x) = k$ , a characteristic property of the geometric distribution. □

2.3.1 Modelling Data

Although we have defined the hazard rate function and the mean residual life function in terms of the time to failure of a device, the definitions and properties can also be used to model data on the time to occurrence of an ‘event’ with appropriate modifications in interpretations that best suit the context. Accordingly, the results mentioned so far have been extensively applied in other disciplines as well. A detailed examination of this aspect will be taken up later in Chapter 9. In the present section, we consider some real data and illustrate the application of the characterizations established above in finding suitable models for them.

Example 2.11

In this example, we consider the famous Bortkiewicz data on the number of soldiers of the Prussian army who died of horse-kicks in a period of 20 consecutive years, with the modification of the data value suggested by Cohen (1960). The last observation in Cohen's data is omitted for the analysis; see Table 2.1 for details. From the data, the probability mass function of X, the time of death, is estimated as

$\begin{matrix} \hat{f} (0) = \frac{129}{199} = 0.6482, \hat{f} (1) = \frac{45}{199} = 0.2261, \\ \hat{f} (2) = \frac{22}{199} = 0.1106, \hat{f} (3) = \frac{3}{199} = 0.0151 . \end{matrix}$

Table 2.1

Model for the deaths of solders by horse-kicks

No. of deaths per year	Observed frequency	$\hat{m} (x)$	Expected frequency
0	129	1.4926	123.08
1	45	1.4002	53.79
2	22	1.1201	18.35
3	3	1.0000	3.78

These values give the estimate of the mean residual life as

$\hat{m} (x) = \frac{1}{\hat{S} (x + 1)} \sum_{t = x + 1}^{3} \hat{S} (t), x = - 1, 0, 1, 2,$

where $\hat{S} (x) = \sum_{t ⩾ x} \hat{f} (t)$ . The values of $\hat{m} (x)$ can be seen in column 2 of Table 2.1. The next step is to seek a functional form for $\hat{m} (x)$ . Accordingly, a straight line fit is considered for $m (x)$ by the method of least-squares. This yields

$m (x) = 1.34112 - 0.17579 x .$

Being a linearly decreasing mean residual life, from Theorem 2.1, we propose the negative hyper-geometric distribution as a model for the data with $n = 3$ . Comparing with the mean residual life function of the negative hyper-geometric law

$m (x) = \frac{k + n - x}{k + 1},$

we obtain the estimate of k as $\hat{k} = 4.863$ . Thus the proposed model has survival function

$\hat{S} (x) = (\begin{matrix} \hat{k} + n - x \\ n - x \end{matrix}) / (\begin{matrix} \hat{k} + n \\ n \end{matrix})$

with $\hat{k} = 4.863$ and $n = 3$ . The expected probability mass function is

$\hat{f} (x) = \hat{S} (x) - \hat{S} (x + 1) .$

Obviously, the chosen model rests on the assumption of linearity of $m (x)$ . Hence the model is validated by calculating the expected frequencies and then verifying their closeness by the chi-square goodness-of-fit test. The chi-square value of 2.4949 does not reject the model at 5% level of significance.

Example 2.12

The results of the well-known Rutherford-Geiger experiment is produced in Table 2.2. It describes the number of α-particles remitted from radioactive substances in 2608 fractions of 75 seconds. As in the previous example, the estimates $\hat{f} (x)$ and $\hat{S} (x)$ of the probability mass function and the survival function and there from $\hat{m} (x)$ are found out. In addition, we also require the estimated hazard rate function $\hat{h} (x) = \hat{f} (x) / \hat{S} (x)$ . The observed values of $\hat{h} (x)$ and $\hat{m} (x)$ are shown in Table 2.2.

Table 2.2

Estimated values of $\hat{h} (x)$ , $\hat{m} (x)$ and $\hat{g} (x)$ for the data in Table 2.3

x	$\hat{h} (x)$	$\hat{m} (x)$	$\hat{g} (x)$
0	0.029	3.9569	2.00
1	0.0795	3.2124	2.06
2	0.1632	2.6437	2.07
3	0.2672	2.2431	1.96
4	0.3695	1.9716	1.87
5	0.4493	1.7642	1.85
6	0.5462	1.6839	1.65
7	0.6615	1.7604	1.61
8	0.5518	1.5576	2.81
9	0.6306	1.5082	2.01
10	0.6223	1.3478	2.23
11	0.6522	1.000	2.12
12	1.000

We also have

$m (- 1) = \sum_{t = 0}^{\infty} \hat{S} (t) = 1 + μ = 4.8702$

so that an estimate of the mean becomes

$\hat{μ} = 3.8702 .$

Neither an examination of the values of $\hat{h} (x)$ and $\hat{m} (x)$ nor their plots in Figs 2.1 and 2.2 exhibit an obvious choice for the functional forms of $h (x)$ and $m (x)$ . In such cases, Theorem 2.4 can be of some assistance in the determination of a suitable model. Since we have the empirical mean residual life from the data, choose $k (x) = x$ so that (2.24) becomes

$m^{⁎} (x) = m (x) + x = μ + \frac{σ h (x) g (x)}{1 - h (x)} .$

Hence, we can calculate $\hat{g} (x)$ from

$\hat{g} (x) = (\frac{\hat{m} (x) + x - \hat{μ}}{\hat{σ}}) (\frac{1}{\hat{h} (x)} - 1) .$

(2.36)

Figure 2.1 Empirical hazard function for Rutherford-Geiger experiment.

Figure 2.2 Empirical mean residual life function for Rutherford-Geiger experiment.

In (2.36), the estimate of the standard deviation σ is taken as the sample standard deviation. The values of $\hat{g} (x)$ so arrived at are shown in Table 2.2. Notice that the value at $x = 8$ is significantly larger than the rest. Barring this value, the rest appears to be clustering around the average $\bar{g} (x) = 1.94$ . This is the same as assuming $g (x) = c$ , a constant for all x, since the least-square estimate of c is $\bar{g} (x)$ . Recall from Theorem 2.4 that the value of $g (x)$ uniquely determines the distribution of X and further, $g (x) = c$ if and only if the distribution is Poisson with mean $c^{2}$ . Thus, the data follows a Poisson distribution

$f (x) = e^{- λ} \frac{λ^{x}}{x!}, x = 0, 1, 2, \dots$

with $λ = c^{2} = 3.8$ . To validate the assumption made on $g (x)$ , we compare the observed and expected frequencies, when $λ = 3.8$ , presented in Table 2.3.

Table 2.3

Goodness-of-fit for Rutherford-Geiger data

x	Observed frequency	Expected frequency	Contribution to $χ^{2}$ -value
0	57	58.4	0.0036
1	203	221.7	1.5773
2	383	421.2	3.4644
3	525	533.6	0.1386
4	532	507.0	1.2327
5	408	385.2	1.3495
6	273	244.1	3.4215
7	139	132.5	0.3189
8	45	62.9	5.0934
9	27	26.6	0.0060
10	10	10.27	0.0039
$\begin{matrix} 11 \\ 12 \end{matrix}$	$\begin{matrix} 4 \\ 2 \end{matrix}}$ 6	4.4	0.5819
Total	2608	2608	16.2216

The chi-square value obtained above does not reject the hypothesis that the data follow the Poisson distribution. The following facts can be further observed:

(i) The characterization theorems used in Examples 2.11 and 2.12 provide some quick estimates of the parameters of the hypothesized models. In more sophisticated models that require iterative procedures to find estimates, the above estimates are good choice may serve as initial values;
(ii) The contribution to the chi-square value or the discrepancy between the observed and expected frequencies at $x = 8$ is considerably larger compared to the rest. The plot of the hazard function shows that the function is decreasing markedly at this point which is uncharacteristic of the Poisson model where the hazard rate is increasing. If we use the maximum likelihood estimate $\hat{λ} = 3.87$ for the Poisson distribution, the difference is still higher with expected frequency 67. Thus, the $\hat{g} (x)$ value at $x = 8$ is taken as a discordant one and omitted in the calculation of $\bar{g} (x)$ .

2.4 Variance Residual Life Function

The variance of the residual life $(X - x | X > x) = X_{x}$ is studied in reliability theory in various contexts. Primarily, its role is to define ageing concepts that are weaker than some ageing criteria based on the hazard rate and the mean residual life. Chapter 4 provides details in this direction. Secondly, variance of residual life has the same role as the usual variance when estimators of mean residual life are discussed. It is also required in the study of coefficient of variation of residual life. Assuming that $E (X^{2}) < \infty$ , we define the variance residual life function as

$σ^{2} (x) = E [{(X - x)}^{2} | X > x] - m^{2} (x),$

(2.37)

where $m (x)$ is the mean residual life function defined in (2.20). Alternatively,

$σ^{2} (x) = E (X^{2} | X > x) - E^{2} (X | X > x) .$

(2.38)

The second factorial moment of the residual life $X_{x}$ , given by

$μ_{(2)} (x) = E [(X - x) (X - x - 1) | X > x],$

will frequently appear in the sequel. We have the following expression for $μ_{(2)} (x)$ .

Theorem 2.8

$μ_{(2)} (x) = \frac{2}{S (x + 1)} \sum_{t = x + 1}^{\infty} \sum_{u = t + 1}^{\infty} S (u) .$

(2.39)

Proof

$\begin{matrix} E (X^{2} | X > x) & = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} t^{2} [S (t) - S (t + 1)] \\ = x^{2} + \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} (2 t - 1) S (t) . \end{matrix}$

(2.40)

Hence,

$\begin{matrix} E [{(X - x)}^{2} | X > x] & = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} (2 t - 2 x - 1) S (t), \\ = \frac{1}{S (x + 1)} [2 \sum_{t = x + 1}^{\infty} \sum_{u = t + 1}^{\infty} S (u) + \sum_{t = x + 1}^{\infty} S (t)] \\ = \frac{2}{S (x + 1)} \sum_{t = x + 1}^{\infty} \sum_{u = t + 1}^{\infty} S (u) + E (X - x | X > x), \end{matrix}$

(2.41)

which is equivalent to (2.39). Thus, the variance residual life function can be written in the form

$σ^{2} (x) = \frac{2}{S (x + 1)} \sum_{t = x + 1}^{\infty} \sum_{u = t + 1}^{\infty} S (u) - m (x) (m (x) - 1) .$

(2.42)

An equivalent expression for (2.40), by defining the survival function as $P (X > x)$ , has been obtained in Khorashadizadeh et al. (2010). □

Example 2.13

For the geometric law considered in Example 2.6, we have

$\begin{matrix} S (x) = q^{x}, m (x) = \frac{1}{p}, \\ \sum_{u = t + 1}^{\infty} S (t) = \frac{q^{t + 1}}{p} and \sum_{t = x + 1}^{\infty} \frac{q^{t + 1}}{p} = \frac{q}{p^{2}} \end{matrix}$

so that from (2.42), we have

$σ^{2} (x) = \frac{2 q}{p^{2}} - \frac{1}{p} (\frac{1}{p} - 1) = \frac{q}{p^{2}},$

which is the same as the variance of X.

There exist some inter-relationships between the reliability functions discussed so far.

Theorem 2.9

$σ^{2} (x + 1) - σ^{2} (x) = h (x + 1) [σ^{2} (x + 1) - m (x + 1) (m (x) - 1)] .$

(2.43)

Proof

From (2.42), we have

$\begin{matrix} [σ^{2} (x) & + m (x) (m (x) - 1)] S (x + 1) \\ - [σ^{2} (x + 1) + m (x + 1) (m (x + 1) - 1)] S (x + 2) \\ = 2 S (x + 2) m (x + 1) . \end{matrix}$

Dividing by $S (x + 2)$ and using (2.3), we get

$σ^{2} (x) + m (x) (m (x) - 1) = (1 - h (x + 1)) [σ^{2} (x + 1) + m^{2} (x + 1) + m (x + 1)] .$

(2.44)

Since

$h (x + 1) = \frac{1 + m (x + 1) - m (x)}{m (x + 1)},$

further simplification of (2.44) yields (2.43). □

Remark 2.5

The identity in (2.43) can be expressed in terms of $σ^{2} (x)$ and $m (x)$ as

$\frac{σ^{2} (x)}{m (x) - 1} = \frac{σ^{2} (x + 1)}{m (x + 1)} + m (x + 1) - m (x) + 1 .$

Theorem 2.10

$σ^{2} (x) = E [m (X) (m (X - 1) - 1) | X > x], x = 0, 1, 2, \dots$

(2.45)

Proof

Starting from (2.43), we have

$σ^{2} (x + 1) - σ^{2} (x) = \frac{f (x + 1)}{S (x + 1)} [σ^{2} (x + 1) - m (x + 1) (m (x) - 1)],$

$S (x + 1) σ^{2} (x) - S (x + 2) σ^{2} (x + 1) = m (x + 1) (m (x) - 1) f (x + 1) .$

Adding the above identity for $x + 1, x + 2, \dots$ , we get

$S (x + 1) σ^{2} (x) = \sum_{t = x + 1}^{\infty} m (t) (m (t - 1) - 1) f (t)$

which is the same as (2.45). □

Remark 2.6

Theorem 2.10 can be employed to find quick estimates of the variance residual life when the estimated mean residual life is known.

Unlike the cases of hazard rate and mean residual life functions, there is no inversion formula that expresses the survival function in terms of the variance residual life. Also, there are only a few standard distributions for which $σ^{2} (x)$ has simple tractable forms, as could be seen from Table 2.3. Therefore, characterizations of life distributions involving $σ^{2} (x)$ take the form of its relationship with other concepts. Hitha and Nair (1989) have shown that

$\frac{σ^{2} (x)}{m (x) (m (x) - 1)} = C, a constant,$

if and only if X is geometric for $C = 1$ , negative hyper-geometric for $C > 1$ , and Waring for $C < 1$ . We also have a more general result satisfied by a class of distribution due to Sudheesh and Nair (2010). We retain the notations used for Theorem 2.4.

Theorem 2.11

Let $k (X)$ be a real-valued function of X that is non-decreasing and satisfying the following conditions:

(i) $E [k^{2} (X)]$ and $E [| Δ k (X) | g_{k} (X)]$ are finite;
(ii) $f (0) > 0$ and the support of X is an integer interval;
(iii) $Δ k (x) \neq 0$ for all x.

Then, X has distribution specified by

$\frac{f (x + 1)}{f (x)} = \frac{σ_{k} g_{k} (x)}{σ_{k} g_{k} (x + 1) - μ_{k} + k (x + 1)}$

for some real-valued function $g_{k} (x)$ if and only if for all x

$\begin{matrix} V (k (X) | X > x) = σ_{k} E (Δ k (X) g_{k} (X) | X > x) \\ + (μ_{k} - m_{k} (x)) (m_{k} (x) - k (x + 1)), \end{matrix}$

where $V (\cdot)$ stands for variance.

Remark 2.7

Setting $k (X) = X$ , we have

$σ^{2} (x) = σ E [g (X) | X > x] + (μ - m^{⁎} (x)) (m^{⁎} (x) - x - 1)$

(2.46)

for all distributions in which the probability mass function satisfies

$\frac{f (x + 1)}{f (x)} = \frac{σ g (x)}{σ g (x + 1) - μ + x + 1} .$

Remark 2.8

From Eqs (2.39) and (2.42), it is evident that the property

$\frac{μ_{(2)} (x)}{m (x) (m (x) - 1)} = k,$

a positive constant, characterize the geometric, Waring and negative hyper-geometric models.

2.5 Upper Partial Moments

A concept that is closely related to moments of residual life is that of partial moments, which can also be interpreted as moments of a different kind of residual life. The rth upper partial moment of X about a point x is defined as

$α_{r} (x) = E ({(X - x)}_{+}^{r}), r = 0, 1, \dots; x = 0, 1, 2, \dots,$

(2.47)

where ${(X - x)}_{+} = \max (X - x, 0)$ . In the case of discrete models, it is sometimes more convenient to work with factorial partial moments defined as

$α_{(r)} (x) = E ({(X - x)}_{+ (r)}), X > x + r - 1$

(2.48)

where

$t_{(r)} = t (t - 1) \dots (t - r + 1)$

(2.49)

is the descending factorial expression. By virtue of the relationship

${(t - x)}_{+}^{r} = \sum_{k = 0}^{r} {(X - x)}_{+ (r)} S (r, k),$

where $S (r, k)$ is the Stirling number of the second kind, $α_{r}$ can be computed in terms of $α_{(r)}$ . Nair et al. (2000) have studied several properties of $α_{(r)}$ and their implications to reliability modelling. First, we note that

$α_{1} (x) = α_{(1)} (x) = \sum_{t = x + 1}^{\infty} S (t),$

and hence

$S (x) = α_{1} (x - 1) - α_{1} (x), x = 0, 1, \dots,$

and

$f (x) = α_{1} (x - 1) - 2 α_{1} (x) + α_{1} (x + 1) .$

The upper partial moments satisfy the recurrence formula

$α_{(r)} (x) - α_{(r)} (x + 1) = r α_{(r - 1)} (x + 1), r = 1, 2, \dots .$

Thus, any one partial moment sequence, particularly $〈 α_{1} (x) 〉$ , $x = 0, 1, 2, \dots$ , determines all other partial moments. Since only the first two partial moments are of importance in reliability analysis we concentrate on their properties. Notice that

$α_{1} (x) - α_{1} (x + 1) = S (x + 1)$

(2.50)

so that

$m (x) = \frac{α_{1} (x)}{α_{1} (x) - α_{1} (x + 1)}$

(2.51)

and

$1 - h (x + 1) = \frac{S (x + 2)}{S (x + 1)} = \frac{α_{1} (x + 1) - α_{1} (x + 2)}{α_{1} (x) - α_{1} (x + 1)} .$

(2.52)

From (2.51), the ratio of the upper partial means at consecutive values gives

$\frac{α_{1} (x + 1)}{α_{1} (x)} = \frac{m (x) - 1}{m (x)} .$

Thus, from Theorem 2.2, we deduce the following result.

Theorem 2.12

If $E (X) < \infty$ , then

$\frac{α_{1} (x + 1)}{α_{1} (x)} = \frac{(l - 1) + a x}{l + a x}, l > 1,$

if and only if X has geometric, Waring, negative hyper-geometric distributions when $a = 0$ , $a > 0$ , and $a < 0$ , respectively.

The second factorial partial moment is

$\begin{matrix} α_{(2)} (x) & = E {[(X - x) (X - x - 1)]}^{+} = S (x + 1) μ_{(2)} (x) \\ = 2 \sum_{t = x + 1}^{\infty} \sum_{u = t + 1}^{\infty} S (u), \end{matrix}$

while

$E [{(X - x)}^{2} | X > x] = \frac{α_{2} (x)}{α_{1} (x) - α_{1} (x + 1)} .$

Thus, the variance residual life is calculated as

$σ^{2} (x) = \frac{α_{2} (x) (α_{1} (x) - α_{1} (x + 1)) - α_{1}^{2} (x)}{{[α_{1} (x) - α_{1} (x + 1)]}^{2}} .$

Example 2.14

Let

$f (x) = (a - b) \frac{{(b)}_{x}}{{(a)}_{x}}, x = 0, 1, 2, \dots, a > b .$

Then, from Example 2.2, we have

$\begin{matrix} S (x) & = \frac{{(b)}_{x}}{{(a)}_{x}}, \\ α_{1} (x) & = \sum_{x + 1}^{\infty} \frac{{(b)}_{t}}{{(a)}_{t}} = \frac{{(b)}_{x + 1}}{{(a)}_{x - 1}} [\frac{1}{a + x} + \frac{b + x + 1}{(a + x) (a + x + 1)} + \dots] \\ = \frac{{(b)}_{x + 1}}{(a - b - 1) {(a)}_{x}}, \\ α_{1} (x) & = 2 \sum_{t = x + 1}^{\infty} \sum_{u = t + 1}^{\infty} S (u) \\ = 2 \sum_{t = x + 1}^{\infty} \frac{{(b)}_{t + 1}}{(a - b - 1) {(a)}_{t}} = \frac{2 {(b)}_{x + 2}}{{(a)}_{x} (a - b - 1) (a - b - 2)} . \end{matrix}$

The ratios of partial moments admit simple forms in this case. For example,

$\frac{α_{1} (x + 1)}{α_{1} (x)} = \frac{b + x + 1}{a + x}$

and

$\frac{α_{2} (x + 1)}{α_{2} (x)} = \frac{b + x + 2}{a + x},$

both being homographic functions. A similar discussion of the properties is possible when the ascending factorial expression

$x^{(r)} = x (x + 1) \dots (x + r - 1)$

is used instead of $x_{(r)}$ in (2.48); for details see Priya et al. (2000).

So far, we have considered reliability functions specified by the survival function $S (x)$ . A parallel theory that builds up when the event $X ⩽ x$ , defined by the distribution function, also has been of interest in reliability literature. Referred to by the name reliability functions in reversed time, they are found to be useful in modelling and analysis of lifetime data and also in other fields of study. We discuss some important functions in this category in the next few sections.

2.6 Reversed Hazard Rate

The reversed hazard rate of X is defined as

$λ (x) = P (X = x | X ⩽ x) = \frac{f (x)}{F (x)} .$

(2.53)

Thus, $λ (x)$ in the discrete case is interpreted as the conditional probability that a device fails at age x, given that its lifetime is at most x. Being a conditional probability, $0 ⩽ λ (x) ⩽ 1$ . Keilson and Sumita (1982), who first defined the reversed hazard rate in continuous time, called it the dual failure function by the property that X has reversed hazard rate $λ (x)$ , $- \infty ⩽ a ⩽ x ⩽ b < \infty$ if and only if the random variable −X has a hazard rate $λ (- x)$ on $(- b, - a)$ .

Finkelstein (2002) observed that, in reliability, one often works with non-negative random variables and therefore the above duality is not applicable. Further, the upper point of support is generally infinite. Thus, the properties of reversed hazard rate of non-negative random variables with infinite support cannot be formally obtained from those of the hazard rates. This makes a study of $λ (x)$ becomes necessary in its own right. Since

$λ (x + 1) = \frac{f (x + 1)}{F (x + 1)} = \frac{F (x + 1) - F (x)}{F (x)},$

it follows that

$F (x) = \prod_{t = x + 1}^{\infty} (1 - λ (t)) .$

(2.54)

Notice further that $λ (0) = 1$ and

$f (x) = λ (x) \prod_{t = x + 1}^{\infty} (1 - λ (t)) .$

(2.55)

Thus, $λ (x)$ determines the life distribution uniquely; see Nair and Asha (2004) for details and examples.

Example 2.15

We say that X follows arithmetic distribution if it has a probability mass function of the form

$f (x) = \frac{2 x}{b (b + 1)}, x = 1, 2, \dots, b .$

The corresponding distribution function is

$F (x) = \frac{x (x + 1)}{b (b + 1)},$

and hence

$λ (x) = \frac{2}{x + 1},$

a reciprocal linear function.

In the continuous case, Block et al. (1998) have shown that for an absolutely continuous random variable X with interval, of support $(a, b)$ , $- \infty ⩽ a < b ⩽ \infty$ , if the reversed hazard rate is a constant $c > 0$ for all $a < x < b$ , then $a = - \infty$ , $b < \infty$ and

$F (x) = \exp [c (x - b)], x < b,$

and conversely. Thus, there does not exist an absolutely continuous distribution with constant reversed hazard rate on the positive real axis. We will now show that in the discrete case, reversed hazard rate can be constant when a subset of the set of nonnegative integers is as the support of X.

Example 2.16

Let X be distributed with probability mass function

$f (x) = {\begin{matrix} {(1 + c)}^{- b}, & x = 0, \\ c {(1 + c)}^{x - b - 1}, & x = 1, 2, \dots, b, c > 0, \end{matrix}$

(2.56)

where b is a finite positive integer greater than unity. Then,

$F (x) = {\begin{matrix} {(1 + c)}^{x - b}, & x = 0, 1, 2, \dots, b \\ 1, & x ⩾ b . \end{matrix}$

(2.57)

The reversed hazard rate in this case is

$λ (x + 1) = \frac{c {(1 + c)}^{x - b - 1}}{{(1 + c)}^{x - b}} = \frac{c}{1 + c}, for all x = 0, 1, 2, \dots,$

and $λ (0) = 1$ .

Remark 2.9

The terms in (2.56), for $x = 1, 2, \dots$ , are in geometric progression with a common ratio $(1 + c)$ and therefore, the successive terms are increasing, as opposed to the usual geometric distribution where the monotonicity is in the opposite direction. We will term (2.56) as the reversed geometric distribution with parameter c.

This distribution has an important role in the sequel. Since it does not form part of the standard distributions discussed in the next chapter, some properties of the model (2.56) are presented here. First,

$\frac{f (x + 1)}{f (x)} = {\begin{matrix} c, & x = 0, \\ c + 1, & x = 1, 2, \dots . \end{matrix} .$

Thus, the distribution can be identified in practice for data for which the ratio of the successive frequencies, except the first, are nearly constant. The mean and variance are given by

$\begin{matrix} μ & = E (X) = {(1 + c)}^{- b} [1 + c \sum_{t = 1}^{b} t {(1 + c)}^{t - 1}] \\ = \frac{b c - 1}{c} + \frac{1}{c {(1 + c)}^{b - 1}} \end{matrix}$

and

$\begin{matrix} σ^{2} & = {(1 + c)}^{- b} [1 + c \sum_{t = 1}^{b} t^{2} {(1 + c)}^{t - 1}] - μ^{2} \\ = \frac{1}{c} + \frac{2}{c^{2}} + b^{2} - \frac{2 b}{c} - (\frac{1}{c} + \frac{1}{c^{2}}) {(1 + c)}^{- b} - μ^{2} . \end{matrix}$

For the geometric distribution, the hazard rate is constant which is equivalent to the lack of memory property. An analogous property in a reverse sense is satisfied by the reversed geometric law, as the next theorem shows.

Theorem 2.13

Let X be a discrete random variable with a finite set $(0, 1, \dots, b)$ as support. Then,

$P (X ⩽ t | X ⩽ t + s) = P (X ⩽ 0 | X ⩽ s)$

(2.58)

for all $t, s$ in the support of X if and only if X has reversed geometric distribution.

Proof

Eq. (2.58) can be restated as

$F (t + s) F (0) = F (t) F (s) .$

(2.59)

Substituting (2.57) in (2.59), we readily have the ‘if’ part. Conversely, (2.59) is equivalent to the functional equation

$a (t + s) = a (t) a (s),$

where $a (t) = \frac{F (t)}{F (0)}$ . To solve for $a (t)$ , we set $s = 1$ , so that

$a (t + 1) = a (t) a (1) .$

Iterating for t,

$a (t) = {[a (1)]}^{t} for all t = 1, 2, \dots, b,$

and so

$F (x + 1) = \frac{{[F (1)]}^{x + 1}}{{[f (0)]}^{x}} .$

(2.60)

Thus,

$f (x + 1) = \frac{p^{x + 1}}{{[f (0)]}^{x}} - \frac{p^{x}}{{[f (0)]}^{x - 1}}, p = F (1) .$

Summing for $x = 0, 1, 2, \dots$ and using the fact that $\sum_{x = 0}^{b} f (x) = 1$ , we get $f (0) = p^{\frac{b}{b - 1}}$ . Inserting the values of $f (0)$ in (2.60), we get

$F (x) = p^{- (x - b)} .$

Since $F (1) < 1$ , we should have $p < 1$ and setting $p = {(1 + c)}^{- 1}$ , the distribution in (2.57) is recovered, as needed. □

Remark 2.10

The reversed lack of memory property implies that, given the lifetime of device is upto age x, the probability that the device fails at any age in $[0, x]$ is the same. Further, the distribution of such a lifetime is governed by the reversed geometric law.

Remark 2.11

From the above discussions, we observe that the following are equivalent:

(i) X has constant reversed hazard rate;
(ii) X follows reversed geometric law;
(iii) X has reversed lack of memory property.

Further discussions and application of these results can be seen in Nair and Sankaran (2013).

The definition in (2.53), when applied to the continuous case, has the form $λ (x) = \frac{d \log F (x)}{d x}$ . This property is not shared in the discrete case. For reasons similar to those explained in Section 2.2 regarding the alternative hazard rate, a second definition for reversed hazard rate can be put forward as

$λ_{1} (x) = \log \frac{F (x)}{F (x - 1)}, x = 1, 2, \dots .$

(2.61)

In this case,

$F (x - 1) = F (x) e^{- λ_{1} (x)},$

$F (x) = \exp [- \sum_{t = x + 1}^{\infty} λ_{1} (t)] .$

Also,

$f (x) = \exp [- \sum_{t = x + 1}^{\infty} λ_{1} (t)] [1 - e^{- λ_{1} (x)}], x = 1, 2, \dots,$

and $f (0)$ is determined from

$f (0) = 1 - \sum_{x = 1}^{\infty} f (x) .$

Example 2.17

The reversed geometric distribution specified by

$F (x) = {(1 + c)}^{x - b}, x = 0, 1, \dots, b,$

has

$λ_{1} (x) = \log \frac{{(1 + c)}^{x - b + 1}}{{(1 + c)}^{x - b}} = \log (1 + c),$

which is a constant for all x.

From the definitions in (2.61) and (2.53), the relationship between $λ (x)$ and $λ_{1} (x)$ is found to be

$λ (x) = 1 - e^{- λ_{1} (x)} .$

(2.62)

It may be noticed that $λ_{1} (0) = \infty$ , is consistent with the value $λ (0) = 1$ .

2.7 Reversed Mean Residual Life

A second measure of interest in reversed time is the reversed mean residual life. Suppose a device has failed before attaining age t. Then, the random variable ${}_{t}X = t - X | (X < t)$ is the time elapsed since the device has failed, conditioned on the fact that its lifetime is less than t, and this is referred to as the reversed residual life or inactivity time of X. It is easy to see that ${}_{t}X$ has the distribution function

${}_{t}F_{x} = \frac{F (t - 1) - F (t - x - 1)}{F (t - 1)}, x = 0, 1, 2, \dots .$

(2.63)

The mean of this distribution is called the reversed mean residual life or mean inactivity time, and is denoted by $r (x)$ . One can also define $r (x)$ as

$r (x) = E (x - X | X < x) = \frac{1}{F (x - 1)} \sum_{t = 1}^{x - 1} t f (t), x = 1, 2, \dots .$

(2.64)

We define $r (0) = 0$ . Note also that $r (1) = 1$ . Goliforushani and Asadi (2008) have established the following properties of $r (x)$ :

(i) $x - r (x)$ is an increasing function with $0 ⩽ r (x) ⩽ x$ ;
(ii) $μ = x - F (x - 1) r (x) + S (x) m (x - 1)$ ;
(iii) $r (x)$ cannot be a decreasing function for all x;
(iv)

$λ (x) = \frac{1 + r (x) - r (x + 1)}{r (x)}, x = 1, 2, \dots;$

(v) If X has a finite support as $(0, 1, 2, \dots, b)$ , then

$F (x) = \frac{\prod_{t = 1}^{x} (\frac{r (t)}{r (t + 1) - 1})}{F (0)}, x = 1, 2, \dots, b,$

(2.65)

$F (0) = {[\prod_{t = 1}^{b} \frac{r (t)}{r (t + 1) - 1}]}^{- 1};$

(2.66)

(vi) If $λ (x)$ is decreasing function of x, then $r (x)$ is an increasing function of x;
(vii)

$r (x) = \frac{\sum_{t = 1}^{x} F (t - 1)}{F (x - 1)} .$

It is of interest to mention here that the notions of residual life and inactivity time have been extended to general coherent systems and their properties, mixture representations and stochastic orderings of various forms have been discussed by numerous authors; see, for example, Navarro et al. (2008, 2013), Zhang (2010), Goliforushani and Asadi (2011), Goliforushani et al. (2012), and Parvardeh and Balakrishnan (2013, 2014).

Example 2.18

For the geometric distribution $G (p)$ , we have

$F (x) = 1 - q^{x + 1},$

and hence

$\begin{matrix} r (x) & = \frac{1}{F (x - 1)} [F (0) + \dots + F (x - 1)] \\ = \frac{x p - q (1 - q^{x})}{p (1 - q^{x})} . \end{matrix}$

Example 2.19

Consider the discrete uniform distribution

$f (x) = \frac{1}{b}, x = 1, \dots, b .$

In this case,

$F (x) = \frac{x}{b}$

and

$r (x) = \frac{b}{x - 1} (\frac{1}{b} + \frac{2}{b} + \dots + \frac{x - 1}{b}) = \frac{x}{2} .$

Notice that in this case $λ (x) = \frac{1}{x}$ and that the product x of $r (x)$ and $λ (x)$ is $\frac{1}{2}$ for all x.

In the case of geometric distribution, the hazard rate $h (x)$ is constant and so is the mean residual life function $m (x)$ , the two being related by $m (x) h (x) = 1$ . We will show that a different scenario exists in the relationship between $λ (x)$ and $r (x)$ . For the reversed geometric law, $λ (x) = \frac{c}{1 + c}$ , whereas

$r (x) = \frac{1 + c}{c} [1 - \frac{1}{{(1 + c)}^{x}}]$

which is a strictly increasing function of x. However, we can recover a distribution with constant reversed mean residual life by modifying the probability at $x = 0$ .

Theorem 2.14

A random variable with the support $(0, 1, 2, \dots, b)$ has reversed mean residual life

$r (x) = \frac{c + 1}{c}, c > 0, x = 2, 3, \dots, b,$

(2.67)

for all x if and only if its distribution is given by

$F (x) = {\begin{matrix} c^{- 1} {(1 + c)}^{1 - b}, & x = 0, \\ {(1 + c)}^{x - b}, & x = 1, 2, \dots, b . \end{matrix}$

(2.68)

Proof

For the distribution in (2.68), we have

$\begin{matrix} r (x) & = {(1 + c)}^{b - x + 1} [\frac{1}{c {(1 + c)}^{b - 1}} + {(1 + c)}^{1 - b} + \dots + {(1 + c)}^{x - 1 - b}] \\ = {(1 + c)}^{b - x + 1} [\frac{1}{c {(1 + c)}^{b - 1}} + \frac{(1 + c)}{c} \frac{({(1 + c)}^{x - 1} - 1)}{{(1 + c)}^{b}}] \\ = \frac{1 + c}{c} . \end{matrix}$

The converse follows upon noting from (2.67) that

$\frac{r (x)}{r (x + 1) - 1} = (1 + c)$

and then applying (2.65) and the fact that $\sum_{0}^{b} f (x) = 1$ . □

Remark 2.12

The probability mass function corresponding to (2.68) is

$f (x) = {\begin{matrix} c^{- 1} {(1 + c)}^{1 - b}, & x = 0, \\ (c - 1) c^{- 1} {(1 + c)}^{1 - b}, & x = 1, \\ c {(1 + c)}^{x - b - 1}, & x = 2, 3, \dots, b . \end{matrix}$

Accordingly, we have

$λ (x) = {\begin{matrix} 1, & x = 0, \\ (c - 1) c^{- 1}, & x = 1, \\ c {(c + 1)}^{- 1}, & x = 2, 3, \dots, b . \end{matrix}$

Remark 2.13

In comparison with the reversed geometric model, there is a difference in the value of $λ (1)$ for the model in (2.68). It may also be noted that

$λ (x) r (x) = 1, x = 2, 3, \dots, b,$

for the distribution in (2.68). This is a characteristic property as evidenced by the identity

$λ (x) r (x) = 1 + r (x) - r (x + 1)$

which gives

$r (x + 1) - r (x) = 0$

for all x, or $r (x)$ is a constant.

The reversed mean residual life can also be related to the alternative reversed hazard rate. From (2.62) and the identity in (iv) above, we have

$λ_{1} (x) = \log \frac{r (x)}{r (x + 1) - 1}, x = 0, 1, 2, \dots .$

In certain problems, it is more convenient to deal with the conditional mean

$r^{⁎} (x) = E (X | X ⩽ x) = x - r (x + 1) .$

The corresponding results for $r (x)$ is easily derived from those of $r^{⁎} (x)$ by using the above identity.

Relationship between $λ (x)$ and $r (x)$ of a different nature, than those indicated above, that characterizes families of discrete distributions has been proposed in literature. The main results reviewed here are from Gupta et al. (2006) and Nair and Sudheesh (2008). We retain the same notation as in Section 2.3. Let $k (x)$ be real-valued function such that $E k^{2} (X) < \infty$ . Then, the probability mass function of X will be of the form

$\frac{f (x + 1)}{f (x)} = \frac{σ_{k} g_{k} (x)}{σ_{k} g_{k} (x + 1) - μ_{k} + k (x + 1)},$

(2.69)

where $μ_{k}$ and $σ_{k}$ are the mean and standard deviation of $k (X)$ satisfying $σ_{k} g_{k} (0) = μ_{k} - k (0)$ for some real-valued function $g (\cdot)$ if and only if

$r_{k}^{⁎} (x) = μ_{k} - σ_{k} g_{k} (x) λ (x)$

(2.70)

with $r_{k}^{⁎} (x) = E (k (X) | X ⩽ x)$ . The $g_{k}$ function appearing in the above relationships is unique for a particular distribution and often assumes simple forms. Special cases of the form in (2.69) that includes various families like the discrete Pearson and Katz families and several individual distributions are discussed in Chapter 3. In practice, the formulas in (2.69) and (2.70) will work if we replace $σ_{k} g_{k} (x)$ by function $a_{k} (x)$ , which can be determined from the data, without actually using $σ_{k}$ .

When any two of the functions $h (x)$ , $λ (x)$ and $F (x)$ are known, the third can be determined from the identity

$F (x) = h (x) {[h (x) + λ (x) - λ (x) h (x)]}^{- 1} .$

Similarly, we also have

$S (x) = λ (x) {[h (x) + λ (x) - λ (x) h (x)]}^{- 1}$

and

$f (x) = λ (x) h (x) {[h (x) + λ (x) - λ (x) h (x)]}^{- 1} .$

From the last three forms, we arrive at

$\frac{f (x + 1)}{f (x)} = \frac{h (x + 1) (1 - h (x))}{h (x)} = \frac{λ (x + 1)}{λ [1 - λ (x + 1)]},$

(2.71)

which will be useful in later chapters.

2.8 Reversed Variance Residual Life

Just as the mean of the reversed residual life ${}_{x}X$ , the variance of ${}_{x}X$ is also an important function reliability analysis, called the reversed variance residual life or variance inactivity time, and is denoted by $v (x)$ . In algebraic manipulations, different expressions for $v (x)$ have been employed. These are

$\begin{matrix} v (x) & = V (x - X | X < x) = V (X | X < x) \\ = E (X^{2} | X < x) - E^{2} (X | X < x) \end{matrix}$

(2.72)

and

$v (x) = E [{(x - X)}^{2} | X < x] - r^{2} (x) .$

(2.73)

Further,

$\begin{matrix} E ({(x - X)}^{2} | X < x) & = \frac{1}{F (x - 1)} \sum_{t = 0}^{x - 1} {(x - t)}^{2} f (t) \\ = \frac{1}{F (x - 1)} \sum_{t = 1}^{x - 1} {(x - t)}^{2} [F (t) - F (t - 1)] \\ = \frac{2}{F (x - 1)} \sum_{t = 1}^{x} (x - t) F (t) - r (x) \\ = \frac{2}{F (x - 1)} \sum_{t = 1}^{x} \sum_{u = 1}^{t} F (u - 1) - r (x), \end{matrix}$

$v (x) = \frac{2}{F (x - 1)} \sum_{t = 1}^{x} \sum_{u = 1}^{t} F (u - 1) - r (x) (r (x) + 1) .$

(2.74)

Example 2.20

Consider the geometric distribution $G (p)$ in Example 2.2, for which $F (x) = 1 - q^{x + 1}$ . Then, the conditional probability mass function of $X | (X < x)$ is given by

$f_{x} (t) = \frac{p q^{t}}{1 - q^{x}}, t = 0, 1, \dots, x - 1 .$

(2.75)

We use (2.72) to calculate $v (x)$ . For this, from (2.75), we have

$\sum_{t = 0}^{x - 1} p q^{t} = 1 - q^{x} .$

Differentiating with respect to q, we get

$\sum t p q^{t - 1} = \sum_{0}^{x - 1} q^{t} - x q^{x - 1} .$

Simplifying so as to make the left hand side an expected value, we get

$\sum_{t = 0}^{x - 1} \frac{t p q^{t}}{1 - q^{x}} = E (X | X < x) = \frac{q}{p} - \frac{x q^{x}}{1 - q^{x}} .$

(2.76)

Differentiating (2.76) again with respect to q and rearranging the terms in the same manner, we get

$E (X^{2} | X < x) = \frac{q (1 + q) - q^{x} ((1 + q) (p x + q) + x (x - 1) p^{2})}{p^{2} (1 - q^{x})} .$

(2.77)

The variance is now found from (2.72).

Various reliability functions in reversed residual lifetime for some distributions are exhibited in Table 2.4 and the graph of these functions for arithmetic distribution is presented in Fig. 2.3.

Table 2.4

The reversed hazard function, mean residual life and variance residual life for some distributions

Distribution	$F (x)$	$λ (x)$	$r (x)$	$v (x)$
geometric	1 − q^x+1	$\frac{q^{x} p}{1 - q^{x + 1}}$	$\frac{x p - q (1 - q^{x})}{p (1 - q^{x})}$	(2.76) and (2.77)
uniform	$\frac{x}{b}$ , $x = 1, 2, \dots, b$	$\frac{1}{x}$	$\frac{x}{2}$	$\frac{x (x - 2)}{12}$
arithmetic	$\frac{x (x + 1)}{b (b + 1)}$ , $x = 1, 2, \dots, b$	$\frac{2}{x + 1}$	$\frac{x + 1}{3}$	$\frac{(x + 1) (x - 2)}{18}$
reversed geometric	${(1 + c)}^{x - b}$ , $x = 0, 1, \dots, b$	$\frac{c}{1 + c}$	$\frac{1 + c}{c} [1 - \frac{1}{{(1 + c)}^{x}}]$	$\frac{[{(1 + c)}^{x} + 1] [{(1 + c)}^{x - 1} + 1] - 2 (c x + 1)}{c^{2} {(1 + c)}^{2 (x - 1)}}$

Figure 2.3 Reliability functions in reversed time for the arithmetic distribution.

In the continuous case, identities that connect the three functions $λ (x)$ , $r (x)$ and $v (x)$ have been established. The corresponding results in the discrete case are

$v (x + 1) - v (x) = λ (x) [r (x (r (x + 1) - 1) - v (x)]$

(2.78)

and

$\frac{v (x + 1)}{r (x + 1) - 1} + r (x + 1) - 1 = \frac{v (x)}{r (x)} + r (x) .$

(2.79)

Eqs (2.78) and (2.79) are employed in finding $v (x)$ when the others are known, especially in characterization problems and also in the discussions on the monotonicity of the reversed variance residual life function.

As in the case of the usual mean and variance residual lives, we have

$r (x) = E [\frac{1}{λ (x)} | X < x]$

and

$v (x + 1) = E [r (x) (r (x + 1) - 1) | X < x] .$

There are some special relationships between $λ (x)$ , $r (x)$ and $v (x)$ that characterize certain families of distributions. Some important results in this connection are presented in the next two theorems.

Theorem 2.15

For a random variable X with the support $(1, 2, \dots, b)$ , the relationship

$λ (x) r (x) = c, x = 2, 3, \dots, b, 0 < c < 1,$

(2.80)

holds if and only if the distribution of X is

$F (x) = \frac{(b - 1)! {(θ)}_{x}}{(x - 1)! {(θ)}_{b}}, x = 1, 2, \dots, b,$

(2.81)

with $θ = \frac{c}{1 - c}$ , a positive integer.

Proof

Suppose (2.80) holds for X. From

$λ (x) = \frac{1 + r (x) - r (x + 1)}{r (x)},$

we have the difference equation

$r (x + 1) - r (x) = 1 - c .$

Iterating for x, with the boundary condition $r (2) = 1$ , we get

$r (x + 1) = c + (1 - c) x$

and hence

$\frac{r (x + 1) - 1}{r (x)} = \frac{x - 1}{θ + x - 1} .$

Substituting in (2.65), we obtain (2.81). To prove the converse, from (2.81), we have

$λ (x) = \frac{θ}{θ + x - 1} = \frac{c}{c + (1 - c) (x - 1)}$

so that $r (x) λ (x) = c$ . □

Theorem 2.16

The random variable X in Theorem 2.15 satisfies the property

$\frac{v (x)}{r (x) (r (x) - 1)} = k,$

(2.82)

a constant in $(0, 1)$ , if and only if its distribution is (2.81) when $k < 1$ and the distribution is (2.68) when $k = 1$ .

Proof

First, we note that

$\begin{matrix} r_{2} (x) = E [{(x - X)}^{2} | X < x] = \frac{1}{F (x - 1)} \sum_{t = 1}^{x - 1} {(x - t)}^{2} f (t) \\ = \frac{2}{F (x - 1)} \sum_{t = 1}^{x - 1} (x - t) F (t) - r (x), \end{matrix}$

$[r_{2} (x) + r (x)] F (x - 1) = 2 \sum_{t = 1}^{x - 1} F (t) .$

(2.83)

Changing x to $x + 1$ in (2.83) and then subtracting (2.83), we get

$r_{2} (x + 1) r (x) - r_{2} (x) (r (x + 1) - 1) - r (x) (r (x + 1) - 1) = r (x) r (x + 1) .$

(2.84)

When (2.82) is satisfied with $k = 1$ , from

$v (x) + r^{2} (x) = r_{2} (x)$

and (2.82), we obtain

$r_{2} (x) = r (x) (r (x) - 1) + r^{2} (x) .$

Substituting in (2.84) and simplifying, we get

$r (x + 1) - r (x) = 0$

and hence $r (x)$ is a constant and the distribution is (2.68). On the other hand, when the distribution is (2.68), $r (x) = \frac{1 + c}{c}$ and then (2.74) yields

$\begin{matrix} v (x) & = \frac{2 (c + 1)}{c F (x - 1)} [\sum_{t = 2}^{x} {(1 + c)}^{t - 1 - b} + \frac{{(1 + c)}^{t - b}}{c}] - \frac{{(1 + c)}^{2}}{c^{2}} - \frac{1 + c}{c} \\ = \frac{2 (c + 1)}{c {(1 + c)}^{x - 1}} [(1 + c) ({(1 + c)}^{x - 1} - 1) \frac{1 + c}{c}] - \frac{{(1 + c)}^{2}}{c^{2}} - \frac{1 + c}{c} \\ = \frac{c + 1}{c^{2}} . \end{matrix}$

Also, $r (x) [r (x) - 1] = \frac{c + 1}{c}$ , so that the theorem is proved when $k = 1$ . In the more general case, when $k < 1$ , assume that X has distribution (2.81). Then,

$r (x) = c + (1 - c) (x - 1) = \frac{θ + x - 1}{θ + 1} .$

(2.85)

Further, in formula (2.74), we have

$\begin{matrix} \frac{2}{F (x - 1)} \sum_{t = 1}^{x} \sum_{u = 1}^{t} F (u - 1) & = \frac{2}{F (x - 1)} \sum_{t = 2}^{x} r (t) F (t - 1) \\ = \frac{2 (b - 1)!}{F (x - 1) (θ + 1) {(θ)}_{b}} \sum_{t = 2}^{x} \frac{(θ + t - 1) {(θ)}_{t - 1}}{(t - 2)!} \\ = \frac{2 (b - 1)!}{(θ + 1) {(θ)}_{b} F (x - 1)} \sum_{t = 2}^{x} \frac{{(θ)}_{t}}{(t - 2)!} . \end{matrix}$

(2.86)

Upon using the combinational identity

$\sum_{j = 0}^{r} (\begin{matrix} j + a - 1 \\ a - 1 \end{matrix}) = (\begin{matrix} r + a \\ a \end{matrix}),$

we have

$\sum_{t = x}^{x} \frac{{(θ)}_{t}}{(t - 2)!} = \frac{(x + θ)!}{(θ + 2)! (x - 2)!} .$

Substituting for $F (x - 1)$ from (2.81) and simplifying (2.86), we get

$\frac{2}{F (x - 1)} \sum_{t = 1}^{x} \sum_{u = 1}^{t} F (u - 1) = \frac{2 (θ + x - 1) (θ + x)}{(θ + 1) (θ + 2)} .$

Now from (2.85),

$r (x) [r (x) + 1] = \frac{(20 + x) (θ + x - 1)}{{(θ + 1)}^{2}}$

and so the formula (2.74) yields

$v (x) = \frac{θ (θ + x - 1) (x - 2)}{{(θ + 1)}^{2} (θ + 2)} .$

Thus,

$\frac{v (x)}{r (x) (r (x) - 1)} = \frac{θ}{θ + 2} = k < 1 .$

Conversely, under the hypothesis of the theorem, we have

$v (x) = k r (x) (r (x) - 1) .$

With the aid of (2.79), we write

$k \frac{r (x + 1) (r (x + 1) - 1)}{r (x + 1) - 1} + r (x + 1) - 1 = \frac{k r (x) (r (x) - 1)}{r (x)} + r (x) .$

The last equation leads to

$r (x + 1) - r (x) = \frac{1 - k}{1 + k}$

which on solving using the boundary condition $r (2) = 1$ , gives

$r (x) = 1 + \frac{1 - k}{1 + k} (x - 2) .$

Setting $k = \frac{c}{2 - c}$ so that $0 < c < 1$ as stipulated, we have

$r (x) = c + (1 - c) (x - 1)$

and hence the distribution is (2.81). □

Remark 2.14

We can write the distribution function in (2.81) also as

$F (x) = \frac{(\begin{matrix} θ + x - 1 \\ x - 1 \end{matrix})}{(\begin{matrix} θ + b - 1 \\ b - 1 \end{matrix})}, x = 1, 2, \dots, b .$

(2.87)

Remark 2.15

Eq. (2.84) represents a family of finite range distributions that contains the uniform distribution for $θ = 1$ and the arithmetic law for $θ = 2$ .

Arising from characteristics of the finite range laws discussed above, we have some further characterizations. These results can be proved by invoking Cauchy-Schwarz inequality, as done in Theorems 2.6 and 2.7.

Theorem 2.17

Let X be a discrete random variable with the support $(0, 1, \dots, b)$ . Then:

(i) $E (λ (X)) ⩾ \frac{1}{E (λ (X))}$ with the equality holding if and only if X has reversed geometric law;
(ii) $E (r (X)) ⩾ \frac{1}{E r (X)}$ and the equality holds if and only if X follows distribution (2.68).

Theorem 2.18

Let X be a discrete random variable with the support $(1, 2, \dots, b)$ . Then,

$E (\frac{λ (x)}{r (X)}) ⩾ \frac{1}{E (\frac{λ (X)}{r (X)})}$

if and only if the distribution of X is (2.81).

2.9 Odds Function

Along with the traditional reliability functions presented so far, there has been some interest in discovering the potential of odds function and log odds function in reliability analysis. The motivation for the consideration of these two functions are (i) they are easy to compute and interpret (ii) the estimation of these functions is relatively simpler, and (iii) the behaviour of other reliability functions can be ascertained through them.

The concept of odds ratio originated from gambling wherein the odds of an event A against another event B is defined as the ratio $P (A) / P (B)$ . In reliability theory, we can take the event A as survival of age x and B as failure by age x. Then, the odds ratio of the events become a function of x. The odds ratio for surviving age x is defined as

$ω (x) = \frac{P (X > x)}{P (X ⩽ x)} = \frac{S (x + 1)}{F (x)}, x = - 1, 0, 1, \dots,$

(2.88)

and it is called the odds function for survival. Similarly, the odds function for failure by age x is

$\bar{ω} (x) = \frac{F (x)}{S (x + 1)} .$

From the definitions, it follows that

(i) $ω (- 1) = \infty$ , $ω (\infty) = 0$ and $ω (x)$ is decreasing,
(ii) $\bar{ω} (- 1) = 0$ , $\bar{ω} (\infty) = \infty$ and $\bar{ω} (x)$ is increasing.

Odds functions ω and $\bar{ω}$ are important tools in survival analysis and medical studies in developing models for survival data and in comparing a treatment group with a control group. We refer to Collett (1994) and Kirmani and Gupta (2001) for further details. There has not been much study about the role of odds functions in reliability analysis, especially in the discrete case. We note that

$ω (x) = \frac{1}{F (x)} - 1, x = 0, 1, \dots,$

and

$\bar{ω} (x) = \frac{1}{S (x + 1)} - 1, x = 0, 1, \dots, \bar{ω} (- 1) = 0 .$

Accordingly, both $ω (x)$ and $\bar{ω} (x)$ determine the distribution of X uniquely through the expressions

$F (x) = \frac{1}{1 + ω (x)} = \frac{\bar{ω} (x)}{1 + \bar{ω} (x)}$

(2.89)

and

$S (x + 1) = \frac{ω (x)}{1 + ω (x)} = \frac{1}{1 + \bar{ω} (x)} .$

(2.90)

It is easy to see that the hazard and reversed hazard rates are

$h (x) = \frac{\bar{ω} (x) - \bar{ω} (x - 1)}{1 + \bar{ω} (x)}, x = 0, 1, 2, \dots,$

(2.91)

and

$λ (x) = \frac{ω (x - 1) - ω (x)}{1 + ω (x)} .$

(2.92)

We can express the monotonicities of $h (x)$ and $λ (x)$ in terms of $ω (x)$ and $\bar{ω} (x)$ .

Theorem 2.19

(i) $h (x)$ is decreasing $\Rightarrow λ (x)$ is decreasing $\Rightarrow ω (x)$ is convex;

(ii) $λ (x)$ is increasing $\Rightarrow h (x)$ is increasing $\Rightarrow \bar{ω} (x)$ is convex.

Proof

$h (x)$ is decreasing

$\begin{matrix} \Rightarrow \frac{f (x + 1)}{S (x + 1)} - \frac{f (x)}{S (x)} ⩽ 0 \\ \Rightarrow \frac{f (x + 1)}{1 - F (x + 1)} - \frac{f (x)}{1 - F (x - 1)} ⩽ 0 \\ \Rightarrow λ (x + 1) \frac{F (x + 1)}{1 - F (x)} - λ (x) \frac{F (x)}{1 - F (x - 1)} ⩽ 0 . \end{matrix}$

Since $F (x)$ is non-decreasing $\frac{F (x + 1)}{1 - F (x)} ⩾ \frac{F (x)}{1 - F (x - 1)}$ and so $λ (x + 1) ⩽ λ (x)$ , which proves that $λ (x)$ is decreasing. Next,

$\begin{matrix} ω (x + 1) - ω (x) & = \frac{1 - F (x + 1)}{F (x + 1)} - \frac{1 - F (x)}{F (x)} = \frac{1}{F (x + 1)} - \frac{1}{F (x)} \\ = {(\prod_{x + 2}^{\infty} (1 - λ (t)))}^{- 1} - {(\prod_{x + 1}^{\infty} (1 - λ (t)))}^{- 1} \\ = - λ (x + 1) \prod_{x + 1}^{\infty} {(1 - λ (t))}^{- 1} . \end{matrix}$

Similarly,

$ω (x) - ω (x - 1) = \frac{S (x + 1)}{F (x)} - \frac{S (x)}{F (x - 1)} = - λ (x) {[\prod_{x}^{\infty} (1 - λ (t))]}^{- 1} .$

Thus,

$\begin{matrix} (ω (x + 1) - ω (x)) - (ω (x) - ω (x - 1)) \\ = [λ (x) A (x + 1) - (λ (x) - λ (x + 1))] {[\prod_{x + 1}^{\infty} (1 - λ (t)]}^{- 1} \end{matrix}$

which is non-negative whenever $λ (x)$ is decreasing, and so $ω (x)$ is convex.

The proof of (ii) is similar. □

Example 2.21

Consider the uniform distribution in Example 2.2. In this case,

$ω (x) = \frac{b - x}{x}; \bar{ω} (x) = \frac{x}{b - x} .$

The hazard and reversed hazard rates are obtained by using (2.91) and (2.92) as

$h (x) = \frac{1}{b - x + 1} and λ (x) = \frac{1}{x} .$

Notice that $λ (x)$ is decreasing and $h (x)$ is increasing, so that the reverse implication $h (x)$ increasing means $λ (x)$ is increasing in (ii) is not true. However, it can be easily verified that $\bar{ω} (x)$ is convex.

The concepts based on residual life and reversed residual life can also be expressed in terms of odds functions. Recall that the survival function of the residual life $(X - x | X > x)$ is

$S_{x} (t) = \frac{S (x + t + 1)}{S (x + 1)}, t = 0, 1, \dots .$

The residual odds functions are then

$ω_{x} (t) = \frac{1}{F_{x} (t)} - 1 = \frac{S (x + t + 2)}{S (x + 1) - S (x + t + 2)} = \frac{1 - F_{x} (t)}{F_{x} (t)},$

$F_{x} (t) = \frac{1}{1 + ω_{x} (t)}$

and similarly

${\bar{ω}}_{x} (t) = \frac{1}{S_{x} (t + 1)} - 1 or S_{x} (t + 1) = \frac{1}{1 + {\bar{ω}}_{x} (t)} .$

In terms of $ω (x)$ ,

${\bar{ω}}_{x} (t) = \frac{1 + \bar{ω} (x)}{\bar{ω} (x + t + 1) - \bar{ω} (x)}$

and

$ω_{x} (t) = \frac{F (x + t) - F (x)}{1 - F (x)} = \frac{[1 + ω (x)] ω (x + t + 1)}{ω (x) - ω (x + t + 1)} .$

Also, the mean residual life function is

$m (x) = \frac{1}{S (x + 1)} \sum_{x + 1}^{\infty} S (t) = 1 + \bar{ω} (x) \sum_{t = x + 1}^{\infty} \frac{1}{1 + \bar{ω} (t - 1)} .$

(2.93)

Eq. (2.93) leads to the recurrence relation

$m (x - 1) = 1 + \frac{1 + \bar{ω} (x - 1)}{1 + \bar{ω} (x)} m (x), x = 0, 1, 2, \dots$

(2.94)

Theorem 2.20

X has decreasing (increasing) mean residual life if and only if

$m (x) ⩽ (⩾) \frac{1 + \bar{ω} (x)}{\bar{ω} (x) - \bar{ω} (x - 1)} .$

Proof

By means of (2.91), we can write

$\begin{matrix} X has decreasing mean & residual life \Leftrightarrow m (x - 1) - m (x) ⩾ 0 \\ \Leftrightarrow 1 + (\frac{1 + \bar{ω} (x - 1)}{1 + \bar{ω} (x)} - 1) m (x) ⩾ 0 \\ \Leftrightarrow m (x) ⩽ \frac{1 + \bar{ω}}{\bar{ω} (x) - \bar{ω} (x - 1)} . \end{matrix}$

In terms of the odds functions, the reversed mean residual life is

$r (x) = \frac{1}{F (x - 1)} \sum_{t = 1}^{x} F (t - 1) = (1 + \bar{ω} (x - 1)) \sum_{t = 1}^{x} \frac{1}{1 + ω (t - 1)}$

and so

$r (x + 1) = 1 + \frac{1 + ω (x)}{1 + ω (x - 1)} r (x), x = 0, 1, \dots$

□

In the same manner as we have proved Theorem 2.3, we note the following.

Theorem 2.21

X has increasing reversed mean residual life if and only if, for all x,

$r (x) ⩽ \frac{1 + ω (x - 1)}{ω (x - 1) - ω (x)} .$

Instead of considering the odds functions, it is sometimes beneficial to use the odds rate defined as

${\bar{ω}}_{R} (x) = \bar{ω} (x) - \bar{ω} (x - 1) .$

(2.95)

The function ${\bar{ω}}_{R} (x)$ is interpreted as the rate at which $ω (x - 1)$ changes at age x. It has the following properties:

(i) ${\bar{ω}}_{R} (x) = \frac{1}{S (x + 1)} - \frac{1}{S (x)} = \frac{h (x)}{S (x + 1)}$ ;
(ii) The sequence of rates $〈 {\bar{ω}}_{R} (x) 〉$ determine the distribution of X uniquely as

$S (x) = {[1 + \sum_{t = 0}^{x - 1} {\bar{ω}}_{R} (t)]}^{- 1};$

(2.96)

(iii) If ${\bar{ω}}_{R} (x)$ is decreasing ( $\bar{ω}$ is concave), then $h (x)$ is decreasing;
(iv) If $h (x)$ is increasing, then ${\bar{ω}}_{R} (x)$ is also increasing.

Example 2.22

Assume that ${\bar{ω}}_{R} (x) = \frac{b}{(b - x) (b - x + 1)}$ , $x = 0, 1, 2, \dots$ . Writing

$\begin{matrix} {\bar{ω}}_{R} (x) & = \frac{b}{b - x} - \frac{b}{b - x + 1}, \\ \sum_{t = 1}^{x - 1} {\bar{ω}}_{R} (x) & = \frac{x - 1}{b - x + 1} . \end{matrix}$

Employing formula (2.96), we have

$S (x) = \frac{b}{b - x + 1},$

the survival function of the discrete uniform distribution.

Property (iii) of ${\bar{ω}}_{R} (x)$ given above tells us that the class of distributions with decreasing ${\bar{ω}}_{R} (x)$ is a subset of the decreasing failure rate class and also that a sufficient condition for failure rate to be decreasing is that $\bar{ω} (x)$ is concave. Thus, we have a stronger condition that assists in characterizing and modelling failure time data. Further, when $h (x)$ is increasing, ${\bar{ω}}_{R} (x)$ is also increasing, providing an alternative proof of (ii) in Theorem 2.3. For detailed discussion of the results in this section, we refer to Nair and Sankaran (2015b).

2.10 Log-odds Functions and Rates

The log-odds functions and rates were introduced by Zimmer et al. (1998) as an alternative reliability measure and further propagated in the work of Wang et al. (2003, 2008). These functions were proposed as an alternative to the hazard rate in situations under which the device considered have high reliability or the corresponding hazard rate is non-monotone. When X is a discrete lifetime, Khorashadizadeh et al. (2013a) defined the log-odds function as

$\bar{L} (x) = \log \frac{F (x)}{S (x + 1)} = \log \bar{ω} (x), x = 0, 1, 2, \dots,$

(2.97)

and the corresponding log-odds rate as

${\bar{L}}_{R} (x) = \bar{L} (x) - \bar{L} (x - 1) .$

They have shown the following:

(a) ${\bar{L}}_{R} (x) = λ_{1} (x) + h_{1} (x)$ , where $λ_{1}$ and $h_{1}$ are the alternative reversed hazard and hazard rates of X discussed in Section 2.2;
(b) $F (x)$ is characterized by

$F (x) = \frac{e^{a (x) + b}}{1 + e^{a (x) + b}}, x = 0, 1, \dots,$

where $a (x) = \sum_{t = 1}^{x} \bar{L} (t)$ and $b = \log \frac{F (0)}{S (1)}$ ;
(c) In terms of $Y = \log X$ , we have

${\bar{L}}_{Y} (x) = {\bar{L}}_{X} (e^{x}),$

and hence

${\bar{L}}_{R, Y} (x) = \sum_{t = 1}^{x} (λ_{1} (x) + h_{1} (x)) - \log \frac{F_{X} (x e^{- 1})}{S_{X} {((x + 1) e^{- 1})}^{+ b}},$

where $b_{1} = \log \frac{f_{X} (0)}{1 - f_{X} (0)}$ ;
(d) X has increasing log-odds rate in terms of x ( $y = \log x$ ) if and only if $L (x)$ is convex with respect to x ( $y = \log x$ ).

From the above discussions, it is clear that ${\bar{L}}_{R} (x)$ is directly related to the alternative hazard and reversed hazard rates whereas ${\bar{ω}}_{R} (x)$ is related to the usual hazard rate. There are other mathematical properties that make $\bar{ω} (x)$ and $ω (x)$ more desirable than their logarithms.

Theorem 2.22

If X is a discrete random variable with odds function $\bar{ω} (x)$ , then there exists a random variable $X^{⁎}$ with the same support as X with alternative hazard rate $h_{1}^{⁎} (x) = \bar{ω} (x)$ . The survival functions $S (x)$ of X and $S^{⁎} (x)$ of $X^{⁎}$ are related by

$S (x) = {(1 - \log S^{⁎} (x))}^{- 1} .$

(2.98)

Proof

By virtue of the properties of $\bar{ω} (x)$ ,

$S^{⁎} (x) = \exp [- \bar{ω} (x - 1)], x = 0, 1, 2, \dots$

(2.99)

is a survival function. Let $X^{⁎}$ denote the corresponding random variable. From the representation in (2.14), the cumulative hazard rate of $X^{⁎}$ is

$H_{1}^{⁎} (x) = \sum_{0}^{x - 1} h_{1}^{⁎} (t),$

(2.100)

where $h_{1}^{⁎} (t) = \log \frac{S^{⁎} (x + 1)}{S^{⁎} (x)}$ is the alternative hazard rate of $X^{⁎}$ , and it satisfies

$H_{1}^{⁎} (x) = - \log S^{⁎} (x) = \bar{ω} (x - 1)$

(2.101)

Now, from (2.100) and (2.101), we have

$h_{1}^{⁎} (x) = H_{1}^{⁎} (x + 1) - H_{1}^{⁎} (x) = \bar{ω} (x) - \bar{ω} (x - 1) = {\bar{ω}}_{R} (x),$

proving the first part. Using (2.99),

$S^{⁎} (x) = \exp [- \frac{1}{S (x)} + 1]$

so that (2.98) holds. Since there is a one-to-one correspondence between $S (x)$ and $S^{⁎} (x)$ , they have the same support, and this completes the proof. □

Some observations from Theorem 2.22 are as follows:

(a) The converse of Theorem 2.22 is also true; that is, if $S^{⁎} (x)$ is the survival function of a random variable $X^{⁎}$ satisfying (2.98), then $S (x)$ is the survival function of a random variable X for which $h_{1}^{⁎} (x) = {\bar{ω}}_{R} (x)$ ;
(b) Since

${\bar{ω}}_{R} (x) = \frac{h (x)}{S (x + 1)}$

we have

$h^{⁎} (x) = \frac{h (x)}{S (x + 1)},$

$\begin{matrix} h (x) = h^{⁎} (x) \exp [- \sum_{0}^{x} h_{1}^{⁎} (x)], \\ h_{1}^{⁎} (x) = \bar{ω} (x) - \bar{ω} (x - 1); \end{matrix}$

(c) A parallel result that involves the odds function $ω (x)$ and the alternative reversed hazard rate $λ_{1}^{⁎} (x)$ is also possible. To see this, it is easy to recognize that $F^{⁎} (x) = \exp [- ω (x)]$ is a distribution function, with alternative reversed hazard rate

$λ_{1}^{⁎} (x) = \log \frac{F^{⁎} (x)}{F^{⁎} (x - 1)} = ω (x - 1) - ω (x) .$

Since $ω (x)$ is a decreasing function, $ω (x - 1) - ω (x) = ω_{R} (x)$ is the rate at which $ω (x)$ is decreasing and is therefore an odds rate. Thus, there exists a distribution specified by $F^{⁎} (x)$ for which the odds rate $ω_{R} (x)$ of $F (x)$ is the reversed hazard rate of $F^{⁎} (x)$ and the connection between the two is explained in the next theorem.

Theorem 2.23

Corresponding to a discrete random variable X with distribution function $F (x)$ and odds rate $ω (x)$ , there exist another distribution function $F^{⁎} (x)$ with the same support as $F (x)$ and alternative reversed hazard rate $λ_{1}^{⁎} (x)$ that satisfies

$λ_{1}^{⁎} (x) = ω_{R} (x) .$

(2.102)

Also, $F (x)$ and $F^{⁎} (x)$ are related through

$F (x) = {[1 - \log F^{⁎} (x)]}^{- 1} .$

(2.103)

Conversely, if for two distribution functions $F (x)$ and $F^{⁎} (x)$ (2.103) holds, then $λ_{1}^{⁎} (x) = ω_{R} (x)$ .

Remark 2.16

(1) The distribution function $F^{⁎} (x)$ represents the random variable $X^{⁎}$ of Theorem 2.22;
(2) $ω_{R} (x) = \frac{λ (x)}{F (x - 1)} = λ^{⁎} (x)$ ;
(3) $λ (x) = λ^{⁎} (x) F (x - 1)$ or $λ^{⁎} (x) = λ (x) / \prod_{t = x}^{\infty} (1 - λ (t))$ .

Example 2.23

The distributions of X and $X^{⁎}$ can be mutually characterized. If $X^{⁎}$ is geometric with

$S^{⁎} (x) = q^{x}, x = 0, 1, 2, \dots, 0 < q < 1,$

the distribution of X is

$S (x) = \frac{1}{1 - α \log q}, x = 0, 1, 2, \dots .$

In this case,

$\bar{ω} (x) = - (x + 1) \log q$

and

$ω_{R} (x) = - \log q = h_{1}^{⁎} (x) .$

Example 2.24

In this example, we present an analysis of real data on the number of deaths following surgery in a hospital in the US classified according to the age of the patients (Mosteller and Tukey, 1977). The survival function is estimated from the sample as

$\hat{S} (x) = \frac{number of observations ⩾ x}{total number of observations},$

while

$\hat{F} (x) = \frac{number of observations ⩽ x}{total number of observations} .$

$\hat{\bar{ω}} (x) = \frac{1}{\hat{S} (x + 1)} - 1 and \hat{ω} (x) = \frac{1}{\hat{F} (x)} - 1 .$

Table 2.5 presents the necessary calculations and Fig. 2.4 shows the shapes of the $ω (w)$ and $\bar{ω} (x)$ curves.

Table 2.5

Estimation of ω(x) and $\bar{ω} (x)$

Age	Number dying	$\hat{S} (x)$	$\hat{F} (x)$	$\bar{ω} (x)$	$ω (x)$
0–4	34	1.0000	0.0556	0.0589	17.9856
5–14	9	0.9444	0.0704	0.0757	13.2100
15–24	23	0.9296	0.1080	0.1211	8.2576
25–34	19	0.8920	0.1319	0.1616	6.1811
35–44	16	0.8609	0.1653	0.1980	5.0505
45–54	59	0.8347	0.2619	0.3548	2.8185
55–64	101	0.7581	0.4272	0.7458	1.3408
65–75	185	0.5728	0.7300	2.7037	0.3699
76–83	97	0.2700	0.8870	7.8496	0.1274
83+	68	0.1112	1.0000	8.0917	0.1236

Figure 2.4 Plots of ω(x) and $\bar{ω} (x)$ .

2.11 Mixture Distributions

The role of mixture distributions in reliability studies was mentioned earlier in Section 2.1. When the mixture is of the form (1.11), we have

$f (x) = α f_{1} (x) + (1 - α) f_{1} (x)$

and

$S (x) = α S_{1} (x) + (1 - α) S_{2} (x),$

(2.104)

where $f_{1} (f_{2})$ is the probability mass function and $S_{1}$ ( $S_{2}$ ) is the survival function corresponding to $F_{1}$ ( $F_{2}$ ). If $h (x)$ , $k_{1} (x)$ and $k_{2} (x)$ , respectively, denote the hazard rates of f, $f_{1}$ and $f_{2}$ , it is easy to see that

$h (x) = p (x) k_{1} (x) + (1 - p (x)) k_{2} (x),$

(2.105)

where

$p (x) = \frac{α S_{1} (x)}{α S_{1} (x) + (1 - α) S_{2} (x)} .$

Likewise, for the reversed hazard rates λ, ${\bar{λ}}_{1}$ and $\bar{λ_{2}}$ of $f, f_{1}$ and $f_{2}$ , respectively, we have

$λ (x) = p_{1} (x) {\bar{λ}}_{1} (x) + (1 - p_{1} (x)) {\bar{λ}}_{2} (x),$

(2.106)

with

$p_{1} (x) = \frac{α F_{1} (x)}{α F_{1} (x) + (1 - α) F_{2} (x)} .$

Finite mixtures of two components are commonly used in heterogeneous populations in which the elements are classified into two categories. It follows from (2.105) that

$\min (k_{1} (x), k_{2} (x)) ⩽ h (x) ⩽ \max (k_{1} (x), k_{2} (x)),$

and also that

$k_{1} (x) ⩽ h (x) ⩽ k_{2} (x)$

whenever $k_{1} (x) ⩽ k_{2} (x)$ for all x.

Theorem 2.24

If $k_{1} (x) ⩽ k_{2} (x)$ for all x, $p (x)$ is an increasing function.

Proof

We have

$\begin{matrix} p (x + 1) - p (x) & = & \frac{α S_{1} (x + 1)}{α S_{1} (x + 1) + (1 - α) S_{2} (x + 1)} \\ - \frac{α S_{1} (x)}{α S_{1} (x) + (1 - α) S_{2} (x)} . \end{matrix}$

The sign of the previous equation depends on

$\begin{matrix} t (x) & = α S_{1} (x + 1) [α S_{1} (x) + (1 - α) S_{2} (x)] - α S_{1} (x) [α S_{1} (x + 1) \\ - (1 - α) S_{2} (x + 1)] \\ = (1 - α) α [S_{1} (x + 1) S_{2} (x) - S_{1} (x) S_{2} (x + 1)] \\ = (1 - α) α [\frac{S_{1} (x + 1)}{S_{1} (x)} - \frac{S_{2} (x + 1)}{S_{2} (x)}] S_{1} (x) S_{2} (x) \\ = α (1 - α) S_{1} (x) S_{2} (x) [k_{2} (x) - k_{1} (x)] ⩾ 0, \end{matrix}$

since $k_{1} (x) ⩽ k_{2} (x)$ . Thus, $p (x)$ is increasing. This result has the interpretation that when the lifetime distribution is a mixture, the weakest items will die out first.

When the distribution of X is indexed by a parameter θ, where θ is the value of a random variable Θ defined on $[0, \infty)$ with distribution function $C (θ)$ , from (1.3), the survival function and the probability mass function of X are given by

$\bar{G} (x) = \int S (x | θ) d C (θ)$

and

$g (x) = \int f (x | θ) d C (θ)$

so that the mixture has its hazard rate as

$\begin{matrix} h^{⁎} (x) & = \frac{\int_{Θ} f (x | θ) d C (θ)}{\int_{Θ} S (x | θ) d C (θ)} \\ = \int_{Θ} h (x | θ) d C (θ | x), \end{matrix}$

where

$h (x | θ) = \frac{f (x | θ)}{F (x | θ)}$

is the conditional hazard rate of X, given $Θ = θ$ , and

$d C (θ | x) = \frac{S (x | θ) d C (θ)}{\int_{Θ} S (x | θ) d C (θ)}$

is the conditional density function of θ, given $X ⩾ x$ . When $c (θ) = \frac{d C (θ)}{d θ}$ , the density function of Θ, $c (θ | x)$ , defines the conditional probability density function of θ with the same support as that of Θ. In a Bayesian context, $c (θ)$ can be viewed as a prior distribution of θ, and $C (θ | x)$ as the posterior distribution of θ after observing the data on X. Models in which θ is regarded as random are called frailty models which are extensively discussed in survival analysis. In particular, if a representation of the form

$S_{1} (x) = S_{1} (x | θ) = {[S (x)]}^{θ}$

holds, then from (2.17), we have the alternative hazard rates of $S_{1}$ , say ${\bar{h}}_{1} (x)$ , and S satisfy the relationship

${\bar{h}}_{1} (x) = θ h_{1} (x) .$

We say that a random variable $X_{1}$ with survival function $S_{1} (x)$ is the proportional hazard rates model corresponding to X with survival function $S (x)$ . □

Some similar results exist for reversed hazard rates as well.

Theorem 2.25

In the case of reversed rate functions, if ${\bar{λ}}_{1} (x) > {\bar{λ}}_{2} (x)$ for all x, then $p_{1} (x)$ is an increasing function.

Proof

We have

$\begin{matrix} p_{1} (x + 1) - p_{1} (x) \\ = \frac{α F_{1} (x + 1)}{α F_{1} (x + 1) + (1 - α) F_{2} (x + 1)} - \frac{α F_{1} (x)}{α F_{1} (x) + (1 - α) F_{2} (x)} . \end{matrix}$

The sign of the left hand side depends on

$\begin{matrix} α (1 - α) F_{1} (x + 1) F_{2} (x + 1) [\frac{F_{2} (x)}{F_{2} (x + 1)} - \frac{F_{1} (x)}{F_{1} (x + 1)}] \\ = α (1 - α) F_{1} (x + 1) F_{2} (x + 1) [λ_{1} (x + 1) - λ_{2} (x + 1)] \end{matrix}$

which is non-negative since ${\bar{λ}}_{1} (x) > {\bar{λ}}_{2} (x)$ for all x. Hence, $p_{1} (x)$ is an increasing function.

Assuming θ to be random with distribution function $C (θ)$ , the mixture has reversed hazard rate

$\begin{matrix} λ^{⁎} (x) & = \frac{\int_{Θ} f (x | θ) d C (θ)}{\int_{Θ} F (x | θ) d C (θ)} \\ = \int_{Θ} λ (x | θ) d C^{⁎} (θ | x), \end{matrix}$

where

$λ (x | θ) = \frac{f (x | θ)}{F (x | θ)}$

is the conditional reversed hazard rate of X, given $Θ = θ$ , and

$C^{⁎} (θ | x) = \frac{F (x | θ) d C (θ)}{\int_{Θ} F (x | θ) d C (θ)}$

is the conditional distribution function of θ, given $X ⩽ x$ . As before, if a representation of the form

$F_{1} (x) = F_{1} (x | θ) = {[F (x)]}^{θ}$

holds, then the reversed hazard rates λ of $F_{1}$ and F satisfy the relationship

$λ_{F_{1}} (x) = θ λ_{F} (x) .$

In this case, we say that $F_{1}$ is the reversed proportional hazard rates model of F. □

Nelson (1982) has mentioned that units manufactured in different production periods may have different life distributions due to difference in design, raw materials, handling, etc., and it may therefore be necessary to identify production period, customer environment, etc. that result in poor units, for remedial action on that part of the population. Cox (1959) analyzed data on failure times using mixture models by classifying the cause of failure as identified or not; see also Mendenhall and Hader (1958) and Cheng et al. (1985). Identification of the life distribution is crucial in such cases. One way in which such identification is possible is to use characterization theorems that involve various reliability functions or relationships between them. Nair et al. (1999) proposed certain relationships between the hazard rate and the mean residual life that characterize some mixture distributions for the purpose.

Theorem 2.26

(a) A discrete random variable X taking values in $(0, 1, \dots)$ follows a mixture of geometric laws with probability mass function

$\begin{matrix} f (x) = α p_{1} q_{1}^{x} + (1 - α) p_{2} q_{2}^{x}, \\ 0 ⩽ α ⩽ 1, 0 < p_{i} < 1, q_{i} = 1 - p_{i}, i = 1, 2, \end{matrix}$

(2.107)

if and only if, for all x,

$r (x) = (\frac{1}{p_{1}} + \frac{1}{p_{2}}) - \frac{1}{p_{1} p_{2}} h (x + 1);$

(b) X is a mixture of Waring distributions with probability mass function

$\begin{matrix} f (x) & = & α (a - b_{1}) \frac{{(b_{1})}_{x}}{{(a)}_{x + 1}} + (1 - α) (a - b_{2}) \frac{{(b_{2})}_{x}}{{(a)}_{x + 1}}, \\ a > b_{1}, b_{1} ⩾ b_{2}, b_{2} > 0, \end{matrix}$

if and only if for all x

$\begin{matrix} m (x) & = & \frac{(2 a - b_{1} - b_{2} - 1)}{(a - b_{1} - 1) (a - b_{2} - 1)} (a + x) \\ \times \frac{(a + x) (a + x + 1)}{(a - b_{1} - 1) (a - b_{2} - 1)} h (x + 1) . \end{matrix}$

Corollary 2.27

When $p_{1} = p_{2} = p$ , the geometric law is characterized by the property

$m (x) = 2 p^{- 1} - p^{- 2} h (x + 1),$

and when $b_{1} = b_{2} = b$ , the Waring distribution

$f (x) = (a - b) \frac{{(b)}_{x}}{{(a)}_{x + 1}}, x = 0, 1, 2, \dots,$

is characterized by the property

$r (x) = \frac{(2 a - 2 b - 1)}{{(a - b - 1)}^{2}} (a + x) - \frac{(a + x) (a + x + 1)}{{(a - b - 1)}^{2}} h (x + 1) .$

In Theorem 2.26, we have used the mean residual life of the mixture of the form in (2.104). According to Definition 2.20, it is calculated as

$r (x) = \frac{1}{S (x + 1)} \sum_{t = x + 1}^{\infty} S (t) = \sum_{t = x + 1}^{\infty} \frac{α S_{1} (t) + (1 - α) S_{2} (t)}{α S_{1} (x + 1) + (1 - α) S_{2} (x + 1)},$

(2.108)

where $S_{1}$ and $S_{2}$ are the survival functions of the component distributions. Eq. (2.108) is expressible as

$m (x) = q (x) m_{1} (x) + (1 - q (x)) m_{2} (x),$

with

$q (x) = \frac{α S_{1} (x + 1)}{α S_{1} (x + 1) + (1 - α) S_{2} (x + 1)} = p (x + 1)$

and $m_{1} (x)$ and $m_{2} (x)$ are the mean residual life functions of the component distributions. When $S (x)$ is indexed by the values of a non-negative continuous random variable Θ, the residual life distribution becomes

$S_{x} (t) = \frac{\int_{Θ} S (x + t + 1 | θ) d C (θ)}{\int_{Θ} S (x + 1 | θ) d C (θ)}$

(2.109)

as a naturally corollary of the basic definition in (2.19). Eq. (2.109) is equivalent to

$\begin{matrix} S_{x} (t) & = \frac{\int_{Θ} S_{x} (t | θ) S (x + 1 | θ) d C (θ)}{\int_{Θ} S_{x} (x + 1 | θ) d C (θ)} \\ = \int_{Θ} S_{x} (t | θ) d C (θ | x + 1) d θ, \end{matrix}$

where

$π (θ | x + 1) = \frac{\int_{0}^{\infty} S (x + 1 | θ) d C (θ)}{\int_{0}^{\infty} S (x + 1 | θ) d C (θ)}$

is the distribution function of Θ given $X ⩾ x + 1$ .

Apart from the hazard function and mean residual life function, higher moments of residual life can also be used for characterizing life distributions. Two such results are established in Nair et al. (1999).

Theorem 2.28

If $E (X^{2}) < \infty$ , the identity

$\begin{matrix} μ_{(2)} (x) & = E [(X - x) (X - x - 1) | X > x] = A h (x + 1) + B, \\ where A & = 2 p_{1}^{- 2} p_{1}^{- 2} (p_{1} p_{2} - p_{1} - p_{2}), \\ B & = 2 p_{1}^{- 2} p_{2}^{- 2} (p_{1}^{2} + p_{2}^{2} + p_{1} p_{2} (1 - p_{1} - p_{2})), 0 < p_{1}, p_{2} < 1, \end{matrix}$

holds if and only if X is distributed as geometric mixture in (2.107);
(b)

$E [{(X - x)}^{2} | X > x] = A + B h (x) + C (x),$

where

$\begin{matrix} A & = - 2 {(p_{1} p_{2} p_{3})}^{- 1} (p_{1} + p_{2} + p_{3}), \\ B & = 2 {(p_{1} p_{2} p_{3})}^{- 1} \end{matrix}$

and

$C = 2 {(p_{1} p_{2} p_{3})}^{- 1} (p_{1} p_{2} + p_{2} p_{3} + p_{3} p_{1}),$

if and only if X has a three-component geometric mixture

$f (x) = \sum_{i = 1}^{3} α_{i} p_{i} q_{i}^{x}, 0 < α_{i} < 1, and \sum_{i = 1}^{3} α_{i} = 1 .$

These authors have pointed out with the help of simulated data that Theorem 2.26 is useful in model identification and inference. If the plots of the estimates $(\hat{h} (x), \hat{m} (x))$ fall along a straight line, the distribution is mixture of geometric. A quick estimate of the parameters $p_{1}$ and $p_{2}$ is obtained from the slope and intercept of the fitted line, or more accurately from a least-square fit of the line. One can estimate α by equating the sample mean with

$E (X) = α q_{1} p_{1}^{- 1} + (1 - α) q_{2} p_{2}^{- 1}$

after substituting the estimate of $p_{1}$ and $p_{2}$ . Nair (1983b) addressed the estimation problem by matching the factorial moments of the sample with those of the population. Denoting by $m_{(1)}$ , $m_{(2)}$ and $m_{(3)}$ the first three sample factorial moments, the equations to be solved are as follows:

$\begin{matrix} α r + (1 - α) s = a_{1} = m_{(1)} \\ α r {(r - 1)}^{2} + (1 - α) s (s - 1) = a_{2} = \frac{m_{(2)}}{2} \\ α r {(r - 1)}^{2} + (1 - α) s {(s - 1)}^{2} = a_{3} = \frac{m_{(3)}}{6} \end{matrix}$

with $r = p_{1}^{- 1}$ and $s = p_{2}^{- 1}$ . After some algebra, the quadratic equation

$(a_{1} + a_{2} - a_{1}^{2}) s^{2} + (a_{1}^{2} + a_{1} a_{2} - a_{i} - 2 a_{2} - a_{3}) s + (a_{1} a_{2} - a_{2}^{2}) = 0$

is obtained, which can be solved for s. Now r and α are calculated from

$s = \frac{a_{1} + a_{2} - a_{1} r}{a_{1} - r} and α = \frac{a_{1} - s}{r - s} .$

If there are two admissible solutions for s in the above quadratic equation, the one that gives better fit is chosen as the final estimate.

Sometimes, mixtures of a more general form

$f (x) = \sum_{i = 1}^{n} α_{i} f_{i} (x),$

(2.110)

$S (x) = \sum_{i = 1}^{n} α_{i} S_{i} (x),$

where $α_{i}$ are real numbers, not necessarily all positive, satisfying $\sum α_{i} = 1$ are also considered. They are of practical significance representing distributions of order statistics, coherent systems and inference in step-stress models. Discussions of these aspects in the continuous case can be seen in Navarro and Rychlik (2007), Navarro and Shaked (2006), Balakrishnan et al. (2007, 2009), Balakrishnan and Cramer (2008), and Balakrishnan and Xie (2007). When some of the mixing constants are negative, such mixtures are called generalized mixtures. The representation in (2.110) can be expressed as (Navarro and Hernandez, 2004) as

$S (x) = α S_{+} (x) + (1 - α) S_{-} (x)$

with $α = \sum_{α_{i} > 0} α_{i} > 1$ ,

$S_{+} (x) = \sum_{α_{i} > 0} α_{i} S_{i} (x) / \sum_{α_{i} > 0} α_{i} and S_{+} (x) = \sum_{α_{i} < 0} α_{i} S_{i} (x) / \sum_{α_{i} < 0} α_{i} .$

Then,

$S_{+} (x) = α^{- 1} S (x) + α^{- 1} (α - 1) S_{-} (x)$

and the relationship for $n = 2$ becomes

$\min (m_{1} (x), m_{2} (x)) ⩽ m_{1} (x) ⩽ \max (m (x), m_{2} (x)),$

where $m_{i} (x)$ is the mean residual life function of $S_{i} (x)$ .

Recalling the formula for the reversed mean residual life of X as

$r (x) = \frac{1}{F (x - 1)} \sum_{t = 1}^{x} F (t - 1),$

(2.111)

we see that for the two-component mixture we have

$\begin{matrix} F (x) & = α F_{1} (x) + (1 - α) F_{2} (x), \\ r (x) & = p_{1} (x - 1) r_{1} (x) + (1 - p_{1} (x - 1)) r_{2} (x), \end{matrix}$

(2.112)

where $r_{1} (x)$ and $r_{2} (x)$ are the reversed mean residual lives of $F_{1}$ and $F_{2}$ and $p_{1} (\cdot)$ is as defined in (2.106). Eliminating $p_{1} (x)$ from

$r (x + 1) = p_{1} (x) r_{1} (x + 1) + (1 - p_{1} (x)) r_{2} (x + 1)$

and (2.106), a relationship connecting the reversed mean residual life $r (x)$ and the reversed hazard rate $λ (x)$ of the mixture can be expressed in the form

$r (x + 1) = g_{1} (x) λ (x) + g_{2} (x),$

(2.113)

where

$g_{1} (x) = \frac{r_{1} (x) - r_{2} (x)}{\bar{λ} (x) - {\bar{λ}}_{2} (x)} and g_{2} (x) = \frac{r_{2} (x) {\bar{λ}}_{1} (x) - r_{1} (x) {\bar{λ}}_{2} (x)}{{\bar{λ}}_{1} (x) - {\bar{λ}}_{2} (x)} .$

Remark 2.17

The identity connecting $m (x)$ and $h (x)$ of the mixture also bears the same form

$m (x) = l_{1} (x) h (x + 1) + l_{2} (x),$

where

$\begin{matrix} l_{1} (x) & = \frac{m_{2} (x) {\bar{h}}_{1} (x + 1) - m_{1} (x) {\bar{h}}_{2} (x + 1)}{{\bar{h}}_{1} (x + 1) - {\bar{h}}_{2} (x + 1)}, \\ l_{2} (x) & = \frac{m_{1} (x) - m_{2} (x)}{h_{1} (x + 1) - h_{2} (x + 1)} . \end{matrix}$

(2.114)

Example 2.25

Let X be distributed as a mixture of reversed geometric forms

$F (x) = α F_{1} (x) + (1 - α) F_{2} (x),$

where

$F_{i} (x) = {\begin{matrix} c_{i}^{- 1} {(1 + c_{i})}^{1 - b}, & x = 0 \\ {(1 + c_{i})}^{x_{i} - b}, & x_{i} = 1, 2, \dots . \end{matrix}$

Inserting

$r_{i} (x) = \frac{1 + c_{i}}{c_{i}} and {\bar{λ}}_{i} (x) = \frac{c_{i}}{1 + c_{i}}$

in (2.113), we obtain

$r (x + 1) = \frac{c_{1} + c_{2} + 2 c_{1} c_{2}}{c_{1} c_{2}} - \frac{(1 + c_{1}) (1 + c_{2})}{c 1 c_{2}} λ (x), x = 1, 2, \dots$

(2.115)

The relationship given above provides a plot of $(λ (x), r (x + 1))$ as a decreasing straight line which is easy to verify. Notice that (2.115) is a characteristic property of this distribution.

2.12 Weighted Distributions

Recalling the definition of the weighted distribution in (1.20) that

$f_{W} (x) = \frac{W (x) f (x)}{E [W (x)]}$

and the expression of the survival function in (1.21) given by

$h_{W} (x) = \frac{W (x)}{E [W (X) | X > x]} h_{X} (x),$

the mean residual lives of X and $X_{W}$ satisfy the relationship

$\frac{m_{W} (x - 1) - 1}{m_{W} (x)} = \frac{p (x + 1)}{p (x)} \frac{m (x - 1) - 1}{m (x)}, p (x) = E [W (X) | X > x] .$

As special cases, we have

$\begin{matrix} S_{L} (x) & = \frac{(m (x) + x) S (x)}{μ} = \frac{m^{⁎} (x) S (x)}{μ}, \\ h_{L} (x) & = \frac{x h (x)}{(m (x) + x)} = \frac{x h (x)}{m^{⁎} (x)}, \\ \frac{m_{L} (x - 1) - 1}{m_{L} (x)} & = \frac{m (x + 1) - (x + 1)}{(m (x) - x)} \frac{m (x - 1) - 1}{m (x)} . \end{matrix}$

Denoting $h_{n} (x)$ and $m_{n} (x)$ , for the hazard rate and the mean residual life of the equilibrium random variable $Y_{n}$ (Section 1.4), we have from Nair et al. (2012b) that

$\begin{matrix} h_{n} (x) = \frac{1}{m_{n - 1} (x)}, \\ m_{n - 1} (x + 1) = \frac{m_{n} (x + 1)}{1 + m_{n} (x + 1) - m_{n} (x)} \end{matrix}$

and

$h_{n} (x + 1) = 1 + h_{n + 1} (x + 1) (1 - h_{n + 1}^{- 1} (x)) .$

In particular, for the first-order equilibrium random variable $X_{E}$ (Nair and Hitha, 1989), we have

$h_{E} (x) = \frac{1}{m (x)},$

so that the hazard rate of $X_{E}$ is the reciprocal of the mean residual life of $X_{E}$ . Furthermore, we have

$\begin{matrix} h (x) = 1 + h_{E} (1 + h_{E}^{- 1} (x - 1)), \\ m (x) = m_{E} (x) {[1 + m_{E} (x) + m_{E} (x - 1)]}^{- 1} . \end{matrix}$

From the above identities, Nair and Hitha (1989) have also shown the following:

(a) a mean residual life of the form $A + B x$ characterizes the geometric, Waring and negative hyper-geometric distributions according as $B = 0$ , $B > 0$ and $B < 0$ , respectively;

(b) the random variables X and $X_{E}$ are such that $h (x) = C_{1} h_{E} (x)$ ( $m (x) = C_{2} m_{E} (x)$ ) if and only if X is geometric for $C_{1} = 1$ ( $C_{2} = 1$ ), Waring for $C_{1} > 1$ ( $C_{2} < 1$ ) and negative hyper-geometric for $C_{1} < 1$ ( $C_{2} > 1$ ).

More general characterizations covering the original and the nth-order equilibrium distributions have been provided by Nair et al. (2012b) as follows:

(a) $h_{n} (x) = (1 + C n) h (x)$ for all x and $n = 0, 1, 2, \dots$ ;
(b) $m_{n} (x) = {(1 + B n)}^{- 1} m (x)$ , for all x and $n = 0, 1, 2, \dots$ ;
(c) the variance residual life of $Y_{n}$ ,

$σ_{n}^{2} (x) = C_{n} m_{n} (x) (m_{n} (x) - 1),$

holds if and only if X has one of the above three distributions, the geometric ( $C = 0$ , $B = 0$ , $C_{n} = 1$ ), Waring ( $C < 0$ , $B > 0$ , $C_{n} > 1$ ), and negative hyper-geometric ( $C > 0$ , $B < 0$ , $C_{n} < 1$ ).

In particular, $σ_{n}^{2} (x) = σ^{2} (x)$ characterizes the geometric distribution. So also the property

$E [{(X - x - 1)}^{(n)} | X > x + n] = c,$

a constant, for all x and $n = 1, 2, \dots$ .

Li (2011) obtained formulas for the equilibrium distribution of the n-fold convolution of $f (x)$ and mixtures. If $f^{n ⁎}$ is the n-fold convolution of f (see Section 1.5) and $f_{m}^{n ⁎}$ is the m-th order equilibrium distribution of $f^{n ⁎}$ , then

$f_{m}^{n ⁎} (x) = \frac{μ_{(m)}}{μ_{(m)}^{n^{⁎}}} f_{m}^{(n - 1) ⁎} (x) + \frac{1}{μ_{(m)}^{n ⁎}} \sum_{i = 1}^{m} (\begin{matrix} m \\ i \end{matrix}) μ_{(i)} μ_{(m - i)}^{(n - 1) ⁎} f_{i} (x),$

where $μ_{(r)}^{n ⁎}$ is the rth factorial moment of $f_{r}^{⁎}$ , $m = 1, 2, \dots$ , $n = 2, 3, \dots$ .

When the baseline distribution of X is $f (x | θ)$ , where θ is the realization of a continuous random variable Θ, the mixture (Section 1.3)

$g (x) = \int_{Θ} f (x | θ) d C (θ)$

has nth order equilibrium distribution

$g_{n} (x) = \int_{Θ} f_{n} (x | θ) d C_{n} (θ)$

where

$d C_{n} (θ) = \frac{E (X_{(n)} | θ) d C (θ)}{E (X_{(n)})} .$

Example 2.26

It has been shown in Example 1.2 that the nth order equilibrium distribution of a geometric variable X is the same as the distribution of X. Now, in this case, we have

$\begin{matrix} d C_{n} (q) & = \frac{E (X_{(n)} | q) d C (q)}{E (X_{(n)})} \\ = {(\frac{q}{p})}^{n} \frac{d C (q)}{E (X_{(n)})}, \end{matrix}$

showing that $g_{n} (x)$ is also a mixture geometric law.

Finally, we look at the relationships between various reliability measures in reversed time in the original and weighted versions. For this purpose, we note that the distribution function of the weighted law is given by

$\begin{matrix} F_{w} (x) & = \sum_{t = 0}^{x} W (t) f (t) / E [W (X)], E [W (X)] < \infty \\ = \frac{E [W (X) | X ⩽ x] F (x)}{E [W (x)]} . \end{matrix}$

Accordingly, the reversed failure rate of $X_{W}$ becomes

$λ_{W} (x) = \frac{f_{W} (x)}{F_{W} (x)} = \frac{W (x)}{E [W (x) | X ⩽ x]} λ (x) .$

Similarly, from (2.64),

$r_{W} (x) = \frac{1}{F_{W} (x - 1)} \sum_{t = 1}^{x} F_{W} (t - 1)$

gives

$\frac{r_{W} (x + 1) - 1}{r_{W} (x)} = \frac{p_{1} (x - 1)}{p_{1} (x)} \frac{r (x + 1) - 1}{r (x)}, p_{1} (x) = E (W_{1} (X) | X ⩽ x) .$

These identities, relationships and formulas are helpful while discussing properties of ageing concepts of weighted distributions and their relationships with those of the baseline distribution.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 2: Basic Reliability Concepts

Create new playlist

Sign In

Sign Up

2.1 Introduction

2.2 Hazard Rate Function

2.3 Mean Residual Life

2.3.1 Modelling Data

2.4 Variance Residual Life Function

2.5 Upper Partial Moments

2.6 Reversed Hazard Rate

2.7 Reversed Mean Residual Life

2.8 Reversed Variance Residual Life

2.9 Odds Function

2.10 Log-odds Functions and Rates

2.11 Mixture Distributions

2.12 Weighted Distributions

Table of Contents for
Chapter 2: Basic Reliability Concepts