Appendix C

Quasi-likelihood functions and properties

The term quasi-likelihood function was initially introduced by Wedderburn (1974). While providing a way of allowing for overdispersion in statistical analysis, quasi-likelihood estimating equations are mostly applied for grouped binary or count data. In theories underlying this model, the quasi-likelihood function is shown to have similar properties to the log-likelihood function. Briefly, in this perspective, only relationship between the mean and the variance is specified in the form of a variance function.

Suppose we have independent observations Yi (i = 1, …, N) with expectation μi and variances V(μi), where V(·) is some known function. Let each observation μi be some known function of parameters β=(β1,...,βM)image. Then for each observation the quasi-likelihood function, denoted by Q(Yi,μi)image, is defined by the relation

Q(Yi,μi)μi=YiμiV(μi),

image(C.1)
or equivalently

μiQ(Yi,μi)=.μiYiμiV(μi)dμi+function of Yi.

image

For analytic convenience, the subscript i will be dropped in the succeeding texts, so that Y and μ will refer to an observation and its expectation, respectively.

Following immediately from the definition of Qimage, then Qimage has the following first characteristic:

EQμ=0.

image(C.2)

By applying the chain rule, we have

Qβm=Qμμβm,

image
where m = 1, …, M. Thus, we obtain another quasi-likelihood equation:

EQβm=0.

image(C.3)

Given V(μ) = var(Y), we have

EQQβmβm=EQμ2μβmμβm=E(Yμ)2[V(μ)]2μβmμβm=1V(μ)μβmμβm.

image(C.4)

Also, the second partial derivative of the quasi-likelihood function with respect to β can be written by

E2Qβm βm=EβmYμV(μ)μβm=E(Yμ)βm1V(μ)μβm1V(μ)μβmμβm=1V(μ)μβmμβm.

image(C.5)

Let l denote the log-likelihood and the distribution of Y be specified in terms of μ. Then, from the Cramér–Rao inequality (Stuart and Ord, 1994), we have

var(Y)1E2 lμ2.

image(C.6)

Summarizing the above inferences, the quasi-likelihood Qimage has the following properties:

Property (i):EQμ=0,Property (ii):EQβm=0,Property (iii):EQμ2=E2Qμ2=1V(μ),Property (iv):EQ Qβmβm=E2Qβm βm=1V(μ)μβmμβm,Property (v):E2Qμ2E2lμ2,

image
where Property (iii) is a special case of (iv), and for a one-parameter exponential family, the inequality in Property (v) becomes equality. Wedderburn (1974) mathematically proves that the precision of maximum quasi-likelihood estimates can be estimated from the expected second derivatives of Qimage in the same fashion as the precision of maximum likelihood estimates is estimated from the expected second derivatives of the log likelihood. In practice, using the Newton–Raphson method with the expected second derivatives of Qimage to calculate β^image is equivalent to iteratively calculating a weighted linear regression of the residuals on the quantities of the scores by linear least squares, with the residuals and the score being calculated from the current estimate of β^image.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
52.15.74.25