5.3. A Model for Three Categories

First, some notation. Define

pi1 = the probability that WALLET=1 for person i,

pi2 = the probability that WALLET=2 for person i,

pi3 = the probability that WALLET=3 for person i.

Let xi be a column vector of explanatory variables for person i:

xi = [1 xi1 xi2 xi3 xi4]’

If this is unfamiliar, you can just think of xi as a single explanatory variable. In order to generalize the logit model to this three-category case, it’s tempting to consider writing three binary logit models, one for each outcome:


where the β’s are row vectors of coefficients. This turns out to be an unworkable approach, however. Because pi1+pi2+pi3=1, these three equations are inconsistent. If the first two equations are true, for example, the third cannot be true. Instead, we formulate the model as follows:


These equations are mutually consistent and one is redundant. For example, the third equation can be obtained from the first two. Using properties of logarithms, we have


which implies that β3 = β1β2. Solving for the three probabilities, we get


Because the three numerators sum to the common denominator, we immediately verify that the three probabilities sum to 1.

As with the binary logit model, the most general approach to estimation is maximum likelihood. I won’t go through the derivation, but it’s very similar to the binary case. Again, the Newton-Raphson algorithm is widely used to get the maximum likelihood estimates. The only SAS procedure that will do this is CATMOD.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.224.62.105