There is a special type of data known as dichotomous or binomial, which means that only two categorical options are being considered.
Some data is naturally binomial. Consider employee turnover, which is sometimes analyzed
as “leaver vs. stayer.” Consider business failure data, which is sometimes analyzed
as “failed vs. did not fail.” In marketing, we may have “bought vs. did not buy.”
In numerical terms, you can think of such variables as 0/1 data (coded as 0 or 1).
You can, however, look at any given category – no matter how many other categories
there are in the variable – in binomial terms. To do this, we would ”zoom in” on the
category and analyze it on its own, so that the categories would implicitly become
“belongs to this category” versus “does not belong to this category.” In the Size
data, although there are three categories, we could zoom in on (say) Big companies
only. Our focus would become the probability of belonging to this category, versus
other sizes.
Usually, we are interested in comparing the actual percentage of observations in the
focus category to a benchmark percentage. In the previous section we ascertained that
the Chi-square test suggested that the distribution of sizes differed significantly
from a benchmark set of sizes. Now, we may wish to focus on the Big companies. The
benchmark percentage for these was 50%, while in our sample there are 120 big companies
totaling 43% of the population.
The resulting analysis – seen in
Figure 15.7 Results of standard binomial test for one category below - gives several confidence intervals and p-values. In this case all agree that – at least with 95% confidence – the company’s 43% proportion
of big customers is statistically significantly lower than the industry benchmark
of 50%, because the confidence intervals exclude 50%, and because both p-values are
low suggesting a rejection of the null hypothesis that the company and industry figures
are equal.
Note again that the one-sided p-value supports a specific directional hypothesis,
for example a test of whether the company specifically has lower numbers of big customers
than the industry.
There are more tests that can be achieved with binomial proportions. One possibility
is to test for equivalence, which is a specific test for whether a categorical proportion in your data is statistically
equivalent to a benchmark value (whereas the usual binomial test looks at difference).
The interested reader can find out how to test for these specific issues in the SAS
9 helpfiles (the SAS/STAT 13.2 User’s Guide or the like). The binomial test example in the PROC FREQ section
includes an example of equivalence testing.