When linking categorical to continuous variables, the basic process is to estimate
summary statistics such as the average and standard deviation of the continuous variables
for each category.
In “Data03_Aggregated” I have created aggregate trust and satisfaction variables (Chapter
5 discusses the process to do this). To see if trust, satisfaction, enquiries, and
sales differ by customer size and license type, we need to estimate the means and
standard deviations (possibly also medians and IQRs) for each continuous variable
within each category and combinations of categories.
Figure 8.9 Summary table relating size & license to continuous variables shows an example of the final analysis, in which we see each continuous variable
analyzed separately by each size and license combination.
You can also test statistically whether these means differ substantially. For instance,
in
Figure 8.9 Summary table relating size & license to continuous variables medium-sized firms actually have the highest sales. These sales levels across sizes
seem different, but are they really different enough to say that they are
statistically different? A set of statistics called “Comparison of Means,” introduced in Chapter
14, can further test such differences.
To get the means and standard deviation of continuous variables split by categories,
you can use various modules in SAS. I propose the PROC MEANS module. Open and run the file “Code08d Categorical Continuous“ to see how SAS presents
the raw output that you can format into a table such as the above. In that file, I
also provide code that analyzes each continuous variable across combinations of size and license.