Chapter 9. Multivariate Statistics

In this chapter, we will cover the following recipes:

  • Finding the principal components of a set of data
  • Using factor analysis to identify the underlying factors
  • Analyzing the consistency of a test paper using item analysis
  • Finding similarity in results by rows using cluster observations
  • Finding similarity across columns using cluster variables
  • Identifying groups in data using cluster K-means
  • The discriminant analysis
  • Analyzing two-way contingency tables with a simple correspondence analysis
  • Studying complex contingency tables with a multiple correspondence analysis

Introduction

Multivariate tools can be useful in exploring large datasets. They help us find patterns and correlations in the data; or, try to identify groups from within a larger dataset.

Tools such as principal components analysis and factor analysis are used as a way to identify underlying correlations or factors that are hidden in the data. The clustering tools try to find a similarity between observations or columns; for example, finding the similarity between how close the rows and variables are to each other.

Correspondence analysis helps us investigate relationships between two-way tables and even more complex tabular data.

We may find the use of multivariate tools as a precursor to modeling data in regression or ANOVA as these techniques can often lead to an understanding of the relationships between variables and the dimensionality of our results.

The data files used in the recipes are available for download on the Packt website.

The Multivariate tools are found under the Stat menu as shown in the following screenshot:

Introduction
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.191.14.93