Professional Documents
Culture Documents
KrIsHna
Multivariate Analysis
Multivariate Analysis is a study of several
dependent random variables simultaneously.
These analysis are straight generalization of
univariate analysis.
Certain distributional assumptions are
required for proper analysis.
The mathematical framework is relatively
complex as compared with the univariate
analysis.
These analysis are being used widely around
the world.
Multivariate Analysis
Methods
Two general types of MVA technique
Analysis of dependence
Where one (or more) variables are dependent
variables, to be explained or predicted by others
E.g. Multiple regression, PLS
Analysis of interdependence
No variables thought of as “dependent”
Look at the relationships among variables, objects or
cases
E.g. cluster analysis, factor analysis
Some Multivariate Measures
The Mean Vector
Collection of the means of the variables under
study
The Covariance Matrix
Collection of the Variances and Covariances of
the variables under study
The Correlation Matrix
Collection of Correlation Coefficients of the
variables involved under study
The Generalized Variance
Determinant of the Covariance Matrix
Some Multivariate Tests of
Significance
Testing significance of a single mean vector
Testing equality of two mean vectors
Testing equality of several mean vectors
Testing significance of a single covariance
matrix
Testing equality of two covariance matrices
Testing equality of several covariance
matrices
Testing independence of sets of variates
The Factor Analysis
Deals with the grouping of like variables in
sets.
Sets are formed in decreasing order of
importance.
Sets are relatively independent from each
other.
Two types are commonly used:
The Exploratory Factor Analysis
The Confirmatory Factor Analysis
ij
The quantity is loading of i–th variable on
j–th factor and measures the degree of
dependence of a variable on a factor.
The i–th communality; that measures the
portion of variation of i–th variable explained by
j–th factor; is givenmas
ij
2
j 1
Factor Rotation
Rotation is done to simplify the solution of
factor analysis.
Interpretations can be easily done from rotated
solution.
Two types of rotations are available:
Orthogonal Rotation; factors formed are
orthogonal
Oblique Rotation; factors formed are correlated
Cluster Analysis
Techniques for identifying separate groups of
similar cases
Similarity of cases is either specified directly in
a distance matrix, or defined in terms of some
distance function
Also used to summarise data by defining
segments of similar cases in the data
This use of cluster analysis is known as
“dissection”
Clustering Techniques
Two main types of cluster analysis methods
Hierarchical cluster analysis
Each cluster (starting with the whole dataset) is divided
into two, then divided again, and so on
Iterative methods
k-means clustering (PROC FASTCLUS)