01 Multivariate Analysis
01 Multivariate Analysis
DEPENDENCY INTERDEPENDENCY
Data Reduction technique
Customer’s ratings of a fast food restaurant.
Food taste
Food temperature Food quality
Freshness
Waiting time
Cleanliness Service quality
Friendliness of employees
To predict the changes in the dependent variable
in response to changes in independent variables
Monthly expenditure on dining out(dependent
variable) might be predicted from regarding a
family’s income, size, age of
households(independent variables)
Company’s sales from information on its
expenditure for advertising, the number of sales
people, number of stores carrying its product
Single dependent variable is nonmetric i.e.
dichotomous(e.g., male-female) or
multichotomous(e.g., high- medium- low).
Total population can be divided in to groups
based on non-metric dependent variable classes
It is used to understand group differences and to
predict the likelihood that an entity will belong
to particular class or group based on several
metric independent variables
Used to distinguish innovators from non-
innovators according to demographic and
psychographic profiles.
It is a combination of multiple discriminant
analysis and multiple regression.
Main difference b/w logistic regression and
multiple regression is dependent variable is non-
metric as discriminant analysis.
Does not require assumption of multivariate
normality.
Firms Successful over five years and
unsuccessful after five years
LR is used to select best firms for investment in
future for investment
It can be viewed as logical extension of
multiple regression analysis.
Several metric dependent variables and
several metric independent variables
The most direct application is in new product
or service development
Product
3 attribute (price, quality and color)
3 level(red, blue, green)
Instead of evaluating 27 combinations, a
subset of 9 or more can be evaluated for their
attractiveness to consumer
Same as factor…
But factor analysis is used for variables,
Cluster analysis is used for respondents
Highly motivated by low prices v/s who are
less motivated by price.
Objective is to transform consumer judgment
of similarity 0r preference
Facilitates the perceptual mapping of objects
on a set of non-metric attributes
Cross tabulation table
Demographic variables(such as age, gender,
income, occupation etc) by how many people
preferring each brand or preference for online
or offline..
Association or correspondence of brands
Supervision
Job search
Job
Work
satisfaction
Environment
Cowoerkers
Establish practical significance as well as statistical significance
Recognize that sample size affects all results
Know your data
• Influence of outliers
• Violation of assumptions
• Missing data
Strive for model parsimony
• Specification error
• Multicollinearity
Look at your errors
Validate your results
• Splitting samples
• Gathering a separate sample
• Resampling and Bootstrapping
STAGE- 1
Define the research problem, Objectives and
Multivariate techniques to be used
A conceptual model need not be complex and
detailed; it can be just a simple representation
of the relationship to be studied.
Model specified means the researcher has to
choose an appropriate multivariate technique
based on measurement characteristics of the
dependent and independent variables.
Stage – 2
Develop the analysis plan
At this stage attention turns to the
implementation issues.
Minimum or required sample size,
Allowable or required type of variables,
Data collection method etc.
Stage – 3
Evaluate the assumptions underlying the multivariate
technique
With the data collected, the first task is to evaluate its
underlying assumptions, both statistical and conceptual.
For the techniques based on statistical inference the
assumptions of multivariate normality, linearity,
independence of error terms, and equality of variance
must all be met.
Each technique also involves a series of conceptual
assumptions dealing with such issues as model
formulation and the types of relationships to be
represented.
Stage – 4
Estimate the Multivariate Model and assess overall
model fit
Actual estimation of the multivariate model and
assessment of overall model fit.
After the model estimation, the researcher needs to
ascertain whether it achieves acceptable levels on
statistical criteria( i.e. level of significance), identified the
proposed relationships and achieves practical significance.
Many times the model will be respecified in an attempt to
achieve better levels of overall fit and/ or explanations.
Overall fitting may be identified as outliers, influential
observations or the other disparate results(e.g., single
member clustering)
Stage- 5
Interpret the variates
Interpreting the variates revels the nature of
multivariate relationships.
The objective is to identify empirical evidence
multivariate relationship in sample data that
can be generalized to the total population.
Stage-6
Validate the multivariate model
i.e. final approval to generalization to the
total population