01 - Multivariate - Introduction To Multivariate Analysis
01 - Multivariate - Introduction To Multivariate Analysis
MULTIVARIATE ANALYSIS
MULTIVARIATE for BUSINESS
MANAJEMEN
Semester Ganjil 2022/2023
What is Multivariate Analysis?
• Multivariate data analysis is a powerful tool for researchers.
• Multivariate analysis techniques are popular because they enable
organizations to create knowledge and thereby improve their decision-
making.
• Multivariate analysis refers to all statistical techniques that
simultaneously analyze multiple measurements on individuals or objects
under investigation.
• Thus, any simultaneous analysis of more than two variables can be
considered multivariate analysis.
The Implications of Big Data, The Emergence
of Algorithmic Models and Causal Inference on
Multivariate Analysis
• Three emerging factors in the domain of analytics have created a very different
and constantly changing environment for researchers over the past decade and
will continue in the future.
• The emergence of Big Data has fundamentally influenced aspects include the
abundance of data (both variables and observations), changes in the
fundamental characteristics of data now available, and the increased desire for
data-driven decisions within all types of organizations.
• Along with the emergence of Big Data there is more widespread use of
algorithmic models where the emphasis is on prediction rather than explanation.
• Finally, the abundance of data and the addition of entirely new sets of
techniques has compelled researchers to strive for more causal inference in their
analyses in order to avoid capturing spurious correlations that can result in
invalid conclusions.
The Nature of Measurement Scales and Their
Relationship to Multivariate Techniques
• Data analysis involves the identification and measurement of
variation in a set of variables, either among themselves or between a
dependent variable and one or more independent variables.
• The key word here is measurement because the researcher cannot
identify variation unless it can be measured.
• Measurement is important in accurately representing the research
concepts being studied and is instrumental in the selection of the
appropriate multivariate method of analysis.
• Data can be classified into one of two categories—nonmetric
(qualitative) and metric (quantitative)—based on the type of
attributes or characteristics they represent. The researcher must
define the measurement type for each variable.
Understand the Nature of Measurement
Error and Its Impact on Multivariate
Analysis
• Use of multiple variables and reliance on their combination (the
variate) in multivariate methods focuses attention on a
complementary issue: measurement error.
• Measurement error is the degree to which the observed values
are not representative of the “true” values.
• Measurement error has many sources, ranging from data entry
errors to the imprecision of the measurement and the inability of
respondents to accurately provide information.
• When variables with measurement error are used to compute
correlations or means, the “true” effect is partially masked by the
measurement error, causing the correlations to weaken and the
means to be less precise.
Determine Which Multivariate Technique
is Appropriate for a Specific Research
Problem
The multivariate techniques can be classified based on three judgments
the researcher must make about the research objective and nature of the
data:
1) Can the variables be divided into independent and dependent
classifications based on some theory?
2) If they can, how many variables are treated as dependent in a single
analysis?
3) How are the variables, both dependent and independent, measured?
Selection of the appropriate multivariate technique depends on the
answers to these three questions.
Types of Multivariate Techniques
• Interdependence Techniques
• Exploratory Factor Analysis: Principal components and common factor analysis
• Cluster analysis
• Dependence Techniques
• Multiple regression and multiple correlation
• Multivariate analysis of variance and covariance
• Multiple discriminant analysis
• Logistic regression
• Structural equation modeling and confirmatory factor analysis
• Partial least squares structural equation modeling and confirmatory composite analysis
• Canonical correlation analysis
• Conjoint analysis
• Perceptual mapping, also known as multidimensional scaling
• Correspondence analysis
What type of
relationship is
being
examined?
Dependence Interdependence
• Assume a product concept has three attributes (price, quality, and color),
each at three possible levels (e.g., red, yellow, and blue as the three
levels of color).
• Instead of having to evaluate all (3 x 3 x 3) possible combinations, a
subset (9 or more) can be evaluated for their attractiveness to
consumers, and the researcher knows not only how important each
attribute is but also the importance of each level (e.g., the attractiveness
of red versus yellow versus blue).
• Moreover, when the consumer evaluations are completed, the results of
conjoint analysis can also be used in product design simulators, which
show customer acceptance for any number of product formulations and
aid in the design of the optimal product.
CORRESPONDENCE ANALYSIS (1)
• Correspondence analysis is a recently developed interdependence
technique that facilitates the perceptual mapping of objects (e.g., products,
persons) on a set of nonmetric attributes.
• Researchers are constantly faced with the need to “quantify the qualitative
data” found in nominal variables.
• Correspondence analysis differs from the interdependence techniques
discussed earlier in its ability to accommodate both nonmetric data and
nonlinear relationships.
• In its most basic form, correspondence analysis employs a contingency
table, which is the cross-tabulation of two categorical variables.
• It then transforms the nonmetric data to a metric level and performs
dimensional reduction (similar to exploratory factor analysis) and
perceptual mapping.
• Correspondence analysis provides a multivariate representation of
interdependence for nonmetric data that is not possible with other
methods.
CORRESPONDENCE ANALYSIS (2)
• With data collected, the first task is not to estimate the multivariate
model but to evaluate its underlying assumptions, both statistical and
conceptual, that substantially affect their ability to represent
multivariate relationships.
• For the techniques based on statistical inference, the assumptions of
multivariate normality, linearity, independence of the error terms, and
equality of variances must all be met.
• Each technique also involves a series of conceptual assumptions dealing
with such issues as model formulation and the types of relationships
represented.
• Before any model estimation is attempted, the researcher must ensure
that both statistical and conceptual assumptions are met.
STAGE 4: Estimate the multivariate model
and evaluate fit
• With the assumptions satisfied, the analysis proceeds to the actual estimation of
the multivariate model and an assessment of overall model fit.
• In the estimation process, the researcher may choose among options to meet
specific characteristics of the data (e.g., use of covariates in MANOVA) or to
maximize the fit to the data (e.g., rotation of factors or discriminant functions).
• After the model is estimated, the overall model fit is evaluated to ascertain
whether it achieves acceptable levels on statistical criteria (e.g., level of
significance), identifies the proposed relationships, and achieves practical
significance.
• Many times, the model will be respecified in an attempt to achieve better levels
of overall fit and/or explanation.
• In all cases, however, an acceptable model must be obtained before proceeding.
STAGE 5: Interpret the variates