Assignment No 3
Assignment No 3
NO 3
The best solution for a set of components/factors
Exploratory Factor
Analysis
2021
Factor analysis is a technique that is used to reduce a large number of
variables into fewer numbers of factors. This technique extracts maximum
common variance from all variables and puts them into a common score.
1. Data screening: calculate the descriptive statistics and a correlation matrix of all
variables to be used in the analysis.
2. Extract factors
3. Rotate factors to create a more understandable factor structure
4. Interpret results
The first part of the analysis consists of determining the number of extracted factors.
1. Click Analyze, Click Data Reduction and Click Factor. You should see the Factor
Analysis dialogue box.
2. Holding down the control key, click the 58 SPSS variables (items a through j). The
click on > to move the items to the Variables box in the Factor Analysis dialog box.
3. At the bottom of this box, click on Descriptives. In the Descriptives dialog box make
sure that Univariate descriptives, Coefficients, Significance Levels, Determinant,
KMO, Barlett’s test of Sphericity and Reproduced are checked. Now click on
Continue.
4. Click Extraction. You will see the “Factor Analysis: Extraction” dialog box.
5. In the “Extraction” box you will notice that the default Method is Principal
Components. This is the most commonly used method for exploratory purposes.
6. Continuing in “Extraction” box, we also want to make sure that the boxes
Correlation Matrix, Unrotated Factor Solution, and Eignevalues over: 1 and
Scree plot are checked. Now click on Continue.
Rotating Factors
7. When back in the Factor Analysis dialogue box, click Rotation. You will see the
Rotation Dialog box.
8. In the Rotation dialog box make sure that you check Varimax. Varimax is a type of
orthogonal rotation method. Make sure that the Rotated Solution and Loading plots
boxes are checked. Change the maximum iterations to “30”. (Normally 25 is
sufficient but we have an unusually large dataset to work with in this example). Now
click on Continue
9. Now back in the Factor Analysis dialogue box, click Options.
Page | 3
EXPLORATORY FACTOR ANALYSIS
10. In the Factor Analysis: Options box under Missing Values select Exclude Cases
Pairwise. Now under Coefficient Display Format: Make sure that you check Sorted
by Size. Click on Continue.
11. From the Factor Analysis dialogue box click on Scores. In the Factor Analysis:
Scores dialogue box check Save as variables and under Save as Variables select
Anderson-Rubin. Next check Display Factor Score Coefficient Matrix. Click on
Continue.
STANDARD DEVIATION:
Page | 4
EXPLORATORY FACTOR ANALYSIS
Page | 5
EXPLORATORY FACTOR ANALYSIS
CORRELATION MATRIX
Page | 6
EXPLORATORY FACTOR ANALYSIS
Page | 7
EXPLORATORY FACTOR ANALYSIS
Page | 8
EXPLORATORY FACTOR ANALYSIS
OUTPUT 1:
Correlations
The first step is to examine the correlation matrix (refer to output) between variables (items) to
examine how well they relate to one another. If we find that there are variables that do not
correlate well with any other variables (or very few) then we should consider excluding these
variables before the factor analysis is conducted. We would like to see our correlation
coefficients exceed .30.
Multicollinearity (Singularity)
The opposite problem of low correlations is variable that correlate too highly. It is important to
avoid extreme multicollinearity (i.e. variable that are very highly correlated) or singularity
(variables that are perfectly correlated). As with regression, singularity causes problems in factor
analysis because it becomes impossible to determine the unique contribution of a variable that is
highly correlated with another variable.
Bottom Line for Treating Collinearity
At this early stage we look to eliminate any variables that show no relationship (do not correlate)
with any other variables or that correlate too highly with other variables (i.e. r > .90).
OUTPUT 2:
To evaluate the issues of low correlations and Singularity refer to the Correlation Matrix in
addition to KMO (Kaiser-Meyer-Olkin) and Barlett’s Test of Sphericity sections of the SPSS
output.
Page | 9
Communalities
Extractio
EXPLORATORY FACTOR ANALYSIS Initial n
TEC 1.000 .705
r > .8 and consider 1 eliminating them from the
analysis. TEC 1.000 .655
3. The Barlett’s test is 2 designed to determine if the
correlation matrix is an identity matrix (where all
TEC 1.000 .571
correlation coefficients are 0). A significant value (less
than .05) indicates that 3 the data do not produce an
identity matrix TEC 1.000 .625 indicating there are adequate
relationships between 4 variables to conduct the factor
analysis. Results from TEC 1.000 .536 this test also indicate that the
correlations among 5 variables overall are not so
strong suggesting multicollinearity.
TEC 1.000 .574
6
4. The KMO test is a TEC 1.000 .669 measure of whether the
distribution of values 7 based on the sample is
adequate for conducting TEC 1.000 .557 a factor analysis. This test
indicates the amount of overlap or shared variance
between pairs of 8 variables (remember we are
trying to identify items PA1 1.000 .699 that are related but yet provide
unique information to PA2 1.000 .626 the factors we are attempting to
identify). Values PA3 1.000 .627 should be greater than .5.The
value is .753 which is PA4 1.000 .565 good.
PA5 1.000 .494
COMMUNALITIES
PA7 1.000 .505
PA8 1.000 .500
PA9 1.000 .523
OUTPUT 3: PA10 1.000 .571
Refer to the output entitled PA11 1.000 .692 “Communalities”.
Communalities are estimates PA12 1.000 .572 of shared or common variance
PA14 1.000 .504
among the variables after extraction has taken place.
PA15 1.000 .555
Communalities for each SR2 1.000 .618 variable can also be interpreted
as the squared multiple SR3 1.000 .619 correlation (R2) of the variable
predicted from the combination SR4 1.000 .643 of extracted factors. The goal
SR5 1.000 .618
of factor analysis is to identify groups of variables (items in
SR7 1.000 .554
this case) that are related to one another and derive a
SR8 1.000 .631
description of the underlying SR9 1.000 .641 traits that best represent the
data structure SR10 1.000 .700
SR11 1.000 .605
TOTAL VARIANCE SR12 1.000 .685 EXPLAINED
FC3 1.000 .606
FC4 1.000 .502
FC6 1.000 .544 Page | 10
FC7 1.000 .553
CBI4 1.000 .606
CBI6 1.000 .663
EXPLORATORY FACTOR ANALYSIS
Page | 11
EXPLORATORY FACTOR ANALYSIS
17 . 1.890 72.69
945 7
18 . 1.827 74.52
913 4
19 . 1.655 76.17
827 9
20 . 1.536 77.71
768 4
21 . 1.523 79.23
762 8
22 . 1.487 80.72
743 5
23 . 1.397 82.12
699 2
24 . 1.285 83.40
642 6
25 . 1.201 84.60
600 7
26 . 1.143 85.75
572 1
27 . 1.060 86.81
530 1
28 . .986 87.79
493 7
29 . .951 88.74
475 8
30 . .885 89.63
443 3
31 . .835 90.46
418 8
32 . .808 91.27
404 6
33 . .760 92.03
380 6
34 . .729 92.76
364 5
35 . .710 93.47
355 5
Page | 12
EXPLORATORY FACTOR ANALYSIS
36 . .633 94.10
316 8
37 . .608 94.71
304 7
38 . .593 95.31
297 0
39 . .557 95.86
279 7
40 . .534 96.40
267 2
41 . .497 96.89
249 9
42 . .489 97.38
245 8
43 . .447 97.83
224 6
44 . .433 98.26
216 9
45 . .378 98.64
189 7
46 . .317 98.96
158 4
47 . .292 99.25
146 6
48 . .275 99.53
138 1
49 . .248 99.77
124 9
50 . .221 100.0
111 00
Page | 13
EXPLORATORY FACTOR ANALYSIS
OUTPUT 4:
The total amount of variance for a component or factor is represented as an
Eigenvalue. The eigenvalue for the first component is 9.584 and accounts for of
19.167the variability or variance of the total data structure .Components or
factors with eigenvalues of “1” or greater are considered to contribute
significantly to the data structure. SPSS by default extracts only components or
factors with eigenvalues of “1” or greater.
SCREE PLOT:
OUTPUT 5:
If there are less than 30 variables and communalities after extraction are
greater than 0.7 or if the sample size exceeds 250 and the average
communality is greater than 0.6 then retain all factors with Eigen values
Page | 14
EXPLORATORY FACTOR ANALYSIS
COMPONENT MATRIX
Component Matrixa
Component
1 2 3 4 5 6 7 8 9 10 11
PA .582
16
SR .572
10
SR .568
12
SR .566
7
PA .564
15
SR .558
2
SR .554
1
SR .553 .417
3
TE .543
C8
PA .543
4
SR .542
11
CB .532
I4
SR .532
6
TE .529
C3
CB .528
I8
Page | 15
EXPLORATORY FACTOR ANALYSIS
CB .524
I9
SR .521
8
PA .520
5
SR .519
5
SR .518
9
TE .510
C6
PA .507
2
SR .496
4
CB .495
I1
CB .495
I7
CB .491
I6
CB .490 -.405
I5
CB .484 -.475
I2
PA .480
3
TE .466
C4
TE .455
C5
TE .453
C7
PA .607 .424
11
FC .598
5
Page | 16
EXPLORATORY FACTOR ANALYSIS
FC .554
4
FC .516
6
PA .508
14
PA .506
9
FC .497
3
PA .470
6
PA .420
7
PA .413 .406
10
PA
8
PA .405 .512
12
PA
13
FC
8
FC
7
PA .522
1
TE .403 -.404 .421
C2
TE .423
C1
Extraction Method: Principal Component Analysis.
a. 11 components extracted.
Page | 17
EXPLORATORY FACTOR ANALYSIS
Page | 18
EXPLORATORY FACTOR ANALYSIS
CB .565
I4
CB .489
I9
PA .650
6
PA .642
7
PA .642
8
PA .590
9
PA .585
14
FC .743
3
FC .647
5
FC .632
6
FC .598
4
FC .533
8
FC .490
7
SR .616
4
SR .613
5
SR .545
3
SR .504
6
PA .456
16
PA .752
1
Page | 19
EXPLORATORY FACTOR ANALYSIS
SR .586
1
PA .542
2
PA .695
3
PA .488
4
PA .467
5
PA .440
15
CB .674
I2
CB .596
I1
CB .433
I5
PA .614
11
PA .542
12
PA .508
13
PA .503
10
TE .774
C1
TE .558
C2
TE .491
C3
Extraction Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization.a
a. Rotation converged in 32 iterations.
OUTPUT 6:
Page | 20
EXPLORATORY FACTOR ANALYSIS
The Rotated Component (Factor) Matrix table in SPSS provides the Factor Loadings for
each variable (in this case item) for each factor. A Factor Loading is the Pearson correlation (r)
coefficient between the original variable with a factor. For example, if we consider question 6,
we can see that it “loads on” or correlates .800 with Component 1 (Factor 1), -.010 with
Component 2 (Factor 2) and .097 with Component 3 (Factor 3) and -.072 with Component 4
(Factor4).
Page | 21
EXPLORATORY FACTOR ANALYSIS
REPRODUCED CORELATION:
OUTPUT 8
the residuals provided by the reproduced correlation matrix – That is consider the residuals
or difference between the actual correlations and reproduced correlations that stem from the
factor analysis model based on the data analyzed. When considering the “Reproduced
Correlation Matrix” retain the components generated by the model if only a few residuals (the
difference between the empirical and reproduced correlations represented in the lower portion
of the “Reproduced Correlation Matrix” output) exceed a difference of .05 between actual
correlations and reproduce correlations. If several reproduced correlations differ, you may want
to include more components.
A condition of the data of concern is when more that 50% of reproduced and actual
correlations differ by more than .05
Page | 22
EXPLORATORY FACTOR ANALYSIS
CONCLUSION:
The purpose of factor analysis is to reduce a large set of data into a smaller subset of
measurement variables. The factor scores tell us an individual’s score on this subset of measures.
Therefore, any further analysis can be done using factor scores rather than the original data.
Secondly, factor scores may be appropriate to use for Multiple Regression analysis because they
are produced from uncorrelated factors. Thus these scores reduce or eliminate multicollinearity
that can cause problems with multiple regression analysis.
Page | 23