GED102-1 Multiple and Nonlinear Regression Excel
GED102-1 Multiple and Nonlinear Regression Excel
SAMPLE PROBLEM
The survival rate of sperm (y %) from an animal semen was measured at different concentrations u
types of materials (x1, x2 & x3). The data obtained are given in the table below. Find the multiple li
regression model for the given data. Determine also the coefficient of determination and interpret
results.
y x1 x2 x3
26.7 1.51 4.5 12.05
32.4 6.09 4.62 10.65
27.1 5.99 7.61 8.45
39.6 10.29 3.83 9.75
19.6 0.96 10.8 10.65
27.9 0.99 5.05 11.15
27.6 3.87 5.82 9.25
26.4 6.09 7.92 10.35
32.5 3.85 3.62 9.95
25.7 3.92 6.8 10.45
Estimate the % survival rate if x1= 5.58%, x2= 8.5% & x3= 11.3%.
If values of x1, x2 & x3 are given, y can be obtained using the above equation.
x1 5.58
x2 8.5
x3 11.3
y 26.15733
rent concentrations using three
w. Find the multiple linear
ination and interpret the
R^2 Computation
SSR SST
x1*x3 x2*x3 x1*y x2*y x3*y yi^ (yi^-y_bar)^2 (yi-y_bar)^2
18.1955 54.225 40.317 120.15 321.735 28.04742 0.252587778 3.4225
64.8585 49.203 197.316 149.688 345.06 32.54448 15.95590482 14.8225
50.6155 64.3045 162.329 206.231 228.995 27.80149 0.560263387 2.1025
100.3275 37.3425 407.484 151.668 386.1 38.05215 90.29092789 122.1025
10.224 115.02 18.816 211.68 208.74 17.85145 114.4590206 80.1025
11.0385 56.3075 27.621 140.895 311.085 26.63261 3.676378006 0.4225
35.7975 53.835 106.812 160.632 255.3 28.35935 0.036347969 0.9025
63.0315 81.972 160.776 209.088 273.24 27.52209 1.056595584 4.6225
38.3075 36.019 125.125 117.65 323.375 31.70993 9.985151606 15.6025
40.964 71.06 100.744 174.76 268.565 26.97902 2.467972721 8.1225
238.7411503 252.225
433.36 619.2885 1347.34 1642.442 2922.195
R^2=
check first if coefficient matrix is non singular SSE=
det(A)= 173754.374 use mdeterm
-0.571619126 -0.48287658 -2.1797133 non zero --> non singular
0.0230594769 0.011133964 0.0393119 mtrix inverse exist
0.011133964 0.027828364 0.0258832
0.03931194 0.025883216 0.1803014
ft then press enter
b1 b2 b3 B
sumx1 sumx2 sumx3 sum(y)
sum x1^2 sum x1*x2 sum x1*x3 sum(x1*y)
sum x1*x2 sum x2^2 sum x2*x3 sum(x2*y)
sum x1*x3 sum x2*x3 sum x3^2 sum(x3*y)
(yi-y_bar)^2
good fit
0.94654 indicates the proportion of variation described by the three independent variables
13.48385
NON LINEAR REGRESSION
This is another interpolating method which can be used if there exists non linear relationship betwee
y=?
x= 40
y= 27.34
linear relationship between two variables.
SUM
135 r= 0.587634 nonlinear
182.1 R^2=
3475
4154.5
107512.5
97875
2921875
det(A)= 61250000
non singular
matrix inverse exists
B
sum(y)
sum(x1*y)
sum(x2*y)
y x1 x2 x3
26.7 1.51 4.5 12.05
32.4 6.09 4.62 10.65
27.1 5.99 7.61 8.45
39.6 10.29 3.83 9.75
19.6 0.96 10.8 10.65
27.9 0.99 5.05 11.15
27.6 3.87 5.82 9.25
26.4 6.09 7.92 10.35
32.5 3.85 3.62 9.95
25.7 3.92 6.8 10.45
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.97290307
R Square 0.94654039
Adjusted R Squar 0.91981059
Standard Error Standard Error
Observations 10
ANOVA
df SS MS F Significance F
Regression 3 238.74115 79.5803834 35.4114227 0.00032744
Residual 6 13.4838497 2.24730828
Total 9 252.225
RESIDUAL OUTPUT
The survival rate of sperm (y %) from an animal semen was measured at different concentrations u
types of materials (x1, x2 & x3). The data obtained are given in the table below. Find the multiple li
regression model for the given data. Determine also the coefficient of determination and interpret
results.
y x1 x2 x3
26.7 1.51 4.5 12.05
32.4 6.09 4.62 10.65
27.1 5.99 7.61 8.45
39.6 10.29 3.83 9.75
19.6 0.96 10.8 10.65
27.9 0.99 5.05 11.15
27.6 3.87 5.82 9.25
26.4 6.09 7.92 10.35
32.5 3.85 3.62 9.95
25.7 3.92 6.8 10.45
Estimate the % survival rate if x1= 5.58%, x2= 8.5% & x3= 11.3%.
ANS:
If values of x1, x2 & x3 are given, y can be obtained using the above equation.
x1
x2
x3
y
rent concentrations using three
w. Find the multiple linear
ination and interpret the
SSR
x1*x3 x2*x3 x1*y x2*y x3*y yi^ (yi^-y_bar)^2
b1 b2 b3 B
sumx1 sumx2 sumx3 sum(y)
sum x1^2 sum x1*x2 sum x1*x3 sum(x1*y)
sum x1*x2 sum x2^2 sum x2*x3 sum(x2*y)
sum x1*x3 sum x2*x3 sum x3^2 sum(x3*y)
SST
(yi-y_bar)^2
0
good fit
R^2= indicates the proportion of variation explained by the three independent variables
SSE=
non singular
y x1 x2 x3
26.7 1.51 4.5 12.05
32.4 6.09 4.62 10.65
27.1 5.99 7.61 8.45
39.6 10.29 3.83 9.75
19.6 0.96 10.8 10.65
27.9 0.99 5.05 11.15
27.6 3.87 5.82 9.25
26.4 6.09 7.92 10.35
32.5 3.85 3.62 9.95
25.7 3.92 6.8 10.45
NON LINEAR REGRESSION
This is another interpolating method which can be used if there exists non linear relationship betwee
mmult X=inv(A)*B
b0 NORMAL EQUATIONS
b1 b0 b1 b2
b2 A= n sumx1 sumx2
sumx1 sum x1^2 sum x1*x2
ANS 𝑦=17.56+21.153𝑥−0.0227𝑥^2 sumx2 sum x1*x2 sum x2^2
y=?
x= 40
y=
linear relationship between two variables.
SUM
r= nonlinear
R^2=
det(A)=
non singular
matrix inverse exists
B
sum(y)
sum(x1*y)
sum(x2*y)