Assignment On Regression
Assignment On Regression
Submitted To:
Dr.Amanpreet Singh
Submitted By:
Sanjana Chhabra
Roll no. 19421198
Section:- D
Let’s say it turned out that the regression equation was estimated as
follows:
Y = 42 + 2.3*X1 + 11*X2
Interpreting the Intercept
B0, the Y-intercept, can be interpreted as the value you would predict for
Y if both X1 = 0 and X2 = 0.
We would expect an average height of 42 cm for shrubs in partial sun
with no bacteria in the soil. However, this is only a meaningful
interpretation if it is reasonable that both X1 and X2 can be 0, and if the
data set actually included values for X1 and X2 that were near 0.
If neither of these conditions are true, then B0 really has no meaningful
interpretation. It just anchors the regression line in the right place. In our
case, it is easy to see that X2 sometimes is 0, but if X1, our bacteria level,
never comes close to 0, then our intercept has no real interpretation.
(Don’t forget that since the bacteria count was measured in 1000 per ml
of soil, 1000 bacteria represent one unit of X1).
Therefore, each coefficient does not measure the total effect on Y of its
corresponding variable, as it would if it were the only variable in the
model.
Definition: The Regression Line is the line that best fits the data, such
that the overall distance from the line to the points (variable values)
plotted on a graph is the smallest. In other words, a line used to
minimize the squared deviations of predictions is called as the regression
line.
There are as many numbers of regression lines as variables.
Suppose we take two variables, say X and Y, then there will be
two regression lines:
R-squared evaluates the scatter of the data points around the fitted
regression line. It is also called the coefficient of determination, or the
coefficient of multiple determination for multiple regression. For the
same data set, higher R-squared values represent smaller differences
between the observed data and the fitted values.
Usually, the larger the R2, the better the regression model fits your
observations.