logistic_regression
logistic_regression
a choice model
variation of multiple regression
allows for prediction of an event
can utilize non-metric (typically binary) dependent variables, as the objective is to
arrive at a probabilistic assessment of a binary choice
independent variables can be either discrete or continuous
a contingency table is produced, which shows the classification of
observations as to whether observed and predicted events match
measure of the effectiveness of the model:
the sum of events that were predicted to occur which actually did occur and the
events that were predicted not to occur which actually did not occur, divided by
the total number of events
Linear regression and the
logistic regression model
Y = + X
Y = dependent variable
X= predictor
= intercept, i.e. value of Y when X is zero
= slope, i.e. change of Y associated with one-unit increase of X
Linear regression and the
logistic regression model
Y = + 1X1,+2X2,+...+,kXk
Y = dependent variable
X= predictor
= intercept
i = partial slope coefficients
Linear regression and the
logistic regression model
multiple regression analysis:
Y = + 1X1,+2X2,+...+,kXk ,+
Y = dependent variable
X= predictor
= intercept
i = partial slope coefficients
= error term, a random variable
Linear regression and the
logistic regression model
Y = + 1X1,+2X2,+...+,kXk ,+
estimates of intercept and regression coefficients ( , ) are
obtained mathematically using the method of
ordinary least squaes (OLS)
(cf. practically every intro to statistics text)
Linear regression and the
logistic regression model
logarithmic transformation
( Y+1) = e X
or
Y = e X
-1
____________________________________
note: Adding 1 avoids taking the nat. log. of zero (undefined)
e = 2.72
probabilities, odds and odds ratios
__________________________________
Odds(Y=0) is the ratio of the probability that Y=1 to the probability that Y≠1 or
Odds(Y=1)= P(Y=1)/[1-P(Y=1)]
probabilities, odds and odds ratios