Chapter 7
Chapter 7
Chapter 7
Traffic Engineering
Chapter objectives: By the end of these chapters the student will be
able to (We spend 3 lecture periods for this chapter. We do skip
simple descriptive stats because they were covered in CE361.):
Apply the basic principles of statistics contained in section 7.1 to traffic data
analyses
Explain the characteristics of the normal distribution and read the normal
distribution table correctly
Explain the meaning of confidence bounds and determine the confidence
interval of the mean
Determine sample sizes of traffic data collection
Explain how random variables are added
Explain the implication of the central limit theorem
Explain the characteristics of various probabilistic distributions useful for traffic
engineering studies and choose a correct distribution for the study
Explain the special characteristics of the Poisson distribution and its
usefulness to traffic engineering studies
Conduct a hypothesis test correctly (two-sided, one-sided, paired test, F-test)
Conduct a Chi-square test to test hypotheses on an underlying distribution f(x)
Introduction
Traffic engineering studies: Infer the characteristics
in a population (typically infinite) by observing the
characteristics of a finite sample.
Statistical analysts is used to address the
following questions:
How many samples are required?
What confidence should I have in this estimate?
What statistical distribution best describes the observed
data mathematically?
Has a traffic engineering design resulted in a change in
characteristics of the population (hypothesis tests)?
7.1 An Overview of Probability Functions and Statistics
Most of the topics in this section are reviews of what we have learned
in CEEn 361. (Review 7.1.1, 7.1.3 and 7.1.4 by yourself.)
(Sample) x i s2 i
N i 1 N 1
= 1.43
Z = 1.43
X
Point estimate of X from a
sample
X
Two-sided interval
X – tas/sqrt(n) X + tas/sqrt(n) estimate
7.3 (cont)
Y ai X i
Expected value (or mean) of the random variable Y:
Y ai xi
f (X )
F(x)
approaches
X
x
µ µ X
X distribution X distribution
X ~ any (µ, 2) X ~ N ( , X 2 )
7.6 The Binomial Distribution Related to the
Bernoulli and Normal Distributions
7.6.1 Bernoulli and the Binomial distribution (discrete
probability functions))
P(type II error ) =
Dependence between , , and sample
size n
There is a distinct relationship between the two probability values
and and the sample size n for any hypothesis. The value of any one
is found by using the test statistic and set values of the other two.
One-sided upper
Two-sided or one-sided test
12 22
82 82
Y 1.53
n1 n2 55 55 |µ1 - µ2| = |60-55| = 5 > zc
z / 2 1.96 z 1.65
By either test, H0 is
At significance level = 0.05 (See rejected.
Table 7-3.)
7.8.3 Other useful statistical tests
The t-test (for small samples, n<=30) – Table 7.6:
When the t-test and other similar means tests are conducted,
there is an implicit assumption made that s1=s2. The F-test can
test this hypothesis.
s12
F 2 The numerator variance > The denominator
variance when you compute a F-value.
s2
If Fcomputed ≥ Ftable (n1-1, n2-1, a), then s1≠s2 at a asignificance
level.
You perform a paired difference test only when you have a control
over the sequence of data collection.
e.g. Simulation You control parameters. You have two different
signal timing schemes. Only the timing parameters are changed. Use
the same random number seeds. Then you can pair. If you cannot
control random number seeds in simulation, you are not able to do a
paired test.
Table 7-8 shows an example showing the benefits of paired testing
The only thing changed is the method to collect speed data. The
same vehicle’s speed was measure by the two methods.
Paired or not-paired example (table 7.8)
The authors intentionally use the uniform distribution to make the computation simpler. We will
test a normal distribution I class using Excel.
Steps of Chi-square (2-) test
i 1 fi
Steps of Chi-square (2-) test (cont)
2 is chi-square distributed (see Table 5-8). If this
value is low if our hypothesis is correct. Usually
we use = 0.05 (5% significance level or 95%
confidence level). When you look up the table,
the degree of freedom is f = N – 1 – g where g is
the number of parameters we use in the
assumed distribution. For normal distribution g =
2 because we use µ and to describe the shape
of normal distribution.
If the computed 2 value is smaller than the
critical c2 value, we accept H0.
What’s the Chi-square (2-) test testing?
Chi-square (2-)
Expected test Actual
distribution (or histogram
histogram)