Chow - Tests of Equality Between Sets of Coefficients in Two Linear Regressions
Chow - Tests of Equality Between Sets of Coefficients in Two Linear Regressions
Chow - Tests of Equality Between Sets of Coefficients in Two Linear Regressions
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access
to Econometrica
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
Econometrica, Vol. 28, 3 (July 1960)
BY GREGORY C. CHOW
1. INTRODUCTION
THE MODEL of normal linear regression has often been widely applied to the
measurement of economic relationships. In studies of the consumption
function, the mean of consumption is assumed to be a linear function of
income and other variables. In studies of consumer demand, the quantity
of a commodity is regressed linearly on its price, income, and perhaps the
price of an important complement or substitute. In studies of business
investment, linear regressions on profits, sales, liquid asset holdings, and
the interest rate, have been estimated. Other notable examples include
empirical studies of dividend policy, of prices of corporate stocks, and
of cost and supply functions.
When a linear regression is used to represent an economic relationship,
the question often arises as to whether the relationship remains stable in
two periods of time, or whether the same relationship holds for two different
groups of economic units. Is the consumption pattern of the American
people today the same as it was before World War II? Do the firms in the
steel industry and the firms in the chemical industry have similar dividend
policies? Statistically these questions can be answered by testing whether
two sets of observations can be regarded as belonging to the same regression
model.
Often there is no economic rationale in assuming that two relationships
are completely the same. It may be more reasonable to suppose that only
parts of the relationships are identical in two periods, or for two groups.
Maybe the price elasticity of demand for a certain food product has not
changed since World War II, while the income elasticity has changed.
Maybe the investments of two groups of firms are affected in the same
manner by profits, but not by liquid assets. Statistically, we are asking
whether subsets of coefficients in two regressions are equal.
1 An early draft of this paper has been revised after helpful comments from William
Kruskal, Edwin Kuh, and David L. Wallace, to all of whom I am grateful.
591
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
592 GREGORY C. CHOW
(1) y Xli + l
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 593
= [I +X2(XiX )1X2]a2.
(8) ~~~~~~~d2
(8) [1 ? X2(X'Xl)-1 X2] s1
will be distributed as F(1, n - p). This test, which is based on the prediction
interval for one new observation, can be found in [8].
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
594 GREGORY C. CHOW
When we have m new observations and thus m differences dl, d2, ..., dm,
we may consider the average.
- I m
(9) d - d
i=1
K]dI22
{ 1 1] [I + X2(Xl"Xl)-' X2"] []}
2 I am indebted to Robert Solow for pointing out this reference and the errors
therein.
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 595
It is well known that the square of the standard error from the first
2
regression, SI, times (n - p)/a2, follows X2(- p). This X2(n p) is inde-
2
will follow F(m, n -). Since the numerator of (13) will have a non-central
x2 distribution when P2 # fl, the upper-tail F test can appropriately be
used. Clearly the test (13) reduces to the prediction interval (8) when m = 1.
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
596 GREGORY C. CHOW
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 597
Now the sum of squares (18) under Ho will be decomposed into the sum
of squares (21) under Ha plus the sum of squares of the differences
We will proceed to show that the rank of the quadratic form Q3 can at
most be p. From (16) and (19), it follows that
(27)QX(bb)2
(27) Q3 X2(X'2X2) -1 XX
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
598 GREGORY C. CHOW
y1-Xbo 2
(30) Yl-X = IIy-XibiII2 +? IXibi -XiboII2 + IIY2
Y2 -X2b50
+ IIX2b2 X2boII2
Our models for Ho and Ha are (15) and (14), as before. The sum of squares
under HO is clearly Qi whether m > p or m <? p. The sum of squares under
Ha will become yi - X1b1 l2 when m < p-this can be seen either by
evaluating the sum of squares of the residuals from regression (14) or by
noting that the residuals from the second sample will simply be zero.
IIXibi-XiboII2 + IIY2-X2boII2
5 Additional references on the analysis of covariance include [1], [5], [9], and [10].
[1] is a special issue devoted to the analysis of covariance mainly for the design of
experiments.
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 599
(35) d'[I + X2(X'X1) -' X']-' d [y2- boX2] [I + X2(X'Xl) -' X2] [y2- X2
The results given so far, as summarized- by (29) and (31), will now be
extended to testing the equality between subsets of coefficients in two
regressions. As before, we will first examine the case m > /.
Under the alternative hypothesis, our model is
[][2]
= ?[?Wi 0 Y2 +r:8i
Z2 0 W2 [i 82
(32
where the coefficients flu are divided into y, and b(i, the matrix X1 is cor-
respondingly divided into Z, and W1, and similarly for f2 and X2. Let yi
and y2 be column vectors of q elements each; and &1 and 62 be column
vectors of q q elements each. The subsets of coefficients to be tested
are y, and y2-
The null hypothesis is y, = y2 = y, implying the model
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
600 GREGORY C. CHOW
-l Zi'zizl 0 ZWi 0 -i Z, -
(41) C2 _ Z2Z2 ? Z2'W2 ? Z2
( ) di Wi'Zi ? W1iF ? Wi 0 L21
_d2_ L O W2Z2 0 W2'W2_ _LO W2'_
The sum of squares of the residuals under Ha, which is identical with (21),
will have m + n - 2p degrees of freedom.
As before, the sum of squares under H, can be broken up into the sum
of squares under Ha plus the sum of squares of the differences between the
two sets of estimates of y, namely,
or
Q1 Q2 + Q3*
(43) [Z Wi w Lo 0 Wi 1 I
( ) 22 ? W2] 0 Z2 ? W2] 0 I O
LO O Ii
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 601
Rather than digressing to complete the proofs of (44) and (45), we simply
indicate that essentially they involve partitioning the matrix of the cross-
products of the explanatory variables in (41) into four blocks and then
inverting the partitioned matrix. The same method will also prove
and
0 W, 0 pC2 1 Wi 0 CO
(48) [dZ 0 Wi ] [il] - 0 wW2 CO
0 220 W2 di 0 W2 d20
I I-O W2('2W2)-1W2[
wff'wi)-l W 0 -Z
Z2(C2 -CO))
The developments from now on will correspond to the developments of (25),
(26), (27), and (28) in Section 4. From (44) and (46), it follows that
(49)
(49 [Z1. 1. [Z'. Z= Z1
2 22.1 C0 +z yl. zl
4 Z2. 1
Y2= 'Z'ZC1
+ '?l Z1
Z 2 Z'
22. C2
which corresponds to (25). (49) can be used to express C2 - cO as a linear
tranformation of cl - cO, a transformation similar to the one in (26). Replacing
C2 - CO in (48) by this transformation of cl - cO, we will observe that Q3 is a
quadratic form in cl -cO, with maximum rank q. A further step analogous
to (28) will show that Q* tends to be larger when y, # y2. We thereforehav
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
602 GREGORY C. CHOW
6. EXAMPLES
To illustrate how some of the tests given above are applied, one
example for each of (29) and (31) will now be provided. These
originated in a study of the demand for automobiles in the Un
[2]. We will not here go into the economic justifications of them
contained in the reference cited. The study utilizes annual ob
on the following variables:
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 603
The residuals of the observed values from the estimated values are very
small, as compared with the standard error of .618. They do not indicate
any shifts in the pattern of demand for automobile ownership during the
four years 1954 to 1957.
We will now proceed with the analysis of covariance (29). The method
involved can be described very simply. Suppose that n observations are
used to estimate a regression with P parameters (p - 1 coefficients plus
one intercept). Suppose also that there are m additional observations, and
we are interested in deciding whether they are generated by the same
regression model as the first n observations. To perform the analysis of
covariance, we need the following sums of squares:
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
604 GREGORY C. CHOW
Again, inspection of the residuals reveals that they are not large relative to
0.308, the standard error of estimate for (4s). The year 1955 is an exception,
where we find the residual to be twice as large as the standard error. To
apply test (31), we compute the sum of squares A of the 32 deviations from
the regression including the four new observations, with 32-4 or 28 degrees
of freedom. A turns out to be 2.6444 numerically. The sum of squares B
of the 28 deviations from the regression of the original set of observations
turns out to be 2.2818, with 24 degrees of freedom. The sum of squares C
vanishes as long as the number m of new observations does not exceed the
number of parameters p. According to (31), the F ratio is the ratio of
(A - B)/4 to B/24, or 0.95 numerically. Therefore, we accept the null
hypothesis that automobile purchases in the years 1954 to 1957 were govern-
ed by the same relationship as before.
Cornell University
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms
TESTS OF EQUALITY 605
REFERENCES
This content downloaded from 213.219.153.168 on Thu, 07 Mar 2019 07:20:22 UTC
All use subject to https://about.jstor.org/terms