Calling recessions in real time

James D. Hamilton

Calling recessions in real time

2011, International Journal of Forecasting

NBER WORKING PAPER SERIES CALLING RECESSIONS IN REAL TIME James D. Hamilton Working Paper 16162 http://www.nber.org/papers/w16162 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 July 2010 I am grateful for assistance and suggestions provided by Maximo Camacho, Marcelle Chauvet, Jeremy Nalewaik, Gabriel Perez-Quiros, Jeremy Piger, James Stock, Mark Watson, and Jonathan Wright. This research is the work of the author alone and should not be interpreted as the opinions, findings, or procedures of the National Bureau of Economic Research's Business Cycle Dating Committee, or the NBER more generally. NBER working papers are circulated for discussion and comment purposes. They have not been peerreviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications. © 2010 by James D. Hamilton. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source. Calling Recessions in Real Time James D. Hamilton NBER Working Paper No. 16162 July 2010 JEL No. E32 ABSTRACT This paper surveys efforts to automate the dating of business cycle turning points. Doing this on a real time, out-of-sample basis is a bigger challenge than many academics might presume due to factors such as data revisions and changes in economic relationships over time. The paper stresses the value of both simulated real-time analysis-- looking at what the inference of a proposed model would have been using data as they were actually released at the time-- and actual real-time analysis, in which a researcher stakes his or her reputation on publicly using the model to generate out-of-sample, real-time predictions. The immediate publication capabilities of the internet make the latter a realistic option for researchers today, and many are taking advantage of it. The paper reviews a number of approaches to dating business cycle turning points and emphasizes the fundamental trade-off between parsimony-trying to keep the model as simple and robust as possible-- and making full use of available information. Different approaches have different advantages, and the paper concludes that there may be gains from combining the best features of several different approaches. James D. Hamilton Department of Economics, 0508 University of California, San Diego 9500 Gilman Drive La Jolla, CA 92093-0508 and NBER jhamilton@ucsd.edu 1 Introduction U.S. real GDP has grown signiﬁcantly over time, today standing at a level more than seven times that seen in 1947 (see Figure 1). for that long-run growth. Economists broadly agree on the three factors responsible First, the population has increased over time, and more people can produce a greater quantity of goods and services. Second, the stock of equipment and facilities that people have to work with has also increased more than six-fold over this period. And third, the techniques of production, such as higher-yielding crops, faster computers, and more eﬃcient management have all produced tremendous improvements in productivity. These three factors— population, capital stock, and technology— are widely regarded to be the main drivers of long-run growth. But GDP does not increase every single year, and it is of substantial interest to understand why. None of these three factors oﬀer very appealing explanations for downturns. In a recession, we do not lose population; rather, an ever-growing number of people report they are unable to ﬁnd jobs. Capital stock is not destroyed, but instead sits idle. And although the hypothesis that recessions may be caused by an exogenous decline in productivity has been popular with real business cycle theorists, many of us do not ﬁnd that account compelling. Something other than these three factors seems to be governing aggregate economic behavior during particular identiﬁable episodes. These episodes are referred to as economic recessions, indicated by the shaded regions in Figure 1. The importance of recessions is even more dramatic when we consider a series such as the U.S. unemployment rate in Figure 2. There is no clear trend in this series over the last half century. But there is a quite dramatic tendency for unemployment to rise sharply during periods characterized as economic recession. The habit of accompanying graphs like Figures 1 and 2 with recessions indicated as shaded regions is quite ingrained in the economics profession, and with good reason. One of the ﬁrst things we want to know about any series is how its ﬂuctuations are 2 related to movements in and out of economic recession. The question discussed in this paper is, where should those shaded regions be drawn, that is, when did each particular recession begin and end? The conventional answer for U.S. data is to use the dates determined by the Business Cycle Dating Committee of the National Bureau of Economic Research. This is a group of distinguished scholars who meet periodically to discuss recent data, and from time to time issue a proclamation that a recession began or ended at a particular date in the past. Typically, once the Committee makes a pronouncement, that date is not subsequently revised. That is a ﬁne approach to use, and this paper does not advocate abandoning it. I nevertheless investigate whether it is possible to supplement the deliberations of the Committee with mechanical algorithms that could process incoming data in real time and issue a judgment on a purely objective basis. There are three potential beneﬁts to doing so. The ﬁrst is timeliness. The bottom panel of Figure 3 notes the particular dates at which the Business Cycle Dating Committee issued its most recent announcements. These announcements often came long after the fact. For example, the NBER dated the 1990-91 recession as beginning in August 1990 and ending in March 1991. It made the announcement that the recession had begun in April 1991— one month after the NBER later decided that the recession was already over. The end of the 2001 recession was announced in July 2003, which is 28 months after the recession is now deemed to have ended. A second beneﬁt of a purely objective algorithm for making these determinations in real time is that it would ensure that the process is completely apolitical. Although no one has accused the NBER of altering its announcements on the basis of political considerations, the pressure is undeniably present to delay the announcement that a recession has begun or accelerate the announcement that a recovery has begun if one’s goal were to help the incumbent. In the October 13, 1992 debates, then vice-presidential-candidate Al Gore referred to the “worst economic performance since the Great Depression.” Although the recession of 1990-91 is now seen as one of the shortest and mildest of the postwar recessions, it was not until December 22, 1992— after 3 the presidential elections— that the NBER announced that the recession had actually ended in March of 1991. If it had been the case that the recession that began in August 1990 was still ongoing as of October 1992, Gore’s statement would have factual support in the sense that the ongoing recession would have been the longest on record since 1933. Being able to issue these announcements on the basis of a pre-determined, purely mechanical algorithm would insulate the procedure from any possible charges of partisanship. Third, coming up with a mechanical means to recognize business cycle turning points oﬀers further elucidation of what we actually mean when we say that the economy is in a recession. If the dates assigned by the NBER represent the answer, what is the question? The whole process seems to presuppose that there are some very diﬀerent factors operating on the economy at some times relative to others, and that these changes have observable implications. Mechanization of the dating procedure can help clarify exactly how and why we assign the dates that we do. In this paper I survey some of the approaches to dating business cycle turning points, with a focus on algorithms that various analysts have publicly relied on with real-time data. I begin in Section 2 with a review of some early eﬀorts and why the task might be harder than it looks. Section 3 describes in detail a procedure that I have been relying on for the last ﬁve years. A number of alternative approaches are discussed in Section 4. 2 What’s so hard? One’s ﬁrst thought might be that it shouldn’t be too hard to do better than the multi-year lags sometimes associated with NBER announcements, and indeed the really exciting thing would be to predict business cycle turning points before they occur. What’s so hard about that? The ﬁrst answer is that if people could have predicted the recession, it probably wouldn’t have happened. Firms would not have been stuck with inventories, labor, and capital they turned out not to need, and the Federal Reserve probably would have chosen to ease its policy stance 4 sooner. Economists are used to viewing magnitudes such as stock prices as diﬃcult or impossible to predict if the market is functioning properly. It may be that economic recessions by their nature imply similar fundamental limitations for forecasting. Second, the data available at the time are often sending a diﬀerent signal from what one sees once the data are subsequently revised. For example, the top panel of Figure 4 displays real GDP growth rates for each quarter of 2001 as they were reported at the end of 2002. This vintage of data shows 3 successive quarters of declining real GDP, which certainly sounds unambiguously like a recession. On the other hand, the bottom panel shows data for the same quarters as they were actually reported on January 30, 2002. At that date the recession was already over, but recognizing those GDP numbers available at the time as signalling a recession is obviously a much bigger challenge. A third factor making it diﬃcult to recognize business cycle turning points in real time is the fact that key economic relations are continually changing over time. I illustrate these diﬃculties by reviewing the real-time track record of two prominent eﬀorts to predict business cycle turning points. 2.1 Stock and Watson’s original business cycle index model. One of the impressive early eﬀorts along these lines came from the business cycle indicators developed by James Stock and Mark Watson (1989, 1991). Their coincident index model postulated that an observed vector of monthly indicators yt was related to an unobserved scalar ct thought to represent the business cycle according to yt = k + γ(L)ct + ut D(L)ut = εt 5 φ(L)ct = δ + ηt . The observed variables in yt consisted of growth rates of industrial production, personal income, sales, and employment. White noise disturbances εrt and η t were taken to be mutually uncorrelated and D(L) is a diagonal matrix in the lag operator L. Thus the model postulated that the observed dynamics of each yrt could be explained in terms of idiosyncratic autoregressive terms urt and common dependence on the business cycle ct . This will be recognized as a state-space 0 model with state vector (ut , ct )0 and observation vector yt , for which algorithms for estimating the unknown parameters by maximum likelihood and forming an optimal inference about the unobserved state of the business cycle ct are well known. Stock and Watson further deﬁned an unobserved variable St to be unity if the economy was in recession at date t. They interpreted a recession to be a particular pattern followed by ct , St = 1 if {ct−j }8j=0 ∈ Bt , with the set Bt designed so as to best mimic historically-assigned NBER recession dates. The basic Kalman ﬁlter algorithms allow one to form an inference about current or past recessions P (St−h = 1|yt , yt−1 , ..., y1 ; θ̂) (1) for h = 0, 1, 2, ..., or forecast of future recessions P (St+h = 1|yt , yt−1 , ..., y1 ; θ̂). (2) For the latter purpose Stock and Watson also developed a leading-indicator generalization of the model of the form yt = k + γ(L)ct−1 + Γ(L)yt−1 + εt 6 ct = δ + α(L)ct−1 + β(L)0 yt−1 + ηt for yt now a vector based on levels or growth rates of some conventional leading indicators along with a few new indicators proposed by Stock and Watson. The in-sample performance of their model is summarized in Figure 5. The top panel gives the contemporaneous inferences (h = 0 in equation (2)), which looked quite good. The model might seem to have produced a “false positive” in 1967, though this was a widely recognized economic slowdown which perhaps could reasonably have been categorized as an economic recession. The 3- and 6-month ahead forecasts (lower panels in Figure 5) also looked extremely promising. Encouraged by these in-sample results, Stock and Watson began in 1988 to report updated probabilities of recession each month using the latest data. Initially these were distributed by FAX and mail to assorted researchers and members of the press, and later were posted on the web.1 Figure 6 displays the model’s real-time track record over the 1990-91 recession as described by Stock and Watson (1993). The leading index proved to be a disappointment— the recession of 1990 came and went, with the model always predicting that no recession was coming (see bottom panels of Figure 6). The contemporaneous index in fact did by November of 1990 signal that a recession had started (top panel), but the model thought it was going to be suﬃciently short-lived that the 3- and 6-month-ahead probabilities always remained below 50% even though ex-post probabilities eventually recognized that a recession had arrived at some earlier date. What went wrong? One of the intriguing new leading indicators that Stock and Watson discovered was the spread between the yield on commercial paper and Treasury bills. Figure 7 shows that this spread had spiked up dramatically prior to each of the recessions in their original sample, but did very little out of the ordinary in the 1990-91 recession for which their model was 1 An archive of releases for 1999 to 2003 is available at http://www.economics.harvard.edu/faculty/stock/ﬁles/xri.zip. 7 on real-time display.2 Stock and Watson subsequently released probabilities from both their original leading index and an alternative leading index that did not make use of interest rates or interest rate spreads. The models’ performance over the 2001 recession was similar to that for the 1990-91 downturn. The alternative leading index calculated a 6-month-ahead probability of recession P (t + 6|t) that remained below 20% throughout 2001, while P (t + 6|t) as calculated from the original leading index peaked at 35% for t = October 2001. again did reasonably well. Nonetheless, inferences from the coincident index On July 3, 2001, Stock and Watson reported P (t|t) = 76% for t = May 2001, and the value reached 97% for t = October 2001. In retrospect, if Stock and Watson had emphasized using their approach to recognize a turning point some time after it had been crossed— that is, having most faith in the historical inferences (1)— its record would have looked pretty good. But the goal of forecasting the moves in advance proved too ambitious. 2.2 Using the yield curve to predict turning points. Many of the subsequent academic papers in this area have focused on procedures that might be simpler and more robust. Quite a few academic studies have suggested that the slope of the yield curve— for example, the spread between yields on a 10-year Treasury bond and a 3-month Treasury bill shown in the top panel of Figure 8— seems to be extremely promising as a predictor of recessions; see among others Estrella and Mishkin (1998), Chauvet and Potter (2005), Kauppi and Saikkonen (2008), and Katayama (2009). As the yield curve again became inverted in August of 2006, many observers began to wonder whether this meant another recession would soon arrive. One aspect of the situation that made this a diﬃcult call at the time was the fact that although the 3-month rate was above the 10-year rate, the overall level of the 3-month rate was lower than it had been prior to any recession since 1960 (see the bottom panel of Figure 8). Many observers argued that the low overall level of interest rates mitigated somewhat the recessionary signal given 2 Notwithstanding, credit spread indicators remain a very promising variable for forecasting real economic activity, as demonstrated by the recent analysis by King, Perlie and Levin (2007) and Gilchrist, Yankov, and Zakrajšek (2009). 8 by the inverted yield curve. One speciﬁcation that came to be used in ongoing real-time announcements was one developed by Jonathan Wright (2006). Wright deﬁned a historical indicator Ht to be unity if it subsequently proved to be the case that NBER would declare any of the following four quarters to have been characterized by an economic recession:     1 if St+1 = 1, St+2 = 1, St+3 = 1, or St+4 = 1 . Ht =    0 otherwise His model then calculated the probability of a recession occurring some time over the next year from P (Ht = 1|i10y,t , i3m,t , if t ) = Φ(−2.17 − 0.76(i10y,t − i3m,t ) + 0.35ift ) for Φ(.) the N (0, 1) cumulative distribution function, i10y,t the 10-year Treasury yield, i3m,t the 3-month Treasury yield, and if t the fed funds rate. The website Political Calculations reported weekly updates of the predictions of this model between April 2006 and August 2008 (see Figure 9). The highest this probability ever reached was 50% on April 4, 2007, after which the probability monotonically declined. It stood at only 10% on August 20, 2008, right before one of the worst 6 months that the economy has experienced over the last half century. These two examples illustrate that a good in-sample ﬁt is no guarantee of out-of-sample realtime performance. As Yogi Berra observed, it’s tough to make predictions, especially about the future. In this paper I instead pursue the more modest goal of trying to recognize a turning point soon after it occurred using algorithms that would be robust with respect to data revisions and out-of-sample structural change. The next section describes one very simple approach that so far seems to have a reasonable track record. 9 3 Real-time inference based on GDP alone Consider the following formulation of the fundamental question: what is diﬀerent about the behavior of GDP during quarters that the NBER characterized as recession compared with those characterized as expansion? Chauvet and Hamilton (2006) answered this by collecting U.S. GDP growth rates from the 45 quarters between 1947:Q2 and 2004:Q2 that the NBER ended up describing as part of an economic recession. This subsample of 45 observations has a mean growth rate of -1.2% (expressed at an annual rate) and standard deviation of 3.5 The top panel of Figure 10 plots a nonparametric estimate of the density of this subsample3 . The remaining 184 expansion quarters had a mean of 4.5 and standard deviation of 3.2, with density given by the middle panel. The observed sample of 225 observations can be viewed as a mixture of these two distributions, with 20% coming from the recession distribution and the remaining 80% from the expansion distribution, as shown in the bottom panel of Figure 10. Suppose we are given only the value for a given quarter’s GDP growth, and aren’t told how the NBER is eventually going to characterize that quarter. Can we infer from which of the two distributions it is most likely to have been drawn? Let St = 1 if quarter t is eventually declared by the NBER to be part of a recession, St = 2 if it is eventually declared to be part of an expansion, and let yt denote the observed GDP growth. Let P (St = 1, yt ) denote the joint probability that quarter t is a recession and has GDP growth given by yt , P (St = 1, yt ) = f(yt |St = 1)P (St = 1), which is found by multiplying the height of the curve in the top panel of Figure 10 by 0.2. Likewise P (St = 2, yt ) is obtained by multiplying the height of the middle curve by 0.8. The optimal inference, P (St = 1|yt ) = 3 P (St = 1, yt ) , P (St = 1, yt ) + P (St = 2, yt ) This was calculated using the DENSITY command in RATS with a Gaussian kernel and bandwidth of 3. 10 will then be recognized as simply the ratio of the height of the green curve to the height of the black curve in the bottom panel of Figure 10. For example, if we observe GDP growth of yt = −6 we could be quite conﬁdent this would be characterized as a recession, whereas if we observe yt = +6, it will almost surely be classiﬁed as an expansion. Such a rule seems to satisfy the requirements of being extremely simple and robust. But unfortunately, it is not terribly helpful, because the vast majority of observations will fall in a range where they don’t send a very clear signal. But there is a second feature of the NBER dates that can be quite helpful— the value of St is pretty likely to be the same as St−1 . Ninety-ﬁve percent of the observations for which St−1 = 2 it was the case that St also equalled two, whereas 78% of the observations for which St−1 = 1 were followed again by St = 1. Thus, even if yt alone does not give us much of a useful signal, the value of yt−1 can help us reﬁne it. For example, an observed value of yt = 1.0 tells us very little, implying (by calculating the ratio of heights just described) a probability of recession P (St = 1|yt = 1.0) = 0.21 that is little diﬀerent from the unconditional odds. But suppose this follows after a quarter of strong growth, yt−1 = 6.0, from which the height ratio calculation yields a very low probability of having been in recession the previous quarter: P (St−1 = 1|yt−1 = 6.0) = 0.074. Before seeing the quarter t GDP numbers, we would accordingly have known that the probability of a recession observation in period t is only 10% rather than the unconditional probability of 20%: P (St = 1|yt−1 = 6.0) = 0.074(0.78) + (1 − 0.074)(1 − 0.95) = 0.10. Having observed yt−1 = 6.0, to use the period t value for yt we’d then want to weight the blue curve in the top panel by 0.1 rather than by 0.2 to arrive at the appropriate conditional probability: P (St = 1|yt , yt−1 ) = f(yt |St = 1, yt−1 )P (St = 1|yt−1 ) . f (yt |St = 1, yt−1 )P (St = 1|yt−1 ) + f(yt |St = 2, yt−1 )P (St = 2|yt−1 ) From this calculation it turns out P (St = 1|yt = 1.0, yt−1 = 6.0) = 0.11; having seen strong growth the preceding quarter, we’d be reasonably conﬁdent that the expansion is continuing this quarter despite the weak GDP report. We can of course extend such calculations, doing better 11 yet using the whole history of observations up to the present to calculate P (St = 1|yt , yt−1 , ...., y1 ). For that matter, the same principles allow us to use observations on yt+1 , yt+2 , ....yT to reﬁne our assessment of where the economy was at some date t, which we refer to as a full-sample smoothed probability: P (St = 1|yT , yT −1 , ..., y1 ). I described above how such an inference could be implemented nonparametrically with little assumed structure other than some very broad features of the NBER dating record. But there’s little reason we need to be nonparametric, since the distribution in the top panel of Figure 10 could clearly be well approximated with a Gaussian density. Also, little is lost by assuming that the variance of the distributions in the top two panels is the same. In other words, we could parameterize the top two panels of Figure 10 as yt |St = 1 ∼ N (µ1 , σ 2 ) yt |St = 2 ∼ N (µ2 , σ2 ), expecting to use µ1 = −1.2, µ2 = 4.5, and σ = 3.4. If we let π denote the unconditional probability that a given quarter will be characterized by recession (expecting π = 0.2), we can characterize the height of the black curve in the bottom panel of Figure 10 for observation t = 1 as f(y1 ; µ1 , µ2 , σ, π) = πφ(y1 ; µ1 , σ) + (1 − π)φ(y1 ; µ2 , σ) for φ(.) the Normal density, with the proposed inference given by P (S1 = 1|y1 ) = πφ(y1 ; µ1 , σ) . πφ(y1 ; µ1 , σ) + (1 − π)φ(y1 ; µ2 , σ) For observation t = 2 we were proposing to calculate P (S2 = 1|y2 , y1 ) = ξ 2 φ(y2 ; µ1 , σ) ξ 2 φ(y2 ; µ1 , σ) + (1 − ξ 2 )φ(y2 ; µ2 , σ) for ξ 2 = p11 P (S1 = 1|y1 ) + (1 − p22 )P (S1 = 2|y1 ) 12 with p11 denoting the probability of a recession continuing, p11 = P (St = 1|St−1 = 1), and p22 the probability of an expansion continuing. Iterating for t = 3, 4, ..., we have P (St = 1|yt , yt−1 , ..., y1 ) = ξ t φ(yt ; µ1 , σ) ξ t φ(yt ; µ1 , σ) + (1 − ξ t )φ(yt ; µ2 , σ) (3) for ξ t = p11 P (St−1 = 1|yt−1 , yt−2 , ..., y1 ) + (1 − p22 )P (St−1 = 2|yt−1 , yt−2 , ..., y1 ). If we interpreted the persistence of recessions and expansions as the sole source of serial dependence in observed GDP growth rates, we could also use the above recursion to calculate a conditional likelihood function for the tth observation: f (yt |yt−1 , yt−2 , ..., y1 ) = 2 X f (yt |St = j)P (St = j|yt−1 , yt−2 , ..., y1 ) j=1 = ξ t φ(yt ; µ1 , σ) + (1 − ξ t )φ(yt ; µ2 , σ). Using the unconditional probability given by π = (1 − p22 )/(1 − p22 + 1 − p11 ), we could then calculate the sample log likelihood as T X log f (yt |yt−1 , yt−2 , ..., y1 ; θ) (4) t=1 solely as a function of observed GDP growth rates and the assumed parameter vector θ = (µ1 , µ2 , σ, p00 , p11 )0 , and maximize this log likelihood with respect to θ. Chauvet and Hamilton (2006) used data on GDP growth rates over 1947:Q2 to 2004:Q2 to maximize (4) with respect to θ. column of Table 1. The maximum likelihood estimates are reported in the last Note that these estimates θ̂ are based solely on observed GDP and make no use of the historical NBER classiﬁcations. It is therefore of considerable interest that the MLE θ̂ turns out to be quite close to what we would have expected based on the discussion above of NBER classiﬁcations. The exercise suggests that the separation of the historical record into periods of expansion and contraction is not just an artifact created by the NBER Business Cycle Dating Committee, but instead is exactly the same kind of classiﬁcation one would see in the GDP 13 data themselves if one took the perspective that they come from an mixture of two distributions with diﬀerent means and transitions between the two regimes governed by an unobserved Markov chain.4 The top panel of Figure 11 plots the ﬁlter inference (3) for each date t in the 1947:Q2-2004:Q2 sample, along with NBER-determined recession dates. The correspondence with the NBER dates is quite close, suggesting that this algorithm oﬀers a promising alternative for recognizing business cycle turning points. However, although the formula (3) only makes use of data through date t, it would be a mistake to suppose that it would do as well in a real-time setting, for two reasons. First, the value used for the parameter vector θ̂ was in fact based on the full sample of observations through 2004:Q2. Second and more importantly, the value one would be using for any historical observation yt in that sample was the number reported in the 2004:Q2 vintage set, which could diﬀer substantially from the number as initially released. To try to gauge the usefulness as a realtime tool, for each date t, Chauvet and Hamilton (2006) used a data set for GDP as it would have been reported at that date (obtained from the Croushore and Stark (2003) data base maintained by the Federal Reserve Bank of Philadelphia) both to estimate the parameter vector θ as well as to form an inference. Use of data as actually available at the time resulted in a considerable deterioration of reliability. We therefore recommended waiting one quarter for data to be revised and to be able to use the added precision aﬀorded by one-quarter smoothing before making a declaration, so that one only tries to form an inference about St when the GDP growth rate for the next quarter, yt+1 , is ﬁrst released. This real-time one-quarter-smoothed inference is plotted in the lower panel of Figure 11. This exercise suggested that such an inference might indeed oﬀer a useful method for recognizing business cycle turning points. In our 2006 paper we recommended the following decision rule. When the one-quarter 4 One could of course propose a more elaborate parametric model, such as allowing further dynamics in GDP growth beyond those induced by the business cycle, as in the original speciﬁcation of Hamilton (1989). However, even as fundamental an indicator as GDP has had changing business-cycle behavior over time (e.g., McConnell and Perez-Quiros, 2000). The extreme simplicity of the model described here may give it more robustness with respect to these changes than more detailed speciﬁcations. 14 smoothed inference P (St = 1|yt+1 , yt , ..., y1 ; θ̂t+1 ) ﬁrst exceeds 0.65, declare that a recession has started, and at that time assign a probable starting point for the recession as the beginning of the most recent set of observations for which P (St−j = 1|yt+1 , yt , ..., y1 ; θ̂ t+1 ) exceeds 1/2. The recession call would remain in eﬀect until P (St = 1|yt+1 , yt , ..., y1 ; θ̂ t+1 ) falls below 0.35, and at that time assign a probable ending point for the recession as the beginning of the most recent set of observations for which P (St−j = 1|yt+1 , yt , ..., y1 ; θ̂t+1 ) is less than 1/2. The starred entries in Table 2 summarize the simulated real-time performance of this algorithm. The dates assigned to historical recessions are quite close to those determined by the NBER, and would typically be announced at about the same time as the NBER announcements. On the basis of these results, I was persuaded that this approach oﬀered a promising alternative for categorizing business-cycle turning points on a real-time basis. Since July of 2005, I have been regularly reporting at the website www.econbrowser.com the value of P (St = 1|yt+1 , yt , ..., y1 ; θ̂ t+1 ) each quarter when the advance GDP numbers are ﬁrst released. Figure 12 plots these values as a function of t. To the left of the vertical line are what I call “simulated real-time calculations”— for each t, the value plotted is the number one would have calculated with the release of quarter t + 1 GDP numbers as they were initially released at the time. To the right of the vertical line are what I call “actual real time”, meaning that the number plotted for each quarter t is exactly the value that was publicly reported on the day that the t + 1 numbers were ﬁrst released. The index has continued to perform well in terms of its true out-of-sample track record. The algorithm calculated that the 11th postwar U.S. recession began in the fourth quarter of 2007, agreeing with the NBER’s determination that 2007:Q4 marked the business cycle peak. The algorithm’s announcement came in January 2009, one month later than the NBER’s declaration in December 2008. The algorithm determined in April 2010 that the recession ended in 2009:Q2. As of the time of this writing, the NBER has not yet declared an end to the most recent recession. Although the algorithm has so far proved to be robust in terms of its out-of-sample performance, it was not particularly timely in making the call that the recession had started, coming 15 one month later than the NBER’s declaration, and much later than all observers understood that a powerful economic downturn was underway. Why was the signal from GDP indecisive until January of 2009? Figure 13 shows that the GDP numbers available at the time showed slightly negative growth for 2007:Q4 and 2008:Q3, +0.9% growth for 2008:Q1, and +2.8% for 2008:Q2. Is that convincing evidence that a recession had begun? The probability generated in October 2008 (following release of the 2008:Q3 advance estimate) was 46.9%, consistent with a suspicion that a recession could well have begun, but not deﬁnitive. It was not until the 3.9% GDP drop in 2008:Q4 that the index climbed to 88.4%, at which time the beneﬁt of the smoothed inference raised the likelihood above 50% for observations going back to 2007:Q4. Hence the algorithm was unable to declare until January 2009 that a recession had likely started in 2007:Q4. Given what we were attempting to do with this procedure— form an inference based solely on GDP growth— it is hard to dispute that this was the correct call given the data at the time, and I feel that the procedure performed well in terms of that objective. But it is clearly worth taking a look at the real-time performance of various other procedures that were making use of alternative economic indicators. 4 Real-time track record of models based on other indicators I motivated the calculations above as an essentially model-free robust inference, while noting that the calculations could alternatively be derived as the optimal statistical inference about an unobserved recession variable St that takes on the value of 1 or 2 according to the outcome of a Markov chain and with GDP growth yt conditional on St coming from one of two Normal distributions. As described in Hamilton (1994, Chapter 22), one can also conduct similar inference on much more elaborate parametric formulations by postulating that the history of being in recessions aﬀects the conditional density of some observed vector of indicators yt according to f (yt |yt−1 , yt−2 , ..., y1 , St , St−1 , ..., S1 ; θ) for f (.) an arbitrary density governed by unknown pa- 16 rameters θ. This section reviews the real-time experience with inferring business cycle turning points based on diﬀerent indicators and alternative Markov-switching time series models. 4.1 Gross domestic income. The national income accounting convention is that every dollar of value added by deﬁnition generates a dollar in income for somebody. We could measure GDP either by looking at expenditures or by looking at income, and should in principle get exactly the same number either way. The Bureau of Economic Analysis does indeed make calculations based on both methods, but in practice the number obtained is not exactly the same, and the diﬀerence between gross domestic product and gross domestic income is oﬃcially reported by the BEA as a “statistical discrepancy.” If the BEA measurements were all correct, that discrepancy would be zero. Jeremy Nalewaik (2007, 2010) has argued that GDI may provide helpful information in addition to GDP, particularly for purposes of recognizing business-cycle turning points. The top panel of Figure 14 plots the growth rates of GDP and GDI; note that GDI was sending a signiﬁcantly more pessimistic signal than GDP during 2007 and 2008. Nalewaik (2007) developed a generalization of the model discussed in the previous section in which GDI and GDP are treated jointly as a bivariate vector indicator. He has been distributing the probabilities calculated by that model the third month of each quarter since 2006:Q3 as part of internal Federal Reserve Board communications. The bottom panel of Figure 14 plots his simulated and actual real-time experience with this method. Nalewaik provided me with these values in September of 2008, at which point his algorithm had generated a strong recession signal for 2007:Q4 through 2008:Q2. Does this mean that we should base an inference on GDI rather than GDP? By deﬁnition, we do not have a good understanding of what the statistical discrepancy may represent, and the conventional view is that GDP may be the slightly more reliable indicator. Certainly for the most recent recession, GDI would have provided a more timely signal, but for some earlier episodes, the GDI-based inference from the bottom panel of Figure 14 is less reliable than the GDP-based 17 inference in Figure 12. If we have to rely on just one for future unknown incoming data, I am more comfortable sticking with GDP, even though GDI did better last time. It might be possible to try to combine the information from the two series in a more parsimonious way, for example, using a weighted average of the two measures. 4.2 Unemployment rate. The dramatic cyclical behavior of unemployment seen in Figure 2 suggests we might have better success with a measure based on the unemployment rate rather than GDP, with sharp and rapid increases in unemployment being perhaps a deﬁning characteristic of an economic recession. My 2005 paper proposed the following model for yt the monthly unemployment rate, yt = cst + φ1 yt−1 + φ2 yt−2 + ut (5) where ut is distributed Student t with estimated degrees of freedom and St is assumed to follow an unobserved 3-state Markov chain. This speciﬁcation will be recognized as a generalization of the parametric interpretation of the approach to GDP above, in allowing for additional dynamics beyond shifts in the business-cycle phases (as represented by nonzero values for φ1 and φ2 ) and allowing for three rather than two business cycle states (as represented by the three possible values for the intercept, c1 , c2 , and c3 ). The three business cycle phases might be interpreted as normal growth (St = 1), mild recession (St = 2), and severe recession (St = 3). Figure 15 shows the smoothed probability that the U.S. economy was in either the mild or severe recession phase calculated from this model as of early September of 2008. At that point the model was declaring with 95% conﬁdence that a recession had started. The parameter estimates θ̂ were taken from Hamilton (2005) and I publicized this particular graph on September 5, 2008. Does that qualify as out-of-sample, actual real time? Although I (and most other observers) were persuaded that the U.S. had entered a recession at this time, I would be uncomfortable using this model (even though it’s my own!) as a basis for issuing a fully automated announcement for purposes of the next cycle. The reason is that although (5) seems to capture nicely the timing 18 of historical business cycles, it is harder to motivate as a primitive, ﬁrst-order description of what we mean by a recession in the same way as the GDP-based inference discussed in the previous section. The unemployment rate exhibits changes in cyclical behavior over time, as demographic and other variables evolve signiﬁcantly. Hence, even though it’s worked historically, I’d have less conﬁdence that such a rule will work as well on the next out-of-sample recession. Even so, this kind of inference, like the GDI inference just discussed, oﬀers a very useful supplement to the inference from GDP. 4.3 Multiple monthly indicators. Chauvet (1998) and Kim and Nelson (1998) proposed incorporating business-cycle shifts directly into a model like that in Stock and Watson (1989). The unobserved state of the business cycle is represented by a scalar ct that is subject to regime shifts, ct = αSt + φct−1 + η t (6) where ηt ∼ N (0, σ 2η ) and αSt = α1 when the economy overall is in a recession (St = 1) and αSt = α2 in expansion, with transitions between St = 1 and 2 again governed by a Markov chain. Each of 4 indicators is presumed related to the business cycle according to for r = 1, 2, 3, 4 yrt = λr ct + vrt (7) with the idiosyncratic factor vrt itself exhibiting AR(1) dynamics: vrt = θ r vr,t−1 + εrt . (8) Variants of the same four indicators used by Stock and Watson— growth rates of industrial production, personal income, sales, and employment— can also be used here. Chauvet and Hamilton (2006) and Chauvet and Piger (2008) conducted simulated real-time exercises on these type of models suggesting that they could perform well in real time. 19 Since August 2006, Piger has been posting the probabilities calculated by this model almost every month on a publicly available web page.5 The actual real-time ﬁlter inferences for each month, P (St = 1|yt , yt−1 , ..., y1 ; θ̂t ) are plotted in the top panel of Figure 16. This ﬁlter probability ﬁrst moved above 50% in Piger’s November 1, 2008 report which was based on data describing August. The probability moved to 99.2% when the September data became available but fell back to 16.1% with the next month’s data which showed industrial production and real personal income less transfers to be growing again in October. Chauvet and Piger (2008) had recommended an inference rule of declaring a recession as soon as three consecutive values of P (St−k = 1|yt , yt−1 , ..., y1 ; θ̂t ) were all above 80%. This threshold was passed with the release of the October data, for although P (St = 1|yt , yt−1 , ..., y1 ; θ̂t ) was only 16.1% based on t = October 2008 data, Pr(St−k = 1|yt , yt−1 , ..., y1 ; θ̂ t ) was above 80% for k = 1, 2, and 3 (see the bottom panel of Figure 16). Chauvet and Piger’s (2008) announced rule would then date the start of the recession as the earliest k for which this probability was above 50%, which turned out to be February 2008. Thus their approach would have announced on January 1, 2009 that a recession had started in February 2008, although the ﬁlter probabilities available in January 2009 raised the possibility that the recession could already have been over by October. Subsequent data conﬁrmed the downturn was ongoing (panels 2 and 3 of Figure 16). The Chauvet-Piger rule of waiting for a reading of 3 consecutive months below 20% would have resulted in a declaration in January 2010 that a recovery likely began in July of 2009. Marcelle Chauvet has periodically posted6 real-time assessments from a closely related speciﬁcation adapted from Chauvet and Hamilton (2006). One important diﬀerence from the speciﬁcation of Chauvet and Piger is the reliance on the BLS household survey for the measure of employment rather than the establishment payroll data. Figure 17 shows the sequence of smoothed probabilities associated with diﬀerent vintages of data. This model would have sent a signal in 5 Apparently the January 1, 2009 posting was missed due to a holiday data release and personal commitments. I am most grateful to Jeremy Piger for providing me with these data. 6 See http://sites.google.com/site/marcellechauvet/probabilities-of-recession. 20 August 2008 that a recession had begun in December 2007. Like Piger’s calculations, it showed a temporary hope of an end when only data through October 2008 were available. Chauvet used this model to announce on her website in October of 2009 that the recession ended in June or July of 2009.7 This actual real-time experience conﬁrms that there is useful additional information in the monthly indicators, though some caution is necessary before trying to use this framework as an actual real-time basis for recognizing business cycle turning points. Chauvet and Potter (2010) also emphasize the need to allow for instability over time of monthly indicator models. 4.4 Inference from panel data. Hamilton and Owyang (2009) proposed a method for forming an inference based on data on employment growth for each of the separate 48 contiguous U.S. states. The assumption is that each state n might be in recession or not for any given quarter t (Snt = 1 or 2 for n = 1, ..., 48) with the quarterly employment growth rate for state n and quarter t, ynt , governed by ynt |Snt ∼ N (µSnt , σ2n ). Our hypothesis was that certain clusters of states might be in recession together and that the determination of which cluster is in recession at any given date was governed by a larger Markov chain. The smoothed probability of each cluster being in recession at any given date is reported in Figure 18. Of particular interest is the blue line, which tracks the probability that we were at that date experiencing a national recession in which every state participated. These track closely the NBER dates, though assign a much longer duration to the recession of 2001 on the basis of its “jobless recovery.” What is perhaps remarkable is that the model concluded that, with virtual certainty, the U.S. economy had entered a national recession as of 2007:Q2. I publicized this particular graph and its implication that we were well into another recession on Econbrowser on April 3, 2008. Although 7 Chauvet has also been calculating probabilities using the payroll employment ﬁgures instead of the household survey, and feels that the payroll numbers might better recognize a peak while the household numbers are better for the trough. Based on the payroll numbers, she announced in April 2008 that the recession had started in December. 21 the inference proved to be prescient, it is again one I wouldn’t trust out of sample. For one thing, the model attributed all the comovement between states to these recession clusters, an assumption that was necessary given our estimation algorithm but that is clearly implausible. If you really think you’re getting 48 independent observations on something, it’s correct that you could be virtually certain of your inference, but this is an artifact of assuming independence rather than a correct assessment of the uncertainty. The fact that employment growth behaved so diﬀerently for the 2001 recession as in previous recessions is another reason to be queasy about relying on such a model. It nevertheless again illustrates the potential gains from using multiple indicators, in this case employment data at the level of individual states. 4.5 Mixed-frequency inference. Camacho and Perez-Quiros (2010) showed how data of diﬀerent frequencies can be combined in a linear state-space framework. Their model uses observations on Euro-area GDP and employment together with monthly observations on industrial production, retail sales, exports, industrial new orders, Euro-zone Economic Sentiment Indicator, German business climate index, Belgian overall business indicator, and both services and manufacturing purchasing managers indexes for the Euro area. One of the nice contributions of their paper is a very convenient algorithm for processing an unbalanced panel on a real-time daily basis as diﬀerent data releases or revisions to past data get announced on diﬀerent days. The forecasts from this model are now being used inside the Eurosystem and so far have an excellent actual real-time track record. Camacho, Perez-Quiros, and Poncela (2010) adapted the same idea to a latent business-cycle regime model based on a similar set of observed mixed-frequency indicators. Figure 19 reports the probabilities of a Euro-area recession that would have been calculated by their approach based on the actual data available as of each day during 2008 and 2009. The model yields a remarkably sharp and stable inference over this period, with the probability jumping from 3.8% on June 24, 2008 to 98.1% by July 24, 2008. The probability remained above 69% until April 23, 2009, at 22 which point it fell to 6%. Based on these results, the approach appears extremely promising. There is again of course the concern that with so many estimated parameters, the subsequent out-of-sample real-time performance may deteriorate. 5 Other approaches I have focused in the last two sections exclusively on Markov-switching time series models, in part because this is the approach with the most clearly established out-of-sample real-time track record. But there are a number of promising ideas from alternative perspectives. Here I very brieﬂy mention a few. One can think of any probit prediction of NBER classiﬁcations, such as that by Wright discussed in Section 2, as being driven by an unobserved latent Gaussian variable zt∗ governed by zt∗ = α + β0 yt−1 + εt (9) with NBER recession classiﬁcations St characterized by St = 1 if zt∗ > 0 and therefore optimally forecast using the formula P (St = 1|yt−1 ) = Φ(α + β 0 yt−1 ). Dueker (2005) proposed to think of zt∗ as part of a VAR8 including also the observed variables yt and used Bayesian methods to infer parameters based on historical observations of y1 , ..., yT and NBER classiﬁcations S1 , ..., ST , which he called a qual-VAR speciﬁcation. Given the model 8 That is, (9) is generalized to zt∗ = 0 0 α + β01 yt−1 + β 2 yt−2 + · · · + βp yt−p ∗ ∗ ∗ +γ 1 zt−1 + γ 2 zt−2 + · · · + γ p zt−p + εt yt = 0 0 c + Φ01 yt−1 + Φ2 yt−2 + · · · + Φp yt−p ∗ +ξ1 zt−1 ∗ + ξ 2 zt−2 23 + ··· + ∗ ξp zt−p + ut . parameters, one can then calculate an optimal forecast of future business cycle phases conditional on whatever is actually known as of some out-of-sample date τ: P (Sτ +k = 1|yτ , yτ −1 , ..., y1 , Sτ −r , Sτ −r−1 , ..., S1 ; θ̂) for Sτ −r the most recent NBER classiﬁcation known with certainty as of date τ . Dueker (2008) used this model on December 9, 2008 to predict (quite successfully, as it turned out) that the recession would end in July or August of 2009 but that employment growth would not resume until March of 2010. Leamer (2008) considered instead simple rules of the form St = 1 if yrt < cr for yrt some indicator and cr a threshold chosen to capture historical NBER dates. Leamer proposed using 6-month changes in employment (from both the establishment and household survey), industrial production, and the unemployment rate. In August of 2008 he concluded on the basis of such calculations that: this algorithm indicates that the data through June 2008 do not yet exceed the recession thresholds, and will do so only if things get much worse. Subsequent data unfortunately were indeed much worse, and Leamer’s indicators produced a clear recession call by October 19 (Hamilton, 2008b). Berge and Jordà (2009) proposed using the receiver operating characteristic curve as a basis for systematically selecting such indicators yrt , and concluded that the national activity index developed by Stock and Watson (1999) and regularly updated by the Federal Reserve Bank of Chicago is a particularly promising indicator. Harding and Pagan (2006) formalized the traditional Burns and Mitchell (1946) approach of identifying local peaks and troughs in a series and then looking for maximal correspondence between the inference from individual series to identify an aggregate recession. Chauvet and Piger 24 (2008) found this approach performed reasonably well with simulated real-time data, though it appeared to be dominated by their 4-indicator Markov-switching speciﬁcation. 6 Conclusion This paper discussed approaches to recognizing business cycle turning points using a variety of data sets and speciﬁcations. There is a trade-oﬀ between parsimony and making full use of all available information. A very simple model based on GDP alone appears reasonably robust to data revisions and structural change and so far has a good real-time out-of-sample track record, but could clearly be improved upon. Averaging the inference from alternative speciﬁcations or using Bayesian approaches to constrain more richly parameterized speciﬁcations are worth exploring. A particularly promising approach is to combine data of diﬀerent frequencies. 25 References Berge, Travis J., and Òscar Jordà (2009), The classiﬁcation of economic activity, working paper, University of California, Davis. Burns, Arthur F., and Wesley C. Mitchell (1946), Measuring Business Cycles, New York: NBER. Camacho, Maximo, and Gabriel Perez-Quiros (2010), Introducing the EURO-STING: Short term indicator of Euro area growth, Journal of Applied Econometrics, 25: 663-694. Camacho, Maximo, Gabriel Perez-Quiros, and Pilar Poncela (2010), Green shoots in the Euroa area: a real time measure, working paper, Bank of Spain, forthcoming. Chauvet, Marcelle (1998), An economic characterization of business cycle dynamics with factor structure and regime switches, International Economic Review, 39:969-996. Chauvet, Marcelle (2010), The beginning and end of the 2007-2009 recession, http://sites.google.com/ site/crefcus/probabilities-of-recession/The-beginning-and-end-of-the-2007-2009-recession1, April. Chauvet, Marcelle, and James D. Hamilton (2006), Dating business cycle turning points, in Costas Milas, Philip Rothman, and Dick van Dijk, eds., Nonlinear Time Series Analysis of Business Cycles, pp. 1-54, Amsterdam: Elsevier. Chauvet, Marcelle, and Jeremy Piger (2008), A comparison of the real-time performance of business cycle dating methods, Journal of Business Economics and Statistics, 26: 42-49. Chauvet, Marcelle, and Simon Potter (2005), Forecasting recessions using the yield curve, Journal of Forecasting, 24: 77-103. Chauvet, Marcelle, and Simon Potter (forthcoming), Monitoring business cycles with structural changes, International Journal of Forecasting. Croushore, Dean, and Tom Stark (2003), A real-time data set for macroeconomists: Does the data vintage matter?, Review of Economics and Statistics 85:605-617. 26 Dueker, Michael (2005), Dynamic forecasts of qualitative variables: A qual VAR model of U.S. recessions, Journal of Business and Economic Statistics, 23:96-104. Dueker, Michael (2008), Predicting the trough and a jobless recovery, http://www.econbrowser.com/ archives/2008/12/predicting_the_1.html, Dec. 9, 2008. Estrella, Arturo, and Frederic S. Mishkin (1998), Predicting U.S. recessions: ﬁnancial variables as leading indicators, Review of Economics and Statistics 80: 45-61. Gilchrist, Simon, Vladimir Yankov, and Egon Zakrajšek (2009), Credit market shocks and economic ﬂuctuations: evidence from corporate bond and stock markets, Journal of Monetary Economics, 56:471-493. Hamilton, James D. (1989), A new approach to the economic analysis of nonstationary time series and the business cycle, Econometrica 57:357-384. Hamilton, James D. (1990), Analysis of time series subject to changes in regime, Journal of Econometrics 45:357-384. Hamilton, James D. (1994), Time Series Analysis, (Princeton University Press, Princeton). Hamilton, James D. (2005), What’s real about the business cycle?, Federal Reserve Bank of St. Louis Review, July/August (87, no. 4): 435-452. Hamilton, James D. (2008a), Regional propagation of business cycles, http://www.econbrowser.com/ archives/2008/04/regional_propag.html, April 3. Hamilton, James D. (2008b), More unhappy numbers, http://www.econbrowser.com/archives/ 2008/10/more_unhappy_nu.html, October 19. Hamilton, James D., and Michael T. Owyang (2009), The propagation of regional recessions, working paper, UCSD. Harding, Donald and Adrian R. Pagan (2006), Synchronization of cycles., Journal of Econometrics, 132:59-79. Katayama, Munechika (2009), Improving recession probability forecasts in the U.S. economy, working paper, Louisiana State University. 27 Kauppi, Heikki and Pentti Saikkonen (2008), Predicting U.S. recessions with dynamic binary response models, Review of Economics and Statistics, 90:777-791. Kim, Chang-Jin, and Charles R. Nelson (1998), Business cycle turning points, a new coincident index, and tests of duration dependence based on a dynamic factor model with regime-switching, Review of Economics and Statistics, 80:188-201. King, Thomas B., Andrew T. Levin, and Roberto Perli (2007), Financial market perceptions of recession risk, working paper, Federal Reserve Board. Leamer, Edward E. (2008), What’s a recession anyway?, NBER working paper 14221. McConnell, Margaret M., and Gabriel Perez-Quiros (2000), Output ﬂuctuations in the United States: what has changed since the early 1980’s?, American Economic Review, 90:1464-1476. Nalewaik, Jeremy J. (2007), Estimating probabilities of recession in real time using GDP and GDI, working paper, Federal Reserve Board. Nalewaik, Jeremy J. (2010), The income- and expenditure-side estimates of U.S. output growth, Brookings Papers on Economic Activity, forthcoming. Stock, James H., and Mark W. Watson (1989), New indexes of coincident and leading economic indicators, in Olivier Jean Blanchard and Stanley, Fischer, eds., NBER Macroeconomics Annual 1989, 351-394, Cambridge MA, MIT Press. Stock, James H. and Mark W. Watson (1991), A probability model of the coincident economic indicators, in Kajal Lahiri and Geoﬀrey H. Moore, eds., Leading Economic Indicators: New Approaches and Forecasting Records, (Cambridge University Press, Cambridge, U.K.). Stock, James H., and Mark W. Watson (1993), A procedure for predicting recessions with leading indicators: econometric issues and recent experience, in James H. Stock and Mark W. Watson, eds., Business Cycles, Indicators, and Forecasting, 95-153, Chicago, University of Chicago Press. Stock, James H., and Mark W. Watson (1999), Forecasting inﬂation, Journal of Monetary Economics, 44:293-335. 28 Wright, Jonathan H. (2006), The yield curve and predicting recessions, working paper, Federal Reserve Board. 29 Table 1 Parameter estimates based on (1) characteristics of expansions and recessions as classiﬁed by NBER, and (2) values that maximize the observed sample log likelihood of postwar GDP growth rates over 1947:Q2 to 2004:Q2. ––––––––––––––––––––––––––––––––––––––– Parameter Value from NBER Interpretation Value from GDP alone classiﬁcations µ1 average growth in expansion 4.5 4.62 µ2 average growth in recession −1.2 −0.48 σ standard deviation of growth 3.4 3.34 p11 prob. expansion continues 0.95 0.92 p22 prob. recession continues 0.78 0.74 –––––––––––––––––––––––––––––––––––––––— Notes to Table 1: Reproduced from Chauvet and Hamilton (2006). 30 Table 2 Business cycle turning points and dates at which announcements were issued by NBER and the GDP-based algorithm. Peak as determined by NBER Date NBER made declaration 1969:Q4 1973:Q4 1980:Q1 1981:Q3 1990:Q3 2001:Q1 2007:Q4 N.A. N.A. Jun 1980 Jan 1982 Apr 1991 Nov 2001 Dec 2008 Trough as determined by NBER Date NBER made declaration 1970:Q4 1975:Q1 1980:Q3 1982:Q4 1991:Q1 2001:Q4 N.A. N.A. N.A. Jul 1981 Jul 1983 Dec 1992 Jul 2003 N.A. Start of recessions Recession start Date algorithm as determined made by algorithm declaration 1969:Q2 1973:Q4 1979:Q2 1981:Q2 1989:Q4 2001:Q1 2007:Q4 May 1970* May 1974* Nov 1979* Feb 1982* Feb 1991* Feb 2002* Jan 2009 Start of expansions Recession end Date algorithm as determined made by algorithm declaration 1970:Q4 1975:Q1 1980:Q2 1982:Q4 1991:Q4 2001:Q3 2009:Q2 Aug 1971* Feb 1976* May 1981* Aug 1983* Feb 1993* Aug 2002* Apr 2010 Algorithm announcement lead (-) or lag (+) in months N.A. N.A. -7 +1 -2 +3 +1 Algorithm announcement lead (-) or lag (+) in months N.A. N.A. -2 +1 +2 -12 N.A. Notes to Table 2. Starred entries denote simulated real-time declarations, unstarred are actual real-time delarations. N.A. indicates information is not available. 31 US real GDP, 1947-2010:Q1 1000 950 900 850 800 750 700 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 Figure 1. One hundred times the natural logarithm of U.S. real GDP, 1947:Q1-2010:Q1. Last shaded region covers 2007:Q4-2009:Q2; other shaded regions correspond to NBER recession dates. 32 Unemployment 11 10 9 8 7 6 5 4 3 2 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 Figure 2. Civilian unemployment rate, 1948:M1 to 2010:M3. Seasonally adjusted, from Bureau of Labor Statistics. Last shaded region covers 2007:M12 to 2009:M6; other shaded regions correspond to NBER recession dates. 33 Dates of recessions 11 10 9 8 7 6 5 4 3 1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 2001 2003 2005 2007 2009 Dates of announcements 11 10 9 8 7 6 5 4 3 1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 Figure 3. Unemployment, recessions, and dates of NBER announcements. Top panel: same as Figure 2 for 1981:M8 to 2010:M3. Bottom panel: dates at which the NBER announced the business cycle turning points indicated in the top panel. 34 3.0 2.0 1.0 0.0 -1.0 -2.0 -3.0 2001:Q1 2001:Q2 2001:Q3 2001:Q1 2001:Q2 2001:Q3 2001:Q4 3.0 2.0 1.0 0.0 -1.0 -2.0 -3.0 2001:Q4 Figure 4. Quarterly growth of real GDP for 2001, quoted at an annual rate. Top panel: as reported on October 31, 2002. Bottom panel: as reported on January 30, Data source: historical data archive of the Federal Reserve Bank of Philadelphia. 35 1.00 P(t|t) 0.75 0.50 0.25 0.00 62 64 66 68 70 72 74 76 78 80 82 84 86 88 62 64 66 68 70 72 74 76 78 80 82 84 86 88 62 64 66 68 70 72 74 76 78 80 82 84 86 88 1.00 P(t+3|t) 0.75 0.50 0.25 0.00 1.00 P(t+6|t) 0.75 0.50 0.25 0.00 Figure 5. In-sample values for probability of recession from Stock-Watson business cycle model. Top panel: contemporaneous probability. Middle panel: 3-month-ahead forecast. Bottom panel: 6-month-ahead forecast. Adapted from Figure 2.1 in Stock and Watson (1993), using code provided by Mark Watson (http://www.princeton.edu/~mwatson/ddisk/pr.zip). 36 1.00 P(t|t) 0.75 0.50 0.25 0.00 88 89 90 91 88 89 90 91 88 89 90 91 1.00 P(t+3|t) 0.75 0.50 0.25 0.00 1.00 P(t+6|t) 0.75 0.50 0.25 0.00 Figure 6. Out-of-sample values for probability of recession from Stock-Watson business cycle model. Vertical line drawn for July 1990, which NBER subsequently declared to have been the business cycle peak. Top panel: contemporaneous probability. Middle panel: 3-month-ahead forecast. Bottom panel: 6-month-ahead forecast. Adapted from Figure 2.1 in Stock and Watson (1993), using code provided by Mark Watson (http://www.princeton.edu/~mwatson/ddisk/pr.zip). 37 6-month CP-Tbill spread 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 1970 1973 1976 1979 1982 1985 1988 1991 1994 1997 Figure 7. 6-month commercial paper rate minus 6-month Treasury bill rate, 1970:M1 to 1997:M8. 38 10 year - 3 month spread 5 4 3 2 1 0 -1 -2 -3 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 1995 2000 2005 2010 3 month rate 17.5 15.0 12.5 10.0 7.5 5.0 2.5 0.0 1955 1960 1965 1970 1975 1980 1985 1990 Figure 8. Yield spread and short rate. Top panel: Yield on 10-year Treasury bonds minus that on 3-month T-bills. Bottom panel: level of 3-month T-bill rate. 39 Figure 9. Real-time implications of Wright’s probit model. Horizontal axis: average fed funds rate over the preceding 90 days. Vertical axis: average 10 year minus 3 month yield over preceding 90 days. Iso-probability implications of the model shown as upward-sloping dashed lines, with circles denoting observed values for each week between August 2004 and August 2008. Source: politicalcalculations.blogspot.com. 40 Density of contractions 0.10 0.08 0.06 0.04 0.02 0.00 -15 -10 -5 0 5 10 15 5 10 15 5 10 15 GDP growth Density of expansions 0.100 0.075 0.050 0.025 0.000 -15 -10 -5 0 GDP growth Density of mixture 0.100 MIXTURE EXPANSION 0.075 RECESSION 0.050 0.025 0.000 -15 -10 -5 0 GDP growth Figure 10. Density of GDP growth in contractions, expansions and overall. Top panel: Density of GDP for quarters characterized by NBER as part of a recession. Middle panel: density for quarters characterized as expansion. Bottom panel: 0.2 times entry in top panel (green) , 0.8 times entry in middle panel (blue), and sum of the two products (black). Adapted from Chauvet and Hamilton (2006, Figures 2 and 3). 41 Current filter probabilities based on data vintage April 2004 1.25 1.00 0.75 0.50 0.25 0.00 -0.25 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 1-qtr smoothed probabilities with real-time vintage data 1.25 1.00 0.75 0.50 0.25 0.00 -0.25 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 Figure 11. Model-derived recession probabilities. Top panel: filter probability on revised data; plotted series for date t is P( St = 1 | yt ,T , yt −1,T ,..., y1,T ; θ T ) where yt ,T denotes GDP growth for quarter t as reported in vintage released T = 2004:Q2, and θ T is the MLE based on ( y1,T , y2,T ,..., yT ,T ) . Bottom panel: one-quarter smoothed probability based on real-time data; plotted series for date t is P( St = 1 | yt +1,t +1 , yt ,t +1 ,..., y1,t +1; θ t +1 ) where θ t+1 is the MLE based on ( y1,t +1 , y2,t +1 ,..., yt +1,t +1 ) . NBER-determined recession dates (which were not used in any way to calculate either of the plotted series) are indicated as shaded regions on both graphs. Adapted from Figures 6 and 7 in Chauvet and Hamilton (2006). 42 GDP-based Recession indicator index 125 100 75 50 25 0 -25 1970 1975 1980 1985 1990 1995 2000 2005 Figure 12. GDP-based recession indicator index, 1967:Q4-2009:Q4. Last shaded region covers 2007:Q4-2009:Q2; other shaded regions correspond to NBER recession dates. Prior to 2005, each point on the graph corresponds to a simulated real-time inference that was constructed from a data set as it would have been available four months after the indicated date. After 2005, points on the graph correspond to actual announcements that were publicly released four months after the indicated date. 43 4 3 20 08 :Q 2 20 08 :Q 20 08 :Q 1 4 20 08 :Q 3 20 07 :Q 2 20 07 :Q 20 07 :Q 20 07 :Q 1 6.0 5.0 4.0 3.0 2.0 1.0 0.0 -1.0 -2.0 -3.0 -4.0 -5.0 Figure 13. Quarterly real GDP growth rates for 2007-2008. Quoted at an annual rate, as reported in January 2009. 44 GDP and GDI 20 15 10 5 0 -5 -10 1978 1980 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2000 2002 2004 2006 2008 GDP- and GDI-based probabilities 1.00 0.75 0.50 0.25 0.00 1978 1980 1982 1984 1986 1988 1990 1992 1994 1996 1998 Figure 14.Real gross domestic product, gross domestic income, and probabilities inferred from joint behavior of GDP and GDP, 1978:Q2 to 2008:Q2. Real GDI calculated as nominal GDP minus statistical discrepancy (BEA Table 1.7.5) divided by implicit GDP deflator. Top panel: black line is quarterly growth rate of real GDP (annual rate), blue line is quarterly growth rate of real GDI. Bottom panel: vertical line at 2006:Q2; values for 2006:Q2 and earlier correspond to simulated real-time probabilities; values for 2006:Q3 and later represent actual real-time probabilities. Source: Jeremy Nalewaik (personal correspondence). 45 Smoothed probabilities 1.25 1.00 0.75 0.50 0.25 0.00 -0.25 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 Figure 15. Inference from 3-phase model of unemployment as published on September 5, 2008. Graph shows P( St > 1 | yT , yT −1 ,...., y1; θ T ) for each t and is reproduced from www.econbrowser.com/archives/2008/09/rising_unemploy.html. 46 current filter 1.0 0.8 0.6 0.4 0.2 0.0 2006 2007 2008 2009 2008 2009 2008 2009 smoothed 1 lag 1.0 0.8 0.6 0.4 0.2 0.0 2006 2007 smoothed 2 lag 1.0 0.8 0.6 0.4 0.2 0.0 2006 2007 full set of smoothed probabilities as of Jan 2009 1.0 0.8 0.6 0.4 0.2 0.0 2006 2007 2008 2009 Figure 16. Actual Real-time probabilities from the Chauvet and Piger (2008) 4-indicator model as reported each month at www.uoregon.edu/~jpiger/us_recession_probs.htm. Top panel: filter probability that month t (plotted on horizontal axis) is characterized by recession based on initial release of data describing month t; value shown would have been actually reported two months after the indicated date. Middle panel: one-month smoothed probability that month t (plotted on horizontal axis) is characterized by recession allowing for 1-month revisions and smoothing; value shown would have been reported three months after the indicated date. Third panel: two-month smoothed probabilities; value shown would have been reported four months after the indicated date. Bottom panel: full sample of smoothed probabilities as of January 2009; values shown would have all been reported on January 1, 2009. Vertical lines drawn at 2007:M12, which NBER declared on December 1, 2008 to have been the business cycle peak. 47 Figure 17. Full sample of smoothed probabilities available for indicated vintage of data as calculated by Chauvet’s 4-indicator model. Source: Chauvet (2010). 48 Regime Probabilities Group 1 Recession Group 2 Recession Group 3 Recession Recession 1.1 0.9 0.7 0.5 0.3 0.1 -0.1 1956:Q2 1962:Q2 1968:Q2 1974:Q2 1980:Q2 1986:Q2 1992:Q2 1998:Q2 2004:Q2 Figure 18. Full-sample smoothed probabilities from the Owyang-Hamilton model of different clusters being in recession, 1956:Q2 to 2007:Q4. Blue lines indicate probability that all states are in recession together, other colors denote conditions when only a subset of U.S. states were in recession. Reproduced from Hamilton (2008a). 49 1.25 1.00 0.75 0.50 0.25 0.00 -0.25 2008 2009 Figure 19. Simulated real-time probabilities of Euro-area recession, Jan 1, 2008 to Jan 15, 2010. Value shown for each day is based solely on data actually available as of that day. Source: Camacho, Perez-Quiros, and Poncela (2010). 50

Log In

Calling recessions in real time

Related papers

Related papers

Related topics