Studies in Nonlinear Dynamics & Econometrics: Markov-Switching GARCH Modelling of Value-at-Risk
Studies in Nonlinear Dynamics & Econometrics: Markov-Switching GARCH Modelling of Value-at-Risk
Studies in Nonlinear Dynamics & Econometrics: Markov-Switching GARCH Modelling of Value-at-Risk
Econometrics
Volume 12, Issue 3 2008 Article 7
R EGIME -S WITCHING M ODELS IN E CONOMICS AND F INANCE
∗
University of Essex, rsajja@essex.ac.uk
†
University of Essex, jcoakley@essex.ac.uk
‡
University of Essex, jcnank@essex.ac.uk
Abstract
This paper proposes an asymmetric Markov regime-switching (MS) GARCH model to esti-
mate value-at-risk (VaR) for both long and short positions. This model improves on existing VaR
methods by taking into account both regime change and skewness or leverage effects. The perfor-
mance of our MS model and single-regime models is compared through an innovative backtesting
procedure using daily data for UK and US market stock indices. The findings from exceptions and
regulatory-based tests indicate the MS-GARCH specifications clearly outperform other models in
estimating the VaR for both long and short FTSE positions and also do quite well for S&P posi-
tions. We conclude that ignoring skewness and regime changes has the effect of imposing larger
than necessary conservative capital requirements.
1 Introduction
Value-at-Risk (VaR) is one of the most popular approaches to quantifying market
risk. It yields an estimate of the likely losses which could rise from price changes
over a pre-determined horizon at a given confidence level. It is usual that VaR is
separately computed for the left and right tails of the returns distribution
depending on the position of the risk managers or traders. Traders with long
positions are exposed to the risk of price falls (left tail VaR) while those with
short positions stand to lose when prices increase (right tail VaR). Symmetric
VaR models of the Generalised Autoregressive Conditional Heteroskedasticity
(GARCH) class have difficulties in correctly modeling the tails of the returns
distribution (Giot and Laurent, 2003) due to leverage effects.1
There are two main approaches that allow for the leverage effect in volatility
forecasting. The first is the use of conditional asymmetric models which extend
ARCH models by imposing an asymmetry parameter in the conditional variance
equation. The second approach is based on the use of asymmetric density
functions for the error term or an asymmetric confidence interval around the
predicted volatility.2 Although such approaches provide an improvement in fit
compared with symmetric models, the empirical evidence suggest that the
persistence in the conditional variance is likely to exhibit substantial upward bias.
One potential source of bias is that the means and variances are assumed fixed
rather than varying over the entire sample period (Lamoureux and Lastrapes,
1990, Timmermann, 2000, and Mikosch and Starica, 2004).
A Markov Regime-Switching (MS) approach can resolve this by
endogenising changes in the data generating process. Gray (1996) extended the
Hamilton (1994) MS model to the MS-GARCH framework by allowing within-
regime, GARCH type heteroskedasticity. This was subsequently modified by
Klaassen (2002). Marcucci (2005) compares a set of GARCH, EGARCH and
GJR-GARCH models within an MS-GARCH framework (Gaussian, Student’s t
and Generalized Error Distribution for innovations) in terms of their ability to
forecast S&P100 volatilities. Ane and Ureche-Rangau (2006) extend the regime-
switching model developed by Gray (1996) to an Asymmetric Power (AP)
GARCH model to analyze empirically Asian stock indices returns. Their
empirical results indicate that all the generalizations introduced by the MS-
APGARCH model are statistically and economically significant.
In this paper we introduce a MS-GARCH framework to take account of both
asymmetry and regime changes in returns data in forecasting VaR. Our study
1
This means that a negative shock leads to a higher conditional variance (volatility) in the
subsequent period than a positive shock would.
2
See Bond (2000) for a survey on early asymmetric conditional density functions.
builds upon the previous literature in several ways. First, we focus on VaR for
long and short positions allowing for asymmetries in both conditional variance
and distribution of error terms. This is crucially important in taking account of the
leverage effect in stock markets. Existing studies such as Marcucci (2005) assume
symmetric distributions and focus on long VaR only. Second, while Giot and
Laurent (2003) model the long and short VaR using a single-regime APARCH
model combined with the skewed Student’s t distribution, we extend the analysis
to the MS context since our results indicate that regime change matters. Third, we
evaluate out-of-sample model performance by using a novel combination of
exceptions and regulatory-based backtesting procedures. This provides a more
robust evaluation of model performance than the in-sample analysis found in
existing studies such as that of Ane and Ureche-Rangau (2006).
The rest of the paper is organized in the following way. Section 2 presents
the selected volatility models implemented to model VaR for the long and short
trading positions. The empirical results for model specification and diagnostic
tests are presented in Section 3. In Section 4, the performance of competing
models in forecasting VaR is examined. Finally, Section 5 concludes the paper.
2 VaR Models
This section presents the VaR models that are used to model the long and short
sides of daily trading positions. The GARCH model, originally introduced by
Bollerslev (1986), is the most popular model in volatility forecasting and financial
risk management. We go beyond the single-regime GARCH framework and
consider MS-GARCH models for modeling VaR for long and short trading
positions.
φ s ( L)(rt − γ s ) = θ s ( L)ε t ,
t t t
(2.2)
ε t = h Zt 1/ 2
t , st
where the innovations Z t are i.i.d. with zero mean, unit variance and marginal
density function f z (Z ) and L is the lag operator. The conditional variance ht can be
defined either by:
η st / 2 η
β s ( L)ht ,s
t t
= ω st + [ β st ( L) − ϕ st ( L)](1 + λst S t ) | ε t | st , (2.3)
giving the EGARCH class, where η > 0 denotes the power parameter, st = 1 if
ε t < 0 and 0 otherwise, and λ is the leverage or asymmetry parameter.
The usual ARCH (Engle, 1982) and GARCH (Bollerslev, 1986) models are
obtained if in equation (2.3) only the parameters ω , β (L) and ϕ (L) are included
and the power parameter η is fixed at 2. Alternatively, the power parameter η
can be estimated rather than imposed as 2, yielding the Power ARCH (or
PARCH) class proposed by Ding et al. (1993). When η is fixed at 2, equation
(2.3) gives a variant of the Threshold GARCH or GJR-GARCH as introduced
independently by Zakoïan (1994) and Glosten et al. (1993). Finally, equation (2.4)
represents the Exponential GARCH model (Nelson, 1991).
Since st is an unobserved variable, the conditional variance in (2.3) and (2.4)
depends on the entire sequence of regimes up to time t. This means that, for a
sample of length T, the likelihood function requires integrating over M T
sequences of (unobserved) regime paths rendering the model essentially
L(θ ) = ∑t =1 f (rt | ψ t −1 ; Λ ) ,
T
where
The key probability in (2.5) and (2.6) has a first-order recursive structure
which can be written as
where F −1 is the so-called quantile function defined as the inverse of the CDF.
The VaR for a short position is similarly computed where the same definition is
used for the right tail of the distribution function, i.e. 1-p substitutes for p.4
Since quantiles are direct functions of the variance in parametric models, the
ARCH class models present a dynamic measure of VaR. More precisely, the VaR
for time T+1 based on the ARCH family models can be defined as
4
See, for example, Dowd (2005) for a comprehensive survey of VaR methods.
⎧ 1 p 1
⎪⎪ ζ St p ,ν [ 2 (1 + ζ )] p<
2
if
1+ ζ 2
SkSt *p ,ζ ,ν =⎨ (2.11)
1− p 1
⎪− ζ St p ,ν [ (1 + ζ −2 )] if p≥
⎪⎩ 2 1+ ζ 2
where ζ is the asymmetry coefficient and St p ,ν is the quantile function of the (unit
variance) Student’s t density with ν degrees of freedom. Then the associated
quantile function is obtained from
SkSt *p ,ζ ,ν − m
SkSt p ,ζ ,ν = , (2.12)
s
where parameters m and s 2 are the mean and the variance of the non-standardized
skewed Student’s t, respectively:
ν −1
Γ( ) ν −2
2 1 1
m= (ζ − ) , s 2 = (ζ 2 + − 1) − m 2 .
π Γ(ν / 2) ζ ζ 2
3 Empirical Findings
The class of model developed in the previous section enables us to consider
excess kurtosis and skewness as well as possible structural changes exhibited by
most financial time series data. In the following subsections we implement this
model to examine stock market behavior.
3.1 Data
The data set analyzed in this paper comprises daily observations on two major
stock market indices returns, namely, the FTSE100 and the S&P 500 (hereafter
the FTSE and S&P). The sample covers the time interval from 1 January 1991 to
31 December 2004, resulting in 3599 daily observations.5 The first 2599
observations are used for in-sample estimation while the remaining 1000
observations are taken as the out-of- sample for forecast evaluation process in
which a sliding window (rolling) method of 1000 days is implemented. The usual
descriptive statistics of the data are given in Table 1. The moments of the stock
index returns are shown along with the results of an aggregate autocorrelation
(Ljung-Box) test for returns and their squares.
As can be seen, for both indices, the mean return is quite small, the
skewness is significant and negative, implying a possible leverage effect in data,
and the kurtosis is significantly higher than that of a Gaussian distribution (excess
kurtosis) indicating fat-tailed returns. This suggests the need for a fat-tailed or
skewed fat-tailed distribution, for example Student’s t or skewed Student’s t, to
describe the returns’ conditional distribution. In addition, the large Q-statistics up
to 12, 24 and 36 orders strongly reject the null hypothesis of no serial correlation
in both returns and squared returns for the FTSE but only in squared returns for
the S&P index.6
We also present, in Figure 1, squared returns, rt 2 , for the last four years in
order to give an indication of high and low volatility periods. At first glance, plots
demonstrate substantial volatility clustering as periods of low volatility mix with
periods of high volatility and large positive and negative returns. This indicates
5
All data have been obtained from DataStream.
6
Other test statistics like: the BDS, McLeod–Li, Engle LM, Tsay and Bicovariance tests can be
used to examine whether the residuals are i.i.d. (see for example Panagiotidis, 2005).
S&P500 FTSE100
6 6
5 5
4 4
3 3
2 2
1 1
0 0
2002 2003 2004 2005 2002 2003 2004 2005
Figure 1: Squared returns of the S&P500 and FTSE100 rates over the sample
period January 2001 to December 2004.
It seems that the standard single-regime ARCH models cannot capture the
pressure relieving effect, since they typically imply large persistence for
individual shocks. However, it is possible that regime-switching models allowing
for a switch from a high to a low volatility regime can explain both the large
volatility persistence of individual shocks and the pressure relieving effect.
Furthermore, imposing an asymmetric parameter in the conditional variance
equation and/or distributional form of error terms enables us to capture possible
leverage effects in the data. The following subsection examines these issues.
7
The formal test results for conditional heteroskedasticity are also available upon request.
Regime-switching models
The ML estimation results assuming normal and Student’s t innovations for the
FTSE and the S&P returns are respectively shown in Tables 3-6. In order to find
out the appropriate MS structure for the returns series, three different MS models
are analyzed: a partial MS model with no dynamics in the mean equation, a partial
MS and a full MS model with appropriate dynamics (preferred dynamics as
indicated by the single-regime models) in the mean equation. In the full MS
model all parameters in the mean and variance equations are allowed to switch
between regimes while in the partial MS model only the variance equation
parameters differ across regimes.
According to the information criterion and the LR test, the null of the partial
MS structure can be rejected in favor of the full MS structure for the S&P returns
with Student’s t innovations. However, allowing the mean equation’s parameters
to differ across regimes (full MS model) results in some insignificant parameters
in the mean or variance equations. Consequently, it seems the partial MS model in
conjunction with the ARMA (1, 1) structure for the mean equation, with either
8
All estimations are performed in the TSMod package developed by James Davidson (see
http://www.timeseriesmodelling.com/)
9
See Arghyrou and Gregoriuo (2007) for a very recent application of the wild bootstrap technique.
10
We use the Schwarz Criterion, which provides consistent order-estimation in the context of
linear ARMA models (see Hannan, 1980).
normal or Student’s t, can be considered as the appropriate dynamics for the S&P
returns, while no dynamics is preferred for the FTSE returns. We note that
Marcucci (2005) and Ane and Ureche-Rangau (2006) do not compare the partial
and full MS and instead use the full MS structure in their studies. By contrast,
Haas et al. (2005) implement the partial MS structure in analyzing foreign
exchange data.
Table 3: ML estimation results from the two-regime GARCH models with normal
innovations for the FTSE100 returns.
Mean parameters:
Intercept: γ s 0.020 0.020 -0.035 0.027
{0.006} {0.010} {0.457} {0.003}
AR1: θ s 0.045 0.084 0.041
{0.015} {0.601} {0.063}
Variance parameters:
GARCH Intercept: ω 1s / 2 0.048 0.000 0.049 0.000 0.048 0.000
[0.009] [0] [0.009] [0] [0.009] [0]
ARCH term: α s 0.029 0.083 0.029 0.096 0.030 0.100
{0} {0} {0} {0} {0} {0}
GARCH term: β s 0.939 0.971 0.939 0.967 0.939 0.966
{0} {0} {0} {0} {0} {0}
Stay probabilities: ( p11 , p22 ) 0.817 0.001 0.823 0.001 0.827 0.001
Test summary:
Log Likelihood -1375.44 -1373.25 -1372.29
Schwarz Criterion -1406.86 -1408.60 -1419.42
Ljung-Box Q(12) 16.673 15.879 16.986
{0.162} {0.197} {0.15}
Standard errors are given in square brackets and p-values in curly braces.
The upper and middle parts of the table report the estimated coefficients for the mean and variance
equations. In each pair of columns, left and right columns report parameter estimates for regime 1
and 2, respectively. Non-switching parameters are reported under regime 1 (left columns). The
lower part reports diagnostic test results, where the second row shows the increase in the Log-
likelihood value compared to the one for the corresponding single-regime model. For each of the
criteria, boldface entries indicate the best regime switching model for the particular criterion. Q
(12) indicates the aggregate autocorrelation test for the squared normalized residuals up to lag 12.
Tables 3-6 also present the results of the Q-test to test for autocorrelation in
the standardized residuals. The p-values are high for both series, indicating that
the models are rich enough to remove all traces of autocorrelation in the
normalized residuals up to lag 12. Finally, our results (available upon request)
indicate that the skewness parameter in the skewed Student’s t distribution is
insignificant for both series, similar to the single-regime GARCH models. These
results contrast with those of Ane and Ureche-Rangau (2006) who find significant
asymmetry in an MS-APARCH specification for different (Asian) stock market
indices.
Table 5: ML estimation results from the two-regime GARCH models with normal
innovations for the S&P500 returns.
Mean parameters:
Intercept: γ s 0.031 0.032 -0.381 0.044
{0} {0} {0.546} {0}
AR1: θ s 0.795 0.752 0.757
{0} {0.058} {0}
MA1: φ s 0.839 0.320 0.822
{0} {0} {0}
Variance parameters:
GARCH Intercept: ω 1s / 2 0.019 0.078 0.016 0.096 0.000 0.021
[0.011] [0.070] [0.012] [0.065] [0] [0.007]
ARCH term: α s 0.027 0.271 0.027 0.330 0.130 0.034
{0.024} {0} {0.006} {0} {0.506} {0}
GARCH term: β s 0.953 0.896 0.954 0.870 0.942 0.946
{0} {0} {0} {0} {0} {0}
Stay probabilities: ( p11 , p22 ) 0.775 0.117 0.771 0.103 0.049 0.837
Test summary:
Log Likelihood -1404.84 -1396.51 -1387.38
Schwarz Criterion -1440.23 -1439.76 -1442.43
Ljung-Box Q(12) 8.96 9.567 10.675
{0.706} {0.654} {0.557}
See the legend of Table 3 for explanations.
process endogenously collapses back from its explosive state en route to a stable
regime and is stationary in the long run.
The second column of Table 7 for the Student’s t case indicates persistence
in the two regimes with a staying probability, p11 and p22 , both exceeding 0.99.
The regimes are also characterized by different unconditional variances, Eσ 2jt : the
unconditional variances in the high-volatility regime are about twice as large as
those in the low-volatility regime. Consequently, the degree of persistence due to
the Markov effects is close to one, i.e. p11 + p22 − 1 ≈ 1 . As noted by Timmermann
(2000) and Morana (2002), these values of the staying probabilities representing
infrequent mixing of regimes may be interpreted as closely resembling structural
break models. In this case, estimates of the GARCH parameters from models
ignoring the switching may be overwhelmed by substantial upward bias.
P11 , P22 0.817 0.001 0.998 0.999 0.771 0.103 0.998 0.999
Eσ 1t2 , Eσ 2t2 0.115 0.503 0.176 0.262 0.301 1.371 0.156 0.288
The table shows the unconditional properties of estimated MS-GARCH models with normal and
Student’s t innovations. Pjj , j=1, 2, are the staying probabilities and give the probability that state j
will be followed by state j . π j , j=1, 2, are the unconditional probabilities of being in regime j, that
is π j = (1 − Pii ) /(2 − Pii − Pjj ) , j=1, 2 and i ≠ j . δ j , j=1 2, are the expected duration times for
regime j, that is δ j = 1 /(1 − Pjj ) . Eσ 2jt , j=1, 2, denotes the unconditional expectation of the variance.
11
It is also demonstrated by Dacco and Satchell (1999) that the evaluation of forecasts from non-
linear models like regime-switching models based on statistical measures might be misleading.
12
See Dowd (2005) for a survey of backtesting VaR models.
test, LRind , and a conditional coverage test, LRcc . It is obvious that there are some
big discrepancies between the number of failures for the long and short positions
(hereafter f l and f s , respectively) obtained from the symmetric models (normal
and Student’s t innovations). For the FTSE returns f l is higher than f s at all
levels of VaR, and vice-versa for the S&P returns. This can be considered as clear
evidence of the asymmetry in our returns data.
The specification results showed it was not necessary to impose a skewness
parameter in the Student’s t distribution in modeling the distribution of our returns
series. We check whether the skewed Student’s t may be able to improve the out-
of-sample results for both the negative and positive returns. In fact, it is expected
that skewed Student’s t innovations with ζ < 1 can decrease f l and increase f s ,
compared to f l and f s obtained from a symmetric Student’s t. The opposite result
holds when ζ > 1 .
Tables 8 and 9 indicate that, in general, the skewed Student’s t improves on
the out-of-sample performance of the corresponding GARCH and MS-GARCH
models with Student’s t innovations. The overall improvement for both negative
and positive returns is 50% and 42% for the FTSE and S&P returns,
respectively.13
On the other hand, the EGARCH specification with symmetric innovations
generally leads to an acceptable performance for out-of-sample VaR prediction.
Consequently, the EGARCH model with skewed Student’s t innovations
generally fails to improve on the number of failures compared to those obtained
with symmetric Student’s t innovations.
The results unsurprisingly show that the VaR models based on normal
innovations have difficulties in modeling large returns. In particular, the normal
MS-GARCH model consistently underestimates the return (risk) of both series at
different tails, specifically at the 0.5% and 99.5% tails. In other words, failure
numbers are much greater than the expected one at a given quantile, in the case of
normal innovations. This leads to low p-values for the LRuc test, indicating an
insignificant model for volatility forecasting and the VaR estimation. On the other
hand, the models with Student’s t innovations perform very well, irrespective of
the model and the tail one takes into account. Thus, the LR test results show that a
switch from normal to Student’s t innovations yields a significant improvement in
the VaR performance.
13
“Improvement” implies to the situation where the number of failures obtained from an
asymmetric model is closer to the expected one, compared with the corresponding symmetric
model.
Table 8: LR test results for different confidence level of VaR for the FTSE100
returns.
p-values NF p-values
VaR models NF
LRcc LRind LRcc LRcc LRind LRcc
Table 9: LR test results for different confidence level of VaR for the S&P500
returns.
p-values p-values
VaR models NF NF
LRcc LRind LRcc LRcc LRind LRcc
We also examine the statistical accuracy of the VaR models using Hurlin
and Tokpavi’s (2006) MP test. The results for the FTSE and S&P return are
reported in Tables 10 and 11, respectively. Following their suggestion concerning
the choice of lag order K and number of coverage rates m, the following sets are
considered for each VaR model, k ∈ {1,3,5} , Θ ={5%, 1%} and Θ ={5%, 0.5%}
for m = 2 and Θ = {5%, 1%, 0.5%} for m = 3. The p-values corresponding to
Qm (k ) for long and short positions are reported in the left and right panel of the
tables, respectively. Consider the total number of violations of the no
autocorrelation null. The MS-GARCH models with a total of just 6 (out of a total
of 54 possible) violations outperform the single-regime GARCH and EGARCH
models with 13 and 16 violations, respectively, for the FTSE 100. However, the
EGARCH model (8 violations) outperforms the MS-GARCH (14 violations) and
single-regime GARCH (19 violations) for the S&P500.
Comparing the MP test results with those of the LR test, we conclude that
the MS-GARCH models outperform single-regime models as there is no case in
which the p-values for all lag orders are less than 5%. The only small exception
are the results on the left tail of the S&P returns with Θ = {5%, 1%, 0.5%} where
the MS-GARCH model with skewed Student’s t innovations is rejected at all lag
orders. On the other hand, the single-regime models are more likely to be rejected
by the MP test, especially those with large lag orders and high coverage rates. For
instance, the null of no autocorrelation in the VaR violation sequences left by
GARCH and EGARCH models is rejected when the MP test with the coverage set
Θ = {95%, 99%, 99.5%} and lags orders k=3,5 is implemented for the FTSE
returns. The same result is achieved for the GARCH model in forecasting VaR for
the S&P returns with the coverage set Θ = {5%, 1%, 0.5%} at all lag orders, k=1,
3, 5.
Overall, the MS-GARCH-skt model is favored by our exceptions-based tests
in forecasting of both the long and short VaR for FTSE returns. The same results
hold for the MS-GARCH-t model in the case of S&P returns. Our findings are an
improvement on those in Marcucci (2005) in which no model clearly outperforms
the others in forecasting the long VaR of the S&P100 returns.
Table 10: MP test results for different multivariate coverage rates of VaR for the
FTSE100 returns.
Θ ={5%, 1%} Θ ={95%, 99%}
VaR models
K=1 K=3 K=5 K=1 K=3 K=5
GARCH-N 0.61 0.08 0.05 0.70 0.11 0.32
GARCH-t 0.47 0.19 0.11 0.64 0.00 0.01
GARCH-skt 0.38 0.13 0.06 0.88 0.10 0.35
EGARCH-N 0.38 0.12 0.04 0.91 0.14 0.38
EGARCH-t 0.77 0.16 0.14 0.89 0.17 0.42
EGARCH-skt 0.77 0.11 0.10 0.89 0.13 0.38
MS-GARCH-N 0.80 0.64 0.37 0.54 0.35 0.70
MS-GARCH-t 0.58 0.29 0.06 0.83 0.22 0.46
MS-GARCH-skt 0.79 0.07 0.04 0.85 0.92 0.98
Table 11: MP test results for different multivariate coverage rates of VaR for the
S&P500 returns.
Θ ={5%, 1%} Θ ={95%, 99%}
VaR models
K=1 K=3 K=5 K=1 K=3 K=5
GARCH-N 0.00 0.01 0.01 0.83 0.16 0.45
GARCH-t 0.05 0.11 0.20 0.91 0.09 0.28
GARCH-skt 0.03 0.06 0.13 0.93 0.38 0.75
EGARCH-N 0.04 0.33 0.68 0.76 0.04 0.15
EGARCH-t 0.55 0.87 0.91 0.78 0.01 0.08
EGARCH-skt 0.55 0.92 0.97 0.78 0.12 0.45
MS-GARCH-N 0.09 0.27 0.24 0.79 0.55 0.78
MS-GARCH-t 0.03 0.34 0.35 0.73 0.23 0.56
MS-GARCH-skt 0.01 0.05 0.00 0.90 0.47 0.80
Figure 2 shows the capital requirement for the FTSE and SP imposed by
three selected GARCH, EGARCH-t and MS-GARCH models with Student’s t
innovations. One striking feature is that the capital requirements of these three
VaR models display a very similar pattern. However, the MS-GARCH models
usually impose lower daily capital requirements, compared to the single-regime
models. This helps risk managers avoid over-conservative estimation of VaR and
so save on their minimum capital requirements.
FTSE100
25 25
20 20
15 15
10 10
Daily c apital requirem ents
5 5
0 200 400 600 0 200 400 600
S&P500
25 25
20 20
15 15
10 10
5 5
0 200 400 600 0 200 400 600
GARCH-t. The ‘best’ model is that which has the most significant performance
relative to the benchmark model. The other four pair-wise comparisons are those
models with a performance that corresponded to the 75% (second best), 50%
(median), 25% (second worst) and 0% (worst) quantile of model performance.
Table 12: Results of regulatory-based backtesting over 751 days for the FTSE100.
Areas Capital Requirements
VaR Models
Green Yellow Red Average Variance SPA Rank
Long position
GARCH-N 660 91 0 11.97 14.06 Median
GARCH-t 751 0 0 12.30 14.32 Benchmark
GARCH-skt 751 0 0 12.50 14.01
EGARCH-N 452 299 0 12.57 25.97 Worst
EGARCH-t 733 18 0 11.69 15.34 Second best
EGARCH-skt 751 0 0 12.01 16.08 Second worst
MS-GARCH-N 480 271 0 11.84 19.81
MS-GARCH-t 751 0 0 11.49 14.59 Best
MS-GARCH-skt 699 52 0 11.62 14.42
Short position
GARCH-N 619 132 0 11.72 13.02 Worst
GARCH-t 751 0 0 11.93 14.17 Benchmark
GARCH-skt 751 0 0 11.70 14.73 Second worst
EGARCH-N 741 10 0 11.12 12.33 Best
EGARCH-t 741 10 0 11.47 14.99
EGARCH-skt 741 10 0 11.17 14.27
MS-GARCH-N 452 299 0 11.36 11.99
MS-GARCH-t 751 0 0 11.18 14.19 Second best
MS-GARCH-skt 741 10 0 11.31 15.20 Median
This table summarizes the capital requirements imposed by different VaR models. For each
model, the numbers under the green, yellow and red columns indicate how many times during
the 751 days the model has been placed in that particular zone by the Basle traffic light. Average
daily capital requirement is also reported along with its variance over the sample period. The last
column reports the ranking between comparable models based on the Superior Predictive Ability
(SPA) approach, where the benchmark model is GARCH-t. The ‘best’ model is that model had
the most significant performance relative to the benchmark model. The other four pair-wise
comparisons are those models with a performance that corresponded to the 75% (second best),
50% (median), 25% (second worst) and 0% (worst) quantile of model performance. There is no
ranking for the models which have some red zone record, as a placing in the red zone implies a
problem within the VaR model. This is also the case for those models which have weak
performance in the exception-based testing.
Table 13: Results of regulatory-based backtesting over 751 days for the
S&P500.
Areas Capital Requirements
VaR Models
Green Yellow Red Average Variance SPA Rank
Long position
GARCH-N 749 2 0 11.23 10.24
GARCH-t 751 0 0 11.92 12.38 Benchmark
GARCH-skt 751 0 0 12.25 13.13 Worst
EGARCH-N 749 2 0 10.81 10.34 Second best
EGARCH-t 751 0 0 11.35 12.56 Median
EGARCH-skt 751 0 0 11.55 14.89 Second worst
MS-GARCH-N 352 396 3 11.73 20.83
MS-GARCH-t 734 17 0 10.76 12.71 Best
MS-GARCH-skt 746 5 0 10.95 11.62
Short position
GARCH-N 504 247 0 11.59 16.42 Worst
GARCH-t 720 31 0 11.63 12.45 Benchmark
GARCH-skt 720 31 0 11.26 12.00 Second worst
EGARCH-N 699 52 0 10.57 10.81 Best
EGARCH-t 751 0 0 11.01 12.54
EGARCH-skt 751 0 0 10.80 12.81 Second best
MS-GARCH-N 435 287 29 11.10 19.09
MS-GARCH-t 504 247 0 11.17 19.63 Median
MS-GARCH-skt 435 287 29 10.87 17.90
See the legend of Table 13 for explanations.
Overall, the MS-GARCH-t model is ranked best for long positions in both
indices whereas EGARCH specifications are best for short positions. However,
the actual difference in performance between both sets of models is not large in
economic terms. This means that the null hypothesis of the SPA test (the
benchmark model imposes minimum capital requirements) is strictly rejected
when the benchmark is one of the standard GARCH models. This result support
those found in earlier studies. Marcucci (2005) obtains similar results with the
S&P100 data, finding that MS-GARCH and EGARCH specifications perform
better than the GARCH specification.
5 Conclusions
It is usually found that the GARCH family of models is a good candidate for
estimating conditional VaR over short-term time horizons. In this paper, we
extend this analysis to take account of both skewness and regime changes in
forecasting VaR for long and short positions. The empirical study shows the MS-
GARCH models lead to considerable improvements in correctly forecasting one-
day-ahead VaR for long and short positions of the FTSE100 and S&P500 indices.
We use a novel combination of exceptions and regulatory-based backtesting
procedures to evaluate out-of-sample model performance. This indicates that the
MS-GARCH-t clearly outperforms other models in estimating the VaR for both
long and short positions of the FTSE returns data. The exceptions-based (LR and
MP) tests indicate that it is an acceptable VaR model and it imposes lower capital
requirements according to the regulatory-based test. By contrast, the MS-
GARCH-t and EGARCH-t models outperform others in the case of the S&P
returns data. Imposing lower capital requirements, on either short or long position,
they are acceptable VaR models for both long and short positions. Furthermore,
analogous to the findings of Giot and Laurent (2003), the LR test confirms that
assuming skewed Student’s t innovations improves on the out-of-sample
performance of the corresponding GARCH and MS-GARCH models with
Student’s t innovations.
Finally, we believe further research could endeavour to replicate these
results for different firm sizes (e. g. the FTSE 250, FTSE Small Cap and FTSE
All Share indices). This would be to investigate if smaller firms have greater
leverage effects due to the additional volatility imposed upon them as a
consequence of lower trading volume. This may shed further light on quantifying
the effects of skewness and regime change in forecasting VaR for long and short
positions.
References
Arghyrou, M., Gregoriou, A., 2007. Testing for Purchasing Power Parity
correcting for non normality using the wild bootstrap. Economics Letters,
95(2), 285-290.
Bond, S. A., 2000. Asymmetry and downside risk in foreign exchange markets.
Mimeo, Department of Land Economy, University of Cambridge,
Cambridge, UK.
Box, G. E. P., Jenkins, G. M., Reinsel, G. C., 1994. Time Series Analysis:
Forecasting and Control. Third edition, Prentice Hall.
Ding, Zhuanxin, Granger, C. W. J., Engle, R. F., 1993. A long memory property
of stock market returns and a new model. Journal of Empirical Finance, 1,
83-106.
Dowd, K., 2005. Measuring Market Risk. John Wiley & Sons Ltd.
Giot, P., Laurent, S., 2003. Value-at-Risk for long and short positions. Journal of
Applied Econometrics, 18, 641-664.
Glosten, L. R., Jaganathan, R., Runkle, D., 1993. On the relation between the
expected value and the volatility of the normal excess return on stocks.
Journal of Finance, 48, 1779-1801.
Haas, M., Mittnik, S., Paollela, M. S., 2004. A new approach in Markov-
switching GARCH models. Journal of Financial Econometrics, 2(4), 493-
530.
Hamilton, J. D., 1994. Time series analysis. Princeton University Press, Princeton,
New Jersey.
Hansen, B. E., 1996. Erratum: the Likelihood Ratio test under nonstandard
conditions: Testing the Markov switching model of GNP. Journal of
Applied Econometrics, 11, 195-198.
Hansen, P. R., 2005. A test for superior predictive ability. Journal of Business and
Economic Statistics, 23, 365-380.
Hannan, E. J., 1980. The estimation of the order of an ARMA process. The
Annals of Statistics, 8(5), 1071-1081.
Hurlin, C., Tokpavi, S., 2006. Backtesting VaR accuracy: A new simple test.
University of Orleans, Working Paper.
Lambert, P., Laurent, S., 2001. Modelling financial time series using GARCH-
type models and a skewed Student density. Mimeo, Universite de Liege.
Laurent, P., Peter, J-P., 2002. G@RCH 2.2: An Ox package for estimating and
forecasting various ARCH models, Journal of Economic Surreys, 16, 447-
485.
Lunde, A., Hansen, P. R., 2005. A forecast comparison of volatility models: does
anything beat a GARCH(1,1)?. Journal of Applied Econometrics, 20, 873-
889.
McLachlan, G. J., Peel, D., 2000. Finite Mixture Models. New York: John Wiley
& Sons.
Mikosch, T., Starica, C., 2004. Nonstationarities in financial time series, the long-
range dependence, and the IGARCH effects. Review of Econometrics and
Statistics, 86, 378-390.
Sarma, M., Thomas, S., Shah, A., 2003. Selection of Value-at-Risk models.
Journal of Forecasting, 22, 337-358.