Modeling and Volatility Analysis of Oil Companies in Pakistan
Firdos Khan
Scientific officer, Climate Section, Global Change Impact
Studies Centre (GCISC), National Centre for Physics (NCP) Complex,
Quaid-i-Azam University Campus Islamabad, Pakistan.
fkyousafzai@gmail.com
Zahid Asghar
Assistant Professor at Quaid-i-Azam University, Islamabad.
g.zahid@gmail.com
ABSTRACT
We focus on modeling of conditional mean and conditional variance of the share prices of two
stocks, i.e. Shell Gas (LPG) Pakistan and Attock Petroleum (AP) from Karachi Stock Exchange by using
ARIMA and ARCH models. We have daily data for shell gas (LPG) Pakistan and Attock Petroleum (AP)
from 3rd of August 1998 to 31st of January 2007.
Firstly we have constructed ARIMA modeling. Unit root testing with different lag length criteria
has been performed. We find that there are more than one potential models which fit to this data. The
performances of these models are not different from each other. The best model for AP series is
ARIMA(1,1,0) and for LPG ARIMA((1,2),1,0) is the best model among the competing models.
Secondly we have measured the volatility of the stocks by ARCH family of models. Maximum
likelihood method of estimation is used to estimate the unknown parameters of the models. Different lag
length criteria like AIC, BIC etc are used to find the best possible model. The potential models are then
analyzed on the basis of their forecast performance. GARCH model is the best model for AP series and
EGARCH model is the best model for LPG series among the competing models. This study is an important
application of all the modern time series econometric tools. Moreover effort is made to remove a misnomer
(which our old econometricians and books show that true model is known) that there is only one model
which fits the data.
Keywords: ARIMA Models, ARCH Models, Unit Root test, AIC, BIC, ARCH-LM test, Volatility, MSE,
RMSE, MAE, TIC, TGARCH Model, LM-Test, GARCH model, EGARCH model, JGR-GARCH model,
P-Value, White noise, Stationarity.
1
Firdos Khan and Zahid Asghar
1. Introduction.
Recent development in financial time series requires the use of models and
techniques that are able to model the attitude of the investors not only toward expected
returns, but also towards risk or uncertainty. Therefore the modeling of uncertainty is
more important. To model the uncertainty of a series it requires the models to be capable
of dealing with the uncertainty or variance of the series. The most suitable models that
deal with such phenomena are the ARCH class of models. An important assumption of
the conventional econometrics is that the disturbance terms appearing in a regression
equation are homoskedastic i.e. the variance of the disturbance terms remains constant
over time. But in practical analysis this assumption dose not holds. In case of
heteroskedasticity the OLS estimators are unbiased, consistence but no longer efficient
and therefore they are not BLUE.
Before the introduction of the ARCH models the researchers thought that
heteroskedasticity is the problem of cross-sectional data. However, some researchers tried
to forecast financial time series like stocks prices, inflation rates, foreign exchange rates
etc, they observed that their forecast variable varies from one time period to another time
period. The forecast errors are small for some period and large for other period. This
variability could be due to volatility in financial market. This would suggest that the
variance of the forecast error is not constant, but varies from period to period. It means
that there is some kind of autocorrelation in the variance of forecast error. Since the
behavior of the forecast error can be assumed to depend on the behavior of the
(regression) disturbance. It can be make a case of autocorrelation in the variance of
(regression) disturbance. To capture this autocorrelation Engle (1982) introduced a model
called ARCH model.
Prior to the introduction of the ARCH model the researchers used different types
of model, which are based on the sample standard deviation to model the current
uncertainty and forecast the future uncertainty of the series. For example Fama (1965)
used RW model for this purpose.
According to random walk model the standard
deviation of a series for today is the estimated forecast for tomorrow and so on. The other
models, which were used by researchers, include the models like HISVOL models,
moving average models, EWMA models etc.
2
Modeling and Volatility Analysis of Oil Companies in Pakistan
There are several reasons that we may wish to model and forecast volatility of
the data. First we may need to analyze the risk of holding an asset. Second forecast
confidence interval may be time varying, so that more accurate interval can be obtained
by modeling the variance of the error terms. The third reason is that more efficient
estimates can be obtained if heteroskedasticity in the error terms is handle properly.
ARCH models are especially designed to model and forecast conditional variance. The
variance of the dependent variable is modeled as a function of the past value of the
dependent variable and independent or exogenous variables. ARCH model introduced by
Engle’s (1982) and generalized as GARCH by Bollerslev (1986) and Taylor (1986)
independently, are popular in financial time series and are widely used in econometrics
analysis.
The ARCH and GARCH model perform very well but these models only
consider the magnitude of the random shocks and do not utilize the direction of the
shocks. The non-negativity constraints on the parameters create difficulties in estimation,
and in some studies they are found negative. Another problem with the ARCH and
GARCH models is, whether a shock persists to the conditional variance, if yes then how
long it well last?
Keeping in view the drawbacks of the ARCH and GARCH models, Nelson
(1990) proposed a new model called EGARCH model. EGARCH model overcomes the
drawbacks of ARCH and GARCH models. EGARCH model captures the asymmetry of
the data. Asymmetry means positive and negative shocks of the same magnitude have not
the same effect on the conditional variance. Glosten et al (1993) proposed a model for
modeling the second moment of the financial time series, called GJR-GARCH model due
to Glosten, Jagannathan, and Runkle. GJR-GARCH model is also called TARCH model.
Engle and Ng (1993) introduced PNP model and made comparison of PNP and other
ARCH class models like GARCH, EGARCH, and GJR-GARCH. Except the GARCH
model the remaining models gives similar results. The Violation of constrain of nonnegativity of the parameters in ARCH and GARCH models were frequently found. To
overcome this problem
Geweke (1986) and Milhoj (1987 a) suggested log ARCH
model. The log-ARCH models have no need of non-negativity constraint. Bera and
3
Firdos Khan and Zahid Asghar
Higgins (1992) proposed non-linear ARCH model. This model includes linear ARCH
model as a special case and log ARCH as a limiting case.
A number of studies in the literature can be found in which the researchers
made comparison among different volatility models. Akgiray (1989) have found evidence
in favour of GARCH (1,1) model over other model like ARCH (2), EWAM and HISVOL
model. He found that GJR-GARCH is the best model in all the employed models.
Figlewski (1997) compared HISVOL model and GARCH (1,1) model using the data on
three stocks and one exchange rate. For some past data set GARCH model gives best
result and for some data set other models were superior. Lee (1991) compared different
types of ARCH models by using weekly data on different five exchange rates. He found
evidence in favour of non-linear ARCH models. McMillan et al (2000) used daily,
weekly and monthly data for two stocks of UK and compared different Volatility models.
They concluded that RW model is the best model if the crash of 1987 is included and all
models gives similar results if 1987 crash is excluded. Brooks (1998) used daily data and
made an interesting comparison among different volatility models. Brooks also
concluded that the performance of all of the employed models were similar if the 1987
crash is not included in the analysis.
The main objective of the study is to provide suitable model to the investors that
can be used by investors to predict or forecast the future expected share prices and make
investment accordingly. For this purpose we first do modeling of the chosen series by
assuming homoscedasticity i.e. ARIMA modeling. We also relaxed the assumption of
homoscedasticity and do ARCH modeling of the selected series. By using ARCH model
the investors can forecast expected future return as well as the uncertainty of the series,
so it is very helpful for investors to do investment or not. The rest of the paper is
distributed as, in section two we present methodology adopted in the study as well as
some models, which are employed in the study. In section three data analysis and results
are given, and section four is reserved for conclusion. At the end a list of references is
given.
4
Modeling and Volatility Analysis of Oil Companies in Pakistan
2. Methodology and models used in the study.
This section consists on methodology and models used in the study. In the first
step we performed unit root testing, because stationarity is one of the assumption of Box
and Jenkins (1976) methodology of ARIMA modeling. After achieving stationarity we
build ARIMA models to each series. ARIMA modeling consists of three steps. The first
step is to identify the order of the model, ACF’s and PACF’s. The next step is the
estimation of the unknown parameters of the identified models using OLS method if the
model is linear, otherwise non-linear methods can be used to estimate the unknown
parameters. Diagnostic checking of the modeling completes the ARIMA modeling.
Various diagnostic checks are available in the literature to check the adequacy of the
model. In current study we have applied two diagnostic checks, one is the serial
correlation LM test, and the other is Ljung and Box Q-Statistics. For both series we have
more than one models that perform similarly, therefore they are compared on the basis of
statistics like AIC, BIC, TIC, ME, MAE, RMSE etc.
In the second phase of the study we model the conditional mean as well as the conditional
variance of the given series simultaneously, which is called ARCH modeling. The
analysis is completely based on the residuals obtained from the fitted ARIMA model
built in the first phase. ARCH modeling consists of four steps. Identification of the order
of the model, testing for ARCH error, estimation of the model, and diagnostic checking
of the model. For identification purpose we have used ACF’s to identify the order of the
model. Tests are also available in the literature to test for ARCH error. We have used
ARCH-LM test of Engle (1982) at various lag length to test for ARCH error. ML method
is used to estimate the parameters of the model. It is assumed that the residuals from
ARIMA models are normally distributed. We can use OLS method to estimate the
parameters of the ARCH models, but the ML method is efficient than OLS method in
case of ARCH modeling (Engle, 1982). In the last step diagnostic (ARCH-LM) test is
used to test the adequacy of the estimated models. We also used Q-statistics to check the
adequacy of the conditional mean model, and Q-statistics of square residuals to check the
adequacy of the conditional variance model. Next we present the introduction of the
models used in the study. There are a number of ARCH models in literature, but we
confine ourselves to four main ARCH models, which are widely used in ARCH modeling
5
Firdos Khan and Zahid Asghar
literature. In this Study the parameters of ARIMA model is represented by English letters
and parameters of ARCH model is represented by Greek letters.
2.1 ARIMA Model
ARIMA model is the combination of AR process and MA process when the given
series is non stationary. Stationary is the main difference between ARMA model and
ARIMA model. In the former the order of integration is equal to zero, while in the later it
is greater than zero. Mathematically the general form of an ARIMA process can be
represented by the following equation
p
q
i =1
j =1
Δ d Yt = a0 + ∑ ai Δ d Yt −i + ∑ b j ε t − j + ε t
(2.1.1)
Where Δ d Yt show that original series is integrated of order d time. The Δ d Yt −i are the
previous values of the series of order p, and ε t − j are the previous random shocks of order
q. Where p is the order of AR terms and q is the order of MA process, and ai , for
i = 1, 2,3,..., p are the AR parameters and b j for j = 1, 2,3,..., q are the MA parameters
which are to be estimated. The error terms are assumed to be white noise. The estimation
of the identified ARIMA model is not straightforward, because the ordinary least square
method cannot be used to estimate an ARIMA model. Non-linear estimation algorithm
can be used to estimate an ARIMA model. An ARIMA model of order of integration d,
order of AR terms p, and order MA terms q, can be simply represented as
ARIMA ( p,d,q )
2.2 ARCH Models
Engle (1982) introduced ARCH model. ARCH model suggests that the variance
of residuals at time t depend on the square errors terms from the past. The ARCH model
can be defined as follow. Let Yt is the return of the stock, then the following model is
called ARCH (p) model.
p
q
i =1
j =1
Y t = a0 + ∑ aiY t −i + ∑ b j ε t − j + ε t
(2.2.1)
ε t = υt ht
(2.2.2)
6
Modeling and Volatility Analysis of Oil Companies in Pakistan
s
ht = α 0 + ∑ α iε 2t −i
(2.2.3)
i =1
Where p and q are the order of AR and MA process to yield an ARMA process. It is
assumed that the error terms have mean zero and no autocorrelation at any lag. To specify
an ARCH process it is assumed that the equation (2.2.2) can be decomposed in to two
parts, υt which is homoskedastic, with mean zero and σ υ2 = 1 , and ht is heteroskedastic
with variance given by equation (2.2.3)
The generalization of ARCH model is called GARCH model. Bollerslov (986)
generalized ARCH model as GARCH model by allowing the past conditional variances
in the conditional variance equation. The generalization was done in the similar way as
AR model was generalized to ARIMA model.
The GARCH model can be defined as follow,
Let Yt be the return of a particular stock, and suppose Yt follow an ARIMA process
p
q
i =1
j =1
Y t = a0 + ∑ aiY t −i + ∑ b j ε t − j + ε t
(2.2.4)
ε t = υt ht
(2.2.5)
s
r
i =1
j =1
ht = α 0 + ∑ α iε 2t −i + ∑ β j ht − j
(2.2.6)
Where p and q are the order of AR and MA process to yield an ARIMA model. It is
assumed that the error terms have mean zero and no autocorrelation. In equation (2.2.6) s
is the order of ARCH terms and r is the order of GARCH process. Also α i of order s are
the coefficients of ARCH terms and β j of order r are the coefficients of GARCH terms.
The above Model is denoted by GARCH (s, r) model. It is important to remember that the
GARCH model consist of two parts. One is an ARIMA equation and other is GARCH
equation. If r = 0, the GARCH model reduced to ARCH model.
Black (1976) has noted a negative correlation between current returns and future
volatility that tends to rise in response to bad NEWS and fall in response to good NEWS.
It means that the impact of good NEWS is not the same as that of bad NEWS. GARCH
model is symmetric and gives same weight to good and bad NEWS of the same
magnitude and ignore the sign of the NEWS. The GARCH model failed to interpret the
7
Firdos Khan and Zahid Asghar
persistence of the shock to the conditional variance. In GARCH model some restrictions
are imposed on the parameters to insure the positiveness of the conditional variance.
Nelson (1991) introduced a model known as EGARCH model that can be defined as
follow
Let Yt be the return of a particular stock, and suppose Yt follow an ARIMA process
p
q
i =1
j =1
Y t = a0 + ∑ aiY t −i + ∑ b j ε t − j + ε t
(2.2.7)
ε t = υt ht
(2.2.8)
s
ln ( ht ) = α 0 + ∑ α i
i =1
ε t −i
ht −i
w
+ ∑ λj
j =1
εt− j
ht − j
r
+ ∑ β k ln ( ht − k )
(2.2.9)
k =1
Where p and q are the order of AR and MA process respectively to yield an
ARIMA (p,d,q) model. Where ai are AR parameters of order p and b j are MA
parameters of order q which are to be estimated. It is assumed that the error terms have
mean zero and no autocorrelation at any lag. To specify an ARCH process it is assumed
that equation (2.2.8) can be decomposed in to two parts, υt which is homoskedastic, with
mean zero and σ υ2 = 1 , and ht is heteroskedastic with variance given by equation (2.2.9).
In equation (2.2.9) s, r, and w are the order of ARCH terms, GARCH terms and
asymmetric terms respectively. Where a0 , ai b j , i = 1, 2,3,... p and j = 1, 2,3,...q are the
parameters of the mean equation. and α 0 , α i , β j , and λk for i = 1, 2,3,...s ,
j = 1, 2,3,...w , and k = 1, 2,3,...r are the parameters for the variance equation which are to
be estimated.
The GJR-GARCH model is a member of ARCH family. This model is called
GJR-GARCH due to the Lawrence R. Glosten, Ravi Jagannathan and David E. Rankle
proposed it in (1993) is also celled TGARCH model. This model was independently
introduced by Zakoin in (1990). The major drawback of ARCH and GARCH models is
the fact that they are symmetric. These models only consider the magnitude of the error
term and ignore the sign. Therefore in ARCH and GARCH models a big negative shock
will have exactly the same effect on volatility of the series as a big positive shock of the
same magnitude and its looking plausible due to their symmetry. Therefore, there was
8
Modeling and Volatility Analysis of Oil Companies in Pakistan
need of models which take in to account the asymmetry of the time series data especially
financial time series data. Because the sign play important role in volatility analysis.
Despite the advantages EGARCH model the empirical estimation of the model is
technically difficult as it involves highly non-linear algorithms. In contrast the GJRGARCH model is much simpler than EGARCH, though it not as elegant as EGARCH
model. The general GJR-GARCH model is specified as follow
p
q
i =1
j =1
Y t = a0 + ∑ aiY t −i + ∑ b j ε t − j + ε t
(2.2.10)
ε t = υt ht
(2.2.11)
s
r
w
i =1
j =1
i =1
ht = α 0 + ∑ α iε 2t −i + ∑ β j ht − j + ∑ γ iδ iε 2t −i
(2.2.12)
⎧1 if ε t −i < 0
⎩0 if ε t −i ≥ 0
δt = ⎨
Where Yt the returns, p and q are the order of AR and MA process respectively to yield an
ARIMA process. It is assumed that the error terms have average value equal to zero and
no autocorrelation at any lag. To specify an ARCH process it is assumed that equation
(2.2.11) can be decomposed in to two parts, υt which is homoskedastic, with zero mean
and σ υ2 = 1 , and ht is heteroskedastic with variance given by equation (2.2.12). So, γ i
catches the asymmetry in response of volatility to the shocks in a way that impose a prior
belief that for a positive shock and a negative shock of the same magnitude, future
volatility is always higher, or at least the same, when the sign of the shock is negative.
3. Data analysis and results
The first step in time series analysis is to check the series for unit root. So in the
first step we have used Augmented Dickey – Fuller (ADF) (1979) test to examine the
possibility of unit root in the series. We run the ADF regression. Estimation of ADF
equation would be quite simple by using standard OLS method.
p
Δ ln yt = ρ ln yt −1 + ∑ φ j Δ ln yt − j + υt
(3.1)
j =1
In ADF test our null hypothesis is “there is a unit root”. The null hypothesis is tested and
the results are given in the following table for the selected series.
9
Firdos Khan and Zahid Asghar
Table 3.1: Results of ADF test.
Test statistic
Variables
5% critical value
First
Levels
difference
First
Levels
difference
AP
1.701370(37)
-2.632981(37)
-1.9403
-1.9403
LPG
0.241245(66)
-4.353701(66)
-1.9395
-1.9395
Note: Figures in parenthesis represent the number of lags that are included in ADF test.
At level we accept the null hypothesis of unit root at 5% level of significance in both time
series. At first difference we reject the null hypothesis of unit root at 5% level of
significance, and conclude that the given series are stationary at first difference. The
selected series are integrated of order one denoted as I(1). In further analysis we will use
these integrated series.
To identify the order of the ARIMA model we used ACF,s and PACF,s of the
given time series. We already determined the order of d in ARIMA model, which is equal
to one. Now we want to determine the order of AR terms and MA terms. From the
correlogram we observed that the tentative ARIMA model will be any combination of
AR (1) AR(2) AR(3) MA (1) MA (2) MA (3) MA (4) for AP series. However, the final
three competing models are ARI (1), MA (1), and AR (1) AR (3) MA (3) MA (4) for AP
series. For LPG series from the correlogram it is infered that the final model will be any
combination of the AR(1) AR(2) MA(1) MA(2) terms. Four models i.e. AR(1) AR(2),
AR(1) MA(1), AR(1) MA(2) and MA(1) MA(2) are the final competing models for LPG
series. The final estimation results for competing models for both series are given in the
following table.
10
Modeling and Volatility Analysis of Oil Companies in Pakistan
Table: 3.2 Estimation results of ARIMA modelling for AP and LPG
AP
LPG
S.No
Model
Coefficients
Std. Error
P-Values
Coefficients
Std. Error
P-Values
0.2548
0.0453
0.0000
0.0801
0.0241
0.0009
AR(2)
-------
-------
-------
0.0241
0.0003
AR(1)
0.2897
0.0450
0.0000
0.0352
0.0000
MA(1)
-------
-------
-------
0.0396
0.0000
AR(1)
0.1882
0.0223
0.0000
0.0242
0.0010
-0.8671
0.0236
0.0000
-------
-------
-------
-------
-------
0.0242
0.0002
0.8701
0.0210
0.0000
-------
-------
-------
0.1604
0.0287
0.0000
-------
-------
-------
-------
-------
-------
0.0777
0.0241
0.0013
-------
-------
-------
0.0904
0.0241
0.0002
AR(1)
1
2
AR(3)
3
MA(2)
MA(3)
MA(4)
MA(1)
4
MA(2)
0.0882
0.8462
-0.8288
0.0794
------0.0888
From the table it is clear that all the parameters are significant at 5% and 1%
significance level. Next we checked the adequacy of the model by using the diagnostic
checking test like LM-test and Ljung-Box Q-statistics. All the models mentioned in the
table qualify both tests. So, these all models i.e. four for LPG series and three for AP
series are the suitable models. However, these competing models are compared to judge
the performance of each of these competing models. The forecast performance is also
evaluated in each case. The statistics used for this purpose are, AIC, BIC, SSR, LL, SER,
ME, MAE, RMSE and TIC. These statistics are useful to choose the best possible model
among the competing models that is widely used by researchers in time series analysis
and econometrics analysis. The following table gives summary of the mentioned statistics
for the competing models in each case.
11
Firdos Khan and Zahid Asghar
Table: 3.3 Comparison and forecast evaluation of competing models for AP and LPG
AP
LPG
Statistic
AR (1)
MA (1)
AR (1) AR
(3) MA (3)
MA (4)
AR(1)
AR(2)
AR(1)
MA(1)
AR(1)
MA(2)
MA(1)
MA(2)
AIC
-4.5001
-4.4936
-4.5203*
-5.1338
-5.1751*
-5.1344
-5.1344
BIC
-4.4911*
-4.4845
-4.4836
-5.1275*
-5.1687
-5.1280
-5.1280
SSR
0.2914
0.2939
0.2805*
0.5894
0.5659*
0.5894
0.5898
1021.20*
1014.314
1016.531
4432.554
4434.46*
4399.604
4402.174
SER
0.0255
0.0256
0.0251*
0.0187*
0.0187*
0.0187*
0.0187*
ME
0.0006
0.0003*
0.00032
0.0002*
0.0003
0.00032
0.00046
RMSE
0.0263
0.0265
0.0291*
0.0053*
0.0053*
0.0053*
0.0053*
MAE
0.0201*
0.0203
0.0216
16.4136
16.4040
16.4040
16.3944*
TIC
0.5146*
0.8900
0.7518
0.8500*
0.8900
0.9100
0.8600
LL
Note: “*” indicate that the statistic choose the model among the competing models.
From the above table we can conclude that AIC choose AR(1) AR(3) MA(3) MA(4)
model for AP series, but BIC which choose more parsimonious model than AIC chooses
AR(1) model for AP series. Log likelihood also going in favour of AR(1) model. MAE
and TIC also choose AR(1) model as the best model for AP series. Therefore, the final
ARIMA model for AP series ARIMA(1,1,0). On the other hand for LPG series among 9
statistics in the above table 5 choose ARIMA((1,2),1,0) as the best model for LPG series.
After ARIMA modeling we used ARCH model to model the conditional
mean and conditional variance of the series simultaneously. ARCH-LM test is used to
test for ARCH error at different lag length. The number of lags included in the ARCHLM test is the order of the ARCH model. It is necessary to apply the formal ARCH test to
confirm the order of the ARCH process, for this purpose we have used the Lagrange
multiplier test of Engle (1982), which is called ARCH-LM test, especially design to test
for ARCH error. ARCH-LM test is distributed as χ 2 with p degree of freedom under the
12
Modeling and Volatility Analysis of Oil Companies in Pakistan
null hypothesis to be true, where p is the number of lags, which are included in the test
for testing ARCH error. Our null hypothesis is
H 0 = There is no ARCH error in the series. Vs H A = There is ARCH error in the series.
In our analysis the value of the ARCH-LM test is 58.27565; the lags included in the test
are only 2. The corresponding P-Value is 0.00000, which is very low. So we have no
difficulty to reject the null hypothesis of no ARCH error, and conclude that there is an
ARCH error in the series. This confirms that the order of the ARCH error is two for AP
series. We have also estimated other ARCH models i.e. GARCH model, EGARCH
model and GJR-GARCH model for both series. The estimation results are given in the
following table.
Table: 3.4 Estimation Results of various ARCH models for AP.
Coefficient
ARCH
GARCH
EGARCH
GJR-GARCH
Mean Equation
Constant
a1
______
______
______
______
0.210767
0.207782
0.227461
0.217172
(0.0000)
(0.0000)
(0.0000)
(0.0000)
Variance Equation
α0
0.000339
0.000108
-1.979132
0.000110
(0.0000)
(0.0075)
(0.0009)
(0.0057)
α1
0.276070
0.222069
0.409465
0.178956
(0.0048)
(0.0059)
(0.0001)
(0.0386)
α2
0.212106
______
______
______
(0.0067)
______
______
0.606761
______
0.777937
______
0.590732
______
(0.0000)
(0.0000)
(0.0000)
______
______
-0.064140
-0.134311
______
______
(0.3007)
(0.2466)
β1
λ1
Note: Numbers in the parenthesis are the P-Values.
Table 3.4 is the estimation results of various ARCH models. AR(1) is the mean equation
in all ARCH models. ARCH (1), GARCH(1,1), EGARCH(1,1) and GJR-GARCH(1,1)
are the variance equation in the four estimated ARCH models.
13
Firdos Khan and Zahid Asghar
Table: 3.5 Estimation Results of various ARCH models for LPG.
Coefficient
ARCH
GARCH
EGARCH
GJR-GARCH
Mean Equation
Constant
______
______
Trend
a1
a2
a3
b1
b3
-0.004238
______
______
(-11.0970**)
______
______
______
0.00000329
______
______
______
(8.898950**)
______
______
______
0.74339
______
______
______
(16.84051**)
0.100110
(3.433833**)
______
______
______
0.110905
______
(3.319231**)
-0.321423
______
-0.349591
______
-0.361832
(-2.256154*)
(-1.925883)
(-2.42033*)
______
______
______
-0.68886
______
0.098
(-15.07004**)
______
______
______
______
0.392585
(2.894435**)
______
______
0.413
______
(2.884995**)
(2.437575*)
______
0.42416
(2.761010**)
______
b2
______
______
______
Variance Equation
α0
α1
α2
β1
λ1
0.000204
0.000115
-4.028927
0.00012
(188.2402**)
(14.97888**)
(-18.11205**)
(13.60119**)
0.221432
0.257928
0.569548
0.424892
(7.230734**)
(7.967976**)
(15.32145**)
(6.366758**)
0.224575
______
______
______
(10.32825**)
______
______
0.423238
______
0.541754
______
0.04312
______
(11.57114**)
(21.23085**)
(9.321583**)
______
______
0.235790
-0.30854
______
______
(7.696042**)
(-4.682034**)
Note: “**” and “* “denote that the parameters are significant at 1%and 5% significance level respectively.
Table 3.5 is the estimation results of four employed ARCH model for LPG series.
The situation is quite different here as compare to the estimation results of AP series.
Mean equation in ARCH models is not same in all estimated ARCH models. AR(2)
14
Modeling and Volatility Analysis of Oil Companies in Pakistan
AR(3), AR(3) MA(2) MA(3), AR(1) MA(1) and AR(2) AR(3) MA(3) are the mean
equations for ARCH, GARCH, EGARCH and GJR-GARCH models respectively. The
order of ARCH model is 2 for LPG. The asymmetric coefficient is significant in both
asymmetric models, which show that the response of good and bad NEWS is not
symmetric. After estimating ARCH models we have used ARCH-LM test to see whether
there is any further ARCH error in the series? We have found that there is no evidence of
any further ARCH in both AP and LPG series. The ARCH-LM values are 0.3358,
0.1736, 0.1980 and 0.1067 for ARCH, GARCH, EGARCH and GJR-GARCH models
respectively and P-value is greater than 0.85 in each case for AP series. This proves that
there is no ARCH error remains in AP series. Similar results are obtained for LPG series.
As we have estimated four ARCH models, but we want to know that, which model
performs better. To judge the performance of these models we compared these models
and also evaluate their forecast performance by using some statistics. The comparison
and forecast evaluation results are given in the following table for the estimated models.
Table: 3.6 Comparison and forecast evaluation of various ARCH models for AP and LPG
AP
Statistic
AIC
BIC
SSR
LL
SER
ME
RMSE
MAE
TIC
LPG
AR CH
GAR CH
EGARCH
GJRGARCH
AR CH
GAR CH
EGARCH
GJRGARCH
-4.58352
-4.59270
-4.58888
-4.59319*
-5.436465
-5.438305
-5.459839*
-5.446338
-4.54699
-4.55618*
-4.54322
-4.54753
-5.417371
-5.419212
-5.434405*
-5.424063
0.291984
0.292070
0.291606*
0.291818
0.556017
0.553432*
0.579262
0.553863
1035.293
1037.358
1037.500*
1037.469
4656.895
4658.470
4684.352*
4666.343
0.02558*
0.02559
0.02559
0.02560
0.018059
0.018016*
0.018432
0.018029
0.002299
0.002300
0.002295*
0.002298
0.004299
0.002500
0.002255*
0.002278
0.026277
0.026270*
0.026271
0.026275
0.018352
0.018440
0.018857
0.018348*
0.020144
0.020135*
0.020140
0.020142
0.005522
0.005224*
0.006812
0.005533
0.770943
0.731367*
0.768558
0.760030
0.937972
0.940851
0.904861*
0.929894
Note: “*” indicate that the statistic choose the model among the competing models.
15
Firdos Khan and Zahid Asghar
Table 3.6 is the comparison and forecast evaluation results for four estimated ARCH
models for AP and LPG series. From table we conclude that GARCH model perform
better than any other model for AP. Four out of nine statistics choose GARCH model
among four estimated ARCH models. For LPG, EGARCH model is the best model. It is
obvious from the table that out of nine statistics five choose EGARCH model. The
smaller values of AIC and BIC indicate best model among the competing models. These
both statistics choose EGARCH model for LPG. Log likelihood is maximum for
EGARCH model and show that it is the best model. ME and TIC are minimum for
EGARCH model showing that EGARCH model is the best model among the four
competing ARCH model used for LPG data set.
4. Summary and conclusion
After achieving stationarity by using unit root testing, we have used ARIMA
modeling to model the conditional mean of both AP and LPG series. Various models
have been used for both series to choose the best model. Three different ARIMA models
are the final competing models for AP series and four different ARIMA models are the
competing models for LPG series. These all models are good and can be used for
forecasting, however, we made comparison and evaluate the forecast performance of
these models to choose a single model among the competing models. ARIMA(1,1,0) is
the final model for AP series and ARIMA((1,2),1,0) is the final model for LPG series.
After ARIMA modeling we used ARCH models to model the conditional mean as well as
the conditional variance simultaneously. For this purpose first we used ARCH-LM test to
test whether there is any ARCH error. The test results show that there is an ARCH error
in both series. Four different ARCH models viz., ARCH, GARCH, EGARCH and GJRGARCH models are used for both series. After estimation of ARCH models we again use
ARCH-LM test to test whether there is any further ARCH error in both series. The test
results show that there is no further ARCH error in both series after estimation of ARCH
models. We also compared the estimated ARCH models for both series and check which
model is the best model among the competing models. After comparison we conclude
that GARCH model is the best model for AP series and EGARCH model is the best
model for LPG series. The asymmetric parameter is insignificant in AP series and
significant in LPG series. It is also important to note that ARCH models performed better
16
Modeling and Volatility Analysis of Oil Companies in Pakistan
than ARIMA model in each case. For ARIMA model in case of LPG series minimum
value of AIC is -5.1751 and BIC is -5.1275, while for ARCH models minimum value of
AIC is -5.4598 and BIC is -5.4344. These statistics clearly support ARCH models instead
of ARIMA models. It is easy to look at table 3.3 and table 3.6 to judge the performance
of ARCH and ARIMA models. Almost all statistics support ARCH models. Similar
situation is observed for AP series.
References
Akgiray, V. (1989): “Conditional Heteroskedasticity in Time Series of Stock Returns.
Evidence and Forecast,” Journal of Business. 69, 55-80.
Bera, A. K. and Higgins, M. L. (1992): “A Test for Conditional Heteroskedasticity in
Time Series Model,” Journal of Time Series Analysis, 13, 501-519.
Black, F. (1976): “Studies in Stock Price Volatility Changes, Proceeding of the 1976
Business Meeting of the Business and Economic Section,” American Statistical
Association, 177-181.
Bollerslev, T. (1986): "Generalized Autoregressive Conditional Heteroskedasticity,”
Journal of Econometrics, 31, 307-327.
Box, G. E. P., and Jenkins, G. C. (1976): “Time series analysis, forecasting and
control,” San Francisco: Holden day.
Brooks, C. (1998): “Predicting Stock Market Volatility,” Journal of Forecasting 17: 1,
59-80.
Dickey, D. A., and Fuller, A. W. (1979): “Distribution of the Estimates for
Autoregressive Time Series With Unit Root,” Journal of the American Statistical
Association 74, 427-431.
17
Firdos Khan and Zahid Asghar
Engle, R. F. (1982): "Autoregressive Conditional Heteroskedasticity with Estimates of
the Variance of U.K. Inflation," Econometrica, 50, 987-1008.
Engle, R. F., and Ng, V. K. (1993): “Measuring and Testing the Impact of News on
Volatility,” The Journal of Finance. Vol. 48, 5, 1749-1778.
Fama, E. F., (1965): “The Behavior of Stock Market Prices,” Journal of Business 38, 34-
105.
Figlewski, S. (1997): “Forecasting Volatility in Finan. Markets,” Inst. Instruments, NYU,
Salomon Center, 6:1 1-88.
Geweke, J. (1986): “Modelling the persistence of conditional variance. Comments,”
Econometric Reviews, 5 57-61.
Glosten, L. R., Jagannathan, R., and Runkle, D. E. (1993): “On the Relation Between
the Expected Value and the Volatility of the Nominal excess Returns on Stocks,” Journal
of Finance 48, 1779-1801.
Lee, K. Y. (1991): “Are the GARCH Models Best in out-of-sample Performance,”
Economic Letters 37:3, 305-308.
McMillan, D., Speight, A H., and Owain, A. G. (2000): “Forecasting UK Stock Market
Volatility,” Journal of Applied Econometrics, 10, 435-448.
Milhoj, A. (1987 a): “A Multiplicative Parameterization of ARCH Model,” Research
Report, No 101, Institute of Statistics, University of Copenhagen.
Nelson, D. B. (1991): “Conditional Heteroskedasticity in Asset Returns. A new
Approach,” Econometrica 59, 347-370.
Taylor, S. J. (1986): “Modelling Financial Time Series,” (Wiley, New York, NY).
18
Modeling and Volatility Analysis of Oil Companies in Pakistan
Zakoian, J-M. (1990): “Threshold Heteroskedastic Model,” Unpublished manuscript
(INSEE, Paris).
19
Firdos Khan and Zahid Asghar
GLOSSARY
Asymmetric autoregressive conditional heteroskedastic
AARCH
Augmented Dickey-Fuller
ADF
AIC
Akaike information criteria
AR
Autoregressive
ARCH
Autoregressive conditional heteroskedasticity
ARCH-LM
Autoregressive conditional heteroskedastic langrage
multiplier (Test)
ARCH-M
Autoregressive conditional heteroskedastic in mean
ARIMA
Autoregressive integrated moving average
ARMA
Autoregressive moving average
BIC
Bayesian information criteria
Best Linear Unbiased Estimate
BLUE
DF
Dickey-Fuller
DGP
Data generating process
EGARCH
Exponential Generalize autoregressive conditional
heteroskedastic
ES
Exponential smoothing
EWMA
Exponential weighted moving average
FARIMA
Fractional autoregressive integrated moving average
FTSE
Financial Times Stock Exchange
GARCH
Generalize autoregressive conditional heteroskedastic
GARCH-M
Generalize autoregressive conditional heteroskedastic in
mean
20
Modeling and Volatility Analysis of Oil Companies in Pakistan
GJR-GARCH
Glosten, Jaghanathan and Runkle-Generalize
autoregressive conditional heteroskedastic
HISVOL
Historical volatility model
IGARCH
Integrated generalize autoregressive conditional
heteroskedastic
ISD
Implied standard deviation
LL
Log Likelihood
LM
Langrage multiplier
MA
Moving average
MAE
Mean absolute error
MAPE
Mean absolute percent error
ME
Mean error
MSE
Mean square error
NARCH
Non-linear autoregressive conditional heteroskedastic
NEWS
North East West South
PNP
Partially non-parametric
QGARCH
Quadratic generalize autoregressive conditional
heteroskedastic
RMSE
Root mean square error
RSS
Residuals sum of square
RW
Random walk
SER
Standard Error of Regression
SV
Stochastic volatility
TARCH
Threshold autoregressive conditional heteroskedastic
TIC
Theil’s Inequality Coefficient
21