The Risk Analysis and Modeling of Byco Petroleum in Pakistan Using Extreme Value Theory

The extreme value theory (EVT) has been used to model and measure the distribution of extreme minima of Byco Petroleum in the Pakistan stock market over the period from 2005 to 2012. This paper covers the investigation of distributions that are mostly used in ﬁnance including the generalized extreme value (GEV), generalized logistics (GL), and generalized Pareto (GPA) distribution. L-moment ratio diagram is being used to ﬁnd the appropriate distributions among the distributions. L-moment diagram depicts that GEV and GL distributions are suitable to represent the extremes of Byco Petroleum Pakistan Limited. Thereafter, the probability weighted moment (PWM) method has been used in order to estimate the parameters of probability distributions. Furthermore, Anderson–Darling (AD) goodness-of-ﬁt test is employed to test the goodness of ﬁt among GEV and GL distributions, and it is clear from the results that the GL distribution is more reliable and applicable for extreme minima of Byco Petroleum Company in the Pakistan stock exchange market. EVTand traditional methods are used for value-at-risk (VaR) analysis. The analysis indicates that EVT methods are more suitable for risk measurement in comparison with traditional methods.


Introduction
e insufficiency of risk management was very much obvious after the collapse of the stock market which occurred during the period of 1970s. ese crashes affected all the stock markets around the world, especially the oil price shock and fixed exchange rate system. Oil price shock negatively influenced the United States economy. e changes of these prices led to competitive position change in many industries, such as the automobile industry. ese events left no room for doubt that effective management of financial risk is inevitable. e crisis did not stop there, but these catastrophes continued in the form of worldwide crisis in 1987, the Mexican crisis in 1995, and the financial crisis of Asia in 1997 which increased the threat of worldwide economic collapse. Orange County, Barings Bank, and Long-Term Capital Management were also some of the notable scenarios of the crisis. By this time, financial crisis, its risks, and the poor financial management were evident. All these issues forced the regulators to address these problems at the earliest. In 1996, the Bank for International Settlements (BIS) amended the Capital Accord of 1988 to save from drastic price risk. is amendment also forced the qualitative standard to improve risk management techniques for all the financial markets. It presented the model of value at risk (VaR) for estimating the capital adequacy requirements (CARs). e basic purpose of the CAR was to cover the possible loss that results from market position and help to fight financial crisis (Aboura [1]). VaR helps to examine the loss at different confidence levels covering a specific time period. It can be termed as the lower quantile of distribution that does not cross the certain limit. According to Ahmed Ghorbel [2], there are three traditional methods that are used to calculate VaR: the variance-covariance (VC), the historical simulation (HS), and the Monte Carlo simulation (MCS). e VC method depends on the thought that financial returns have normal distribution, but other research studies set forth that they have fatter tails than normal distribution. e presumption is that the probability of loss is more than suggested by the normal distribution. Conversely, the HS method does not need to infer returns of distributions in order to find VaR. It only delineates that historical returns can be good to predicate future returns, but this method lags behind because of the insufficiency of data which can lead to unreliable results. MCS can be used to calculate VaR, but it needs more computing skills to bring about results.
Even though VaR is being widely used, still there is no concordance among professionals and scholars for which method can be best in calculating VaR. e common flaw of the traditional method is that it does not rightly manipulate lower tails of distribution. It is unable to give right results when we need them the most (Evis Kellezi [3]).
On the contrary, EVT is the branch of the statistics which deals with the extremes to asses a given random variable to find the probability in events which are more extreme than the last observation. EVT-based models are specially designed for extreme events. It provides the models with which we can represent the tails of distribution. e theory has been applied in many areas, for instance, engineering and hydrology, and after that, areas of finance and insurance showed increased interest in it. e application of EVT is not an old development but flourished instantaneously (Davison and Smith [4]). e pioneer to use EVT in finance was Parkinson [5]. He used extreme returns' data that were available to compare the variance of estimated returns. He postulated that the extreme value method can render much better evaluation of variance than the old methods. Another leading name in putting forward the EVT in finance was no doubt Longin. Longin examined the distribution of extremes in the stock market of USA [6]. He studied the extreme movement of the US stock market containing the daily observations of over hundred years . ese extreme variables depend on the returns distributed and the selected time period. Empirically, Longin and Felipe [6] posed that minima and maxima could be accurately characterized by the Frechet distribution. Frechet distribution is an important occurrence in GEV distribution. Michael Rockinger [7] gave the international extreme value theory application. ey used the data of stock markets for about 20 countries. ey evaluate congruence in right and left tails of returns, all across countries. ey used EVT to show that the left tail returns are the same as the right tail returns and concluded that this phenomenon is not the result of clustering of extremes. ey also established the point that GP gave good fit to the experimental dataset. Withal, they rationalized extreme value theory (EVT) for the estimation of VaR, but it is not as good as it could not figure out the S&P 500 index large negative daily returns of − 22.83% in October. e methods that are being used in this paper were also implied to European stock markets. Lux [8] manipulated the data sample of the DAX index covering the period from 1988 to 1994. He posed the thought that the tails of distribution are thinner using Pareto distribution. He evaluated the German stock market data by fitting the GEV distribution and concluded that Frechet distribution is good to fit in maximum occurrences. He also found the same results on DAX index returns implying intraday data.
Another important step was the application of EVT in estimating VaR. is has been tested and put forward by Pownall and Koedijk [9]. ey collected the dataset of Asian stock markets and made the comparison of VaR computation by normal distributions. ey also used the Risk-Metrics model of Morgan et al. [10] with evaluation originated by EVT. ey concluded that VaR estimated apparently better than other models and techniques.
In EVT, GEV and GP are thought to be the only techniques which can fit extremes (minima and maxima). In recent studies, Gettinby et al. [11] delineated the distribution of extremes. For this, they used the data of shares of an index of UK consisting of the time period from 1975 to 2000. ey examined the number of distributions: GEV, Frechet, GP, Weibull, and GL. ey examined and compared each distribution to find the best fit. ey evaluated the extremes of the UK index with each of the distributions separately. ey concluded that, empirically, GL fits the minima and maxima. GL gives better outcomes as compared to GEV.
Another work that seconds Gettinby et al.'s study was recently put forward by Tolikas and Brown [12]. ey also presented generalized logistics (which is not given attention as much as GEV and GP) when it comes to the EVT application. Despite, it gives more stable results for Athens Stock Exchange. ey gathered the data of ASE of the period from 1986 to 2001 and found the minimum daily returns applying the EVT method. Hence, empirically, they concluded GL distribution to be far better than GEV and GP, and GL has fatter tails than GEV. Similarly, the application of EVT in the field of biostatistics is given in [13][14][15].
In this paper, we mainly focused on the measurement of the extreme risk of Byco Petroleum Company in the Pakistan Stock Market. We collected the dataset of Byco Petroleum Pakistan covering the time span of 8 years from 2005 to 2012. We found the daily, weekly, and monthly logarithmic returns of chosen samples of that dataset. After finding logarithmic returns, two distributions were fitted to these returns. L-moment ratio diagram was the technique to identify the distribution of the extreme minimum daily returns. ese distributions were GEV and GL. AD goodness-of-fit test was used to test the goodness of fit. Findings of this paper indicate that GL is a better fit for these returns.

Extreme Value eory.
EVT is a branch of statistics which deals with the extreme observation. e main role of EVT is to empirically estimate and determine the extreme behavior of random variables. According to Longin and Felipe [6], extreme observations are either minimum or maximum of returns of certain intervals such as daily, weekly, and monthly. For a more clear understanding, let us assume variables Y 1 , Y 2 , . . ., Y n as the time series of the daily index and denote the extreme minima with . Now, the point here is to acquire the adequate probability distribution which will assess and explain the behavior of minimum X 1 , X 2 , . . ., X n/m . According to presupposition, returns are independent and identically distributed (iid). Gnedenko [16] based his study on this thought: he explained that the generalized extreme value distribution is a limiting distribution of extremes. GEV holds huge support theoretically, but there is also great backing to the thought that financial returns show serial correlation and heteroscedasticity. Kearns and Pagan [17] also underlay their study on iid assumption. ey used the Monte Carlo simulation (MCS) technique to present that violating the iid assumption exaggerates the shape parameter estimates. Leadbetter et al. [18] examined the extreme value theory, and findings of their paper showed that it can deal with weak data structures. Jondeau and Rockinger [19] gathered the data of DAX index daily returns covering the period of 1969 to 1998. ey analyzed the extremes of German stock market. ey also examined time series returns for autocorrelation and volatility. e finding of the paper showed no significant change in results, so the analysis can be performed without the iid assumption. GL and other distributions were also considered for empirical applications.
Logarithmic returns are divided into daily, weekly, and monthly intervals. Daily extremes are divided into 10 and 30 subperiods, weekly into 2, 4, and 10 subperiods, and monthly into 2 subperiods.
After dividing the extremes into subperiods of daily, weekly, and monthly, further job is to look for the suitable distributions for the data. ose distributions will be considered for analysis which can represent fat tail. GEV, GL, and GP were selected because of the influence of flood frequency analysis. ese distributions deal with the three parameters. ese three parameters are location (α), scale (ß), and shape (c). e first parameter location represents mean; the higher value of mean signifies the larger extremes. e second parameter scale corresponds to standard deviation. High value of standard deviation shows that the distribution of extremes is largely spread. e third parameter shape is the most important parameter of all because a higher value of the shape parameter is analogous to fatter tails. In the frame reference of finance, it means that high values of the shape parameter depict large negative returns or heightened probability of crash (Aparicio and Estrada [20]). L-moments are used to identify the suitable distribution of all the three chosen distributions. L-moment estimation technique was developed by Hosking [21]. is technique is widely used because one can identify the best fitted distribution and obtain more accurate parametric estimates as compared to conventional moment estimators.
In the next step, probability weighted moment method is used to estimate the parameter of distributions. In this approach, sample moments of GL and GEV are equated to estimate parameters. Hosking and Wallis [22] gave an easy description of PWM. Let X be the random variable having finite mean and F be the distribution function; then, PWM is useful while dealing with small samples of data because extremes are not high even in large samples. Hosking established the view that robust L-moment and PWM have a linear relationship, which is shown in four equations of L-moments.
e next step after estimating parameters is to know which distribution is the best fit. Anderson-Darling test is used for this purpose. Anderson-Darling test statistics are defined by where F(x) is the distribution function of X and n is the number of observations. At last, after properly modeling the distributions of extremes, lower quantiles have been put to use in order to calculate VaR estimates. As the length of minima is specified of certain time period, confidence level is adjusted to obtain the desired frequency of EVT-based VaR. Probability of extreme returns is denoted by p ext . It is presumed that collected daily returns will not exceed VaR. e probability of extreme returns is VaR is thought to be effective only when it provides accurate results, and to know its efficiency, these models are needed to be validated. Backtesting can be used for the validation of VaR models. VaR is rightly specified when the confidence level is equal to the violations of returns. For VaR validation, we employed Christoffersen [23] test.

Data Description
In this paper, we used the dataset consisting of 1920 daily logarithmic returns collected from the data stream. e data Mathematical Problems in Engineering were taken from the State Bank of Pakistan and cover the 8year period from January 3, 2005, to December 21, 2012.
In Table 1, the Byco Petroleum daily returns, negative skewness of − 1.07, and kurtosis 11.22 can be seen. It clearly demonstrates that a large number of daily returns deviate from normality.
is deviation from normality was also assessed by the Shapiro-Wilk test. It is clear that both subperiods deviated from normality, but the second subperiod was more volatile in comparison with the first subperiod.
In Table 2, μ is denoted as mean and σ is the standard deviation. Expected frequency represents large negative daily returns which exceeded the defined threshold. Total values are divided into two subperiods. It also contains the values expected on the normality assumption. In cluster row includes the values of daily returns preceding another daily return within two trading weeks.
Daily returns of Byco Petroleum are examined as shown in Table 2 in which the main focus is on the left tail distribution. In the table, μ is denoted as the overall mean, and σ is denoted as the overall standard deviation. Frequency of the negative daily returns exceeded all three thresholds: μ − 2σ, μ − 3σ, and μ − 4σ. e result of the table showed that all the daily returns which were above the limit of the threshold were also higher than assumed normality.
After finding values above the threshold, the next job was to look for the tendency to cluster of these returns with the passage of time. For this purpose, returns were computed with their lower and higher values of a certain time period of two weeks. It can be noticed that clustering heightened under the first threshold μ − 2σ. e table showed that the investors who will take this normal distribution as a base while investing in Pakistan Stock Market will not be able to accurately foresee the risk.
On the whole, descriptive statistics demonstrate that the distribution of daily returns is different in two subperiods examined.

Analysis of the Extremes in the Pakistan Stock Market
Byco Petroleum daily index covering the period of 8 years is collected to find the daily, weekly, and monthly minima. Table 1 shows the descriptive statistics of minima. It clearly illustrated that mean minima increased as we increased the interval, and all subperiods deviated from normality.

Identification of the Distribution of the Extreme Minimum
Daily Returns Using the L-Moment Diagram. L-moment diagram is used to determine the suitable distribution for extreme daily returns. Using the data of daily minima, L-skewness and L-kurtosis were calculated for 30 subperiods of daily returns. en, these calculations of L-skewness and L-kurtosis were plotted on the L-moment diagram. In the diagram, it can be seen that Ʈ3 and Ʈ4 were divided into the series of 30 subperiods for daily minima. One can clearly infer in the first glance that GL and GEV are only suitable distributions excluding other distributions because all the points of Ʈ3 and Ʈ4 lie around the curves of GL and GEV. Despite all, L-moment does not clearly determine which distribution is to be focused among GL and GEV. For this purpose, these two distributions need to be analyzed by the goodness-of-fit test. L-moment ratio diagram for the daily returns of Byco Petroleum Pakistan Limited indicates that the L-skewness and L-kurtosis are mainly concentrated around the theoretical curves of the generalized logistics and generalized extreme value distribution indicating that these two distributions are likely to adequately fit the empirical data.

Parameter Estimates and Goodness-of-Fit
Test. GL and GEV distributions were used to fit the whole sample and subperiods of daily, weekly, and monthly minima of Byco Petroleum Pakistan Limited. After that, using probability weighted moment (PWM), parameters of GL and GEV distributions were calculated for all the samples and subperiods of daily, weekly, and monthly minima. en, p values of AD goodness-of-fit test were found as shown in Tables 3-5. GL and GEV distributions were now fitted to the whole sample, in which the p value indicated that both distributions do not give adequate description, but when we divided the data into subperiods, both GEV and GL provided an adequate description. For example, when we used 10 subperiods, 7 subperiods showed adequate results; however, GL performed better than GEV. When we used 30 subperiods, GL provided better results in 27 subperiods, and the GEV provided adequate results in 24 subperiods. e GL gave better results as compared to GEV in 23 subperiods and GEV in 6 of them. e AD p value for the GEV distribution ranges from 0.000 to 0.774, and the p value for the GL distribution ranges from 0.002 to 0.940.
In the GEV distribution, the shape parameter sometimes takes positive values which signify Weibull distribution, whereas negative values go for Frechet. Previous studies [6] in the USA and [11] in the UK showed no change of sign in the shape parameter. In Figure 1, it can be noticed that the shape parameter fitted with GL and GEV was stable in the first half and greatly variable in the second half. e variability of the shape parameter in the second half can affect VaR estimates significantly.
In Table 3, the probability weighted moment estimates and Anderson-Darling p value of GEV and GL distributions for the daily minima returns of Byco Petroleum Pakistan Limited were calculated. N denotes the number of extreme observations in each period. Alpha denotes the location parameter, beta denotes the scale parameter, and gamma denotes the shape parameter. Figure 2 illustrates the behavior of shape parameter estimates of GL and GEV distributions for the daily minima divided over 30 subperiods.
In Figure 2, it can be noticed that the shape parameter fitted with GL and GEV was stable in the first half and greatly variable in the second half. e variability of the shape parameter in the second half can affect VaR estimates significantly. Higher values of the shape parameter can lead to      Mathematical Problems in Engineering higher estimation of the value at risk. e whole series of extremes and the series of subperiods were fitted to GL and GEV in order to evaluate the variability of extremes. Table 4 illustrates the findings of weekly minima of the whole period. It showed that GEV gave the adequate fit with the p value of 0.136. On the contrary, GL could not prove to be a good fit for the series of weekly minima of the whole period, but when the subperiods of weekly minima were fitted to GL and GEV, GL proved to be a better fit.
In Table 5, the PWM estimates and AD p value for GEV and GL distributions for the weekly minima returns of Byco Petroleum Pakistan were calculated. N denotes the number of extreme observations in each period. Alpha denotes the location parameter, beta denotes the scale parameter, and gamma denotes the shape parameter.
We examined the behavior of stock exchange extremes by fitting the whole data and subperiods of Byco Petroleum Pakistan Limited. e results of both GEV and GL distributions were presented in the tables with the AD p value. When fitted to the daily extremes, the GL distribution shows the better fit, while for the subperiods, it showed a mixed behavior; sometimes, it is GEV which showed the better fit, and sometimes, it is GL, but overall, the GL showed the better fit for the daily returns as well as for the weekly returns.
In a nutshell, all the examinations of extremes proved that GL is a better fit for the extremes of Byco Petroleum  daily returns. GL performed better than GEV in more subperiods, but in few cases, GEV went beyond GL in performance. It is also important to note that the behavior of extremes can change as the nature of the shape parameter changed with subperiods.

Estimating and Comparing VaR
EVT methods are evaluated in order to determine the efficiency of these methods to calculate VaR. Lower quantiles of extremes were used to produce VaR estimates of daily returns. To that end, parameter estimates of both GL and GEV distributions for daily extremes were put to use. Moving window method is employed on the grounds that the distribution of extremes has time-varying behavior. Traditional methods including HS, MCS, and VC were used to generate VaR estimates. For this purpose, 250 past daily returns were examined but for HS VaR estimates, 1000 historical returns were used because 250 returns are fewer to calculate VaR at higher confidence levels. e Christofferson p value is presented for all methods in parenthesis. e results mentioned in Table 6 throw spotlight on different facts. At first, it can be noticed that the VC method overestimated the risk at lower confidence levels, but at 99.75% and 99.90%, it predicted the VaR rightly where the number of violations is 20 and 16. Like the VC method, HS method also performed well at high confidence levels. However, with 1000 past daily returns, the HS method seemed to be forecasting accurately even at lower confidence levels. MCS, on the contrary, worked unsatisfying at all confidence levels. All traditional methods did not accurately measure the risk of extremes daily negative returns.
Furthermore, it can be noticed from the results of Table 5 that EVT methods based on the static approach performed badly in forecasting the risk. Only GL with the static approach worked well at the 99.90% confidence level. e reason behind this poor performance is the time-varying behavior of the distribution of extremes. So, we can see that the moving window approach eliminated this problem up to some extent. EVT methods with moving window produced accurate results at higher confidence levels. For instance, expected violations are 9, 4, and 2 at 99.50%, 99.75%, and 99.90% confidence levels, respectively. However, generalized logistics distribution with the static approach resulted in 29, 20, and 2 violations, and the outcome of GL with the MW approach is 13, 6, and 1 violations, respectively. e findings are summarized, which showed that GL and GEV performed better than HS, VC, and MCS. GL even surpasses GEV in some cases because GL has fatter tails and can evaluate larger extremes efficiently.

Conclusions
In this paper, daily returns of Byco Petroleum Pakistan Limited were investigated to determine which distribution is better able to model the risk. e whole sample and different subperiods of the daily minima were fitted by the GL and GEV distributions, and the PWM method was implied to estimate the parameters of these distributions. AD goodness-of-fit test p value and parameter estimates of both distributions were presented in the tables. e findings of this paper clearly indicate that GL can adequately explain the extreme minima of daily returns. is result is important in changing the perspective regarding the poor application of GL. In EVT applications, GP and GEV are thought to be only applicable distributions, but it is evident in the conclusion that the GL distribution is a good fit for lower tail data. It can also be seen that the division of extremes into subperiods changes the nature of extremes with the passage of time.
EVT-based VaR analysis indicated that EVT methods were adequate in the risk measurement of extreme returns with lower probabilities. Among traditional methods, HS was the only method that gave accurate estimates at higher confidence levels, but the HS method needs more data points for accurate estimation as compared to other methods employed in the study. It can be noticed that GL and GEV

Conflicts of Interest
e authors declare that they have no conflicts of interest.