Simulation Analysis of Threshold Autoregressive Unit Root Tests

Using numerical simulation, the finite-sample properties of threshold autoregressive TAR and momentum-threshold MTAR autoregressive-based unit root tests under both deterministic and consistent methods of threshold estimation are examined in the presence of generalised autoregressive conditional heteroskedasticity GARCH . Previous research is extended by considering both the impact of alternative robust methods of covariance matrix estimation and the behaviour of the secondary tests of asymmetry associated with the TAR and MTAR models. The results obtained reveal many interesting features, in particular the distortionary effects of consistent-threshold estimation. In summary, the findings presented indicate that caution should be exercised when interpreting the results of these frequently employed threshold-based testing methods.


Introduction
Following the work of Engle 1 , Bollerslev 2 , and Taylor 3 , the analysis of volatility has become a central feature of both empirical and theoretical finance.This in turn has led to the emergence of a large literature examining the properties and behaviour of unit root tests in the presence of volatility in its commonly considered form of generalised autoregressive conditional heteroskedasticity GARCH .The GARCH model has become a cornerstone of empirical research in economics and finance, receiving widespread application and proving to provide accurate forecasts see 4 .While some authors have considered joint estimation of unit root testing equations and GARCH 1,1 processes see, inter alia, 5-7 and references contained therein , others have examined the size properties of unit root tests in the presence of neglected GARCH behaviour see 8-10 .Following the emergence of a number of studies considering the relevance and implications of heavy-tailed distributions in economic and financial time series data see, inter alia, 11-14 , a further related literature has emerged examining the properties of unit root tests when applied to series with heavy-tailed disturbances see 15, 16 .In this paper, this latter literature is extended by considering the properties of the threshold autoregressive TAR and momentum-threshold autoregressive MTAR unit root tests of Enders and Granger 17 when applied to time series with GARCH 1,1 disturbances.This issue clearly merits attention given the noted prevalence of GARCH behaviour in economic and financial data.While Cook 9 provides an initial examination of the properties of the TAR and MTAR unit root tests in the presence of GARCH, the current study develops this work in two crucial ways.First, the impact of alternative methods of variance-covariance estimation is considered.An analysis of this is of clear importance as while the covariance matrix estimators of White 18 and Newey and West 19 are routinely adopted by econometricians and time series analysts as a general correction for problems encountered with heteroskedasticity, their impact in the presently considered circumstances is unknown.Second, the subsequent tests of the hypothesis of symmetry associated with the TAR and MTAR tests are examined.Again, this is an important issue as the results of these subsequent tests of symmetry are often reported by investigators using Fisher's F-distribution to reinforce their empirical analyses, despite the individual properties of the tests in these circumstances, and the applicability of the F-distribution, being unknown.The use of the F-distribution to provide critical values for the symmetry tests is apparent in the empirical analysis conducted in the seminal work of Enders and Granger 17 .More recent studies employing the F-distribution include, inter alia, Payne and Mohammadi 20 and Thompson 21 .Indeed, it will be seen that the use of the Fdistribution can generate misleading results for the symmetry tests, irrespective of whether GARCH is present.The present work addresses these issues using Monte Carlo simulation, considering the TAR and MTAR tests under both deterministic and consistent methods of threshold estimation.
This paper proceeds as follows.In the following section the TAR and MTAR tests are outlined.Using Monte Carlo methods, Section 3 considers the finite-sample distributions of the TAR and MTAR unit root tests under alternative methods of covariance matrix estimation and the empirical sizes of the symmetry tests under use of the F-distribution.The simulation analysis is extended in Section 4 to consider the behaviour of the TAR and MTAR unit root tests and the subsequent tests of symmetry, in the presence of GARCH.Section 5 concludes.

TAR and MTAR Unit Root Tests
Consider a time series process {y t } T t 0 .Given this series, the familiar Dickey-Fuller 22 DF test in its simplest form can be expressed as the t-ratio of φ in the following regression: 2.1 Comparison of the calculated test statistic with specifically derived nonstandard critical values allows examination of the unit root hypothesis H 0 : φ 0 against an alternative of asymptotic stationarity H 0 : φ < 0 .However it is apparent that 2.1 is an implicitly symmetric specification.To allow for the possibility of stationary asymmetric adjustment about an underlying attractor, Enders and Granger 17 draw upon the threshold autoregressive methods of Tong 23,24 .Adopting this approach, 2.1 is generalised via the introduction of the Heaviside indicator function to partition the lagged level term y t−1 with a resulting testing equation given as where the zero-one Heaviside indicator function I t can be specified as either or: The combination of 2.2 and 2.3 results in a TAR model, while combination of 2.2 and 2.4 leads to an MTAR model.The testing equation 2.2 therefore provides the basis of the asymmetric unit root tests of Enders and Granger 17 , with the speed of adjustment about the stationary attractor given by ρ 1 when above equilibrium and ρ 2 when below.The unit root hypothesis is then tested via the null H 0 : ρ 1 ρ 2 0, with the resulting test statistic denoted as Φ when using 2.3 and Φ * when using 2.4 .Due to the nonstandard distributions of the Φ and Φ * statistics, their use requires the derivation of appropriate critical values via Monte Carlo experimentation.To further examine potential asymmetric stationarity, investigators often conduct a further test of the symmetry hypothesis via the null ρ 1 ρ 2 using a conventional F-statistic.The appropriateness of the F-distribution when conducting this subsequent test of symmetry will be questioned later in this paper.
The above analysis adopts an implicit assumption of zero attractor y t 0 .However, in practice it is more realistic to consider a nonzero attractor.In these circumstances, it is possible to derive the required threshold either deterministically or via consistent-threshold estimation.Under the first option, a deterministic threshold is imposed via regressing the original series {y t } upon a constant to derive a new series { y t }.In this analysis, the interceptonly model is considered for the TAR and MTAR tests as it is this form alone which is available for the tests under consistent-threshold estimation.The derived series { y t } is then employed in a modified version of 2.2 as given below: In a similar manner, the specification of the Heaviside indicator function in 2.3 or 2.4 is modified by using the appropriate revised series { y t } rather than {y t }.The resulting TAR and MTAR tests are denoted as Φ μ and Φ * μ , respectively.Under consistent-threshold estimation, the above two-step procedure is replaced by an optimisation procedure to select the threshold.To illustrate this, consider the Φ μ and Φ * μ tests above.To construct these tests statistics, prior to regression upon a constant term effectively, this demeans the series of interest, implying that the version of 2.2 actually estimated is given as However, as noted by Enders 25 , in the presence of asymmetric adjustment ρ 1 and ρ 2 differ, and consequently the mean is a biased estimator of the threshold.To overcome this and obtain a consistent value of the threshold, the approach of Chan 26 is followed with a grid search procedure employed to select the threshold.The previously defined Heaviside indicator functions of 2.3 and 2.4 are then revised as follows: with the thresholds τ and τ Δ selected by grid search over a range of values of {y t } and {Δy t } respectively.The appropriate asymmetric unit root testing equations are therefore given as Application of the consistent-threshold versions of the TAR and MTAR tests therefore requires selection of the appropriate threshold value ψ .Considering TAR adjustment, the series of interest {y t } is reordered as

with the central 70% of observations y o
i , i 0.15T, . . ., 0.85T within this range of values considered as potential thresholds.Each of these values y o i τ is employed in turn in the indicator function of 2.7 with 2.9 then estimated.The value y o i delivering the minimum residual sum of squares η 2 t for 2.9 is then deemed to be the consistent estimate of the threshold τ with the resulting TAR asymmetric unit root test labelled Φ c μ .Under MTAR adjustment, a similar approach is followed with the central 70% of observations from the reordered sequence {Δy o 1 , Δy o 2 , . . ., Δy o T } now considered as potential thresholds.Again the selected threshold τ Δ is that value minimising the residual sum of squares of 2.9 with 2.8 now employed as the appropriate indicator function.The resultant MTAR test under consistent-threshold estimation is labelled as Φ * ,c μ .In the subsequent research of Cook and Manning 27 , the use of consistent-threshold estimation was shown to substantially increase the power of both TAR and MTAR asymmetric unit root tests in the presence of asymmetrically stationary processes.In the subsequent sections of this paper, the behaviour of the above tests will be examined in the presence of GARCH, with particular attention paid to the role of alternative variancecovariance matrix estimators and the secondary test of symmetry.

Derivation of Critical Values for the Asymmetric Unit Root Tests
Before considering the properties of the Φ μ , Φ * μ , Φ c μ , and Φ * ,c μ tests and their associated subsequent tests of symmetry in the presence of GARCH, critical values for these tests are derived under the use of alternative variance-covariance estimators.In addition to considering the standard OLS covariance estimator, the covariance matrix estimators of White 18 and Newey-West 19 are also employed for the four tests.The use of White and Newey-West covariance matrix estimators is denoted by the addition of the subscripts w and The reported results represent critical values for the TAR and MTAR tests under deterministic and consistent threshold estimation using alternative covariance matrix estimators.The results were derived over 50,000 simulations.
n, respectively, to the test statistic, while no additional subscript is employed when using the standard OLS covariance estimator.To obtain the required critical values, the following data generation process DGP is employed: where the innovation series {ε t } is generated using pseudo i.i.d.N 0, 1 random numbers from the NRND procedure in the EViews 6.The experiments are performed over 50,000 simulations for three sample sizes: T {200, 400, 800}.Critical values are reported in Table 1 for three levels of significance α 0.10, 0.05, 0.01 .The consideration of four tests, three sample sizes, three covariance matrix estimators, and three levels of significance results in the calculation of 4 × 3 × 3 × 3 108 critical values.For all of the four tests examined, the results presented in Table 1 show that finite-sample critical values are clearly dependent upon the covariance matrix estimator employed.Although the impact of alternative covariance estimators is negligible asymptotically, its finite-sample impact is apparent and is noticeable for even the relatively large sample of 800 observations considered in the present experiments.In short, it can be seen that movement from the OLS covariance matrix estimator to the White estimator and then Newey-West estimator leads to inflation of the critical values of all tests.

Empirical Sizes of the Symmetry Tests
To consider the finite-sample sizes of the subsequent associated tests of symmetry H 0 : ρ 1 ρ 2 associated with the Φ μ , Φ * μ , Φ c μ and Φ * ,c μ tests under alternative methods of covariance matrix estimation, the DGP of the previous subsection is employed but with 3.1 respecified as follows: This modification is introduced to allow consideration of stationary processes via |ρ| < 1, as the symmetry test is valid for stationary processes only.To examine the finite-sample empirical sizes of the symmetry tests associated with the Φ μ , Φ * μ , Φ c μ and Φ * ,c μ statistics, ρ 0.85 is employed in the following Monte Carlo analysis.The empirical rejection frequencies for the symmetry tests are derived at the 5% level of significance α 0.05 over 50,000 replications with the innovation series {ε t } again generated as a standard normal variable using the NRND procedure.
The empirical rejection frequencies for the symmetry tests are reported in Table 2. From inspection of this table, it is clear that there are a number of interesting findings to consider.First, under deterministic estimation of the threshold, the symmetry test for the TAR model is substantially undersized irrespective of the covariance matrix estimator employed.The reported results represent empirical rejection frequencies at the 5% level of significance of symmetry tests for the TAR and MTAR models under deterministic and consistent threshold estimation using alternative covariance matrix estimators.The results were derived over 50,000 simulations.
In contrast to this, the symmetry test for the MTAR model with a deterministic threshold has near nominal size.Second, under consistent-threshold estimation, the symmetry tests for both the TAR and MTAR models are dramatically oversized, with the former exhibiting greater size distortion than the latter.Third, movement from the standard OLS covariance matrix estimator to the White and the Newey-West estimator results in an increase in empirical size irrespective of the test and method of threshold estimation employed.

Empirical Sizes of Asymmetric Unit Root Tests in the Presence of GARCH
To examine the properties of the above-mentioned asymmetric unit root tests in the presence of GARCH 1,1 errors, the following DGP is employed:

4.1
This DGP closely follows that of Kim and Schmidt 8 and considers GARCH processes generated for a range of realistic values of {φ 1 , φ 2 } corresponding to near integration, with φ 0 1 − φ 1 − φ 2 in all cases.In this paper the less empirically realistic, or relevant, cases of degenerate GARCH φ 0 0 and integrated GARCH φ 1 φ 2 1 are not considered.The precise values of the volatility parameter φ 1 and the degree of persistence φ 1 φ 2 employed are based upon estimated values observed in empirical research, with The reported results represent empirical sizes of the test of the unit root hypothesis at the 5% level of significance for the TAR model under deterministic and consistent threshold estimation using alternative covariance matrix estimators.The results were derived over 20,000 simulations.
the distinction between typical values of {φ 1 , φ 2 } observed at differing data frequencies noted see 28, 29 .Throughout, the initial value of the conditional variance is set equal to one h 0 1 , with an additional, initial 400 observations of the generation of the GARCH process discarded prior to generation of the series of interest y t to remove the impact of the initial condition.The initial value of y t is set to zero y 0 0 without loss of generality.The innovation series {v t } is generated using pseudo i.i.d.N 0, 1 random numbers from the NRND procedure in EViews 6 with all experiments performed over 20,000 simulations for the previously considered sample sizes: T {200, 400, 800}.The empirical sizes of the alternative tests are derived as the rejection frequencies for the alternative tests at the 5% level of significance under the use of the OLS, White and Newey-West covariance estimators using the critical values derived in the previous section.
Considering the results presented in Table 3 for the TAR unit root test for the standard OLS covariance estimator, it can be seen that the presence of GARCH does result in size inflation, with this becoming more noticeable for larger values of the volatility parameter.From inspection of the results for {φ 1 , φ 2 } {0.15, 0.84} and {φ 1 , φ 2 } {0.25, 0.70} it is also apparent that the degree of near integration as well as the size of the volatility parameter influences size distortion, as empirical size is lower for the second pairing of parameters with a lower sum for the GARCH parameters, but a higher degree of volatility.While these results hold under both deterministic and consistent threshold estimation, it can be seen that greater size distortion results from the use of consistent-threshold estimation.Conversely, it is apparent that the use of either the White or Newey-West covariance covariance matrix estimators dramatically reduces oversizing for the tests to a near negligible level.The reported results represent empirical sizes of the test of the unit root hypothesis at the 5% level of significance for the MTAR model under deterministic and consistent threshold estimation using alternative covariance matrix estimators.The results were derived over 20,000 simulations.
From inspection of the results for the MTAR unit root test in Table 4, it can be seen that the behaviour of the MTAR test closely mimics that of the TAR test.However, it is also apparent that under consistent-threshold estimation and use of the standard OLS covariance matrix estimator, the oversizing of the MTAR test is less than that of TAR test for the identical values of the GARCH parameters.However, despite this, empirical sizes more than twice the nominal size are obtained.

Empirical Sizes of the Symmetry Tests in the Presence of GARCH
In Tables 5 and 6, the empirical sizes of the symmetry tests associated with the TAR and MTAR unit root tests are reported.These results are obtained using the following DGP: where ρ 0.85 in 4.2 .Aside from the consideration of stationary time series via the use of the AR 1 specification y t 0.85y t−1 w t , the Monte Carlo analysis conducted in this The reported results represent empirical sizes of the test of the symmetry hypothesis at the 5% level of significance for the TAR model under deterministic and consistent threshold estimation using alternative covariance matrix estimators.The results were derived over 20,000 simulations.
subsection follows that of the previous subsection in terms of generation and replication.While a range of results is reported in Tables 5 and 6, the findings can be summarised straightforwardly.
Considering the TAR model with a deterministically imposed threshold value, substantial undersizing is apparent.Conversely, when consistent-threshold estimation is employed, the resulting symmetry test is massively oversized, with the use of OLS covariance matrix inducing greater size inflation than the corrected covariance matrix estimators of White and Newey-West.Despite undersizing oversizing of the test with a deterministic consistent threshold being noted previously in the absence of GARCH, it is apparent that neglected GARCH behaviour does increase size distortion with size inflation being positively related to the degree of volatility as given by φ 1 .Turning to the results for the MTAR test, under the use of a deterministic threshold and corrected covariance matrix estimator, the test has relatively good size, particularly when the White covariance matrix is used.However, when the standard OLS covariance is employed, substantial size distortion is exhibited for volatile GARCH processes.Considering the symmetry test for the MTAR model under consistent threshold estimation, size distortion is apparent irrespective of the covariance matrix estimator employed, although the corrected covariance estimators, and the White covariance matrix estimator in particular, do reduce this.The results obtained therefore indicate that use of the preferred method of consistent-threshold estimation results in substantial oversizing of the symmetry test for both the TAR and MTAR models with all covariance matrix estimators, leading to a spurious detection of asymmetric behaviour.The reported results represent empirical sizes of the test of the symmetry hypothesis at the 5% level of significance for the MTAR model under deterministic and consistent threshold estimation using alternative covariance matrix estimators.The results were derived over 20,000 simulations.

Concluding Remarks
The popularity of threshold-based unit root tests as a means of detecting asymmetric stationarity in time series processes has increased in recent years, with their analysis and application now widespread.This paper has examined the practically important issue of the finite-sample size properties of these TAR and MTAR tests in the presence of GARCH behaviour, with particular attention paid to the use of alternative covariance matrix estimators and the subsequent test of the symmetry hypothesis.The results of the Monte Carlo analysis undertaken show the TAR and MTAR tests to be oversized when examining the unit root hypothesis in the presence of GARCH, unless a corrected covariance matrix estimator White or Newey-West is employed.This result is particularly apparent when considering more highly volatile GARCH processes.Interestingly, it was seen also that the use of consistent-threshold estimation increases the degree of size distortion exhibited.However, it was the examination of the symmetry hypothesis which provided the more dramatic and important results.The most crucial result concerned the substantial oversizing of the symmetry test for the TAR and MTAR models irrespective of both the covariance matrix estimator employed and the presence of GARCH.This finding indicated that a spurious inference of asymmetric adjustment could be noted with a high degree of probability when applying these tests and strongly rejected the use of the F-distribution for this test as routinely employed by practitioners see 17, 20, 21 .Under the use of a deterministic threshold it was found that the symmetry test under the TAR model was undersized in the absence of GARCH, while the test under the MTAR model had approximately correct size.
In the presence of GARCH behaviour, the most noticeable finding concerned the subsequent oversizing of the symmetry test for the MTAR model, although this was noted for the model using the standard OLS, rather than corrected, covariance matrix estimator.In summary, the results of the present paper indicate that the routine application of TAR and MTAR unit root tests to economic and financial time series can result in incorrect or spurious inferences.While the probability of a spurious inference concerning the unit root hypothesis is most likely to occur when a corrected covariance matrix estimator is employed in the examination of time series processes exhibiting GARCH behaviour, the symmetry test has been shown to exhibit spurious rejection under consistent-threshold estimation even in the absence of GARCH.The results obtained indicate that practitioners should exercise care when applying TAR and MTAR unit root tests to economic and financial time series, particularly when considering the secondary test of symmetry where use of the F-distribution is clearly inappropriate despite its routine use in empirical analyses.As result of the above findings, an obvious future avenue of research concerns the development of asymmetric unit root tests which capture and draw upon any GARCH behaviour exhibited by data.This work, which can be viewed as a development of the studies of Seo 5 and Cook 6 , is the subject of ongoing research.

Table 1 :
Critical values for asymmetric unit root tests.

Table 2 :
Empirical sizes of symmetry tests.

Table 3 :
TAR unit root testing in the presence of GARCH.

Table 4 :
MTAR unit root testing in the presence of GARCH.

Table 5 :
TAR symmetry testing in the presence of GARCH.

Table 6 :
MTAR symmetry testing in the presence of GARCH.