The spurious regression phenomenon in least squares occurs for a wide range
of data generating processes, such as driftless unit roots, unit roots with drift,
long memory, trend and broken-trend stationarity. Indeed, spurious regressions
have played a fundamental role in the building of modern time series econometrics
and have revolutionized many of the procedures used in applied macroeconomics. Spin-offs from this research range from unit-root tests to cointegration and error-correction models. This paper provides an overview of results about spurious
regression, pulled from disperse sources, and explains their implications.
1. Introduction
During the last 30 years econometric theory has undergone a revolution. In the late seventies, economists and econometricians recognized that insufficient attention was being paid to trending mechanisms and that, in fact, most macroeconomic variables were probably nonstationary. Such an appraisal gave rise to an extraordinary development that substantially modified the way empirical studies in time-series econometrics are carried out. Research in nonstationarity has advanced significantly since the early important papers, such as Granger and Newbold [1], Davidson et al. [2], Hendry and Mizon [3], Plosser and Schwert [4], Bhattacharya et al. [5], and Phillips [6]. Nelson and Plosser [7] asserted that many relevant U.S. macroeconomic time series were governed by a unit root (a random trending mechanism), based on Dickey and Fuller’s [8] Unit-root test. Several years later, Perron [9] argued that the trending mechanism in macro variables was deterministic in nature (with some transcendent structural breaks). The debate continues between “unit rooters” and “deterministic trenders”, though there is very general consensus as to the presence of a trending mechanism in the levels of most macroeconomic series. In the words of Durlauf and Phillips [10]: “Traditional Analyses of Economic time series frequently rely on the assumption that the time series in question are stationary, ergodic processes […]. However, the assumptions of the traditional theory do not provide much solace to the empirical worker. Even casual examination of such time series as GNP reveals that the series do not possess constant means.”
Econometrics should work hand-in-hand with economic theory by providing it with the tools it requires to understand economic activity. The modeling of such mechanisms is thus a major goal of time series econometrics (that said, it should be acknowledged that neither unit roots nor deterministic trends are able to model satisfactorily most series; they are what Phillips [11] labeled “heroically naive” concepts). Spurious regression can be considered as having played a fundamental role in this development. To understand it, we paraphrase Granger et al. [12], who provide an illuminating definition (Phillips [6] showed analytically that, when regressing two independent stochastic trends, the estimates of the regression coefficient do not converge to their real value of zero (particularly noteworthy among the various spin-offs of the aforementioned studies are the Error Correction Model (first proposed by Sargan [13] as a link between static equilibrium economic theory and dynamic empirical models, and further developed by Hendry and Anderson [14], Davidson et al. [2], Hendry and Mizon [3], and Hendry et al. [15]) and the entire theory of cointegration (first proposed by Granger [16], Granger and Weiss [17], and Engle and Granger [18]. The linking of Cointegration with Error Correction Models can also be found in these articles). Spurious regression has contributed to the general improvement in the level of empirical work). Phillips’s [6] results are detailed in a simple case in Appendix A): “A spurious regression occurs when a pair of independent series, but with strong temporal properties, is found apparently to be related according to standard inference in a Least Squares regression.”
Phillips [19] presented a counterargument on the usefulness of spurious regression: trend specifications are just coordinate systems for representing behavior over time. Phillips argues that even if the series are statistically independent, when they include a trending mechanism in their DGP, they admit a regression representation (even in the absence of cointegration). This is in sharp contrast to the usual concept of spurious regression. We usually conceive this phenomenon as the statistical identification of a commonality of trending mechanisms. Phillips ventures that such results—the “spurious”—constitute an adequate representation of the data. His main result applies for regressions among stochastically trended series on time polynomials as well as regressions among independent random walks. Phillips [19] proves that Brownian motions can be represented by deterministic functions of time with random coefficients. Given that standardized discrete time series with a unit root (hereinafter UR) converge weakly to Brownian motion processes, it is argued that deterministic time functions may be used to model them. Such representations include polynomial trends, trend breaks as well as sinusoidal trends; it is also proved that a stochastic process can represent an arbitrary deterministic function on a particular interval, so a regression of a UR process on an independent UR process is thus also a valid representation of the data. In both cases, the t-statistics diverge at rate T1/2, which is consistent since such parameterization reflects a partial—though correct—specification of the DGP. One of the most significant conclusions of Phillips concerns the long-standing debate of UR versus Trend Stationarity. To quote Phillips: “[…] Our results show that such specifications (Trend stationary processes) are not, in fact, really alternatives to a UR model at all. Since the UR processes have limiting representations entirely in terms of these functions (deterministic), it is apparent that we can mistakenly “reject” a UR model in favor of a trend “alternative” when in fact that alternative model is nothing other than an alternate representation of the UR process itself.”
This perspective has the virtue of allowing variables with different trending mechanism (deterministic or stochastic) to be related without being limited to the somewhat restrictive case of cointegration. Phillips advances this as an appropriate approach to study stochastically unbalanced relationships such as the ones that may arise between variables such as interest rates, inflation, money stock and GDP (for further detail, see Phillips [20]).
To the best of our knowledge, little has been done in the way of bringing the most important works in this field together, treating them in any kind of standardized way, making connections between them, and of any real study of the profound implications for economics they might indicate. This article aims to rectify this situation.
2. Appraisal of the Spurious Phenomenon
Much progress has been made with Least Squares statistical inference since it was first proposed more than two centuries ago as a means of estimating the course of comets [21]. Theoretical developments in econometrics address the nonexperimental nature of economic data sets. Least Squares (LS) offers a trade-off between simplicity and powerful inference. Nevertheless, LS has certain limitations, such as potential confusion between correlation and causality, and used unwisely may produce misleading evidence. Statisticians and econometricians had been aware of the "spurious phenomenon" since Yule [22], and Pearson [23], (for excellent reviews of these works, see Hendry and Morgan [24], and Aldrich [25]). These results led to the common expertise in the time-series field that indicated the need to differentiate potentially nonstationary series when using these to run regressions or detrending these by fitting trend lines estimated with LS. See Morgan [26].
There are many examples of spurious regression. Some of these are commented on in Phillips [19], where we discover the implausible relationship between “the number of ordained ministers and the rate of alcoholism in Great Britain in the nineteenth century”; the equally “remarkable relationship” presented in Yule [27] concerning the “proportion of Church of England marriages to all marriages and the mortality rate over the period 1866–1911”; the “strange relationship” between price level and cumulative rainfall in the UK, which was advanced as a curious alternative version of quantitative theory by Hendry [28]. Plosser and Schwert [4] presented another example of nonsense correlation when they proposed their quantity theory of sunspots. The main argument is that the log of nominal income can be explained by way of the log of accumulative sunspots. Not only did they find statistically significant estimates, but the goodness of fit, measured with the R2, is quite high: 0.82 (a variant of this example, used by the authors to demonstrate the danger of nonsense correlation was taken seriously 100 years ago, by Jevons [29]). Granger and Newbold [1] computed a Monte Carlo Experiment where a number of regressions, specified as (1), were run using simulated variables, each perfectly independent of the others:
yt=α+βxt+ut,
where t=1,…,T, being T the sample size. The variables xt and yt are independent. Under standard regularity conditions, LS delivers no evidence of a linear relationship between y and x. In particular, β̂ should be statistically equal to zero. Nevertheless, Granger and Newbold’s [1], they were generated as I(1) processes, (I(d) notation, for d an integer, refers to the number of differences to be performed so the variable becomes stationary). usually referred to as random walks, that is, UR processes, but found estimated parameters statistically different from zero, with its associated t-ratio tβ̂=β̂/σ̂β̂, unusually high (we could technically refer to this as Type II spurious regression. This distinction was first proposed by Chiarella and Gao (2002) as an analogy of the classical type I and type II errors in statistics: (1) Type I spurious regression: rejection of a true relationship, and, (2) Type II spurious regression: acceptance of a false relationship. Most of the literature refers exclusively to Type II spurious regression, which we will simply refer to as spurious regression). Phillips [6] provided a theoretical framework that explained the causes of the phenomenon of spurious regression. In short, it is fair to say that standard LS inference can only be drawn when the variables are stationary. Even with stationary but highly persistent variables, spurious regression can occur when the standard errors used in the t-ratio are inconsistent. Ferson et al. [30] provide examples in financial economics. One extremely important exception to this is the case of cointegration. Even if the series are stochastically trending, when the trend is common to both series the LS regression then works particularly well in the sense that estimates converge in probability to their true value at a rate faster than T, but have a nonstandard distribution [31].
3. Data Generating Processes
Research on spurious regression has been making use of increasingly complex Data Generating Processes (DGPs). Table 1 provides a summary of those appearing in this survey: where uwt are independent innovations obeying in Phillips [6, Assumption 1], ϵwt is an i.i.d white noise with mean zero and variance σϵ2, and DUiwt, DTiwt are dummy variables allowing changes in the trend's level and slope respectively, that is, DUiwt=1(t>Tbiw) and DTiwt=(t-Tbiw)1(t>Tbiw), where 1(·) is the indicator function, and Tbiw is the unknown date of the ith break in w. We denote the break fraction as λiw=(Tbiw/T)∈(0,1), where T is the sample size; d∈(-1/2,3/2). Only DGPs 1, 2 and 10 (for d<0.5) satisfy the weak stationarity definition. The remaining DGPs generate nonstationary series (notice that DGP 6 is a special case of DGP 10; nevertheless, both will be treated as if they were different, given that each has been considered independently in the literature). FI processes deserve further discussion; contrary to regular ARMA(p,q) processes—made popular by Box and Jenkins in the 1970s—such as DGP 1, whose autocorrelation function decays at an exponential rate (short memory), FI(d) processes have an autocorrelation function that decays at a hyperbolic rate (long memory).
The DGPs for wt=xt,yt,zt. Note: TS,br and dr stand for trend-stationarity, breaks, and drift, respectively.
#
Name
Model
1
MA(qw) or
wt=∑i=1qwΘiwϵwt-i or
AR(pw)
wt=∑i=1pwϕwiwt-i+ϵwt (stationary)
2
I(0)
wt=μw+uwt
3
I(0)+br
wt=μw+∑i=1NwθiwDUiwt+uwt
4
TS
wt=μw+βwt+uwt
5
TS+br
wt=μw+∑i=1NwθiwDUiwt+βwt+∑i=1MwγiwDTiwt+uwt
6
I(1)
Δwt=uwt
7
I(1)+dr
Δwt=μw+uwt
8
I(1)+dr+br
Δwt=μw+∑i=1NwθiwDUiwt+uwt
9
I(k)
Δkwt=uwt for k=2,3,…
10
FI(d)
(1-L)dwt=uwt for d∈(0,1/2)
11
FI(1+d)
Δwt=uwt with (1-L)duwt=ϵwt for d∈(0,3/2)
There are a number of empirical examples in time series in which dependence falls slowly across time. This phenomenon, known as Long Memory or Long-Range dependence, was observed in geophysical data, such as river flow data [32] and in climatological series [33], as well as in economic time series [34]. In three important papers [35–37], the authors extended these processes to provide more flexible low-frequency or Long Memory behavior by considering I(d) processes with noninteger values of d. As pointed out by Granger and Joyeux [36], “It was standard procedure to consider differencing time series to achieve stationarity”—thus obtaining a form of the series that can be identified as an ARMA model—however, “Some econometricians were reluctant to this technique, believing that they may be losing information, by zapping out the low frequency components.” But using infinite variance series without differencing them was also a source of difficulties at that time. Fractional integration encompasses ARMA models for d=0, and ARIMA models for d=1. The process is stationary and ergodic when d∈(-1/2,1/2); nonstationary but mean-reverting (that is, it returns to its equilibrium or long-run behavior after any random shock). when 1/2<d<1, and nonstationary and mean averting when d≥1.
Mean stationary processes (DGPs 2 and 3) have been used to model the behavior of real exchange rates, unemployment rates, current account, and several great ratios, such as the output-capital ratio and the consumption-income ratio. Unemployment has also been conceived as a nonstationary fractionally integrated process [38]. Some examples can be found in Perron and Vogelsang [39], Wu [40], Wu et al. [41], and D’Adda and Scorcu [42]. Trend Stationarity and I(2) processes (DGPs 4 and 9, resp.) have been used to model growing variables, real and nominal, such as output, consumption and prices; several macro variables have been conceived as DGPs 5, 6, and 7 [9, 43–45]. Variables identified as I(2) processes can also be found in Juselius [46, 47], Haldrup [48], Muscatelli and Spinelli [49], Coenen and Vega [50], and Nielsen [51]. In Table 11 of Appendix B, a few more examples are provided, in which the link between time-series econometrics and important economic issues is acknowledged.
4. Spurious Regression Since the Roaring Twenties
We now begin our survey of the development of the theory of spurious regression. The related literature is vast, for which reason we focus mainly on a limited selection of articles which, in our view, are particularly representative (as noted earlier, Pearson [23], and Yule [22] became aware of several problems concerning the interpretation of the correlation coefficient at the end of the nineteen century, forging by the way the concept of spurious correlation. The debate, in that time, was greatly related with the interpretation of correlation as causality although the seeds of the modern interpretation that prevails in econometrics are also present; further details can be obtained from Aldrich’s [25] historical review). Unless otherwise specified, the regression specification for which all asymptotics are presented is hereinafter expression (1). We focus mainly on on the rate of divergence of the relevant t-ratios and let aside in most cases the asymptotic distributions that would be obtained had the statistics being correctly normalized. This is so because such distributions do have nuisance parameters that prevent one from making use of these to do correct inference; moreover, practitioners cannot be aware a priori of the adequate normalization. However, in some cases [52–54] the asymptotic distributions are important because of the relevance for practitioners (Valkanov suggests a T1/2 normalization of the t-ratio and the simulation of the distribution in the context of Long-Horizon regressions; Sun proposes a consistent t-statistic with a nuisance-free asymptotic distribution, and; Kim, Lee and Newbold precisely emphasize the fact that, even if the t-ratios of their regression do not diverge, inference can not be drawn because of the nuisance parameters).
4.1. Yule's Experiment
Spurious correlation was evidenced by Yule [27] in a computerless Monte Carlo experiment. Shuffling decks of playing cards, Yule obtained independent series of random numbers. In fact, Yule generated independent I(0), I(1), and I(2) series and computed correlation coefficients amongst them. Such correlation coefficients provided correct inference when using I(0) series, but became nonsensical when the order of integration of the variables was higher. With independent I(0) variables the correlation coefficient remained close to zero. The same estimate achieved using I(1) independent variables no longer worked; many times it was close to unity, resulting in what we now refer to as a spurious correlation. If variables were I(2), the most probable outcome was actually a correlation coefficient of close to 1: the spurious phenomenon was even stronger.
4.2. Reappraisal in the Seventies: Spurious Least Squares
Throughout most of the past century, it was commonly recommended to first-difference the series if these seemed to have a trending mechanism. However, not every econometrician was in agreement with such a method because, it was argued, differencing causes losses in the information contained in the original series. A profound reappraisal of this issue began with Granger and Newbold’s [1] article, which allowed the spurious regression in Least squares estimators to be identified. The Monte Carlo experiment described above revealed, among other things, that high R2 and low Durbin-Watson statistics (hereinafter 𝒟𝒲) should be considered as a sign of misspecification. They also pointed out, comparing the outcome of simulation with many results in applied econometrics, that problems of misspecification seemed to be widespread. It was proposed that first-differenced series be used, although the authors warned about the risks of catch-all solutions; their results may be considered as the seed of many fruitful extensions in time series econometrics.
4.3. Theory at Last: Asymptotics in Nonsensical Regressions
Phillips [6] proposed the theoretical framework necessary to allow an understanding of Granger and Newbold's earlier results and provided a first insight into the phenomenon of spurious regression. His development set the groundwork for the spurious regression literature in econometrics. Whilst Granger and Newbold used an i.i.d. noise in their simulations, Phillips allowed a flexible autocorrelation structure, as well as some degree of heterogeneity. He then proved that, when specification (1) is estimated the following asymptotics in Table 2 are obtained: where Op(m) denotes the order of magnitude and →p means convergence in probability. The most relevant results are that the R2 does not collapse, and, the t-statistic associated to β̂ diverges at a rate T1/2. This means that the t-statistic will exceed the classical 5%-level critical values (-1.96 and 1.96) as the sample size grows and, with a sufficiently large sample, the null hypothesis of the t-statistic (ℋ0:β=0) will be rejected with certainty.
Orders in probability: variables yt and xt both independently generated by DGP 6.
β̂=Op(1)
α̂=Op(T1/2)
R2=Op(1)
tβ̂=Op(T1/2)
tα̂=Op(T1/2)
𝒟𝒲→p0
Park and Phillips [55] demonstrated the presence of spurious regression when the independent variables are I(2). Marmol [56] proved that the phenomenon of spurious regression occurs in a more general nonstationary framework; he demonstrated that spurious regression occurs when independent series xt,yt are integrated of order d for d∈𝒩={1,2,…} (see DGP 9). Empirically relevant DGPs, such as the I(2) process (identified, as mentioned earlier, with uncontrolled inflation) can produce spurious regression. The asymptotic results are somewhat similar to those of Phillips (see Table 3): Although the parameter estimate of the constant term, α̂, diverges at a different rate, the t-statistic associated with β̂ diverges at the same rate as in phillips seminal work, that is, Op(T1/2).
Orders in probability: variables yt and xt both independently generated by DGP 9. Both variables are integrated of the same order.
β̂=Op(1)
α̂=Op(T1/2-d)
R2=Op(1)
tβ̂=Op(T1/2)
tα̂=Op(T1/2)
𝒟𝒲=Op(T-2)
One limitation of Marmol's [56] results is that both the dependent and the independent variables share the same order of integration. Banerjee et al. [57], provided Monte Carlo evidence of spurious regression when the order of integration of each variable is different. Marmol [58], later allowed the relevant DGPs for both y and x to be integrated of different orders; yt~I(d1) and xt~I(d2) where d1 and d2∈N={1,2,…}. Marmol’s [58] results are twofold; either d1>d2 or vice versa:
when d1>d2 (Table 4):
when d1<d2 (Table 5):
in both cases R2=Op(1).
Orders in probability: variables yt and xt both independently generated by DGP 9. Variables are integrated of different orders d1>d2.
β̂=Op(Td1-d2)
α̂=Op(Td1-1/2)
tβ̂=Op(T1/2)
tα̂=Op(T1/2)
𝒟𝒲=Op(T-1) if d2=1
𝒟𝒲=Op(T-2) if d2=2,3,…
Orders in probability: variables yt and xt both independently generated by DGP 9. Variables are integrated of different orders. d1>d2.
β̂=Op(Td1-d2)
α̂=Op(Td1-1/2)
tβ̂=Op(T1/2)
tα̂=Op(T1/2)
𝒟𝒲=Op(T-1) if d1=1
𝒟𝒲=Op(T-2) if d1=2,3,…
Indeed, spurious regression persists in the presence of the so-called unbalanced regressions. The rate of divergence of tβ̂ remains T1/2. De Jong [59] extended the study of spurious regression using independent driftless unit root processes; he used DGP (6) to generate the series and ran the following specification, which operates with logarithmic transformation of both variables—a direction which has proved extremely relevant for empirical purposes: (applying the logarithmic transformation in econometrics is typical when the practitioner wants either to homogenize the variance or to directly obtain estimates of average elasticity amongst variables):
ln(yt)=α+βln(xt)+ut
de Jong discovered similar results to previous works, that is, β̂=Op(1) and tβ̂=Op(T1/2).
Entorf [60] made a slight-and yet, fundamental-modification to the Phillips' DGPs by adding a drift (DGP 7) (Entorf's results apply to fixed-effects models where N (the dimension of the cross section) is treated as fixed. The phenomenon of spurious regression in panel data under more general conditions has been further developed; Phillips and Moon [61] provided a regression limit theory for non stationary panel data with large numbers of cross section and time series. Kao [62] studied the Least-Squares Dummy Variable estimator (LSDV) where the spurious regression phenomenon is still present for independent nonstationary variables generated by DGPs (6) and (7). Amongst the most relevant consequences of such drift is the fact that there is not only a stochastic trend but also a deterministic one. In the long run, the deterministic trend dominates the stochastic (see Appendix C). The asymptotic results of estimating (1) using independent variables generated by DGP 7 are given in Table 6: Note that tβ̂ grows at rate T instead of T1/2, contrary to the results presented so far, due to the presence of a deterministic trend.
Orders in probability: variables yt and xt both independently generated by DGP 7.
tβ̂=Op(T)
tα̂=Op(T1/2)
(1-R2)=Op(T-1)
𝒟𝒲=Op(T-1)
4.4. Spurious Regression and Long Memory: An Unforgettable Extension
Among the first papers to deal with spurious regression in econometrics (Bhattacharya et al. [5] proved that the R/S test yields spurious inference concerning long memory when the data have a deterministic trend mechanism. See Section 5). using long memory processes are those of Cappuccio and Lubian [63], and Marmol [64]. The authors use the nonstationary fractionally-integrated processes specified in DGP 11. Under these conditions, the asymptotics of an LS regression as specified in expression (1) are given in Table 7 (here the underlying theory is an invariance principle (see Davydov, 1970, Theorem 2, and Sowell, 1990, Theorem 1).
Orders in Probability: variables yt and xt both independently generated by DGP 11. The variables are integrated of the same order.
β̂=Op(1)
tβ̂=Op(T1/2)
R2=Op(1)
𝒟𝒲=Op(T-(1+2d))
Note that as most of the previous cases the t-statistic of β̂ diverges at rate T1/2. The main difference lies in the 𝒟𝒲, the rate of divergence of which varies according to the degree of long memory, measured by d. These results can be understood as an argument against the usefulness of the “rule-of-thumb”; a regression was usually considered spurious when R2>DW. However, when long memory is present in the variables, the regression may well be spurious even if R2<DW. It is worth mentioning that the fractional integration parameter, d, is the same in both variables. This “shortcoming” is fixed by Tsay and Chung [65], who used a manifold approach. They used two DGPs (No 10 and 11) and then combined these in a simple regression. Thus, there are four fractionally-integrated processes, two stationary, [FI(d1) and FI(d2) with di∈(0,1/2)], and two nonstationary, [FI(di+1) for di∈(1/2,3/2)]. The authors then studied several specifications using six DGPs. The first four are used in the estimation of specification (1) whilst the remaining two estimate yt=α+δt+ut:
yt~FI(1+d1) and xt~FI(1+d2), (the variables may be integrated of order d∈(1/2,3/2) which generalizes Phillips’s [6] results),
yt~FI(d1) and xt~FI(d2) where d1+d2>1/2,
yt~FI(1+d1) and xt~FI(d2) where d2>1/2,
yt~FI(d1) and xt~FI(1+d2) where d1>1/2,
yt~FI(1+d1) where d1>0,
yt~FI(d1) where d1>0.
We can summarize the results by stating that tβ̂=Op(Tr) where 0<r<1 which means that the t-ratio always diverges; the divergence rate, r, is, T1/2, Td1+d2-1/2, Td2, Td1, T1/2, and Td1 for cases (1)–( 6) .
Special attention should be given to the Durbin-Watson statistic, which collapses to zero (as usual) except in the second case, where 𝒟𝒲=Op(1) and hence does not converge to zero as it does elsewhere. This also could be interpreted as an important indication of the need for caution when using the rule-of-thumb, which states that when R2>DW there may be spurious regression. As in Cappuccio and Lubian [63], there is evidence that the rule-of-thumb may be a dangerous tool. Another important result appears when we compare model 1 and model 2. The reader may notice that the variables used in the second model are merely the first differences of those used in the first. What is surprising is that the spurious regression persists after the variables have been differenced ( the same argument was advanced by Marmol [64]. It should be mentioned that the alternative strategy, “detrending” (removing the trend component in the data by running a regression on time) was also known to provide spurious results [10, 66]). This goes against fifty years of tradition (differencing is used to deal with spurious regression (see Dickey and Pantula [67], e.g.). Tsay and Chung [65] actually go further by suggesting that the spurious phenomenon is due to the long memory properties of the series and not to the presence of unit roots (see also Cappuccio and Lubian [63]). Sun [68] shows that spurious regression can occur between two stationary generalized fractional processes, as long as their generalized fractional differencing parameters sum up to a value greater than 1/2 and their spectral densities have poles at the same location.
4.5. Spurious Regression With Stationary Series: Size-Matters
Granger et al. [12] show, both through Monte Carlo experiments and theoretically that the spurious regression phenomenon may occur amongst independent stationary series (see also Mikosch and Stǎricǎ [69]). Two independent AR(1) series (see DGP 2 with Pw=1) were run together to estimate regression (1), resulting in a rejection rate of the t-statistic greater than the expected 5%. Granger et al. [12] provide a theoretical proof of why this happens (although the variance of the estimates does not diverge, it may not be unity, depending on the values of the DGPs parameters) (DGP innovations may be drawn from distributions other than normal (Cauchy, Exponential, Laplace and so on), although these remain i.i.d. and independent of each other).
These results are extended to long-span MA processes (see DGP 1). Curiously, the spurious regression does not depend on the sample size, but rather on the MA span parameter, qw. What is interesting about this result is the fact that it better explains the spurious regression phenomenon in small-sized samples. This would complete the theoretical framework necessary to allow an understanding of the Monte Carlo results of Granger and Newbold [1], Ferson et al. [30].
Granger et al. [12] results differ from all others, the effect being discussed is not an asymptotic phenomenon but rather a size distortion. Size distortions arise because standardizing a test statistic is difficult unless the exact form of the spectral density of the residuals is known. The intuition behind the spurious regression phenomenon is thus, different from the one that underlies all other results.
Mikosch and de Vries [70] provide an alternative theory to explain spurious-type behavior akin to the financial risk measurement literature (Furman and Zitikis [71, Page 7] suggest that “the validity of the CAPM (Capital Asset Price Model) is closely related to the linearity of the regression function of the return on asset i given the return on the market portfolio of all assets”. Nonlinearities such as mean or variance shifts may induce the spurious regression phenomenon; this implies that the CAPM beta regression estimations should be performed cautiously). In the words of Mikosch and de Vries [70]:
”Estimators of the coefficients in equations of regression type which involve financial data are often found to vary considerably across different samples. This observation pertains to finance models like the CAPM beta regression, the forward premium and the yield curve regression. In economics, macro models like the monetary model of the foreign exchange rate often yield regression coefficients which significantly deviate from the unitary coefficient on money which is based on the theoretical assumption that money is neutral.”
Mikosch and de Vries [70] prove that, when the distribution of the innovations is heavy tailed, that is, when there is a departure from the normality assumption usually made, using standard statistical tools (such as LS, e.g.) can be misleading (the non-normal behavior of financial data has been widely documented (see Fama [72]).
4.6. The Last Newcomer: Trend Stationarity
Hasseler [73] studied the spurious regression phenomenon from a different perspective. He considered the possibility of spurious regression when the variables do have a deterministic trend component (what we have defined earlier as trend stationarity [DGP 4]). Thus, there are two independent nonstationary variables. Hasseler’s [73] results, as with those of Kim et al. [52], who worked with the same DGPs, are as in Table 8.
Orders in Probability: variables yt and xt both independently generated by DGP 4.
tβ̂=Op(T3/2)
β=Op(1)
T2(1-R2)→p0
In addition, Kim et al. [52] proved that when one of the deterministic trend components is taken out (i.e., either βy=0 and βx≠0 or vice versa), the t-ratios converge to centered normal distribution, although its variance is not unity, which may provoke spurious rejection of the null hypothesis that the parameter is zero. That means that running a trend stationary variable (DGP 4) on a mean stationary variable (DGP 2) or vice versa lessens the "spuriosity" in the regression. The authors also generated independent series with the DGP (4), as did Entorf [60], although they ran a different specification:
yt=α+βxt+δt+ut.
The asymptotics provided in Kim et al. [74] only concern β̂ and tβ̂. We computed the orders in probability of the other two parameters, given their relevance. We included in Appendix D a guide on how to obtain these asymptotics (Table 9): It should be noted that the t-ratio converges to a normal distribution. However, since the variance is not unity, spurious regression may well be still present.
Orders in Probability: variables yt and xt both independently generated by DGP 4. The estimated specification is (2).
α̂=Op(1)
tα̂=Op(T1/2)
β̂=Op(T-1/2)
tβ̂=Op(1)
δ̂=Op(1)
tδ̂=Op(T1/2)
A relevant extension of Hasseler [73], and Kim et al. [52] was provided by Noriega and Ventosa-Santaulária [75] where (possibly multiple) structural breaks are added to the Trend Stationary DGP, that is, DGP 5. Adding breaks to the specification of the DGPs makes the divergence rate of the t-statistic associated with β̂ return to the usual T1/2 “norm”. They also proved that adding a deterministic trend into the regression specification (see (4.2) does not prevent the phenomenon of spurious regression; tδ̂ remains Op(T1/2). In an effort to unify the related literature, Noriega and Ventosa-Santaulária [76] filled a number of gaps until then unaddressed. They studied several previously overlooked combinations of DGPs and summarized most of the others' results. DGPs 2-9 may indistinctly generate x and/or y (obviously not every combination was carried out by Noriega and Ventosa-Santaulária [76]; besides those presented previously, the following combinations where made by Hasseler [77, 78]: DGP 6-DGP 2, DGP 2-DGP 3, DGP 3-DGP 2, and, DGP 3-DGP 3). In fact, amongst the new possible combinations, the divergence rate of tβ̂ usually remains T1/2. The combination of a Trend Stationary and an I(1) plus drift stands out since the divergence rate of the t-ratio is T rather than T1/2 as in Kim et al. [52], although this result should have been anticipated given the asymptotic dominance of the deterministic trend over the stochastic. This combination is thus clearly linked to Entorf’s [60] results.
Granger et al. [12] showed that the use of an Heteroskedasticity and Autocovariance covariance matrix diminishes size distortions in some cases but their Monte Carlo evidence also showed that this is true only when the sample size is large (greater than 500). The use of HAC standard errors (see Newey and West [79]). when the DGPs are stationary or have a deterministic trend is not so obvious. We performed a simple simulation to support this view (see Table 10). The variables x and y are independently generated either by DGP (4), (5), or (6). Lag selection was done as in Granger et al. [12], that is, using the formula: l=integer[4(T/100)1/4]. Simulation results show that size distortions are less severe when HAC is used, but remain extremely high (perhaps results would improve further if the data could follow a pre-whitening procedure).
t-ratio rejection rates using a 1.96 critical value (5% level) for a standard normal distribution: spurious regression using LS and HAC standard errors.
DGPs
LS standard errors
HAC standard errors
xi
yi
Sample size
50
100
200
500
50
100
200
500
4
4
0.21
0.99
1.00
1.00
0.27
0.99
1.00
1.00
5
5
0.20
0.99
1.00
1.00
0.27
0.99
1.00
1.00
6
4
0.45
0.80
0.90
0.95
0.46
0.71
0.79
0.86
6
5
0.65
0.84
0.91
0.95
0.62
0.74
0.80
0.86
6
6
0.68
0.76
0.83
0.90
0.57
0.61
0.67
0.77
Empirical applications in macroeconomics and finance.
Subject or field
Selection of articles
GDP and GNP trending, growth, convergence and Business cycles, Output-Capital Ratio, exports, unemployment,
[7, 9, 38, 42, 43, 45, 51, 118, 134–143]
Quantitative theory and Money neutrality, Demand of Money,
[4, 49, 50, 54, 95, 144–146]
Purchasing Power Parity, Exchange Rates, Real interest Rate, Taylor Rule, Inflation, Persistence, Stock Prices, CAPM, Forward premium , Yied curve, Financial risk, Premium calculation principles (pcp's),
4.7. Next of Kin: Statistical Tests, Long Horizon, and Instrumental Variables
Not all work on the spurious phenomenon has been monopolized by the LS simple estimator. Stochastic trending mechanisms have been used in most of the studies, although some exceptions are presented at the end of this section. Equally relevant are the Instrumental Variables (IV) estimates, which have also been analyzed [80].
Giles [81] studied two important residual tests: the Jarque-Bera test [hereinafter 𝒥ℬ; see Jarque and Bera [82]] and the Breusch-Pagan-Godfrey test [83, 84], used to test for normality and autocorrelation/heteroskedasticity evidence in the residuals of an LS regression, respectively; he used independent driftless random walks (DGP 6) and proved that both statistics diverge at rate T, that is:
Jarque-Bera test: 𝒥ℬ=Op(T)
Breusch-Pagan-Godfrey test: (T·R2)=Op(T).
The null hypothesis of normality or serial independence/homoskedasticity will be eventually rejected even if it is correct, given a large enough sample (these residuals are neither normal nor serially independent or homoskedastic; hence, the statistical tests provide appropriate inference by rejecting the null). Furthermore, Ventosa-Santaulária and Vera-Valdés [85] studied the behavior of the classical Granger-Causality test (see Granger [86]), hereinafter GC. It is proved that the classical GC test fails to accept the null hypothesis of no GC between independent broken-trend (DGP 5) or broken-mean (DGP 3) processes whether the former series are differenced or not.
Kim et al. [87] study several important nonlinearity tests: the RESET test [88], the McLeod and Li test [89], the Keenan test [90], the Neural Network test [91], White's Information test [92], and the Hamilton test [93]. All these tests were studied by way of finite-sample experiments using independent driftless random walks (see DGP 6). They also presented the asymptotics of the RESET and Keenan tests; both test-statistics are Op(T), so the null of linearity is eventually rejected as the sample size grows. In the same vein, Noriega and Ventosa-Santaulária [94] proved that, when the variables are generated independently by any combination of DGPs (3), (5), (7), and (8), the Engle and Granger cointegration-test spuriously rejects the null of no cointegration (of course, there is an important exception: if both variables are independent I(1) processes, the Engle and Granger test tends to accepts the null of no-cointegration). in an indeterminate number of cases since the relevant t-statistic is Op(T1/2) (the t-statistic may diverge toward +∞ or -∞; the sign depends on the unknown values of the DGPs; Monte Carlo evidence is presented to assert that in many cases, such sign is negative).
Spurious regression has also been identified in the context of Long-Horizon (henceforth LH) regressions, which are used in situations where previous “short-term” (that is, using the variables in levels, without doing any kind of temporal aggregation). studies have failed. In fact, Valkanov’s [95] insight is that rolling summations of I(0) variables, that is LH variables, behave asymptotically as I(1) series. Hence, the theory of spurious regression between independent I(1) variables, discussed earlier, should give readers the intuition. Late eighties' studies provided interesting results in economics and finance. As asserted by Valkanov [95]:
”The results […] are based on long-horizon variables, where a long-horizon variable is obtained as a rolling sum of the original series. It is heuristically argued that long-run regressions produce more accurate results by strengthening the signal coming from the data, while eliminating the noise.”
More precisely, if the specification to be estimated is (1), a Long-Horizon “reinterpretation” is carried out with the building of partially aggregated variables. As usual, let, w=x,y, then wtk=∑j=0k-1wt+j. The LH regression specification can usually be one of the following:
ytk=α+βxtk+ut,ytk=α+βxt-kk+ut,ytk=α+βxt+ut.
Such specifications are used mostly in the estimation of the equity/dividend relationship and to test both the neutrality of money or the Fisher effect. Based upon the previous specifications, Valkanov [95], and Moon et al. [54] proved that this regression strategy also presents the spurious regression phenomenon. To do this, they let time overlap in the summations as a fixed fraction of the sample size, k=[λ·T] where λ∈[0,1]. Valkanov [95] then defines the following DGP:
yt=βxt+uyt,
where (1-L)ϕ(L)xt=uxt. The variable xt follows an autoregressive process whose highest root is unity whilst the rest, represented in the polynomial ϕ(L), is invertible. Let ω=(uyt,uxt)′ be defined as a martingale difference sequence with E(ωtωt′∣ωt-1⋯)=[σ112σ12;σ21σ222] and finite fourth moment. We could argue that, when β=0, the t-ratio associated with its estimate should be small enough for the null hypothesis to be accepted. As in previous results, that does not happen; tβ̂=Op(T1/2). Valkanov [95], and Moon et al. [54] suggest the use of a rescaled t-ratio, tβ̂/T1/2. Although the limiting distribution of such t-statistic's is neither normal nor pivotal, it can be easily simulated and hence used. Lee [96] extends the results for fractionally integrated processes and finds that the t-ratio associated with β̂ in (3) diverges: tβ̂=Op(T1/2).
Ventosa-Santaulária [80] studies the asymptotics of Instrumental Variables (IV) using DGPs (5) and (7), not only for x and y, but also for a spurious and independent instrument. It is shown that the t-ratio still diverges at rate T1/2 which confirms the Monte Carlo simulations carried out by Leybourne and Newbold [97]. This result can be also seen as a complement to those presented by Phillips and Hansen [98], and Hansen and Phillips [99], in which the use of spurious instruments is proposed to improve the estimates whenever strong endogeneity is present between x and the residual term in cointegrated relationships.
Sun [53] developed a convergent t-statistic for correcting the phenomenon of spurious regression. The new t-ratio is based on an estimate of the parameter variance made in the same manner as HAC standard errors. This t-statistic converges to a non-degenerate limiting distribution for many cases of spurious regression. He considered the regression between two independent nonstationary I(d) processes with d>1/2 as well as the regression between an independent nonstationary I(d) process and a linear trend (see (4.5):
yt=α+δt+ut.
In previous studies, the presence of unit roots in the series generally produced a T1/2 divergent t-ratio. To avoid such divergence, Sun proposes a rescaling of the parameter estimate using a new standard error. The new Standard Error is computed in the same way HAC is. The main difference is that HAC estimates usually require a bandwidth or truncation lag. Sun suggests using the entire sample length:
σ̂β̂2=(∑t=1T(xt-x̅)2)-1·TΩ̂·(∑t=1T(xt-x̅)2)-1,
where
Ω̂=∑j=-T+1T-1κ(jT)Γ̂(j),Γ̂(j)={1T∑t=1T-j(xt+j-x̅)ût+jût(xt-x̅)forj≥0,1T∑t=-j+1T(xt+j-x̅)ût+jût(xt-x̅)forj<0,
where κ is a kernel function that belongs to a class that ensures positive definitiveness. Whether xt~I(d) or xt=t, well-defined asymptotic distributions for the t-ratio are provided.
5. What to do if One Fears Spurious Regression
The spurious regression phenomenon pervades many subfields in time series analysis. It might be controlled by using correctly scaled t-ratios, as suggested earlier, but having a clear idea of what DGP best emulates the properties of the series (this could be labeled as DGPi-fication) would therefore be necessary (examples of this can be: (i) evidence of UR must be obtained before a cointegration analysis is undertaken; (ii) the nature of the trending mechanism should be identified prior to the application of a transformation intended to render stationary the series, and; (iii) a test such as Robinson’s [100] could be undertaken in order to identify long-memory behavior). This can be achieved by means of applying a battery of tests to our series. Such an approach is not exempted of failures. Many statistical tests are known to yield spurious evidence under specific circumstances (see the previous subsection). Nevertheless, pretesting the series remains an adequate strategy and allows the practitioner to be aware of the potencial difficulties he could face. In this section we include a short—and incomplete—list of tests that are employed in the DGPi-fication of the series:
Drawing inference concerning the nonstationarity of the series can be done by means of Dickey-Fuller-type (DF) tests (see Dickey and Fuller [8, 101]). The original DF test distinguishes between the null hypotheses of UR (DGPs 6 and 7) and the alternatives of stationarity (DGPs 1, 2 and 4). Other well-known UR tests are: (i) the KPSS test [102]; (ii) the GLS-detrended DF test [103]; (iii) the Phillips-Perron test [104], and; (iv) the Ng and Perron test [105, 106].
The UR tests previously mentioned provide severely biased results under the hypothesis of trend-stationarity in the presence of structural breaks: DF-type tests (i) over-accept the null hypothesis of unit root when there is a trend/level break in the—trend—stationary process [9, 107–113], and; (ii) over-reject the null hypothesis when there is a trend/level/variance break in the unit root process [114–117]. Several alternatives are available. Perron [9] suggested the use of a DF-type test with (point,level and trend) breaks specified in the auxiliary regression (see DGP 5); the break dates must be decided by the practitioner. Zivot and Andrews [118] also proposed to modify the DF test in the same direction than Perron, only they allowed the break date to be endogenously specified; their test allows for a single break (see Lumsdaine and Papell [44] for an extension that allows for two breaks) under the alternative hypothesis (DGP 5) and rules out the possibility of a break under the null hypothesis of UR (DGP 8); Carrion-i-Silvestre and Sansó [119] proposed a test where a break under the null hypothesis is taken into account.
Bai and Persson [120] proposed a test to distinguish between DGPs 2 and 4 and DGPs 3 and 5. The test presupposes that the trending mechanism is exclusively deterministic.
Long-range dependence: many testing procedures are also available. R/S-type test (see Hurst [32], Mandelbrot and Taqqu [121], and Lo [122]) are commonly used to identify Long Memory (LM) against stationarity. However, Bhattacharya et al. [5] proved that the classical R/S test may provide evidence of LM even if the series is stationary when the latter contains a trending mechanism. Moreover, Mikosch and Stărică [123], and Mikosch and Stărică [69] proved that the sample autocorrelation function (sample ACF) can also be a misleading statistical tool when used to identify LM; stationary series that include a non-linear component might yield a sample ACF usually attributed to LM processes (see also Teverovsky and Taqqu [124]). Several Short memory processes may thus seem to behave as LM processes. This phenomenon can be labeled as spurious long memory [69].
Many other tests have been proposed to identify LM while they control for possible non-linearities (structural breaks in the mean or the variance of the series). See Liu et al. [125], Robinson [100], Lobato and Robinson [126], Giraitis et al. [127, 128], Berkes et al. [129], Zhang et al. [130], Aue et al. [131], and Jach and Kokoszka [132].
6. Concluding Remarks
Spurious regression can arise wherever a trending mechanism is present in the data. Even some stationary-autocorrelated processes cause spurious results.
Applied macroeconomists and financial experts have been steadily incorporating technical advances in the analysis of the spurious regression, a phenomenon identified for many empirically-relevant Data Generating Processes. These include stationary processes with AR (or long MA) structure and/or level breaks; random walks (with or without drifts), trend stationarity (with possible level and trend breaks), long-memory processes (whether stationary or not) and so on. These processes have been associated with unemployment rates, price levels, real exchange rates, monetary aggregates, gross domestic product and various financial variables. The use of Least Squares with such variables entails a high risk of obtaining a spurious relationship.
Differencing the series may not always prevent spurious estimates; nor should the R2>DW rule-of-thumb be seen as an adequate rule to identify a spurious regression. Cointegration analysis appears to better prevent non-sensical statistical relationships although, one should bear in mind Phillips’s [20] and further study the statistical relationship at hand. Out-of-sample forecasting evaluation could be an option. Most macroeconomic variables are either nonstationary or very persistent. Pre-testing the variables in order to identify the nature of the trending mechanism arises as the golden rule to avoid non-sense regression. Once the DGP is correctly identified, spurious regression is “easier” to deal with.
Attaining a clear understanding of any problem is the first step toward finding its solution.
AppendicesA. Spurious Regression Using Independent UR Processes
The theoretical explanation of the spurious regression phenomenon documented in Granger and Newbold [1] was provided by Phillips [6]. We present here a simple version of Phillips’s [6] results. Assume two independent nonstationary unit root processes:
xt=xt-1+uxt,yt=yt-1+uyt,
where, for simplicity, uzt~𝒩(0,σz2); uxt⊥uyt. By solving recursively both equations, we obtain:
xt=X0+∑i=1t-1uxt-i︸,ξxt-1yt=Y0+∑i=1t-1uyt-i︸,ξyt-1
where X0 and Y0 are initial conditions. It can be proved (see, e.g., Hamilton [133, Pages 486 and 548]) that:
T-3/2∑t=1Tξzt-1→dσz∫01ωz(r)dr,T-2∑t=1Tξzt-12→dσz2∫01[ωz(r)]2dr,T-2∑t=1Tξxt-1ξyt-1→dσxσy∫01ωx(r)ωy(r)dr,
where ωz(1) is a standard brownian motion and →d denotes convergence in law. Therefore (assume that uz0=0):
∑zt=Z0T+∑t=1Tξzt-1,︸Op(T3/2)∑zt2=Z02T+∑t=1Tξzt-12︸Op(T2)+∑t=1Tξzt-1,∑xtyt=X0Y0T+X0∑t=1Tξyt-1+Y0∑t=1Tξxt-1+∑t=1Tξxt-1ξyt-1︸.Op(T2)
The LS formula of β̂ is:
β̂=T∑xtyt-∑xt∑ytT∑xt2-(∑xt)2.
Replacing the sums that appear in this formula with the above asymptotic expressions and letting T→∞ yields:
β̂=σyσx∫01ωx(r)ωy(r)dr-∫01ωx(r)dr∫01ωy(r)dr∫01[ωx(r)]2dr-(∫01ωx(r)dr)2=Op(1).
The same can be done with the variance of the regression as well as with the t-ratio associated to β̂:
σ̂2→dT-1[∑yt2+α̂2T+β̂2∑xt2-2α̂∑yt-2β̂∑xtyt+2α̂β̂∑xt]=Op(T),tβ̂=Op(T1/2).
Note that the t-ratio diverges as the sample size grows. The usual critical values to test the null hypothesis β=0 are ±1.96 (level: 5%) so it is straight forward to see that the null hypothesis will always be rejected for a sample sufficiently large: this is a simple example of spurious regression.
B. Empirical Work and DGPs
Table 11 explains empirical applications in macroeconomics and finance.
C. Dominance of the Deterministic Trend Over the Stochastic Trend
In order to explain the dominance of the deterministic trend over the stochastic trend we may study the sums of a UR with drift. Assume xt is generated by DGP 7; xt=μx+xt-1+uxt. By solving recursively this we get:
xt=X0︸1+μxt︸2+ξxt-1︸3,
where the first term (1) is an initial condition, the second term (2) represents the deterministic trend, and the third term (3) accounts for the stochastic trend; note that ξxt=∑i=1tuxt-i. Then (all sums run from 1 to T),
∑xt-1=X0T︸Op(T)+μx∑(t-1)︸Op(T2)+∑ξxt-1︸Op(T3/2),∑xt-12=X02T+μx2∑(t-1)2︸Op(T3)+∑ξxt-12︸Op(T2)+2X0μx∑(t-1)+2X0∑ξxt-1+∑(t-1)ξxt-1︸Op(T5/2).
The orders in probability of the underbraced sums can be found in Hamilton [133, chapters 16 and 17]. Note that the leading term in expressions (C.2) and (C.3) is always the deterministic component (T2 in ∑xt-1 and T3 in ∑xt-12). This is why it is said that the time trend asymptotically dominates the stochastic components.
D. Asymptotics of LS Estimates of Specification 2
We present a guide as to how to obtain the order in probability of the estimates and their associated t-ratios using LS where the variables y and x are generated by DGP 4 and specification 2 is estimated. Proof of such was provided with the aid of Mathematica 4.1 software. We use the classical LS. Let Θ̂=(α̂,β̂,δ̂)′:
Θ̂=(X′X)-1X′Y,Var(Θ̂)=σ̂2(X′X)-1,tα̂=α̂σ̂α̂2,tβ̂=β̂σ̂β̂2,tδ̂=δ̂σ̂δ̂2,
where
X′X=[T∑xt∑t∑xt∑xt2∑xtt∑t∑xtt∑t2],Z′Y=[∑yt∑xtyt∑ytt],σ̂2=T-1[∑yt2+α̂2T+β̂2∑xt2+δ̂2∑t2-2α̂∑yt-2β̂∑xtyt-2δ̂∑ytt+2α̂β̂∑xt+2α̂δ̂∑t+2β̂δ̂∑xtt].
We will now describe the process involved in establishing the aforementioned proof. α̂, β̂, δ̂ and their corresponding t-ratios are functions of the following expressions (unless indicated otherwise, all sums run from t=1 to T). Let w=y,x:
∑wt=μwT+βw∑t+∑uwt︸Op(T1/2),∑wt2=μw2T+βw2∑t2+∑uwt2︸Op(T)+2μwβw∑t+2μw∑uwt+2βw∑uwtt︸Op(T3/2),∑wtt=μw∑t+βw2∑t2+∑uwtt,∑xtyt=μy∑xt+βy∑xtt+∑xtuyt,∑xtuyt=μx∑uyt+βx∑uytt+∑uxtuyt︸Op(T1/2),
where
∑t=12(T2+T),∑t2=16(2T3+3T2+T).
The orders in convergence of the underbraced expressions can be found in Hamilton [133, Chapter 16]. We can fill the previously-cited matrices and then compute the LS parameter estimates and the t-statistic associated with each one. The asymptotics are computed by the program (available at the following URL: http://www.ventosa-santaularia.com/JPSappD.pdf).
GrangerC. W. J.NewboldP.Spurious regressions in econometrics197422111120EID2-s2.0-3374806548410.1016/0304-4076(74)90034-7ZBL0319.62072DavidsonJ.HendryD.SrbaF.YeoS.Econometric modelling of the aggregate time-series relationship between consumers' expenditure and income in the United Kingdom19788866169210.2307/2231972HendryD.MizonG.Serial correlation as convenience simplification, not a nuissance: a comment on a study of the demand for money by the bank of England19788854956310.2307/2232053PlosserC. I.SchwertG. W.Money, income, and sunspots: measuring economic relationships and the effects of differencing197844637660EID2-s2.0-000136540810.1016/0304-3932(78)90021-1BhattacharyaR. N.GuptaV. K.WaymireEdThe Hurst effect under trends1983203649662MR71351310.2307/3213900ZBL0526.60027PhillipsP. C. B.Understanding spurious regressions in econometrics1986333311340MR86797910.1016/0304-4076(86)90001-1ZBL0602.62098NelsonC. R.PlosserC. R.Trends and random walks in macroeconmic time series: some evidence and implications1982102139162EID2-s2.0-4904914345510.1016/0304-3932(82)90012-5DickeyD. A.FullerW. A.Distribution of the estimators for autoregressive time series with a unit root197974366, part 1427431MR54803610.2307/2286348ZBL0413.62075PerronP.The great crash, the oil price shock and the unit root hypothesis1989571361140110.2307/1913712ZBL0683.62066DurlaufS. N.PhillipsP. C. B.Trends versus random walks in time series analysis198856613331354MR973983ZBL0653.62068PhillipsP. C. B.Trending time series and macroeconomic activity: some present and future challenges200110012127MR180778310.1016/S0304-4076(00)00048-8ZBL0961.62112GrangerC. W. J.HyungN.JeonY.Spurious regressions with stationary series2001337899904EID2-s2.0-000251546510.1080/00036840152022232SarganG.1964London, UKButtewothHendryD.AndersonG.Testing dynamic specification in small simultaneous systems: an application to a model of building society behaviour in the United Kingdom19773361383HendryD.PaganA.SarganJ.Dynamic specification198421023110010.1016/S1573-4412(84)02010-9ZBL0586.62191GrangerC. W. J.Some properties of time series data and their use in econometric model specification1981161121130EID2-s2.0-4914913683910.1016/0304-4076(81)90079-8GrangerC. W. J.WeissA. A.Time series analysis of error-correction models1983New York, NY, USAAcademic Press255278MR738656ZBL0547.62060EngleR. F.GrangerC. W. J.Co-integration and error correction: representation, estimation, and testing1987552251276MR882095ZBL0613.62140PhillipsP. C. B.New tools for understanding spurious regressions199866612991325MR1654311ZBL1056.62531PhillipsP. C. B.Laws and limits of econometrics2003113486C26C52EID2-s2.0-034547183010.1111/1468-0297.00114LegendreA.Nouvelles méthodes pour la détermination des orbites des comètesF. Didot, 1805YuleG.On the theory of correlation189760481285410.2307/2979746PearsonK.On a form of spurious correlation which may arise when indices are used in the measurement of organs18976048949810.1098/rspl.1896.0076HendryD. F.MorganM. S.1995Cambridge, UKCambridge University Pressxvi+558MR1395383AldrichJ.Correlations genuine and spurious in Pearson and Yule1995104364376MorganM. S.1990Cambridge, UKCambridge University Pressxvi+296Historical Perspectives on Modern EconomicsMR1052409YuleG.Why do we sometimes get nonsense-correlations between time-series? A study in sampling and the nature of time-series192689116310.2307/2341482HendryD.Econometrics-alchemy or science?19804718838740610.2307/2553385JevonsW.Commercial crises and sunspots1884London, UKMacMillan221243FersonW.SarkissianS.SiminT.Is stock return predictability spurious?200313110StockJ. H.Asymptotic properties of least squares estimators of cointegrating vectors198755510351056MR913360ZBL0651.62105HurstH.Long term storage capacity reservoirs1951116770799HipelK. W.McLeodA. I.Preservation of the rescaled adjusted range parts 1, 2 and 319781449151810.1029/WR014i003p00491AdelmanI.Long cycles: fact or artifact196555444463GrangerC. W. J.Long memory relationships and the aggregation of dynamic models1980142227238MR59725910.1016/0304-4076(80)90092-5ZBL0466.62108GrangerC. W. J.JoyeuxR.An introduction to long-memory time series models and fractional differencing1980111529MR60557210.1111/j.1467-9892.1980.tb00297.xZBL0503.62079HoskingJ. R. M.Fractional differencing1981681165176MR61495310.1093/biomet/68.1.165ZBL0464.62088AriñoM. A.MarmolF.A permanent-transitory decomposition for ARFIMA processes200412418797MR206622910.1016/S0378-3758(03)00190-3ZBL1094.62108PerronP.VogelsangT.Nonstationarity and level shifts with an application to purchasing power parity199210330132010.2307/1391544WuJ.-L.ecdjlw@ccunix.ccu.edu.twMean reversion of the current account: evidence from the panel data unit-root test2000662215222EID2-s2.0-003052227810.1016/S0165-1765(99)00198-6ZBL1013.91528WuJ.-L.ecdjlw@ccunix.ccu.edu.twChenS.-L.LeeH.-Y.Are current account deficits sustainable? Evidence from panel cointegration2001722219224EID2-s2.0-003438070810.1016/S0165-1765(01)00420-7ZBL1014.91509D'AddaC.ScorcuA. E.scorcu@spbo.unibo.itOn the time stability of the output-capital ratio200320611751189EID2-s2.0-003269612710.1016/S0264-9993(02)00081-0PerronP.Further evidence on breaking trend functions in macroeconomic variables1997802355385MR148318510.1016/S0304-4076(97)00049-3ZBL0965.62103LumsdaineR. L.PapellD. H.Multiple trend breaks and the unit-root hypothesis1997792212218EID2-s2.0-000289528810.1162/003465397556791MehlA.ajmehl@yahoo.comUnit root tests with double trend breaks and the 1990s recession in Japan2000124363379EID2-s2.0-000289528810.1016/S0922-1425(00)00046-3JuseliusK.A structured VAR under changing monetary policy19961640041210.2307/1392609JuseliusK.Price convergence in the medium and long run: an I (2) analysis of six price indices1999Oxford, UKOxford University PressHaldrupN.An econometric analysis of 1(2) variables1998125645650EID2-s2.0-003241802210.1111/1467-6419.00069MuscatelliV. A.v.a.muscatelli@socsci.gla.ac.ukSpinelliF.The long-run stability of the demand for money: Italy 1861–19962000453717739EID2-s2.0-000152776410.1016/S0304-3932(00)00011-8CoenenG.gunter.coenen@ecb.intVegaJ.-L.The demand for M3 in the Euro area2001166727748EID2-s2.0-4404911482510.1002/jae.629NielsenH. B.heino.bohn.nielsen@econ.ku.dkAn I(2) cointegration analysis of price and quantity formation in Danish manufactured exports2002645449472EID2-s2.0-000062781810.1111/1468-0084.00272KimT.-H.LeeY.-S.NewboldP.Spurious regressions with stationary processes around linear trends2004832257262MR207042910.1016/j.econlet.2003.10.020SunY.A convergent t-statistic in spurious regressions2004205943962MR2089149MoonR.RubiaA.ValkanovR.Long-horizon regressions when the predictor is slowly varying2006ParkJ. Y.PhillipsP. C. B.Statistical inference in regressions with integrated processes. II19895195131MR990790MarmolF.Spurious regressions between I(d) processes1995163313321MR134057210.1111/j.1467-9892.1995.tb00236.xZBL0819.62075BanerjeeA.DoladoJ.GalbraithJ.HendryD.1993Oxford, UKOxford University PressMarmolF.Nonsense regressions between integrated processes of different orders1996582525534EID2-s2.0-33745248740de JongR. M.Logarithmic spurious regressions20038111321MR201004910.1016/S0165-1765(03)00093-4EntorfH.Random walks with drifts: nonsense regression and spurious fixed-effect estimation1997802287296MR148318110.1016/S0304-4076(97)00041-9ZBL0887.62088PhillipsP. C. B.MoonH. R.Linear regression limit theory for nonstationary panel data199967510571111MR1707465ZBL1056.62532KaoC.Spurious regression and residual-based tests for cointegration in panel data1999901144MR168241310.1016/S0304-4076(98)00023-2ZBL1070.62527CappuccioN.LubianD.Spurious regressions between I(1) processes with long memory errors1997184341354MR146688110.1111/1467-9892.00054MarmolF.Spurious regression theory with nonstationary fractionally integrated processes1998842233250MR163021410.1016/S0304-4076(97)00085-7TsayW.-J.ChungC.-F.The spurious regression of fractionally integrated processes2000961155182MR176875310.1016/S0304-4076(99)00056-1ZBL1054.62586NelsonC. R.KangH.Spurious periodicity in inappropriately detrended time series1981493741751EID2-s2.0-001967564910.2307/1911520ZBL0468.62099DickeyD.PantulaS.Determining the order of differencing in autoregressive processes198754455461SunY.Spurious regressions between stationary generalized long memory processes2006903446454MR221218210.1016/j.econlet.2005.10.009MikoschT.StǎricǎC.Nonstationarities in financial time series, the long-range dependence, and the IGARCH effects2004861378390EID2-s2.0-001700538910.1162/003465304323023886MikoschT.de VriesC.2006Laboratory of Actuarial Mathematics, University of CopenhagenFurmanE.ZitikisR.General stein-type decompositions of covariances: revisiting the capital asset pricing model6Proceedings of the 58th Annual Meeting of the Midwest Finance Association2009Chicago, Ill, USA18FamaE.The behavior of stock-market prices19653813410.1086/294743HasselerU.Simple regressions with linear time trends20002112732MR176617210.1111/1467-9892.00171ZBL0974.62068KimT.-H.LeeS.NewboldP.Spurious regressions with processes around linear trends or drifts2003Discussion Papers in EconomicsNoriegaA. E.Ventosa-SantaulàriaD.Spurious regression under broken-trend stationarity2006275671684MR229120610.1111/j.1467-9892.2006.00482.xZBL1112.62096NoriegaA. E.anoriega@banxico.org.mxVentosa-SantaulàriaD.daniel@ventosa-santaularia.comSpurious regression and trending variables2007693439444EID2-s2.0-000024590910.1111/j.1468-0084.2007.00481.xHasslerU.Spurious regressions when stationary regressors are included19965012531MR137212910.1016/0165-1765(95)00728-8ZBL0875.62594HasslerU.Nonsense regressions due to neglected time-varying means2003442169182MR197737210.1007/s00362-003-0144-0ZBL1017.62077NeweyW.WestK.A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix198770370810.2307/1913610ZBL0658.62139Ventosa-SantaulàriaD.20094thCommunications in Statistics. Theory and MethodsGilesD. E. A.Spurious regressions with time-series data: further asymptotic results2007365–8967979MR240525010.1080/03610920601041499JarqueC. M.BeraA. K.Efficient tests for normality, homoscedasticity and serial independence of regression residuals198063255259MR61532310.1016/0165-1765(80)90024-5BreuschT. S.PaganA. R.The Lagrange multiplier test and its applications to model specification in econometrics1980471239253MR61111810.2307/2297111ZBL0465.62107GodfreyL. G.1988Cambridge, UKCambridge University PressVentosa-SantaulàriaD.Vera-ValdésJ. E.Granger-causality in the presence of structural breaks2008361114EID2-s2.0-0000745315GrangerC.Investigating causal relations by econometric models and cross-spectral methods196937342443810.2307/1912791LeeY.-S.KimT.-H.NewboldP.Spurious nonlinear regressions in econometrics2005873301306MR214074010.1016/j.econlet.2004.10.016RamseyJ. B.Tests for specification errors in classical linear least-squares regression analysis196931350371MR0290502ZBL0179.48902McLeodA. I.LiW. K.Diagnostic checking ARMA time series models using squared-residual autocorrelations198344269273MR73858710.1111/j.1467-9892.1983.tb00373.xZBL0536.62067KeenanD. M.A Tukey nonadditivity-type test for time series nonlinearity19857213944MR79019910.1093/biomet/72.1.39ZBL0562.62077WhiteH.An additional hidden unit test for neglected nonlinearity in multilayer feedforward networks2Proceedings of the International Joint Conference on Neural Networks1989Washington, DC, USAIEEE Press451455WhiteH.1992New York, NY, USACambridge University PressHamiltonJ. D.A parametric approach to flexible nonlinear inference2001693537573MR1828535ZBL1017.62076NoriegaA.Ventosa-SantaulàriaD.Spurious cointegration: the Engle-Granger test in the presence of structural breaksWorking paper no. 2006-12, Banco de México, 2006ValkanovR.rossen.valkanov@anderson.ucla.eduLong-horizon regressions: theoretical results and applications2003682201232EID2-s2.0-000024637210.1016/S0304-405X(03)00065-5LeeJ.ecsleej@nus.edu.sgFractionally integrated long horizon regressions2007111EID2-s2.0-0037404245LeybourneS. J.NewboldP.paul.newbold@nottingham.ac.ukSpurious rejections by cointegration tests induced by structural breaks200335911171121EID2-s2.0-3374524874010.1080/0203684032000082068PhillipsP. C. B.HansenB. E.Statistical inference in instrumental variables regression with I (1) processes199057199125MR103465310.2307/2297545ZBL0703.62098HansenB.PhillipsP.Estimation and inference in models of cointegration: a simulation study19908225248RobinsonP. M.Efficient tests of nonstationary hypotheses19948942814201437MR131023210.2307/2291004ZBL0813.62016DickeyD. A.FullerW. A.Likelihood ratio statistics for autoregressive time series with a unit root198149410571072MR625773ZBL0471.62090KwiatkowskiD.PhillipsP. C. B.SchmidtP.ShinY.Testing the null hypothesis of stationarity against the alternative of a unit root. How sure are we that economic time series have a unit root?1992541–3159178EID2-s2.0-3424748017910.1016/0304-4076(92)90104-YZBL0871.62100ElliottG.RothenbergT. J.StockJ. H.Efficient tests for an autoregressive unit root1996644813836MR1399219ZBL0888.62088PhillipsP. C. B.PerronP.Testing for a unit root in time series regression1988752335346MR94605410.1093/biomet/75.2.335ZBL0644.62094NgS.PerronP.Lag length selection and the construction of unit root tests with good size and power200169615191554MR1865220ZBL1056.62529PerronP.NgS.Useful modifications to some unit root tests with dependent errors and their local asymptotic properties1996633435463EID2-s2.0-000149381710.2307/2297890ZBL0872.62085PerronP.Testing for a unit root in a time series with a changing mean153162MontañésA.ReyesM.Effect of a shift in the trend function on Dickey-Fuller unit root tests1998143355363MR162859810.1017/S0266466698143049MontañésA.ReyesM.The asymptotic behaviour of the Dickey-Fuller tests under the crash hypothesis19994218189MR1671793ZBL1087.62511LeybourneS. J.NewboldP.Behavior of Dickey-Fuller t-tests when there is a break under the alternative hypothesis2000165779789MR180380210.1017/S0266466600165077SenA.Behaviour of Dickey-Fuller F-tests under the trend-break stationary alternative2001553257268MR1867529SenA.On unit-root tests when the alternative is a trend-break stationary process2003211174184MR1973808KimT.-H.LeybourneS.NewboldPa.Behaviour of Dickey-Fuller unit-root tests under trend misspecification2004255755764MR208919310.1111/j.1467-9892.2004.02000.xZBL1062.62187HamoriS.TokihisaA.Testing for a unit root in the presence of a variance shift1997573245253MR161548510.1016/S0165-1765(97)00245-0ZBL0903.90022LeybourneS. J.MillsT. C.NewboldP.lezpn@lenl.nott.ac.ukSpurious rejections by Dickey-Fuller tests in the presence of a break under the null1998871191203EID2-s2.0-000289528810.1016/S0304-4076(98)00014-1ZBL0944.62083KimT.-H.LeybourneS.NewboldP.Unit root tests with a break in innovation variance20021092365387MR192106010.1016/S0304-4076(02)00118-5ZBL1043.62107SenA.sen@xavier.eduBehaviour of Dickey-Fuller tests when there is a break under the unit root null hypothesis2008786622628EID2-s2.0-4074913804710.1016/j.spl.2007.09.024ZivotE.AndrewsD. W. K.Further evidence on the great crash, the oil-price shock, and the unit-root hypothesis19921025127010.2307/1391541Carrion-i-SilvestreJ. L.SansóA.Joint hypothesis specification for unit root tests with a structural break200692196224MR232496710.1111/j.1368-423X.2006.00182.xZBL1096.62080BaiJ.PerronP.Estimating and testing linear models with multiple structural changes19986614778MR1616121ZBL1056.62523MandelbrotB. B.TaqquM. S.Robust R/S analysis of long-run serial correlation48Proceedings of the 42nd Session of the International Statistical Institute (ISI '79)19796999MR731558ZBL0518.62036LoA.Long-term memory in stock market prices19911279131310.2307/2938368ZBL0781.90023MikoschT.StăricăC.Is it really long memory we see in financial returns200012149168TeverovskyV.TaqquM.Testing for long-range dependence in the presence of shifting means or a slowly declining trend, using a variance-type estimator1997183279304MR145664110.1111/1467-9892.00050ZBL0934.62094LiuY.-PanM.HsuehL.A modified R/S analysis of long-term dependence in currency futures prices19933297113LobatoI.RobinsonP. M.Averaged periodogram estimation of long memory1996731303324MR141000910.1016/0304-4076(95)01742-9ZBL0854.62088GiraitisL.KokoszkaP.LeipusR.Testing for long memory in the presence of a general trend200138410331054MR187655710.1239/jap/1011994190ZBL1140.62341GiraitisL.KokoszkaP.LeipusR.TeyssièreG.Rescaled variance and related tests for long memory in volatility and levels20031122265294MR195114510.1016/S0304-4076(02)00197-5ZBL1027.62064BerkesI.HorváthL.KokoszkaP.ShaoQ.-M.On discriminating between long-range dependence and changes in mean200634311401165MR227835410.1214/009053606000000254ZBL1112.62085ZhangA.GabrysR.KokoszkaP.Discriminating between long memory and volatility shifts2007364253275AueA.alexaue@wald.ucdavis.eduHorváthL.horvath@math.utah.eduHuškováM.huskova@karlin.mff.cuni.czKokoszkaP.Piotr.Kokoszka@usu.eduTesting for changes in polynomial regression2008143637660EID2-s2.0-000188145810.3150/08-BEJ122ZBL1155.62027JachA.KokoszkaP.Wavelet-domain test for long-range dependence in the presence of a trend2008422101113MR2414352ZBL1148.62072HamiltonJ.1994Princeton, NJ, USAPrinceton University PressKingR.PlosserC.StockJ.WatsonM.Stochastic trends and economic fluctuations1991814819840CochraneJ.Permanent and transitory components of GNP and stock prices1994109124126510.2307/2118434BernardA.DurlaufS.Convergence in International output19951029710810.1002/jae.3950100202BernardA. B.DurlaufS. N.Interpreting tests of the convergence hypothesis1996711-2161173MR138108010.1016/0304-4076(94)01699-2ZBL0842.62094DurlaufS.On the convergence and divergence of growth rates19961064371016101810.2307/2235374KoustasZ.VeloceW.Unemployment hysteresis in Canada: an approach based on long-memory time series models1996287823831EID2-s2.0-000298038010.1080/000368496328263DurlaufS.QuahD.1998Cambridge, Mass, USANational Bureau of Economic Research CambridgeLiQ.Li@twsuvm.uc.twsu.eduPapellD.Convergence of international output time series evidence for 16 OECD countries199983267280EID2-s2.0-000289528810.1016/S1059-0560(99)00020-9DurlaufS. N.Manifesto for a growth econometrics200110016569MR180779010.1016/S0304-4076(00)00055-5ZBL1099.62547AghionP.DurlaufS. N.2005Amsterdam, The NetherlandsNorth-HollandCampbellJ.PerronP.BlanchardO.FisherS.What macroeconomist should know about unit roots1991Cambridge, Mass, USAMIT Press141201FaustJ.LeeperE. M.When do long-run identifying restrictions give reliable results?1997153345353MR1457932CoeP. J.pcoe@ucalgary.caNasonJ. M.Long-run monetary neutrality and long-horizon regressions2004193355373EID2-s2.0-003740424510.l002/jae.74910.1002/jae.749TaylorJ.Estimation and control of a macroeconomic model with rational expectations19794751267128610.2307/1911962ZBL0411.90032CorbaeD.OuliarisS.Cointegration and tests of purchasing power parity198870350851110.2307/1926790DibaB.GrossmanH.Explosive rational bubbles in stock prices?1988783520530CheungY.Long memory in foreign-exchange rates199393101CratoN.de LimaP. J. F.Long-range dependence in the conditional variance of stock returns1994453281285EID2-s2.0-4394915494610.1016/0165-1765(94)90024-8ZBL0800.62791EndersW.HurnS.Theory and tests of generalized purchasing-power parity: common trends and real exchange rates in the Pacific Rim19942217919010.1111/j.1467-9396.1994.tb00039.xGarciaR.PerronP.An analysis of the real interest rate under regime shifts1996781111124EID2-s2.0-4914913662910.2307/2109851CulverS. E.SCULVER@UAB.EDUPapellD. H.DPAPELL@UH.EDUIs there a unit root in the inflation rate? Evidence from sequential break and panel data models1997124435444EID2-s2.0-000289528810.1002/(SICI)1099-1255(199707)12:4<435::AID-JAE430>3.0.CO;2-1BreidtF. J.CratoN.de LimaP.The detection and estimation of long memory in stochastic volatility1998831-2325348MR161386710.1016/S0304-4076(97)00072-9ZBL0905.62116LobatoI. N.SavinN. E.Real and spurious long-memory properties of stock-market data1998163261283MR1648533ZhuZ.The random walk of stock prices: evidence from a panel of G-7 countries199857411413EID2-s2.0-034592058210.1080/135048598354528CatiR. C.GarciaM. G. P.PerronP.perron@bu.eduUnit roots in the presence of abrupt governmental interventions with an application to Brazilian data19991412756EID2-s2.0-0001199867CuddingtonJ. T.cuddingj@georgetown.eduLiangH.hliang@imf.orgPurchasing power parity over two centuries?2000195753757EID2-s2.0-000078647510.1016/S0261-5606(00)00029-2MikoschT.2003Boca Raton, Fla, USAChapman & Hall/CRCExtreme Values in Finance, Telecommunications, and the EnvironmentPedroniP.Panel cointegration: asymptotic and finite sample properties of pooled time series tests with an application to the PPP hypothesis2004203597625MR206172910.1017/S0266466604203073ZBL1059.62123TaylorA. M. R.r.taylor@bham.ac.ukFluctuation tests for a change in persistence2005672207230EID2-s2.0-000043346810.1111/j.1468-0084.2004.00117.xCavaliereG.cavaliere@stat.unibo.itTaylorA. M. R.robert.taylor@nottingham.ac.ukTesting for a change in persistence in the presence of a volatility shift200668supplement 1761781EID2-s2.0-000127316310.1111/j.1468-0084.2006.00455.xFurmanE.ZitikisR.Weighted risk capital allocations2008432263269MR2456620FurmanE.ZitikisR.Weighted pricing functionalsProceedings of the 43rd Actuarial Research Conference (ARC '09)2009Saskatchewan, CanadaUniversity of Regina131AlogoskoufisG. S.SmithR.The Phillips curve, the persistence of inflation, and the Lucas critique: evidence from exchange rate regimes199181512541275EvansM. D. D.LewisK. K.Do expected shifts in inflation affect estimates of the long-run Fisher relation?1995501225253FuhrerJ.MooreG.Inflation persistence1995110112715910.2307/2118513ZBL0834.90037CrowderW. J.The long-run Fisher relation in Canada1997304b11241142EID2-s2.0-000289528810.2307/136313DeLoachS. B.deloach@elon.eduMore evidence in favor of the Balassa-Samuelson hypothesis200192336342EID2-s2.0-000282599910.1111/1467-9396.00283DrineI.drine@univ-Paris1.frRaultC.chrault@hotmail.comDo panel data permit the rescue of the Balassa-Samuelson hypothesis for Latin American countries?2003353351359EID2-s2.0-2134448102910.1080/0003684022000015838Ricardo FariaJ.León-LedesmaM.mal@ukc.ac.ukTesting the Balassa-Samuelson effect: implications for growth and the PPP2003252241253EID2-s2.0-003233390310.1016/S0164-0704(03)00027-2BårdsenG.gunnar.bardsen@norges-bank.noJansenE. S.eilev.jansen@norges-bank.noNymoenR.ragnar.nymoen@econ.uio.noEconometric evaluation of the New Keynesian Phillips curve200466supplement671686EID2-s2.0-004036453110.1111/j.1468-0084.2004.00097.xSunY.yisun@ucsd.eduPhillipsP. C.B.Understanding the fisher equation2004197869886EID2-s2.0-000872502310.1002/jae.760GalíJ.GertlerM.López-SalidoJ.2005Banco de EspañaPhillipsP. C. B.Econometric analysis of Fisher's equation2005641125168EID2-s2.0-34250409798