Testing for serial independence of panel errors
Introduction
The serial independence of unobservable errors is a key assumption for the validity of many statistical inferences and asymptotic results. It is also a useful identification assumption in many econometric models for consumer choice, treatment effects and binary choice, to give a few examples. Brown and Wegkamp (2002) propose a minimum distance from independence estimator, where the independence between the unobserved errors and the exogenous variables is the crucial condition for identification. When one applies Efron’s (1979) bootstrap to residuals, one often assumes the independence of errors for the bootstrap to be valid, see e.g. Singh (1981). Peretti (2005) shows that testing the significance of the departures from utility maximization boils down to testing the independent and identically distributed (iid) assumption of some unobserved errors. For some other cases where the independence of errors is crucial, see e.g. Diebold et al. (1998), Clement and Smith (2000) and Brown et al. (2007).
Despite its importance, there are only a few studies on testing independence of errors. Box and Pierce (1970) and Ljung and Box (1978), BLP hereafter, propose a test for the autocorrelation of errors in ARMA models. Brock et al., 1991, Brock et al., 1996 construct a test, BDS henceforth, for the independence of errors using chaos theory. Their test is nuisance-parameter-free only under conditional mean models, and it has no power against certain types of dependence. Hong and Lee (2003) overcome the aforementioned problems of BDS test, but they use kernel smoothing techniques and their test is affected by the choice of bandwidth. Du and Escanciano (in press) propose a distribution-free test based on the Hoeffding–Blum–Kiefer–Rosenblatt-type empirical process (HBKR hereafter, see Hoeffding, 1948, Blum et al., 1961, Delgado, 1999) applied to residuals at different lags.
There are no existing tests for serial independence of panel errors, to the best of our knowledge. Instead, there are only some tests for serial correlation of panel errors. Many of them are just for first-order serial correlation, see Breusch and Pagan (1980) and Wooldridge (2002, p. 275) for example. Arellano and Bond (1991) propose a test for lack of second-order serial correlation in the first-differenced errors. There are also some portmanteau tests for panel errors, such as Hong and Kao’s (2004) wavelet-based test, Inoue and Solon’s (2006) Lagrange multiplier test and Okui (2009), which is an extension of BLP test to panels. All these tests assume the cross-sectional independence of the errors, which is problematic sometimes, see e.g. Ng (2006), Baltagi et al., 2009, Baltagi et al., 2012 and Shin et al. (2009).
In this paper we propose a test for serial independence of panel errors. Our test is nuisance-parameter-free and powerful against any type of pairwise dependence at all lags. It does not involve bandwidth selection. Moreover, unlike the previous tests, it is well defined even under the presence of cross-sectional dependence in the errors.
We consider a balanced linear panel data model with individual specific effects where is the dependent variable of interest, is a vector of covariates, is the unobserved individual effect that could be correlated with , is the error term, and is some unknown parameter in a compact set .
The null hypothesis that we test in this paper is where henceforth with The alternative hypothesis is the negation of the null (2).
Our test statistic is based on the difference between the joint characteristic function of and the product of their marginal characteristic functions. As is unknown, we replace it with a suitable estimate and construct residuals . In contrast to the general theory on empirical processes with estimated parameters, see e.g. Durbin (1973), our test enjoys the “nuisance-parameter-free” property. We actually show that our test statistic based on the residuals has the same asymptotic distribution as the corresponding statistic based on the true errors. By choosing a proper weighting function in our test statistic, we avoid the multivariate numerical integration in dimensions. The limit distribution of our test statistic is approximated directly by a simple random permutation procedure, see e.g.Delgado (1996).
Our test here extends (Du and Escanciano, in press) to panel data models. Their test based on the HBKR process suffers from the curse of dimensionality as the indicators are essentially all 0 when gets large in panels, where hereinafter the indicator function , if the statement in the parentheses is true, and otherwise. Our use of the characteristic function alleviates this problem.
To assess the finite sample performance of our proposed test, we do some Monte Carlo studies. We compare our test with Wooldridge (2002, p. 275) and Okui (2009). Generally, our test has good size and power performance. We also find that our test based on residuals has sizes and powers very close to its counterpart based on the true errors.
We illustrate our method under large small setup. We then discuss how one can modify our test to large large case if one is ready to assume iid across individuals.
The rest of the paper is organized as follows: in Section 2 we introduce our test statistic and derive its limit distribution. In Section 3 we do some Monte Carlo simulations to study the finite sample performance of the proposed test. In Section 4 we extend our test to large large case. In Section 5 we conclude and suggest some directions for future research. The proofs are gathered in the Appendix.
Section snippets
Test statistic
In the sequel, we simplify the notations as follows: , and , where is a -consistent estimator for . Our asymptotic results in this section are obtained as goes to infinity. In Section 4 we discuss the case where both and go to infinity. Furthermore, let and denote the joint and marginal characteristic functions of , respectively. Let be the
Monte Carlo simulations
To assess the finite sample performance of our proposed test in the previous section, especially the effect of replacing errors by residuals, we do some Monte Carlo studies. The software that we use is R version 2.13.0 for Windows.
We consider the following data-generating process in this section
where the individual effect iid , a chi-square distribution with 2 degrees of freedom, with
Extension to large large case
In this section we extend our test to large large case. In this setup, as Inoue and Solon (2006) and Okui (2009) did, we assume that is iid across , , and our asymptotic results are derived as both and go to infinity.
Our dependence measure in this case is whose sample counterpart is where and
Conclusions
In this paper, we propose a test for the serial independence of errors in panel data models. Our test is nuisance-parameter-free, powerful against any type of pairwise dependence at all lags, and does not involve bandwidth selection. We show that, provided some conditions are satisfied, our test statistic based on the estimated residuals has the same asymptotic distribution as the corresponding statistic based on the unobservable true errors. Some Monte Carlo simulations illustrate the good
Acknowledgments
I would like to thank Juan Carlos Escanciano, the Co-Editor, Erricos Kontoghiorghes, the associate editor and two anonymous referees for their helpful comments and suggestions. I acknowledge the computational resources provided by Bing Li, Yan Dong, Chunlin Chen, Pei Pei, Qing Lin and the Libra High Performance Cluster at Indiana University. The financial support from the Western and Border Areas Program, Humanities and Social Sciences, Ministry of Education of China, 11XJC790002 is greatly
References (43)
- et al.
Testing for heteroskedasticity and spatial correlation in a random effects panel data model
Computational Statistics and Data Analysis
(2009) - et al.
Forecasting with spatial panel data
Computational Statistics and Data Analysis
(2012) Weak convergence of non-stationary multivariate marked processes with applications to martingale testing
Journal of Multivariate Analysis
(2007)- et al.
Approximating the critical values of Cramér–von Mises tests in general parametric conditional specifications
Computational Statistics and Data Analysis
(2010) - et al.
Common risk factors in the returns on stocks and bonds
Journal of Financial Economics
(1993) Testing serial correlation in fixed effects regression models based on asymptotically unbiased autocorrelation estimators
Mathematics and Computers in Simulation
(2009)- et al.
A robust sign test for panel unit roots under cross sectional dependence
Computational Statistics and Data Analysis
(2009) - et al.
Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations
Review of Economic Studies
(1991) Convergence of Probability Measures
(1968)- et al.
Distribution free tests of independence based on the sample distribution function
Annals of Mathematical Statistic
(1961)
Distribution of residual autocorrelations in autoregressive integrated moving average time series models
Journal of the American Statistical Association
The Lagrange multiplier test and its applications to model specification in econometrics
Review of Economic Studies
Nonlinear Dynamics, Chaos, and Instability: Statistical Theory and Economic Evidence
A test for independence based on the correlation dimension
Econometric Reviews
Weighted minimum mean-square distance from independence estimation
Econometrica
Nonparametric tests for serial dependence
Journal of Time Series Analysis
Adaptive permutation tests for serial dependence
Statistica Neerlandica
Evaluating density forecasts of linear and nonlinear models: applications to output growth and unemployment
Journal of Forecasting
Testing serial independence using the sample distribution function
Journal of Time Series Analysis
Cited by (2)
On some characterizations and multidimensional criteria for testing homogeneity, symmetry and independence
2019, Journal of Multivariate AnalysisAsymptotics, finite-sample comparisons and applications for two-sample tests with functional data
2019, Journal of Multivariate Analysis