Abstract
In this paper, the empirical likelihood inferences for varying-coefficient semiparametric mixed-effects errors-in-variables models with longitudinal data are investigated. We construct the empirical log-likelihood ratio function for the fixed-effects parameters and the mean parameters of random-effects. The empirical log-likelihood ratio at the true parameters is proven to be asymptotically \(\chi ^2_{q+r}\), where \(q\) and \(r\) are dimensions of the fixed and random effects respectively, and the corresponding confidence regions for them are then constructed. We also obtain the maximum empirical likelihood estimator of the parameters of interest, and prove it is the asymptotically normal under some suitable conditions. A simulation study and a real data application are undertaken to assess the finite sample performance of the proposed method.

Similar content being viewed by others
References
Bai Y, Fung WK, Zhu ZY (2010) Weighted empirical likelihood for generalized linear models with longitudinal data. J Stat Plan Inference 140:3446–3456
Chen QH, Zhong PS, Cui HJ (2009) Empirical likelihood for mixed-effects error-in-variables model. Acta Mathematicae Applicatae Sinica (English Series) 25:561–578
Cui HJ, Chen SX (2003) Empirical likelihood confidence region for parameter in the errors-in-variables models. J Multivar Anal 84:101–115
Cui HJ, Ng Kai W (2004) Estimation in mixed effects model with errors in variables. J Multivar Anal 91:53–73
Fan JQ, Huang T (2005) Profile likelihood inferences on semi-parametric varying-coefficient partially linear models. Bernoulli 11:1031–1057
Fan JQ, Li R (2004) New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. J Am Stat Assoc 99:710–723
Huang JZ, Wu CO, Zhou L (2002) Varying-coefficient models and basis function approximations for the analysis of repeated measurements. Biometrika 89:111–128
Huang JZ, Wu CO, Zhou L (2004) Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Statistica Sinica 14:763–788
Huang ZS, Zhang RQ (2009) Empirical likelihood for nonparametric parts in semiparametric varying-coefficient partially linear models. Stat Probab Lett 79:1798–1808
Huang ZS, Zhou ZG, Jiang R, Qian WM, Zhang RQ (2010) Empirical likelihood based inference for semiparametric varying-coefficient partially linear models with error-prone linear covariates. Stat Probab Lett 80:497–504
Kaslow RA, Ostrow DG, Detels R, Phair JP, Polk BF, Rinaldo CJ (1987) The multicenter AIDS cohort study: rationale, organization and selected characteristics of the participants. Am J Epidemiol 126:310–318
Kolaczyk ED (1994) Empirical likelihood for generalized linear models. Statistica Sinica 4:199–218
Laird NM, Ware JH (1982) Random effects models for longitudinal data. Biometrics 38:963–974
Li L, Greene T (2008) Varying coefficients model with measurement error. Biometrics 64:519–526
Li YS, Lin XH, Muller P (2010) Bayesian inference in semiparametric mixed models for longitudinal data. Biometrics 66:70–78
Li GR, Xue LG (2008) Empirical likelihood confidence region for the parameter in a partially linear errors-in-variables model. Commun Stat Theory Methods 37:1552–1564
Liang H (2000) Asymptotic normality of parametric part in partially linear models with measurement error in the nonparametric part. J Stat Plan Inference 86:51–62
Liang H, Härdle W, Carroll RJ (1999) Estimation in a semiparametric partially linear errors-in-variables model. Ann Stat 27:1519–1535
Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear model. Biometrika 73:13–22
Lin DY, Ying Z (2001) Semiparametric and nonparametric regression analysis of longitudinal data (with discussion). J Am Stat Assoc 96:103–126
McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman & Hall, London
Nelder JA, Wedderburn RWM (1972) Generalized linear models. J R Stat Soc Ser A 135:370–384
Owen AB (1990) Empirical likelihood ratio confidence regions. Ann Stat 18:90–120
Owen AB (2001) Empirical likelihood. Chapman & Hall, New York
Qin J, Lawless J (1994) Empirical likelihood and general estimating equations. Ann Stat 22:300–325
Qin GY, Bai Y, Zhu ZY (2009) Robust empirical likelihood inference for longitudinal data. Stat Probab Lett 79:2101–2108
Ruppert D, Wand MP, Carroll RJ (2003) Semiparametric regression. Cambridge University Press, Cambridge
Vonesh EF, Chinchilli VM (1996) Linear and nonlinear models for the analysis of repeated measurements. Marcel Dekker, New York
Wang Y (1998) Mixed-effects smoothing spline ANOVA. J R Stat Soc Ser B 60:159–174
Wang XL, Li GR, Lin L (2011) Empirical likelihood inference for semi-parametric varying-coefficient partially linear EV models. Metrika 73:171–185
Wang SJ, Qian LF, Carroll RJ (2010) Generalized empirical likelihood methods for analyzing longitudinal data. Biometrika 97:79–93
Wei CH (2011) Estimation in varying-coefficient errors-in-variables models with missing response variables. Commun Stat Simul Comput 40:383–393
Wu H, Zhang JT (2006) Nonparametric regression methods for longitudinal data. Wiley, New Jersey
Xue LG, Zhu LX (2007a) Empirical likelihood for a varying coefficient model with longitudinal data. J Am Stat Assoc 102:642–654
Xue LG, Zhu LX (2007b) Empirical likelihood semiparametric regression analysis for longitudinal data. Biometrika 94:921–937
Xue LG, Zhu LX (2008) Empirical likelihood-based inference in a partially linear model for longitudinal data. Sci China Ser A 51:115–130
You JH, Chen GM, Zhou Y (2006) Block empirical likelihood for longitudinal partially linear regression models. Can J Stat 34:79–96
You JH, Chen GM (2006) Estimation of a semiparametric varying-coefficient partially linear errors-in-variables model. J Multivar Anal 97:324–341
Zeger SL, Diggle PJ (1994) Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics 50:689–699
Zhao PX, Xue LG (2009) Empirical likelihood inferences for semiparametric varying-coefficient partially linear error-in-variables models with longitudinal data. J Nonparametr Stat 21:907–923
Zhao PX, Xue LG (2010) Variable selection for semiparametric varying coefficient partially linear errors-in-variables models. J Multivar Anal 101:1872–1883
Zhang D, Lin X, Raz J, Sowers M (1998) Semiparametric stochastic mixed models for longitudinal data. J Am Stat Assoc 93:710–719
Zhong XP, Fung WK, Wei BC (2002) Estimation in linear models with random effects and errors-in-variables. Ann Inst Math Stat 54:595–606
Zhou Y, Liang H (2009) Statistical inference for semiparametric varying-coefficient partially linear models with error-prone linear covariates. Ann Stat 37:427–458
Zhu LX, Cui HJ (2003) Semiparametric regression model with errors in variables. Scan J Stat 30:429–444
Acknowledgments
The authors are grateful to the Editor and two anonymous referees for their constructive comments which have greatly improved this paper. This work is partially supported by Anhui Provincial Natural Science Foundation (No. 11040606M04), Key Natural Science Foundation of Higher Education Institutions of Anhui Province of China (No. KJ2012A270), NSFC (No. 11171065), NSFJS (No. BK2011058), Youth Foundation for Humanities and Social Sciences Project from Ministry of Education of China (No. 11YJC790311), Postdoctoral Research Program of Jiangsu Province of China (No. 1202013C) and Scientific Research Starting Foundation for Talents of Tongling University (No. 2012tlxyrc05).
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
To illustrate our main results, the following assumptions are imposed. These assumptions are actually quite mild and can be easily satisfied.
-
(a)
The bandwidth satisfies \(h=h_0n^{-1/5}\) for some constant \(h_0>0\).
-
(b)
The matrices \(\Upsilon (t)\), \(\Phi (t)\) and \(\Psi (t)\) are twice continuously differentiable on \((0,1)\), and \(\Upsilon (t)\) is positive definite on \((0,1)\).
-
(c)
\(\{\alpha _l(u),l=1,\ldots ,p\}\) has continuous second derivatives on \((0,1)\).
-
(d)
The Kernel \(K(\cdot )\) is a symmetric density function with compact support.
-
(e)
The intensity function \(f(t)\) of the process \(N(t)\) is bounded away from 0 and infinity on \([0,1]\), and is continuously differentiable on \((0,1)\).
-
(f)
There is an \(s>2\) such that \(\sup _{0\le t\le 1}E\Vert X(t)\Vert ^{2s}<\infty \), \(\sup _{0\le t\le 1}E\Vert W(t)\Vert ^{2s}<\infty \), \(\sup _{0\le t\le 1}E\Vert Z(t)\Vert ^{4s}<\infty \), \(\sup _{0\le t\le 1}E\Vert \mu (t)\Vert ^{2s}<\infty \), \(\sup _{0\le t\le 1}E\Vert \nu (t)\Vert ^{2s}\!<\!\infty \), \(E\Vert \gamma \Vert ^{4s}<\infty \), \(\sup _{0\le t\le 1}E\Vert \epsilon (t)\Vert ^{2s}<\infty \), \(i=1,\ldots n\), and for some \(\delta <2-s^{-1}\) such that \(n^{2\delta -1}h\rightarrow \infty \).
-
(g)
\(\Gamma \) is a positive definite matrix, where \(\Gamma \) is defined in Theorem 2.
Let \(c_n=\left( \frac{\log (1/h)}{nh}\right) ^{1/2}+h^2\), \(\kappa _l=\int u^lK(u)du\), \(l=0,1,2\), and \(A\otimes B\) be the Kronecker product of matrix \(A\) and \(B\). To prove our main results, we first give several lemmas.
Lemma 1
Suppose that assumptions (a)–(f) hold. Then it holds uniformly for \(t\in \mathcal I \)
Here we omit the proof, which is similar to Lemma 2 in Fan and Huang (2005).
Lemma 2
Suppose that assumptions (a)–(f) hold. Then it holds uniformly for \(t\in \mathcal I \)
Proof
Noting that \(S(t)=[I_p\ \ 0] \left( B_t^TM_tB_t\right) ^{-1}B_t^TM_t\), (5.5)–(5.7) can be obtained directly by Lemma 1.
Lemma 3
Suppose that assumptions (a)–(f) hold. Then it holds uniformly for \(t\in \mathcal I \)
where \(\omega \) is \(\mu \) or \(\nu \).
Proof
With the similar proof of Lemmas 1 and 2, one can obtain (5.8) and (5.9).
Lemma 4
Let \(e_i, i=1,\ldots ,n\), be a sequence of multi-independent random variate with \(E(e_i)=0\) and \(E(e_i^2)<c<\infty \). Then
Further, let \((j_1,\ldots ,j_n)\) be a permutation of \((1,\ldots ,n)\). Then we have
Proof
The proof of Lemma 4 can be found in Zhao and Xue (2009).
Let \(e_{ij}=Z_{ij}^T(\gamma _i-\gamma _\mu )+\epsilon _{ij}-(\mu ^T_{ij},\nu ^T_{ij})\beta \),
and \(J_\kappa =\frac{1}{\sqrt{n}}\sum \limits _{i=1}^n J_{i\kappa }, \kappa =1,\ldots ,5\).
Lemma 5
Suppose that assumptions (a)–(f) hold, we have
Proof
First, we prove \(J_2=o_p(1)\). Let \(A_{ij}=(\Lambda (t_{ij})-S(t_{ij})W,\Pi (t_{ij})-S(t_{ij})Z)^T\) be \((q+r)\times p\) matrix, and \(b_{ij}=X_{ij}e_{ij}\) be \(p\times 1\) vector. Then
Further, let \(a_{ij,rs}\) be the \((r,s)\) component of \(A_{ij}\), \(b_{ij,s}\) be the \(s\)th component of \(b_{ij}\), \(a_{ij_0,rs_0}b_{ij_0,s_0}=\max _{j,s}\{a_{ij,rs}b_{ij,s}\}\), and \((a_{l_i,r},i=1,\ldots ,n)\) be a permutation of \((a_{ij_0,rs_0},i=1,\ldots ,n)\) such that \(a_{l_1,r}\ge \cdots \ge a_{l_n,r}\), corresponding permutation of \((b_{ij_0,s_0},i=1,\ldots ,n)\) are denoted by \((b_{l_i},i=1,\ldots ,n)\). Denote \(J_{21r}\) be the \(r\)th component of \(J_{21}\). By Abel’s inequality, Lemmas 2 and 4, we have
For \(J_{22}\), by (5.5), (5.6) and (5.9), we have \(\Vert J_{22}\Vert \le O_p(\sqrt{n}c_n^2)=o_p(1)\). Together it with (5.11), \(J_2=o_p(1)\) holds.
For \(J_3\) and \(J_4\), we denote \(A_{ij}=\alpha (t_{ij})-S(t_{ij})(Y-(W,Z)\beta )\). From (5.7) and (5.8), we obtain uniformly in \(t_{ij}\in \mathcal I \)
Together this with \(E\left\{ \left( \begin{array}{c} W_{ij}-\Lambda ^T(t_{ij})X_{ij} \\ Z_{ij}-\Pi ^T(t_{ij})X_{ij}\\ \end{array} \right) X_{ij}^T\right\} =0\), and \(E\left\{ \left( \begin{array}{c} \mu _{ij} \\ \nu _{ij}\\ \end{array} \right) X_{ij}^T\right\} =0\), and similar to the proof of \(J_{21}\), we obtain \(J_3=o_p(1)\) and \(J_4=o_p(1)\). For \(J_5\), by (5.5), (5.6), (5.9) and (5.12), we have \(\Vert J_5\Vert \le O_p(\sqrt{n}c_n^2)=o_p(1)\). So the proof of Lemma 5 is completed.
Lemma 6
Suppose that assumptions (a)–(f) hold, we have
where \(B\) is defined in Theorem 2.
Proof
From (2.7), we have
and
Therefore,
By directly calculating its expectation and variance, we have \(E(J_1)=0\) and \(\textit{Var}(J_1)=B+o(1)\). Then, by the central limit theorem, we have \(J_1{\stackrel{\mathfrak{D }}{\longrightarrow }}N(0, B)\). Further, by Lemma 5 and (5.13), we compete the proof of Lemma 6.
Lemma 7
Suppose that assumptions (a)–(f) hold, we have
We omit the proof of Lemma 7, which can be obtained based on Lemmas 5 and 6 by the same arguments as for the proof of Lemma 5.6 in Zhao and Xue (2009).
Lemma 8
Suppose that assumptions (a)–(f) hold, we have
Proof
Note that
By Lemma 3 of Owen (1990), we have \(\max _{1\le i\le n}\Vert J_{i1}\Vert =o_p(n^{1/2})\). From (5.5), (5.6), (5.9) and Lemma 3 of Owen (1990), we have
Similar to the arguments of \(J_{i2}\), we can obtain \(\max _{1\le i\le n}\Vert J_{il}\Vert =o_p(n^{1/2}) (l=3,4,5)\). This competes the proof of Lemma 8.
Proof of Theorem 1
From Lemmas 6–8, using the same arguments as were used in the proof of expression (2.14) of Owen (1990), we have
From (2.13), we have
By using (5.14) and Lemma 8, we obtain
Applying the Taylor expansion to (2.12), and using (5.14) and Lemma 8, it follows that
Then, by (5.15)–(5.17), we have
Together with Lemmas 6–8, This completes the proof of Theorem 1.
Proof of Theorem 2
Following the similar arguments as were used in the proof of Theorem 2 in Xue and Zhu (2008), we have
By Lemma 2, we can prove \(\hat{\Gamma }{\stackrel{\mathcal{P }}{\longrightarrow }}\Gamma \) by the law of large numbers. Together with Lemma 6 and the Slutsky Theorem, this proves Theorem 2.
Rights and permissions
About this article
Cite this article
Zhou, Xc., Lin, JG. Empirical likelihood for varying-coefficient semiparametric mixed-effects errors-in-variables models with longitudinal data. Stat Methods Appl 23, 51–69 (2014). https://doi.org/10.1007/s10260-013-0238-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10260-013-0238-3
Keywords
- Empirical likelihood
- Varying coefficient
- Mixed-effects
- Errors-in-variables
- Longitudinal data
- Confidence regions