Abstract
One or few observations can be highly influential on estimates of regression coefficients in the linear regression model. In this paper we derive influence diagnostics for the varying coefficients model with longitudinal data. We note that diagnostics in this context is quite different from the classical regression model in the sense that regression coefficients vary as time varies. A version of Cook’s distance is suggested to reflect this specific aspect of varying coefficient model. An algorithm to present some guidelines to determine influential observations deserving special attention is developed. An illustrative example based on the AIDS data is also given.
Similar content being viewed by others
References
Altman NS (1990) Kernel smoothing of data with correlated errors. J Am Stat Assoc 85:749–759
Atkinson AC (1981) A comparison of two criteria for the design of experiments for discriminating between models. Technometrics 23:301–305
Banerjee M, Frees EW (1997) Influence diagnostics for linear longitudinal models. J Am Stat Assoc 92:999–1005
Belsley DA, Kuh E, Welsch RE (1980) Regression diagnostics : identifying influential data and sources of collinearity. Wiley, New York
Chatterjee S, Hadi AS (1986) Influential observations, high leverage points, and outliers in linear regression, (with discussion). Stat Sci 1:379–416
Cook RD (1977) Detection of influential observation in linear regression. Technometrics 19:15–18
Cook RD, Weisberg S (1982) Characterizations of an empirical influence function for detecting influential cases in regression. Technometrics 22:495–508
Davidian M, Giltinan DM (1995) Nonlinear models for repeated measurement data. Chapman and Hall, London
Diggle PJ (1988) An approach to the analysis of repeated measurements. Biometrics 44:959–971
Diggle PJ, Liang KY, Zeger SL (1994) Analysis of longitudinal data. Oxford University Press, Oxford
Eubank RL (1985) Diagnostics for smoothing splines. J R Stat Soc Ser B 47:332–341
Eubank RL (1988) Spline smoothing and nonparametric regression. Chapman and Hall, New York
Fan J, Zhang W (1997) Statistical estimation in varying-coefficient models. Ann Stat 27:1491–1518
Fan J, Zhang W (2000) Two-step estimation of functional linear models with applications to longitudinal data. J R Stat Soc Ser B 62:303–322
Fung W-K, Zhu Z-Y, Wei B-C, He X (2002) Influence diagnostics and outlier tests for semiparametric mixed models. J R Stat Soc Ser B 64:565–580
Hart JD (1991) Kernel regression estimation with time series errors. J Am Stat Assoc 86:665–672
Hart JD, Wehrly TE (1986) Kernel regression estimation using repeated measurements data. J Am Stat Assoc 81:1080–1088
Hastie TJ, Tibshirani RJ (1993) Varying-coefficient models. J R Stat Soc Ser B 55:757–796
He X, Zhu ZY, Fung WK (2002) Estimation in semiparametric model for longitudinal data with unspecified dependence structure. Biometrika 89:579–590
He X, Fung WK, Zhu ZY (2005) Robust estimation in generalized partial linear models for clustered data. J Am Stat Assoc 100:1176–1184
Hoover DR, Rice JA, Wu CO, Yang LP (1998) Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85:809–822
Kauermann G, Tutz G (1999) On model diagnostics using varying coefficient models. Biometrika 86:119–128
Kim C (1996) Local influence and replacement measure. Commun Stat Theory Methods 25:49–61
Kim C, Storer BE (1996) Reference values for Cook’s distance. Commun Stat Simul Comput 25:691–709
Kim C, Lee Y, Park B-U (2001) Cook’s distance in local polynomial regression. Stat Prob Lett 54:33–40
Kim C, Park B-U, Kim W (2002) Influence diagnostics in semiparametric regression models. Stat Prob Lett 60:49–58
Kim C, Jeong M, Kim W, Park B-U (2004) Quasi-likelihood regression for varying coefficient models with longitudinal data. J Korean Stat Soc 33:367–379
Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear models. Biometrika 73:13–22
Lin DY, Ying Z (2001) Semiparametric and nonparametric regression analysis of longitudinal data. J Am Stat Assoc 96:103–126
Martinussen T, Scheike TH (1999) A semiparametric additive regression model fit longitudinal data. Biometrika 86:691–702
Pantula SG, Pollock KH (1995) Nested analysis of variance with autocorrelated errors. Biometrics 41:909–920
Silverman BW (1986) Density estimation for statistics and data analysis. Chapman and Hall, London
Ware JH (1985) Linear models for the analysis of longitudinal studies. Am Stat 39:95–101
Wu CO, Chiang CT, Hoover DR (1998) Asymptotic confidence regions for kernel smoothing of a varying-coefficient model with longitudinal data. J Am Stat Assoc 93:1388–1402
Yau KKW, Kuk AYC (2002) Robust estimation in generalized linear mixed models. J R Stat Soc Ser B 64:101–117
Zeger SL, Diggle PJ (1994) Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics 50:689–699
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bae, W., Hwang, S. & Kim, C. Influence diagnostics in the varying coefficient model with longitudinal data. Computational Statistics 23, 185–196 (2008). https://doi.org/10.1007/s00180-007-0025-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-007-0025-4