Abstract
Competing risk data appear widely in modern biomedical research. In the past two decades, cause-specific hazard models are often used to deal with competing risk data. There is no current study on the kernel likelihood method for the cause-specific hazard model with time-varying coefficients. We propose to use the local partial log-likelihood approach for nonparametric time-varying coefficient estimation. Simulation studies demonstrate that our proposed nonparametric kernel estimator performs well under assumed finite sample settings. And we also compare the local kernel estimator with the penalized spline estimator. Finally, we apply the proposed method to analyze a diabetes dialysis study with competing death causes.






Similar content being viewed by others
References
Austin PC, Fine JP (2017) Accounting for competing risks in randomized controlled trials: a review and recommendations for improvement. Stat Med 36(8):1203–1209
Bender R, Augustin T, Blettner M (2005) Generating survival times to simulate cox proportional hazards models. Stat Med 24(11):1713–1723
Beyersmann J, Latouche A, Buchholz A, Schumacher M (2009) Simulating competing risks data in survival analysis. Stat Med 28(6):956–971
Beyersmann J, Allignol A, Schumacher M (2012) Competing risks and multistate models with R. Springer, New York
Breslow N (1974) Covariance analysis of censored survival data. Biometrics 30(1):89–99
Cai Z, Sun Y (2003) Local linear estimation for time-dependent coefficients in cox’s regression models. Scand J Stat 30(1):93–111
Cai Z, Fan J, Li R (2000) Efficient estimation and inferences for varying-coefficient models. J Am Stat Assoc 95(451):888–902
Cai J, Fan J, Jiang J, Zhou H (2007) Partially linear hazard regression for multivariate survival data. J Am Stat Assoc 102(478):538–551
Chen K, Guo S, Sun L, Wang JL (2010) Global partial likelihood for nonparametric proportional hazards models. J Am Stat Assoc 105(490):750–760
Fan J, Gijbels I (1992) Spatial and design adaptation: adaptive order polynomial approximation in function estimation. Institute of Statistics mimeo series 2080, North Carolina State University
Fan J, Gijbels I, King M (1997) Local likelihood and local partial likelihood in hazard regression. Ann Stat 25(4):1661–1690
Fine JP, Gray RJ (1999) A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc 94(446):496–509
Gaynor JJ, Feuer EJ, Tan CC, Wu DH, Brennan MF (1993) On the use of cause-specific failure and conditional failure probabilities: examples from clinical oncology data. J Am Stat Assoc 88(422):400–409
Geskus RB (2011) Cause-specific cumulative incidence estimation and the fine and gray model under both left truncation and right censoring. Biometrics 67(1):39–49
Gray RJ (1988) A class of k-sample tests for comparing the cumulative incidence of a competing risk. Ann Stat 16(3):1141–1154
Gray RJ (1992) Flexible methods for analyzing survival data using splines, with applications to breast cancer prognosis. J Am Stat Assoc 87(420):942–951
Härdle W (1990) Applied nonparametric regression, vol 19. Cambridge University Press, Cambridge
Hastie T, Tibshirani R (1990) Generalized additive models. Chapman and Hall, London
Hastie T, Tibshirani R (1993) Varying coefficient models (with discussion). J R Stat Soc Ser B Methodol 55(4):757–796
Hoover DR, Rice JA, Wu CO, Yang LP (1998) Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85(4):809–822
Kalbfleisch JD, Prentice RL (2011) The statistical analysis of failure time data, 2nd edn. Wiley, London
Lau B, Cole SR, Gange SJ (2009) Competing risk regression models for epidemiologic data. Am J Epidemiol 170(2):244–256
Lin H, Fei Z, Li Y (2016a) A semiparametrically efficient estimator of the time-varying effects for survival data with time-dependent treatment. Scand J Stat 43(3):649–663
Lin H, He Y, Huang J (2016b) A global partial likelihood estimation in the additive cox proportional hazards model. J Stat Plan Inference 169:71–87
Peng L, Huang Y (2007) Survival analysis with temporal covariate effects. Biometrika 94(3):719–733
Prentice RL, Kalbfleisch JD, Peterson AV, Flournoy NT, Breslow NE (1979) The analysis of failure times in the presence of competing risks. Biometrics 34(4):541–554
Putter H, Fiocco M, Geskus RB (2007) Competing risks and multi-state models. Stat Med 26(11):2389–2430
Ren X, Li S, Shen C, Yu Z (2018) Linear and nonlinear variable selection in competing risks data. Stat Med 37(13):2134–2147
Rice JA, Silverman BW (1991) Estimating the mean and covariance structure nonparametrically when the data are curves. J R Stat Soc Ser B Methodol 53(1):233–243
Schulgen G, Olschewski M, Krane V, Wanner C, Ruf G, Schumacher M (2005) Sample sizes for clinical trials with time-to-event endpoints and competing risks. Contemp Clin Trials 26(3):386–396
Sun L, Zhu L, Sun J (2009) Regression analysis of multivariate recurrent event data with time-varying covariate effects. J Multivar Anal 100(10):2214–2223
Sun L, Song X, Zhang Z (2011) Mean residual life models with time-dependent coefficients under right censoring. Biometrika 99(1):185–197
Tian L, Zucker D, Wei LJ (2005) On the cox model with time-varying regression coefficients. J Am Stat Assoc 100(469):172–183
Verweij PJM, Van Houwelingen HC (1993) Cross-validation in survival analysis. Stat Med 12(24):2305–2314
Wanner C, Krane V, März W, Olschewski M, Mann JF, Ruf G, Ritz E (2005) Atorvastatin in patients with type 2 diabetes mellitus undergoing hemodialysis. N Engl J Med 353(3):238–248
Yu Z, Lin X (2010) Seminparametric regression with time-dependent coefficents for failure time data analysis. Stat Sin 20(2):853–869
Yu Z, Liu L, Bravata DM, Williams LS (2014) Joint model of recurrent events and a terminal event with time-varying coefficients. Biom J 56(2):183–197
Zhao X, Zhou J, Sun L (2010) Semiparametric transformation models with time-varying coefficients for recurrent and terminal events. Biometrics 67(2):404–414
Zucker D, Karr A (1990) Nonparametric survival analysis with time-dependent covariate effects: a penalized partial likelihood approach. Ann Stat 18(1):329–353
Acknowledgements
This research was supported in part by the National Natural Science Foundation of China (12171318), by the Shanghai Commission of Science and Technology (21ZR1436300), by the 3-year plan of Shanghai public health system construction (GWV-10.1-XK05), and also by Shanghai Jiao Tong University STAR Grant (20190102).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A cross-validation score
A cross-validation score
In this appendix A, we first construct the log-likelihood of the ith subject. For failure type \(j=1,\dots ,m\), omitting the n in the log-likelihood formula (2), we can obtain the local partial log-likelihood as follows:
It is equivalent to the local partial log-likelihood defined in (2). Similarly, when the ith subject is left out, the local partial log-likelihood can be written as
From (10) and (11) yields the contribution of individual i to the likelihood
For the second term on the right-hand side, the term
Therefore, for failure type \(j=1,\dots ,m\), we derive an alternative expression for \(l_j^i(b_j)\) by
It is equivalent to the likelihood as follows:
Next, we give the derived process of the approximations for \(\widehat{\mathbf {b}}^{(-i)}_{j}\) and the cross-validated score \(CV_s(h)\). For failure type \(j=1,\dots ,m\), we apply a Taylor expansion to approximate \(\widehat{\mathbf {b}}^{(-i)}_{j}\). From (6), we have \(l^{(-i)}_j(b_j)=l_j(b_j)-l^i_j(b_j),\) taking the derivative of both sides of this equation with respect to \(b_j\)
at \(b_j=\widehat{\mathbf {b}}_j\), using first-order Taylor expansion to approximate
Combining \( \frac{\partial l^{(-i)}_j}{\partial b_j}(\widehat{\mathbf {b}}_{j})=\frac{\partial l_j}{\partial b_j}(\widehat{\mathbf {b}}_{j})-\frac{\partial l^i_j}{\partial b_j}(\widehat{\mathbf {b}}_{j})\), \(\frac{\partial l_j}{\partial b_j}(\widehat{\mathbf {b}}_{j})=0\), and substituting \(\widehat{\mathbf {b}}_{j}^{(-i)}\) for \(b_j\) in the above formula, we have
Solving the above equation with respect to \(\widehat{\mathbf {b}}^{(-i)}_{j}\), we infer
Substituting \(\widehat{\mathbf {b}}_{j}\) for \(b_j\) in (14), and combined with (15), for failure type \(j=1,\dots ,m\), we establish the approximations for \(\widehat{\mathbf {b}}^{(-i)}_{j}\) as
where we ignore the second derivation of \(l_j^i(b_j)\) because its calculation will consume much computer storage and time, and the approximation of the estimator \(\widehat{\mathbf {b}}^{(-i)}_{j}\) is the function of \(\widehat{\mathbf {b}}_{j}\).
Then we are going to approximate the cross-validated score \(CV_s(h)=\sum _{i=1}^n l^i_j\left( \widehat{\mathbf {b}}^{(-i)}_{j}\right) \) for failure type \(j=1,\dots ,m\). Associating with (12) and (16), for \(l_j^i(b_j)\) using a first-order Taylor approximation, we obtain
where for the first term of the right-hand side, using a Taylor expansion at \(\widehat{\mathbf {b}}_{j}^{(-i)}=\widehat{\mathbf {b}}_{j}\), and for the third term of the right-hand side, utilizing the trace’s basic properties. Then, from (17), we establish the approximation of \(CV_s\) for failure type \(j=1,\dots ,m\)
Rights and permissions
About this article
Cite this article
Qi, X., Yu, Z. Kernel regression for cause-specific hazard models with time-dependent coefficients. Comput Stat 38, 263–283 (2023). https://doi.org/10.1007/s00180-022-01227-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-022-01227-2