Skip to main content

Advertisement

Log in

Bandwidth selection for the estimation of transition probabilities in the location-scale progressive three-state model

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

Times between consecutive events are often of interest in medical studies. Usually the events represent different states of the disease process and are modeled using multi-state models. This paper introduces and studies a feasible estimation method for the transition probabilities in a progressive three-state model. We assume that the vector of gap times \((T_1,T_2)\) satisfies a nonparametric location-scale regression model \(T_2=m(T_1)+\sigma (T_1)\epsilon \), where the functions \(m\) and \(\sigma \) are ‘smooth’, and \(\epsilon \) is independent of \(T_1\). Under this model, Van Keilegom et al. (J Stat Plan Inference 141:1118–1131, 2011) proposed estimators of the transition probabilities. However, the important issue of automatic bandwidth choice in this setting has not been examined, making the analysis of real datasets rather difficult. In this paper, we study the performance of their estimator in practice, we propose some modifications and study practical issues related to the implementation of the estimator, which involves the choice of an appropriate bandwidth. In an extensive simulation study the good performance of the method is shown. Simulations also demonstrate that the proposed estimator compares favorably with alternative estimators. Furthermore, the proposed methodology is illustrated with a real database on breast cancer.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Aalen OO, Johansen S (1978) An empirical transition matrix for nonhomogeneous Markov chains based on censored observations. Scand J Stat 5:141–150

    MathSciNet  MATH  Google Scholar 

  • Amorim AP, Uña-Álvarez J, Meira-Machado LF (2001) Presmoothing the transition probabilities in the illness-death model. Stat Probab 81:797–806

    Article  Google Scholar 

  • Andersen PK, Borgan O, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer, New York

    Book  MATH  Google Scholar 

  • Beran R (1981) Nonparametric regression with randomly censored survival data. Technical report. University of California, Berkeley

    Google Scholar 

  • Cadarso-Suárez C, Meira-Machado LF, Kneib T, Gude F (2010) Flexible hazard ratio curves for continuous predictors in multi-state models: a P-spline approach. Stat Model 10:291–314

    Article  MathSciNet  Google Scholar 

  • Chavez-Uribe E, Cameselle-Teijeiro J, Vinuela JE et al (2007) Hypoploidy defines patients with poor prognosis in breast cancer. Oncol Rep 17:1109–1114

    Google Scholar 

  • Dabrowska DM (1992) Variable bandwidth conditional Kaplan-Meier estimate. Scand J Stat 19:351–361

    MathSciNet  MATH  Google Scholar 

  • Hougaard P (2000) Analysis of multivariate survival data. Springer, New-York

    Book  MATH  Google Scholar 

  • Kaplan EL, Meier P (1958) Nonparametric estimation from incomplete observations. J Am Stat Assoc 53:457–481

    Article  MathSciNet  MATH  Google Scholar 

  • Li G, Datta G (2001) A bootstrap approach to nonparametric regression for right censored data. Ann Inst Stat Math 53:708–729

    Article  MathSciNet  MATH  Google Scholar 

  • Meira-Machado LF, de Uña-Álvarez J, Cadarso-Suárez C (2006) Nonparametric estimation of transition probabilities in a non-Markov illness-death model. Lifetime Data Anal 12:325–344

    Article  MathSciNet  Google Scholar 

  • Van Keilegom I, Akritas M (1999) Transfer of tail information in censored regression models. Ann Stat 27:1745–1784

    Article  MATH  Google Scholar 

  • Van Keilegom I, Akritas M, Veraverbeke N (2001) Estimation of the conditional distribution in regression with censored data: a comparative study. Comput Stat Data Anal 35:487–500

    Article  MATH  Google Scholar 

  • Van Keilegom I, de Uña-Álvarez J, Meira-Machado LF (2011) Nonparametric location-scale models for successive survival times under dependent censoring. J Stat Plan Inference 141:1118–1131

    Article  MATH  Google Scholar 

Download references

Acknowledgments

Luís Meira-Machado acknowledges financial support from FEDER Funds through “Programa Operacional Factores de Competitividade—COMPETE” and by Portuguese Funds through FCT—“Fundação para a Ciência e a Tecnologia”, in the form of grants PTDC/MAT/104879/2008 and Est-C/MAT/UI0013/2011. Luís Meira-Machado and Carmen Cadarso-Suárez acknowledge the support received by the Spanish Ministry of Industry and Innovation, Grant MTM2011-28285-C02-01. Roca-Pardiñas acknowledges financial support from research Grants MTM2011-23204 (FEDER support included) of the Spanish Ministry of Industry and Innovation, and Galician Regional Authority (Xunta de Galicia) grant 10PXIB300068PR. Van Keilegom acknowledges financial support from the European Research Council under the European Community’s Seventh Framework Programme (FP7/2007-2013) / ERC Grant agreement No. 203650, from IAP research network P7/06 of the Belgian Government (Belgian Science Policy), and from the contract ‘Projet d’Actions de Recherche Concertées’ (ARC) 11/16-039 of the ‘Communauté française de Belgique’, granted by the ‘Académie universitaire Louvain’. We are also grateful to both the associate editor and the two peer referees for their valuable comments and suggestions, which served to make a substantial improvement to this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luís Meira-Machado.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Meira-Machado, L., Roca-Pardiñas, J., Van Keilegom, I. et al. Bandwidth selection for the estimation of transition probabilities in the location-scale progressive three-state model. Comput Stat 28, 2185–2210 (2013). https://doi.org/10.1007/s00180-013-0402-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-013-0402-0

Keywords

Navigation