Abstract
Traditional survival analysis estimates the instantaneous failure rate of an event and predicts survival probabilities distributions. In fact, in a set of censored data there may exist several sub-populations with various risk profiles or survival distributions, for which regular survival analysis approaches do not take into consideration. Consequently, there is a need for discovering such sub-populations with unambiguous risk profiles and survival distributions. In this work, we propose a modified version of the K-Medoids algorithm which can be used to efficiently cluster censored data and identify diverse groups with distinct lifetime distributions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bair, E., Tibshirani, R., Golub, T.: Semi-supervised methods to predict patient survival from gene expression data. PLoS Biol. 2(4), e108 (2004)
Chapfuwa, P., Li, C., Mehta, N., Carin, L., Henao, R.: Survival cluster analysis. In: Proceedings of the ACM Conference on Health, Inference, and Learning, pp. 60–68 (2020)
Clark, T.G., Bradburn, M.J., Love, S.B., Altman, D.G.: Survival analysis part i: basic concepts and first analyses. Br. J. Cancer 89(2), 232–238 (2003)
Fox, J., Carvalho, M.S.: The rcmdrplugin. survival package: extending the r commander interface to survival analysis. J. Statist. Softw. 49, 1–32 (2012)
Henderson, R., Shimakura, S., Gorst, D.: Modeling spatial variation in leukemia survival data. J. Am. Stat. Assoc. 97(460), 965–972 (2002)
Ho-Kieu, D., Vo-Van, T., Nguyen-Trang, T.: Clustering for probability density functions by new-medoids method. Sci. Program. 2018, 1–7 (2018)
Huster, W.J., Brookmeyer, R., Self, S.G.: Modelling paired survival data with covariates. Biometrics 45(1), 145-56 (1989)
Klein, J.P., Moeschberger, M.L.: Survival Analysis: Techniques for Censored and Truncated Data. Springer, New York (2006). https://doi.org/10.1007/978-1-4757-2728-9
Kuiper, N.H.: Tests concerning random points on a circle. In: Nederlandse Akademie van Wetenschappen. roceedings. Series A.,vol. 63, pp. 38–47 (1960)
Lee, E.T., Wang, J.: Statistical Methods for Survival Data Analysis, vol. 476. John Wiley & Sons (2003)
Li, H., Gui, J.: Partial cox regression analysis for high-dimensional microarray gene expression data. Bioinformatics 20(suppl_1), i208–i215 (2004)
Manduchi, L., et al.: A deep variational approach to clustering survival data. arXiv preprint arXiv:2106.05763 (2021)
Marinos, G., Kyriazis, D.: A survey of survival analysis techniques. In: HEALTHINF, pp. 716–723 (2021)
Marinos, G., Symvoulidis, C., Kyriazis, D.: Micsurv: medical image clustering for survival risk group identification. In: 2021 4th International Conference on Bio-Engineering for Smart Technologies (BioSMART), pp. 1–4. IEEE (2021)
Matusita, K.: On the notion of affinity of several distributions and some of its applications. Ann. Inst. Stat. Math. 19(1), 181–192 (1967)
Mouli, S.C., Naik, A., Ribeiro, B., Neville, J.: Identifying user survival types via clustering of censored social network data. arXiv preprint arXiv:1703.03401 (2017)
Mouli, S.C., Teixeira, L., Neville, J., Ribeiro, B.: Deep lifetime clustering. arXiv preprint arXiv:1910.00547 (2019)
Nielsen, F., Nock, R., Amari, S.i.: On clustering histograms with k-means by using mixed \(\alpha \)-divergences. Entropy 16(6), 3273–3301 (2014)
Pham-Gia, T., Turkkan, N., Vovan, T.: Statistical discrimination analysis using the maximum function. Commun. Stati. Simul. Comput. ® 37(2), 320–336 (2008)
Prinja, S., Gupta, N., Verma, R.: Censoring in clinical trials: review of survival analysis techniques. Indian j. Commun. Med. 35(2), 217 (2010)
Vasn de Vijver, M.J., et al.: A gene-expression signature as a predictor of survival in breast cancer. N. Engl. J. Med. 347(25), 1999–2009 (2002)
Van Dongen, S.: Graph Clustering by Flow Simulation. 2000. University of Utrecht (2001)
Van Vo, T., Pham-Gia, T.: Clustering probability distributions. J. Appl. Stat. 37(11), 1891–1910 (2010)
Acknowledgements
The research leading to the results presented in this paper has received funding from the European Union’s funded Project iHelp under grant agreement no 101017441.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Marinos, G., Symvoulidis, C., Kyriazis, D. (2023). K-Medoids-Surv: A Patients Risk Stratification Algorithm Considering Censored Data. In: Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2022. Lecture Notes in Computer Science(), vol 13589. Springer, Cham. https://doi.org/10.1007/978-3-031-23480-4_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-23480-4_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23479-8
Online ISBN: 978-3-031-23480-4
eBook Packages: Computer ScienceComputer Science (R0)