Abstract
In this paper, we describe clinical outcomes analysis for data in Memorial Sloan-Kettering Cancer Center Sarcoma Database using relational data mining and propose an infrastructure for managing cancer data for Drexel University Cancer Epidemiology Server (DUCES). It is a network-based multi-institutional database that entails a practical research tool that conducts On-Line Analytic Mining (OLAM). We conducted data analysis using relational learning (or relational data mining) with cancer patients’ clinical records that have been collected prospectively for 20 years. We analyzed clinical data not only based on the static event, such as disease specific death for survival analysis, but also based on the temporal event with censored data for each death. Rules extracted using relational learning were compared to results from statistical analysis. The usefulness of rules is also assessed in the context of clinical medicine. The contribution of this paper is to show that rigorous data analysis using relational data mining provides valuable insights for clinical data assessment and complements traditional statistical analysis and to propose an infrastructure to manage and mine clinical outcomes used in multi-institutional organizations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altman, D.G.: Practical Statistics for Medical Research. Chapman & Hall, Boca Raton (1999)
Daniel, W.: Biostatistics: A Foundation for Analysis in the Health Sciences, 7th edn. John Wiley and Sons, Inc., New York (1999)
Pisters, P., Leung, D., Woodruff, J., Shi, W., Brennan, M.: Analysis of Prognostic Factors in 1,041 Patients with Localized Soft Tissue Sarcomas of the Extremities. Clinical Oncology 14, 1679–1689 (1996)
Stojadinovic, A., Leung, D.H.Y., Hoos, A., Jaques, D.P., Lewis, J.J., Brennan, M.F.: Analysis of the prognostic significance of microscopic margins in 2084 localized primary adult soft tissue sarcomas. Ann. Surg. 235, 424–434 (2002)
Kattan, M.W., Leung, D.H., Brennan, M.F.: Postoperative nomogram for 12-year sarcoma-specific death. J. Clin. Oncol. 20, 791–796 (2002)
Lewis, J.J., Leung, D., Woodruff, J.M., Brennan, M.F.: Retroperitoneal soft-tissue sarcoma: analysis of 500 patients treated and followed at a single institution. Ann. Surg. 222, 355–365 (1998)
Han, J., Kamber, M.: Data Mining: Concepts And Techniques. Morgan Kaufmann Publishers, San Francisco (2001)
Russell, S.J., Norvig, P.: Artificial Intelligence: A Modern Approach. Prentice Hall, Englewood Cliffs (1995)
Mitchell, T.M.: Machine Learning. WCB/McGraw-Hill (1997)
Muggleton, S.: Inverse entailment and Progol. New Generation Computing 13, 245–286 (1995)
Muggleton, S.: CProgol4.4: A Tutorial Introduction. Inductive Logic Programming and Knowledge Discovery in Databases (2001)
Cootes, A., Muggleton, S.H., Sternberg, M.J.E.: The automatic discovery of structural principles describing protein fold space. Journal of Molecular Biology (2003)
Han, H., Elamsri, R.: Learning Rules for Conceptual Structure on theWeb. Journal of Intelligent Information Systems, Kluwer Academic Publishers(2004)
Liu, H., Motoda, H.: Feature Selection For Knowledge Discovery and Data Mining. Kluwer Academic Publishers, Dordrecht (1998)
The Health Insurance Portability and Accountability Act of 1996, HIPAA (1996)
Zupan, B., Lavrac, N.: Data Mining Techniques and Applications in Medicine. Artificial Intelligence in Medicine 16, 1–2 (1999)
Mani, S., Pazzani, M.J., West, J.: Knowledge Discovery from a Breast Cancer Database. Proceedings on Artificial Intelligence in Medicine, Europe (1997)
Ong, L.S., Shepherd, B., Tong, L.C., Seow-Choen, F., Ho, Y.H., Tang, C.L., Ho, Y.S., Tan, K.: The Colorectal Cancer Recurrence Support (CARES) System. Artificial Intelligence in Medcine 11, 175–188 (1997)
Hankey, B., Ries, L., Edwards, B.: The surveillance, epidemiology, and end results program: a national resource. Cancer Epidemiol Biomarkers Prev. 8, 1117–1121 (1999)
Menck, H., Bland, K., Eyre, H., Cunningham, M., Fremgen, A., Murphy, M.: Clinical highlights from the National Cancer Data Base. Cancer J. Clin. 48, 134–145 (1998)
McGinnis, L., Menck, H., Eyre, H., Bland, K., Scott-Conner, C., Morrow, M.: National Cancer Data Base survey of breast cancer management for patients from low income zip codes. Cancer 88, 933–945 (2000)
Hindle, W.: Breast Cancer: Introduction. Diagnosis and treatment of nongynecologic cancer. Clin. Obstet. Gynecol. 45, 738–745 (2002)
Shinagawa, S.: The excess burden of breast carcinoma in minority and medically underserved communities: application research, and redressing institutional racism. Cancer 88, 1217–1223 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Han, H., Song, IY., Hu, X., Prestrud, A., Brennan, M.F., Brooks, A.D. (2004). Managing and Mining Clinical Outcomes. In: Lee, Y., Li, J., Whang, KY., Lee, D. (eds) Database Systems for Advanced Applications. DASFAA 2004. Lecture Notes in Computer Science, vol 2973. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24571-1_37
Download citation
DOI: https://doi.org/10.1007/978-3-540-24571-1_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21047-4
Online ISBN: 978-3-540-24571-1
eBook Packages: Springer Book Archive