skip to main content
10.1145/3373017.3373050acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacswConference Proceedingsconference-collections
research-article

Effective Patient Similarity Computation for Clinical Decision Support Using Time Series and Static Data

Authors Info & Claims
Published:04 February 2020Publication History

ABSTRACT

This paper presents a technique for computing patient similarity using time series data effectively combined with static data. Time series data of inpatients, such as heart rate, blood pressure, Oxygen saturation, respiration are measured at regular intervals, especially for inpatients in intensive care unit (ICU). The static data are mainly patient background and demographic data, including age, weight, height and gender. The similarity computation is done in unsupervised way. It is therefore free from data labeling requirement. However, such patient similarity can be very useful in developing various clinical decision support systems including treatment, medication, hospital admission and diagnosis. Our proposed technique works in three main steps. First, patient similarity is computed for each individual time series. Second, patients are grouped by clustering the static data. Finally, similarities from individual time series are combined and effectively blended with the patient group information to create a nearest neighborhood model. This model consists of a collection of the nearest neighbors for a given patient. We encounter several challenges for this task, including dealing with multi-variate time series data, variable sampling quantities and rates, missing values, and combining time-series with static data. We evaluate the proposed technique on a real patient database on two target features, namely, ‘diagnosis’ and ‘admission type’. Notable performance is recorded for both targets, achieving f1-score as high as 0.8. We believe this technique can effectively combine different types of clinical data and develop an efficient unsupervised framework for computing patient similarity to be utilized for clinical decision support systems.

References

  1. Donald J. Berndt and James Clifford. 1994. Using Dynamic Time Warping to Find Patterns in Time Series. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining(AAAIWS’94). AAAI Press, 359–370.Google ScholarGoogle Scholar
  2. Andrei Z Broder, Moses Charikar, Alan M Frieze, and Michael Mitzenmacher. 2000. Min-Wise Independent Permutations. J. Comput. System Sci. 60, 3 (2000), 630 – 659. https://doi.org/10.1006/jcss.1999.1690Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Xiongcai Cai, Oscar Perez-Concha, Enrico Coiera, Fernando Martin-Sanchez, Richard Day, David Roffe, and Blanca Gallego. 2016. Real-time prediction of mortality, readmission, and length of stay using electronic health record data. Journal of the American Medical Informatics Association 23, 3(2016), 553–561. https://doi.org/10.1093/jamia/ocv110Google ScholarGoogle ScholarCross RefCross Ref
  4. Matthew Herland, Taghi M. Khoshgoftaar, and Randall Wald. 2013. Survey of Clinical Data Mining Applications on Big Data in Health Informatics. In Proc. ICMLA ’13 - Volume 02. 465–472.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. K. Jain, M. N. Murty, and P. J. Flynn. 1999. Data Clustering: A Review. ACM Comput. Surv. 31, 3 (1999), 264–323.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Tapas Kanungo, David Mount, Nathan Netanyahu, Christine Piatko, Ruth Silverman, and Angela Wu. 2002. An Efficient K-Means Clustering Algorithm Analysis and Implementation. IEEE TPMAI 24 (07 2002), 881–892.Google ScholarGoogle Scholar
  7. Li Li, Wei-Yi Cheng, Benjamin S Glicksberg, Omri Gottesman, Ronald Tamler, Rong Chen, Erwin P Bottinger, and Joel T Dudley. 2015. Identification of type 2 diabetes subgroups through topological analysis of patient similarity. Science translational medicine 7, 311 (2015), 311ra174–311ra174.Google ScholarGoogle Scholar
  8. Tianle Ma and Aidong Zhang. 2017. Integrate multi-omic data using affinity network fusion (ANF) for cancer patient clustering. In 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 7–10.Google ScholarGoogle ScholarCross RefCross Ref
  9. Yi Mao, Wenlin Chen, Yixin Chen, Chenyang Lu, Marin Kollef, and Thomas Bailey. [n.d.]. An Integrated Data Mining Approach to Real-time Clinical Monitoring and Deterioration Warning. In Proc. SIGKDD ’12. 1140–1148.Google ScholarGoogle Scholar
  10. Shraddha Pai and Gary D Bader. 2018. Patient similarity networks for precision medicine. J. of Mol. Biology (2018).Google ScholarGoogle Scholar
  11. Shraddha Pai, Shirley Hui, Ruth Isserlin, Muhammad A Shah, Hussam Kaka, and Gary D Bader. 2018. netDx: Interpretable patient classification using integrated patient similarity networks. bioRxiv (2018).Google ScholarGoogle Scholar
  12. [12] Physionet-MIMICIII.[n.d.]. https://archive.physionet.org/physiobank/database/mimic3cdb/.Google ScholarGoogle Scholar
  13. Sanjay Purushotham, Chuizheng Meng, Zhengping Che, and Yan Liu. 2017. Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets. CoRR abs/1710.08531(2017). arxiv:1710.08531http://arxiv.org/abs/1710.08531Google ScholarGoogle Scholar
  14. Anshumali Shrivastava and Ping Li. 2014. In Defense of MinHash Over SimHash. arxiv:stat.CO/1407.4416Google ScholarGoogle Scholar
  15. Bo Wang, Aziz M Mezlini, Feyyaz Demir, Marc Fiume, Zhuowen Tu, Michael Brudno, Benjamin Haibe-Kains, and Anna Goldenberg. 2014. Similarity network fusion for aggregating data types on a genomic scale. Nature methods 11, 3 (2014), 333.Google ScholarGoogle Scholar
  16. Yu and Shi. 2003. Multiclass spectral clustering. In Proceedings Ninth IEEE International Conference on Computer Vision. 313–319 vol.1. https://doi.org/10.1109/ICCV.2003.1238361Google ScholarGoogle Scholar
  1. Effective Patient Similarity Computation for Clinical Decision Support Using Time Series and Static Data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ACSW '20: Proceedings of the Australasian Computer Science Week Multiconference
      February 2020
      367 pages
      ISBN:9781450376976
      DOI:10.1145/3373017

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 February 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate61of141submissions,43%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format