skip to main content
10.1145/2339530.2339605acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Towards heterogeneous temporal clinical event pattern discovery: a convolutional approach

Published: 12 August 2012 Publication History

Abstract

Large collections of electronic clinical records today provide us with a vast source of information on medical practice. However, the utilization of those data for exploratory analysis to support clinical decisions is still limited. Extracting useful patterns from such data is particularly challenging because it is longitudinal, sparse and heterogeneous. In this paper, we propose a Nonnegative Matrix Factorization (NMF) based framework using a convolutional approach for open-ended temporal pattern discovery over large collections of clinical records. We call the method One-Sided Convolutional NMF (OSC-NMF). Our framework can mine common as well as individual shift-invariant temporal patterns from heterogeneous events over different patient groups, and handle sparsity as well as scalability problems well. Furthermore, we use an event matrix based representation that can encode quantitatively all key temporal concepts including order, concurrency and synchronicity. We derive efficient multiplicative update rules for OSC-NMF, and also prove theoretically its convergence. Finally, the experimental results on both synthetic and real world electronic patient data are presented to demonstrate the effectiveness of the proposed method.

Supplementary Material

JPG File (307_m_talk_11.jpg)
MP4 File (307_m_talk_11.mp4)

References

[1]
I. Batal, L. Sacchi, R. Bellazzi, and M. Hauskrecht. A temporal abstraction framework for classifying clinical temporal data. In AMIA, pages 29--33, 2009.
[2]
S. Bengio, F. Pereira, Y. Singer, and D. Strelow. Group sparse coding. In NIPS, 2009.
[3]
B. Cao, D. Shen, J.-T. Sun, X. Wang, Q. Yang, and Z. Chen. Detect and Track Latent Factors with Online Nonnegative Matrix Factorization. In Proceedings of the 20th IJCAI, pages 2689--2694, 2007.
[4]
M. Cooper, T. Liu, and E. Rieffel. Video Segmentation via Temporal Pattern Classification. IEEE TMM, 9(3):610--618, 2007.
[5]
S. de Amo and D. A. Furtado. First-order temporal pattern mining with regular expression constraints. Data & Knowledge Engineering, 62(3):401--420, 2007.
[6]
X. Du, R. Jin, L. Ding, V. E. Lee, and J. H. T. Jr. Migration motif: a spatial - temporal pattern mining approach for financial markets. In KDD, pages 1135--1144, 2009.
[7]
J. Eggert and E. Körner. Sparse coding and nmf. In IJCNN, pages 2529--2533, 2004.
[8]
J. Fails, A. Karlson, L. Shahamat, and B. Shneiderman. A Visual Interface for Multivariate Temporal Data: Finding Patterns of Events across Multiple Histories. In Visual Analytics Science And Technology, 2006 IEEE Symposium On, pages 167--174, 2006.
[9]
C. Févotte and J. Idier. Algorithms for nonnegative matrix factorization with the beta-divergence. arXiv:1010.1763, 2010.
[10]
R. Grosse, R. Raina, H. Kwong, and A. Y. Ng. Shift-invariant Sparse Coding for Audio Classification. In UAI, 2007.
[11]
J. Han, J. Pei, Y. Yin, and R. Mao. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. DMKD, 8(1):53--87, 2004.
[12]
P. O. Hoyer. Non-negative matrix factorization with sparseness constraints. JMLR, 5:1457--1469, 2004.
[13]
D. D. Lee and H. S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788--791, 1999.
[14]
D. D. Lee and H. S. Seung. Algorithms for Non-negative Matrix Factorization. In NIPS 13, pages 556--562, 2001.
[15]
J. Lin, E. Keogh, S. Lonardi, and B. Chiu. A symbolic representation of time series, with implications for streaming algorithms. In Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery, pages 2--11, 2003.
[16]
J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online Learning for Matrix Factorization and Sparse Coding. Journal of Machine Learning Research, 11:10--60, 2010.
[17]
F. Mörchen and A. Ultsch. Efficient mining of understandable patterns from multivariate interval time series. DMKD, 15(2):181--215, 2007.
[18]
M. Mørup, M. N. Schmidt, and L. K. Hansen. Shift invariant sparse coding of image and music data. Technical report, 2008.
[19]
P. N. E. Nohuddin, F. Coenen, R. Christley, and C. Setzkorn. Detecting temporal pattern and cluster changes in social networks: A study focusing uk cattle movement database. In IFIP Advances in Information and Communication Technology, pages 163--172, 2010.
[20]
G. Norén, J. Hopstadius, A. Bate, K. Star, and I. Edwards. Temporal pattern discovery in longitudinal electronic patient records. DMKD, 20(3):361--387, 2010.
[21]
P. D. O'Grady and B. A. Pearlmutter. Discovering convolutive speech phones using sparseness and non-negativity. In ICA, 2007.
[22]
J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M. Hsu. Mining sequential patterns by pattern-growth: The prefixspan approach. IEEE TKDE, 16(11):1424--1440, 2004.
[23]
C. Plaisant, S. Lam, B. Shneiderman, M. S. Smith, D. Roseman, G. Marchand, M. Gillam, C. Feied, J. Handler, and H. Rappaport. Searching electronic health records for temporal patterns in patient histories: a case study with microsoft amalga. In AMIA, pages 601--605, 2008.
[24]
M. Robert Moskovitch and Y. Shahar. Medical temporal-knowledge discovery via temporal abstraction. In AMIA, pages 452--456, 2009.
[25]
B. R. Shah, J. Drozda, and E. D. Peterson. Leveraging observational registries to inform comparative effectiveness research. American Heart Journal, 160(1):8--15, 2010.
[26]
P. Smaragdis. Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs. In ICA, pages 494--499. 2004.
[27]
H. A. Varian. Microeconomic Analysis Third Edition. W.W. Norton and Company, 1992.
[28]
F. Wang, C. Tan, P. Li, and C. König. Efficient document clustering via online nonnegative matrix factorization. In Proceedings of the 11th SDM, 2011.
[29]
T. Wang, C. Plaisant, A. Quinn, R. Stanchak, B. Shneiderman, and S. Murphy. Aligning temporal data by sentinel events: Discovering patterns in electronic health records. In Proc. of ACM Conference on Human Factors in Computing Systems, pages 457--466, 2008.
[30]
M. J. Zaki. Spade: An efficient algorithm for mining frequent sequences. Machine Learning, 42:31--60, 2001.

Cited By

View all
  • (2025)Exploring the Preprocessing of a Time-Series Healthcare Administrative Dataset on Deep Learning to Improve PredictionIEEE Access10.1109/ACCESS.2024.352042513(3485-3496)Online publication date: 2025
  • (2024)Breast Cancer Assessment Using a HybridANN-CNN Approach2024 International Conference on Intelligent and Innovative Technologies in Computing, Electrical and Electronics (IITCEE)10.1109/IITCEE59897.2024.10468038(1-5)Online publication date: 24-Jan-2024
  • (2024)Meta-learning in Healthcare: A SurveySN Computer Science10.1007/s42979-024-03166-95:6Online publication date: 12-Aug-2024
  • Show More Cited By

Index Terms

  1. Towards heterogeneous temporal clinical event pattern discovery: a convolutional approach

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2012
        1616 pages
        ISBN:9781450314626
        DOI:10.1145/2339530
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 12 August 2012

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. convolution
        2. nmf
        3. pattern discovery

        Qualifiers

        • Research-article

        Conference

        KDD '12
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

        Upcoming Conference

        KDD '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)14
        • Downloads (Last 6 weeks)4
        Reflects downloads up to 28 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2025)Exploring the Preprocessing of a Time-Series Healthcare Administrative Dataset on Deep Learning to Improve PredictionIEEE Access10.1109/ACCESS.2024.352042513(3485-3496)Online publication date: 2025
        • (2024)Breast Cancer Assessment Using a HybridANN-CNN Approach2024 International Conference on Intelligent and Innovative Technologies in Computing, Electrical and Electronics (IITCEE)10.1109/IITCEE59897.2024.10468038(1-5)Online publication date: 24-Jan-2024
        • (2024)Meta-learning in Healthcare: A SurveySN Computer Science10.1007/s42979-024-03166-95:6Online publication date: 12-Aug-2024
        • (2023)Voting Classifier on Ensemble Algorithms for Breast Cancer Prediction2023 International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT)10.1109/IDCIoT56793.2023.10053498(653-660)Online publication date: 5-Jan-2023
        • (2023)Automatic Scikit-learn based detection and classification of Breast Cancer using Machine./ Learning techniques2023 Third International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT)10.1109/ICAECT57570.2023.10117662(1-8)Online publication date: 5-Jan-2023
        • (2023)An EHR data quality evaluation approach based on medical knowledge and text matchingIRBM10.1016/j.irbm.2023.100782(100782)Online publication date: Apr-2023
        • (2022)A Survey of Deep Learning for Electronic Health RecordsApplied Sciences10.3390/app12221170912:22(11709)Online publication date: 17-Nov-2022
        • (2022)Novel Approach to Predict the Chances of Breast Cancer by Demographic Data using Machine Learning Algorithms2022 6th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS53718.2022.9788124(1360-1368)Online publication date: 25-May-2022
        • (2021)3D-CNN-SPP: A Patient Risk Prediction System From Electronic Health Records via 3D CNN and Spatial Pyramid PoolingIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2019.29604745:2(247-261)Online publication date: Apr-2021
        • (2021)Multi-View Deep Learning Framework for Predicting Patient Expenditure in HealthcareIEEE Open Journal of the Computer Society10.1109/OJCS.2021.30525182(62-71)Online publication date: 2021
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media