skip to main content
10.1145/1081870.1081897acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Detection of emerging space-time clusters

Published: 21 August 2005 Publication History

Abstract

We propose a new class of spatio-temporal cluster detection methods designed for the rapid detection of emerging space-time clusters. We focus on the motivating application of prospective disease surveillance: detecting space-time clusters of disease cases resulting from an emerging disease outbreak. Automatic, real-time detection of outbreaks can enable rapid epidemiological response, potentially reducing rates of morbidity and mortality. Building on the prior work on spatial and space-time scan statistics, our methods combine time series analysis (to determine how many cases we expect to observe for a given spatial region in a given time interval) with new "emerging cluster" space-time scan statistics (to decide whether an observed increase in cases in a region is significant), enabling fast and accurate detection of emerging outbreaks. We evaluate these methods on two types of simulated outbreaks: aerosol release of inhalational anthrax (e.g. from a bioterrorist attack) and FLOO ("Fictional Linear Onset Outbreak"), injected into actual baseline data (Emergency Department records and over-the-counter drug sales data from Allegheny County). We demonstrate that our methods are successful in rapidly detecting both outbreak types while keeping the number of false positives low, and show that our new "emerging cluster" scan statistics consistently outperform the standard "persistent cluster" scan statistics approach.

References

[1]
R. Agrawal, J. Gehrke, D. Gunopulus, and P. Raghavan. Automatic subspace clustering of high dimensional data for data mining applications. In Proc. ACM-SIGMOD Intl. Conf. on Mgmt. of Data, pages 94--105, 1998.
[2]
R. Assuncao, A. Tavares, and M. Kulldorff. An early warning system for space-time cluster detection. Technical report, 2004.
[3]
J. Besag, J. York, and A. Mollie. Bayesian image restoration with two applications in spatial statistics. Ann. Inst. Statist. Math., 43:1--59, 1991.
[4]
D. Clayton and J. Kaldor. Empirical bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics, 43:671--681, 1987.
[5]
T. Fawcett and F. Provost. Activity monitoring: noticing interesting changes in behavior. In Proc. 5th Intl. Conf. on Knowledge Discovery and Data Mining, 1999.
[6]
J. Friedman and N. Fisher. Bump hunting in high dimensional data. Statistics and Computing, 9(2):1--20, 1999.
[7]
W. Hogan, G. Cooper, M. Wagner, and G. Wallstrom. A bayesian anthrax aerosol release detector. Technical report, RODS Laboratory, 2004.
[8]
V. Iyengar. On detecting space-time clusters. In Proc. 10th ACM SIGKDD Conf. on Knowledge Discovery and Data Mining, pages 587--592, 2004.
[9]
E. Knox. The detection of space-time interactions. Applied Statistics, 13:25--29, 1964.
[10]
M. Kulldorff. A spatial scan statistic. Communications in Statistics: Theory and Methods, 26(6):1481--1496, 1997.
[11]
M. Kulldorff. Spatial scan statistics: models, calculations, and applications. In J. Glaz and N. Balakrishnan, editors, Scan Statistics and Applications, pages 303--322. Birkhauser, 1999.
[12]
M. Kulldorff. Prospective time-periodic geographical disease surveillance using a scan statistic. Journal of the Royal Statistical Society A, 164:61--72, 2001.
[13]
M. Kulldorff, W. Athas, E. Feuer, B. Miller, and C. Key. Evaluating cluster alarms: a space-time scan statistic and cluster alarms in los alamos. American Journal of Public Health, 88:1377--1380, 1998.
[14]
M. Kulldorff, R. Heffernan, J. Hartman, R. Assuncao, and F. Mostashari. A space-time permutation scan statistic for the early detection of disease outbreaks. PLoS Medicine, 2005, in press.
[15]
M. Kulldorff and N. Nagarwalla. Spatial disease clusters: detection and inference. Statistics in Medicine, 14:799--810, 1995.
[16]
N. Mantel. The detection of cancer clustering and the generalized regression approach. Cancer Research, 27:209--220, 1967.
[17]
D. B. Neill and A. W. Moore. Detecting space-time clusters: prior work and new directions. Technical report, Carnegie Mellon University, 2004.
[18]
D. B. Neill and A. W. Moore. Rapid detection of significant spatial clusters. In Proc. 10th ACM SIGKDD Conf. on Knowledge Discovery and Data Mining, pages 256--265, 2004.
[19]
D. B. Neill, A. W. Moore, F. Pereira, and T. Mitchell. Detecting significant multidimensional spatial clusters. In Advances in Neural Information Processing Systems 17, pages 969--976, 2005.

Cited By

View all
  • (2024)Bayesian cluster geographically weighted regression for spatial heterogeneous dataRoyal Society Open Science10.1098/rsos.23178011:6Online publication date: 19-Jun-2024
  • (2024)Enhanced cluster detection and noise reduction for geospatial time series data of COVID-19Multimedia Tools and Applications10.1007/s11042-023-15901-083:2(5621-5652)Online publication date: 1-Jan-2024
  • (2023)A Comprehensive Survey on Graph Anomaly Detection With Deep LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.311881535:12(12012-12038)Online publication date: 1-Dec-2023
  • Show More Cited By

Index Terms

  1. Detection of emerging space-time clusters

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
    August 2005
    844 pages
    ISBN:159593135X
    DOI:10.1145/1081870
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 August 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. biosurveillance
    2. cluster detection
    3. space-time scan statistics

    Qualifiers

    • Article

    Conference

    KDD05

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Bayesian cluster geographically weighted regression for spatial heterogeneous dataRoyal Society Open Science10.1098/rsos.23178011:6Online publication date: 19-Jun-2024
    • (2024)Enhanced cluster detection and noise reduction for geospatial time series data of COVID-19Multimedia Tools and Applications10.1007/s11042-023-15901-083:2(5621-5652)Online publication date: 1-Jan-2024
    • (2023)A Comprehensive Survey on Graph Anomaly Detection With Deep LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.311881535:12(12012-12038)Online publication date: 1-Dec-2023
    • (2023)Social Media Driven Big Data Analysis for Disaster Situation Awareness: A TutorialIEEE Transactions on Big Data10.1109/TBDATA.2022.31584319:1(1-21)Online publication date: 1-Feb-2023
    • (2022)Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A SurveyACM Computing Surveys10.1145/348789355:2(1-38)Online publication date: 18-Jan-2022
    • (2022)Public transportation network scan for rapid surveillanceBiostatistics & Epidemiology10.1080/24709360.2022.20694587:1Online publication date: 2-Jun-2022
    • (2022)Public transportation network scan for rapid surveillanceBiostatistics & Epidemiology10.1080/24709360.2022.20656287:1Online publication date: 27-May-2022
    • (2022)Statistical analysis of teleconsultations in the context of the COVID-19 epidemicCommunications in Statistics: Case Studies, Data Analysis and Applications10.1080/23737484.2022.20565478:3(381-392)Online publication date: 8-Apr-2022
    • (2022)Spatio-temporal clustering analysis using two different scanning windows: A case study of dengue fever in Peninsular MalaysiaSpatial and Spatio-temporal Epidemiology10.1016/j.sste.2022.10049641(100496)Online publication date: Jun-2022
    • (2021)Scalable Data Analysis Application to Web Usage DataMultimedia and Sensory Input for Augmented, Mixed, and Virtual Reality10.4018/978-1-7998-4703-8.ch014(261-274)Online publication date: 2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media