skip to main content
10.1145/1014052.1014124acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

On detecting space-time clusters

Published:22 August 2004Publication History

ABSTRACT

Detection of space-time clusters is an important function in various domains (e.g., epidemiology and public health). The pioneering work on the spatial scan statistic is often used as the basis to detect and evaluate such clusters. State-of-the-art systems based on this approach detect clusters with restrictive shapes that cannot model growth and shifts in location over time. We extend these methods significantly by using the flexible square pyramid shape to model such effects. A heuristic search method is developed to detect the most likely clusters using a randomized algorithm in combination with geometric shapes processing. The use of Monte Carlo methods in the original scan statistic formulation is continued in our work to address the multiple hypothesis testing issues. Our method is applied to a real data set on brain cancer occurrences over a 19 year period. The cluster detected by our method shows both growth and movement which could not have been modeled with the simpler cylindrical shapes used earlier. Our general framework can be extended quite easily to handle other flexible shapes for the space-time clusters.

References

  1. R. Agrawal, J. Gehrke, D. Gunopulos, and P. Raghavan. Automatic subspace clustering of high dimensional data for data mining applications. In Proceedings of the ACM-SIGMOD International Conference on Management of Data, pages 94--105, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Duczmal and R. Assuncao. A simulated annealing strategy for the detection of arbitrary shaped spatial clusters. Computational Statistics and Data Analysis, March 2003.Google ScholarGoogle Scholar
  3. J. Fleiss. Statistical methods for Rates and Proportions. John Wiley & Sons, 1981.Google ScholarGoogle Scholar
  4. J. Glaz and N. Balakrishnan. Scan Statistics and Applications. Birkhauser, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Goldberg. Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Kulldorff. A spatial scan statistic. Communications in Statistics: Theory and Methods, 26(6):1481--1496, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  7. M. Kulldorff. Spatial scan statistics: models, calculations, and applications. In Scan Statistics and Applications, edited by Glaz and Balakrishnan, 1999.Google ScholarGoogle Scholar
  8. M. Kulldorff, W. Athas, E. Feuer, B. Miller, and C. Key. Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los Alamos. American Journal of Public Health, 88:1377--1380, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  9. M. Kulldorff and Information Management Services Inc. Satscan v. 3.1: Software for the spatial and space-time scan statistics. Technical report, 2002. http://www.satscan.org/.Google ScholarGoogle Scholar
  10. National Cancer Institute. Brain cancer in New Mexico. Technical Report Data set (1973-1991), Division of Cancer Prevention, Biometry Research Group.Google ScholarGoogle Scholar
  11. D. Neill and A. Moore. A fast multi-resolution method for detection of significant spatial overdensities. Technical Report Carnegie Mellon CSD Technical Report CMU-CS-03-154 (Abbreviated version to appear in NIPS 2003), Carnegie Mellon University, June 2003.Google ScholarGoogle Scholar
  12. P. van Laarhoven and E. Aarts. Simulated Annealing: Theory and Applications. D. Reidel Publishing Company, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Wilson and B. Rudin. Introduction to the IBM Optimization Subroutine Library. IBM Systems Journal, 31(1):4--10, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. On detecting space-time clusters

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2004
      874 pages
      ISBN:1581138881
      DOI:10.1145/1014052

      Copyright © 2004 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 August 2004

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader