skip to main content
10.1145/2661829.2662007acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Maximizing Multi-scale Spatial Statistical Discrepancy

Authors Info & Claims
Published:03 November 2014Publication History

ABSTRACT

Detecting anomalous events from spatial data has important applications in real world. The spatial scan statistic methods are popular in this area. With maximizing the spatial statistical discrepancy by comparing observed data with a given baseline data distribution, significant spatial overdensity and underdensity can be detected. In reality, the spatial discrepancy is often irregularly shaped and has a structure of multiple spatial scales. However, a large-scale discrepancy pattern may not be significant when conducting fine granularity analysis. Meanwhile, local irregular boundaries of a maximized discrepancy cannot be well approximated with a coarse granularity analysis. Existing methods mostly work either on a fixed granularity, or with a regularly shaped scanning window. Thus, they have difficulties in characterizing such flexible spatial discrepancies. To solve the problem, in this paper we propose a novel discrepancy maximization algorithm, RefineScan. A grid hierarchy encoding multi-scale information is employed, making the algorithm capable of maximizing spatial discrepancies with multi-scale structures and irregular shapes. Experiments on a wide range of datasets demonstrate the advantages of RefineScan over the state-of-the-art algorithms: It always finds the largest discrepancy scores and remarkably better characterizes multi-scale discrepancy boundaries. Theoretical and empirical analyses also show that RefineScan has a moderate computational complexity and a good scalability.

References

  1. D. Agarwal, A. McGregor, J. Phillips, S. Venkatasubramanian, and Z. Zhu. Spatial scan statistics: approximations and performance study. In KDD'06, pages 24--33. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Aldstadt and A. Getis. Using AMOEBA to create a spatial weights matrix and identify spatial clusters. Geographical Analysis, 38(4):327--343, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  3. W. Chang, D. Zeng, and H. Chen. A stack-based prospective spatio-temporal data analysis approach. Decision Support Systems, 45(4):697--713, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. Dong, X. Zhang, Z. Jiang, W. Sun, L. Xie, and A. Hampapur. Detect irregularly shaped spatio-temporal clusters for decision support. In SOLI, pages 231--236. IEEE, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  5. W. Dong, X. Zhang, L. Li, C. Sun, L. Shi, and W. Sun. Detecting irregularly shaped significant spatial and spatio-temporal clusters. In SDM, pages 732--743, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Ester, H. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, pages 226--231, 1996.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. V. Iyengar. On detecting space-time clusters. In KDD'04, pages 587--592. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. V. Janeja and V. Atluri. Random walks to identify anomalous free-form spatial scan windows. IEEE Trans. Knowl. Data Eng., 20(10):1378--1392, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Kulldorff. A spatial scan statistic. Comm. Statist. Theory Methods, 26(6):1481--1496, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  10. M. Kulldorff, L. Huang, L. Pickle, and L. Duczmal. An elliptic spatial scan statistic. Statistics in Medicine, 25(22):3929--3943, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  11. M. Kulldorff and Information Management Services, Inc. SaTScan™ v8.0: Software for the spatial and space-time scan statistics, 2009. http://www.satscan.org.Google ScholarGoogle Scholar
  12. D. Neill and A. Moore. Rapid detection of significant spatial clusters. In KDD'04, pages 256--265, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Neill, A. Moore, K. Daniel, and R. Sabhnani. city4_applic software for Scan Statistics, 2011. Auton Lab, Carnegie Mellon University, http://www.autonlab.org/autonweb/downloads/software.html.Google ScholarGoogle Scholar
  14. D. Neill, A. Moore, M. Sabhnani, and K. Daniel. Detection of emerging space-time clusters. In KDD'05, pages 218--227, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Openshaw. The modifiable areal unit problem. Geo Books, 1984.Google ScholarGoogle Scholar
  16. G. Sheikholeslami, S. Chatterjee, and A. Zhang. Wavecluster: A multi-resolution clustering approach for very large spatial databases. In VLDB, pages 428--439, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Tango and K. Takahashi. A flexibly shaped spatial scan statistic for detecting clusters. International Journal of Health Geographics, 4(1):11, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  18. W. Wang, J. Yang, and R. Muntz. STING: A statistical information grid approach to spatial data mining. In VLDB, pages 186--195, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Maximizing Multi-scale Spatial Statistical Discrepancy

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
      November 2014
      2152 pages
      ISBN:9781450325981
      DOI:10.1145/2661829

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 November 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CIKM '14 Paper Acceptance Rate175of838submissions,21%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader