ABSTRACT
Detecting anomalous events from spatial data has important applications in real world. The spatial scan statistic methods are popular in this area. With maximizing the spatial statistical discrepancy by comparing observed data with a given baseline data distribution, significant spatial overdensity and underdensity can be detected. In reality, the spatial discrepancy is often irregularly shaped and has a structure of multiple spatial scales. However, a large-scale discrepancy pattern may not be significant when conducting fine granularity analysis. Meanwhile, local irregular boundaries of a maximized discrepancy cannot be well approximated with a coarse granularity analysis. Existing methods mostly work either on a fixed granularity, or with a regularly shaped scanning window. Thus, they have difficulties in characterizing such flexible spatial discrepancies. To solve the problem, in this paper we propose a novel discrepancy maximization algorithm, RefineScan. A grid hierarchy encoding multi-scale information is employed, making the algorithm capable of maximizing spatial discrepancies with multi-scale structures and irregular shapes. Experiments on a wide range of datasets demonstrate the advantages of RefineScan over the state-of-the-art algorithms: It always finds the largest discrepancy scores and remarkably better characterizes multi-scale discrepancy boundaries. Theoretical and empirical analyses also show that RefineScan has a moderate computational complexity and a good scalability.
- D. Agarwal, A. McGregor, J. Phillips, S. Venkatasubramanian, and Z. Zhu. Spatial scan statistics: approximations and performance study. In KDD'06, pages 24--33. ACM, 2006. Google ScholarDigital Library
- J. Aldstadt and A. Getis. Using AMOEBA to create a spatial weights matrix and identify spatial clusters. Geographical Analysis, 38(4):327--343, 2006.Google ScholarCross Ref
- W. Chang, D. Zeng, and H. Chen. A stack-based prospective spatio-temporal data analysis approach. Decision Support Systems, 45(4):697--713, 2008. Google ScholarDigital Library
- W. Dong, X. Zhang, Z. Jiang, W. Sun, L. Xie, and A. Hampapur. Detect irregularly shaped spatio-temporal clusters for decision support. In SOLI, pages 231--236. IEEE, 2011.Google ScholarCross Ref
- W. Dong, X. Zhang, L. Li, C. Sun, L. Shi, and W. Sun. Detecting irregularly shaped significant spatial and spatio-temporal clusters. In SDM, pages 732--743, 2012.Google ScholarCross Ref
- M. Ester, H. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, pages 226--231, 1996.Google ScholarDigital Library
- V. Iyengar. On detecting space-time clusters. In KDD'04, pages 587--592. ACM, 2004. Google ScholarDigital Library
- V. Janeja and V. Atluri. Random walks to identify anomalous free-form spatial scan windows. IEEE Trans. Knowl. Data Eng., 20(10):1378--1392, 2008. Google ScholarDigital Library
- M. Kulldorff. A spatial scan statistic. Comm. Statist. Theory Methods, 26(6):1481--1496, 1997.Google ScholarCross Ref
- M. Kulldorff, L. Huang, L. Pickle, and L. Duczmal. An elliptic spatial scan statistic. Statistics in Medicine, 25(22):3929--3943, 2006.Google ScholarCross Ref
- M. Kulldorff and Information Management Services, Inc. SaTScan™ v8.0: Software for the spatial and space-time scan statistics, 2009. http://www.satscan.org.Google Scholar
- D. Neill and A. Moore. Rapid detection of significant spatial clusters. In KDD'04, pages 256--265, 2004. Google ScholarDigital Library
- D. Neill, A. Moore, K. Daniel, and R. Sabhnani. city4_applic software for Scan Statistics, 2011. Auton Lab, Carnegie Mellon University, http://www.autonlab.org/autonweb/downloads/software.html.Google Scholar
- D. Neill, A. Moore, M. Sabhnani, and K. Daniel. Detection of emerging space-time clusters. In KDD'05, pages 218--227, 2005. Google ScholarDigital Library
- S. Openshaw. The modifiable areal unit problem. Geo Books, 1984.Google Scholar
- G. Sheikholeslami, S. Chatterjee, and A. Zhang. Wavecluster: A multi-resolution clustering approach for very large spatial databases. In VLDB, pages 428--439, 1998. Google ScholarDigital Library
- T. Tango and K. Takahashi. A flexibly shaped spatial scan statistic for detecting clusters. International Journal of Health Geographics, 4(1):11, 2005.Google ScholarCross Ref
- W. Wang, J. Yang, and R. Muntz. STING: A statistical information grid approach to spatial data mining. In VLDB, pages 186--195, 1997. Google ScholarDigital Library
Index Terms
- Maximizing Multi-scale Spatial Statistical Discrepancy
Recommendations
Detecting and interpreting clusters of economic activity in rural areas using scan statistic and LISA under a unified framework
The primary aim of this paper is to expose the use and the value of spatial statistical analysis in business and especially in designing economic policies in rural areas. Specifically, we aim to present under a unified framework, the use of both point ...
Multi-scale GEOBIA with very high spatial resolution digital aerial imagery: scale, texture and image objects
This study used geographic object-based image analysis (GEOBIA) with very high spatial resolution (VHR) aerial imagery (0.3 m spatial resolution) to classify vegetation, channel and bare mud classes in a salt marsh. Three classification issues were ...
On the limiting distribution of the spatial scan statistic
Bootstrap is the standard method in the spatial scan test. However, because the spatial scan statistic lacks theoretical properties, its development and connection to mainstream statistics has been limited. Using the methods of empirical processes with ...
Comments