skip to main content
10.1145/3557918.3565870acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Density-based cluster detection at multiple spatial scales via kullback-leibler divergence of reachability profiles

Published:14 November 2022Publication History

ABSTRACT

Density-based clustering methods are frequently used to define spatial clusters and outliers (noise) for location-only data. Different algorithms for solving this problem emerged over the past few decades, with their main difference being the numerical representation of the spatial density. A problem not addressed by conventional density-based clustering methods is defining alternate spatial cluster maps at statistically significant spatial scales. This problem differs from conventional clustering, as the goal of finding alternate clusters is to define different spatial cluster maps for all statistically significant spatial scales. Knowledge of distinct spatial scales pertinent to clustering is important for understanding various scales underlying the data. In addition, alternate clusters with different spatial scales can inform decisions that require to be made at different spatial granularity. In this paper, we introduce a statistical test that uses Kullback-Leibler (KL) divergence loss between different spatial density profiles to identify all statistically significant spatial scales at which clustering occurs. The proposed method defines different clustering maps that reflect different scales at which spatial clusters occur. We define the divergence on a 1-D representation of cluster density, the reachability profile, to cluster spatial units with varying spatial scales. We illustrate the use of multiple spatial clustering at different scales by comparing the proposed method to the state-of-the-art for defining a single map of multiscale clusters, HDBScan. We conclude the paper by applying the proposed method to physical and human geography problems, area of interest delineation, and wildfire cluster modeling, respectively.

References

  1. Mihael Ankerst, Markus M Breunig, Hans-Peter Kriegel, and Jörg Sander. 1999. OPTICS: Ordering points to identify the clustering structure. ACM Sigmod record 28, 2 (1999), 49--60.Google ScholarGoogle Scholar
  2. Ricardo JGB Campello, Davoud Moulavi, and Jörg Sander. 2013. Density-based clustering based on hierarchical density estimates. In Pacific-Asia conference on knowledge discovery and data mining. Springer, 160--172.Google ScholarGoogle ScholarCross RefCross Ref
  3. Eunjoon Cho, Seth A Myers, and Jure Leskovec. 2011. Friendship and mobility: user movement in location-based social networks. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. 1082--1090.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Xiaowen Dong, Dimitrios Mavroeidis, Francesco Calabrese, and Pascal Frossard. 2015. Multiscale event detection in social media. Data Mining and Knowledge Discovery 29, 5 (2015), 1374--1405.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jeff Eidenshink, Brian Schwind, Ken Brewer, Zhi-Liang Zhu, Brad Quayle, and Stephen Howard. 2007. A project for monitoring trends in burn severity. Fire ecology 3, 1 (2007), 3--21.Google ScholarGoogle Scholar
  6. Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu, et al. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In kdd, Vol. 96. 226--231.Google ScholarGoogle Scholar
  7. Daniela P González, Mauricio Monsalve, Roberto Moris, and Cristóbal Herrera. 2018. Risk and Resilience Monitor: Development of multiscale and multilevel indicators for disaster risk management for the communes and urban areas of Chile. Applied geography 94 (2018), 262--271.Google ScholarGoogle Scholar
  8. Yingjie Hu, Song Gao, Krzysztof Janowicz, Bailang Yu, Wenwen Li, and Sathya Prasad. 2015. Extracting and understanding urban areas of interest using geo-tagged photos. Computers, Environment and Urban Systems 54 (2015), 240--254.Google ScholarGoogle ScholarCross RefCross Ref
  9. Yuhao Kang, Song Gao, Yunlei Liang, Mingxiao Li, Jinmeng Rao, and Jake Kruse. 2020. Multiscale dynamic human mobility flow dataset in the US during the COVID-19 epidemic. Scientific data 7, 1 (2020), 1--13.Google ScholarGoogle Scholar
  10. Teuvo Kohonen. 1990. The self-organizing map. Proc. IEEE 78, 9 (1990), 1464--1480.Google ScholarGoogle ScholarCross RefCross Ref
  11. Solomon Kullback and Richard A Leibler. 1951. On information and sufficiency. The annals of mathematical statistics 22, 1 (1951), 79--86.Google ScholarGoogle Scholar
  12. Pabitra Mitra, CA Murthy, and Sankar K Pal. 2002. Density based multiscale data condensation. (2002).Google ScholarGoogle Scholar
  13. Radford M Neal and Geoffrey E Hinton. 1998. A view of the EM algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models. Springer, 355--368.Google ScholarGoogle Scholar
  14. Marc-André Parisien and Max A Moritz. 2009. Environmental controls on the distribution of wildfire at multiple spatial scales. Ecological Monographs 79, 1 (2009), 127--154.Google ScholarGoogle ScholarCross RefCross Ref
  15. Bin Peng, Kaiyu Guan, Jinyun Tang, Elizabeth A Ainsworth, Senthold Asseng, Carl J Bernacchi, Mark Cooper, Evan H Delucia, Joshua W Elliott, Frank Ewert, et al. 2020. Towards a multiscale crop modelling framework for climate change adaptation assessment. Nature Plants 6, 4 (2020), 338--348.Google ScholarGoogle ScholarCross RefCross Ref
  16. Tian-Tian Zhang and Bo Yuan. 2018. Density-based multiscale analysis for clustering in strong noise settings with varying densities. IEEE Access 6 (2018), 25861--25873.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Density-based cluster detection at multiple spatial scales via kullback-leibler divergence of reachability profiles

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            GeoAI '22: Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery
            November 2022
            101 pages
            ISBN:9781450395328
            DOI:10.1145/3557918

            Copyright © 2022 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 14 November 2022

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate17of25submissions,68%
          • Article Metrics

            • Downloads (Last 12 months)21
            • Downloads (Last 6 weeks)1

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader