Abstract
In spatial and urban sciences, it is customary to partition study areas into sub-areas—so-called regions, zones, neighborhoods, or communities—to carry out analyses of the spatial patterns and processes of socio-spatial phenomena. The purposeful delineation of these sub-areas is critical because of the potential biases associated with MAUP and UGCoP. If one agrees to characterize these sub-areas based on their homogeneity on a certain attribute, these homogeneous areas (patches) can be detected by data-driven algorithms. This study aims to evaluate the effectiveness and performance of five popular spatial clustering and regionalization algorithms (AZP, ARISeL, max-p-regions, AMOEBA, and SOM) for detecting attribute-based homogeneous patches of different sizes, shapes, and those with homogeneous values. The evaluation follows a quasi-experimental approach: It is based on 68 simulated data sets where the true distribution of patches is known and focuses purely on the capability of algorithms to successfully detect patches rather than computational costs. It is the most comprehensive assessment to-date thanks to a systematic control of various conditions so that a true baseline is available for comparison purposes. Among the tested algorithms, SOM and AMOEBA were found to perform very well in detecting patches of different sizes, different shapes, including those with holes, and different homogeneous values.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Openshaw S (1983) The modifiable areal unit problem. Geo Books, Norwich
Kwan M-P (2012) The uncertain geographic context problem. Ann Assoc Am Geogr 102:958–968
MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of 5th Berkeley symposium on mathematical statistics and probability, University of California Press, pp 281–297
Kaufman L, Rousseeuw P (1990) Finding groups in data: an introduction to cluster analysis. Wiley, Hoboken
Yu D, Chatterjee S, Sheikholeslami G, Zhang A (1998) Efficiently detecting arbitrary-shaped clusters in very large datasets with high dimensions. Technical Report 98-8, State University of New York at Buffalo, Department of Computer Science and Engineering
Openshaw S (1977) Algorithm 3: a procedure to generate pseudo random aggregations of N zones into M zones where M is less than N. Environ Plan A 9:1423–1428
Duque JC, Church RL (2004) A new heuristic model for designing analytical regions. In: North American meeting of the regional science association international, Seattle, WA, November
Duque JC, Church RL, Middleton RS (2011) The p-regions problem. Geogr Anal 43(1):104–126
Guo D, Peuquet D, Gahegan M (2003) ICEAGE: interactive clustering and exploration of large and high-dimensional geodata. Geoinformatica 7:229–253
Guo D (2008) Regionalization with dynamically constrained agglomerative clustering and partitioning (REDCAP). Int J Geogr Inf Sci 22:801–823
Anselin L (1995) Local indicators of spatial association—LISA. Geogr Anal 27:93–115
Getis A, Ord J (1992) The analysis of spatial association by use of distance statistics. Geogr Anal 24(3):189–206
Getis A, Aldstadt J (2004) Constructing the spatial weights matrix using a local statistic. Geogr Anal 36:90–104
Kohonen T (2001) Self-organizing maps, 3rd edn. Springer, Berlin
Bação F, Lobo V, Painho M (2004) Geo-self-organizing map (Geo-SOM) for building and exploring homogeneous regions. In: Egenhofer M, Miller H, Freksa C (eds) Geographic information science. Lecture Notes in Computer Science. Springer, Berlin, pp 22–37
Hagenauer J, Helbich M (2013) Contextual neural gas for spatial clustering and analysis. Int J Geogr Inf Sci 27:251–266
Han J, Kamber M, Tung AKH (2001) Spatial clustering methods in data mining: a survey. In: Miller HJ, Han J (eds) Geographic data mining and knowledge discovery. Taylor & Francis, London, pp 188–217
Duque J, Ramos R, Surinach J (2007) Supervised regionalization methods: a survey. Int Reg Sci Rev 30:195–220
Grubesic TH, Wei R, Murray AT (2014) Spatial clustering overview and comparison: accuracy, sensitivity, and computational expense. Ann Assoc Am Geogr 104(6):1134–1156
Rogerson P, Yamada I (2009) Statistical detection and surveillance of geographic clusters. Taylor & Francis Group, London and New York
Craglia M, Haining R, Wiles P (2000) A comparative evaluation of approaches to urban crime pattern analysis. Urban Stud 37(4):711–729
Murray AT, Grubesic TH (2013) Exploring spatial patterns of crime using non-hierarchical cluster analysis. In: Leitner M (ed) Crime modeling and mapping using geospatial technologies. Springer, Berlin, pp 105–124
Guo D, Wang H (2011) Automatic region building for spatial analysis. Trans GIS 15:29–45
Openshaw S, Rao L (1995) Algorithms for reengineering 1991 census geography. Environ Plan A 27:425–446
Duque JC, Dev B, Betancourt A, Franco JL (2011) ClusterPy: library of spatially constrained clustering algorithms, Version 0.9.9. RiSE-group (Research in Spatial Economics). EAFIT University. http://www.rise-group.org
Arthur D, Vassilvitskii S (2007) k-means++: the advantages of careful seeding. Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, pp 1027–1035
Glover F (1977) Heuristic for integer programming using surrogate constraints. Decis Sci 8:156–166
Duque JC, Anselin L, Rey S (2010) The max-p region problem. Working paper, GeoDa Center for Geospatial Analysis and Computation, Arizona State University, Tempe, AZ
Aldstadt J, Getis A (2006) Using AMOEBA to create a spatial weights matrix and identify spatial clusters. Geogr Anal 38:327–343
Ord J, Getis A (1995) Local spatial autocorrelation statistics: distributional issues and application. Geogr Anal 27(4):286–306
Duque JC, Aldstadt J, Velasquez E, Franco JL, Betancourt A (2011) A computationally efficient method for delineating irregularly shaped spatial clusters. J Geogr Syst 13:355–372
Dao THD, Thill J-C (2016) The SpatialARMED framework: handling complex spatial components in spatial association rule mining. Geogr Anal 48:248–274
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Dao, T.H.D., Thill, JC. (2018). Detecting Attribute-Based Homogeneous Patches Using Spatial Clustering: A Comparison Test. In: Popovich, V., Schrenk, M., Thill, JC., Claramunt, C., Wang, T. (eds) Information Fusion and Intelligent Geographic Information Systems (IF&IGIS'17). Lecture Notes in Geoinformation and Cartography. Springer, Cham. https://doi.org/10.1007/978-3-319-59539-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-59539-9_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59538-2
Online ISBN: 978-3-319-59539-9
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)