Abstract
A co-location pattern indicates a group of spatial features whose instances are frequently located together in proximate geographic area. Spatial co-location pattern mining (SCPM) is valuable for many practical applications. Numerous previous SCPM studies emphasize the equal participation per feature. As a result, the interesting co-locations with rare features cannot be captured. In this paper, we propose a novel interest measure, i.e., the weighted participation index (WPI), to identify co-locations with or without rare features. The WPI measure possesses a conditional anti-monotone property which can be utilized to prune the search space. In addition, a fast row instance identification mechanism based on the ordered NR-tree is proposed to enhance efficiency. Subsequently, the ordered NR-tree-based algorithm is developed. To further improve efficiency and process massive spatial data, we break the ordered NR-tree into multiple independent subtrees, and parallelize the ordered NR-tree-based algorithm on MapReduce framework. Extensive experiments are conducted on both real and synthetic datasets to verify the effectiveness, efficiency and scalability of our techniques.













Similar content being viewed by others
References
Andrzejewski W, Boinski P (2015) Parallel GPU-based plane-sweep algorithm for construction of iCPI-Trees. J Database Manag 26(3):1–20
Andrzejewski W, Boinski P (2018) Efficient spatial co-location pattern mining on multiple GPUs. Expert Syst Appl 93:465–483
Andrzejewski W, Boinski P (2019) Parallel approach to incremental co-location pattern mining. Inf Sci 496:485–505
Barua S, Sander J (2014) Mining statistically significant co-location and segregation patterns. IEEE Trans Knowl Data Eng 26(5):1185–1199
Cai J, Liu Q, Deng M, Tang J, He Z (2018) Adaptive detection of statistically significant regional spatial co-location patterns. Comput Environ Urban Syst 68:53–63
Chan HK, Long C, Yan D, Wong RC (2019) Fraction-score: a new support measure for co-location pattern mining. In: IEEE international conference on data engineering (ICDE), pp 1514–1525
Fang Y, Wang L, Wang X, Zhou L (2017) Mining co-location patterns with dominant features. In: International conference on web information systems engineering (WISE), pp 183–198
Feng L, Wang L, Gao S (2012) A new approach of mining co-location patterns in spatial datasets with rare features. J Nanjing Univ Nat Sci 48(1):99–107 ((in Chinese))
Ge Y, Yao Z, Li H (2021) Computing co-location patterns in spatial data with extended objects: a scalable buffer-based approach. IEEE Trans Knowl Data Eng 33(2):401–414
Huang Y, Pei J, Xiong H (2006) Mining co-location patterns with rare events from spatial data sets. GeoInformatica 10(3):239–260
Huang Y, Shekhar S, Xiong H (2004) Discovering colocation patterns from spatial data sets: a general approach. IEEE Trans Knowl Data Eng 16(12):1472–1485
Li J, Adilmagambetov A, Jabbar MSM, Osornio-Vargas A, Wine O (2016) On discovering co-location patterns in datasets: a case study of pollutants and child cancers. Geoinformatica 20(4):651–692
Liu B, Chen L, Liu C, Zhang C, Qiu W (2015) RCP mining: towards the summarization of spatial co-location patterns. In: International symposium on spatial and temporal databases (SSTD), pp 451–469
Lu J, Wang L, Fang Y, Li M (2017) Mining competitive pairs hidden in co-location patterns from dynamic spatial databases. In: Pacific Asia knowledge discovery and data mining (PAKDD), pp 467–480
Lu J, Wang L, Fang Y, Zhao J (2018) Mining strong symbiotic patterns hidden in spatial prevalent co-location patterns. Knowl Based Syst 146:190–202
Ouyang Z, Wang L, Wu P (2017) Spatial co-location pattern discovery from fuzzy objects. Int J Artif Intell Tools 26(2):1750003. https://doi.org/10.1142/S0218213017500038
Shekhar S, Huang Y (2001) Discovering spatial co-location patterns: a summary of results. In: International symposium on spatial and temporal databases (SSTD), pp 236–256
Wang L, Bao X, Cao L (2018) Interactive probabilistic post-mining of user-preferred spatial co-location patterns. In: IEEE international conference on data engineering (ICDE), pp 1256–1259
Wang L, Bao X, Chen H, Cao L (2018) Effective lossless condensed representation and discovery of spatial co-location patterns. Inf Sci 436:197–213
Wang L, Bao X, Zhou L (2018) Redundancy reduction for prevalent co-location patterns. IEEE Trans Knowl Data Eng 30(1):142–155
Wang L, Bao X, Zhou L, Chen H (2019) Mining maximal sub-prevalent co-location patterns. World Wide Web 22(5):1971–1997
Wang L, Bao Y, Lu J, Yip J (2008) A new join-less approach for co-location pattern mining. In: IEEE international conference on computer and information technology (CIT), pp 197–202
Wang L, Bao Y, Lu Z (2009) Efficient discovery of spatial co-location patterns using the iCPI-tree. Open Inf Syst J 3(1):69–80
Yang P, Wang L, Wang X (2018) A parallel spatial co-location pattern mining approach based on ordered clique growth. In: International conference on database systems for advanced applications (DASFAA), pp 734–742
Yang P, Wang L, Wang X (2019) An effective approach on mining co-location patterns from spatial databases with rare features. In: IEEE international conference on mobile data management (MDM), pp 53–62
Yang P, Wang L, Wang X, Fang Y (2018) A parallel joinless algorithm for co-location pattern mining based on group-dependent shard. In: International conference on web information systems engineering (WISE), pp 240–250
Yang P, Zhang T, Wang L (2018) TSRS: trip service recommended system based on summarized co-location patterns. In: APWEB/WAIM, pp 451–455
Yao X, Chen L, Peng L, Chi T (2017) A co-location pattern-mining algorithm with a density-weighted distance thresholding consideration. Inf Sci 396:144–161
Yoo JS, Boulware D, Kimmey D (2020) Parallel co-location mining with MapReduce and NoSQL systems. Knowl Inf Syst 62:1433–1463
Yoo JS, Shekhar S (2004) A partial join approach for mining co-location patterns. In: the 12th Annual ACM international workshop on geographic information systems, pp 241–249
Yoo JS, Shekhar S (2006) A joinless approach for mining spatial colocation patterns. IEEE Trans Knowl Data Eng 18(10):1323–1337
Yu W (2016) Spatial co-location pattern mining for location-based services in road networks. Expert Syst Appl 46:324–335
Acknowledgements
This work is supported by the National Natural Science Foundation of China (61966036, 61662086, 61762090), and the Project of Innovative Research Team of Yunnan Province (2018HC019).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yang, P., Wang, L., Wang, X. et al. Efficient discovery of co-location patterns from massive spatial datasets with or without rare features. Knowl Inf Syst 63, 1365–1395 (2021). https://doi.org/10.1007/s10115-021-01559-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-021-01559-3