Skip to main content
Log in

Efficient discovery of co-location patterns from massive spatial datasets with or without rare features

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

A co-location pattern indicates a group of spatial features whose instances are frequently located together in proximate geographic area. Spatial co-location pattern mining (SCPM) is valuable for many practical applications. Numerous previous SCPM studies emphasize the equal participation per feature. As a result, the interesting co-locations with rare features cannot be captured. In this paper, we propose a novel interest measure, i.e., the weighted participation index (WPI), to identify co-locations with or without rare features. The WPI measure possesses a conditional anti-monotone property which can be utilized to prune the search space. In addition, a fast row instance identification mechanism based on the ordered NR-tree is proposed to enhance efficiency. Subsequently, the ordered NR-tree-based algorithm is developed. To further improve efficiency and process massive spatial data, we break the ordered NR-tree into multiple independent subtrees, and parallelize the ordered NR-tree-based algorithm on MapReduce framework. Extensive experiments are conducted on both real and synthetic datasets to verify the effectiveness, efficiency and scalability of our techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Andrzejewski W, Boinski P (2015) Parallel GPU-based plane-sweep algorithm for construction of iCPI-Trees. J Database Manag 26(3):1–20

    Article  Google Scholar 

  2. Andrzejewski W, Boinski P (2018) Efficient spatial co-location pattern mining on multiple GPUs. Expert Syst Appl 93:465–483

    Article  Google Scholar 

  3. Andrzejewski W, Boinski P (2019) Parallel approach to incremental co-location pattern mining. Inf Sci 496:485–505

    Article  Google Scholar 

  4. Barua S, Sander J (2014) Mining statistically significant co-location and segregation patterns. IEEE Trans Knowl Data Eng 26(5):1185–1199

    Article  Google Scholar 

  5. Cai J, Liu Q, Deng M, Tang J, He Z (2018) Adaptive detection of statistically significant regional spatial co-location patterns. Comput Environ Urban Syst 68:53–63

    Article  Google Scholar 

  6. Chan HK, Long C, Yan D, Wong RC (2019) Fraction-score: a new support measure for co-location pattern mining. In: IEEE international conference on data engineering (ICDE), pp 1514–1525

  7. Fang Y, Wang L, Wang X, Zhou L (2017) Mining co-location patterns with dominant features. In: International conference on web information systems engineering (WISE), pp 183–198

  8. Feng L, Wang L, Gao S (2012) A new approach of mining co-location patterns in spatial datasets with rare features. J Nanjing Univ Nat Sci 48(1):99–107 ((in Chinese))

    Google Scholar 

  9. Ge Y, Yao Z, Li H (2021) Computing co-location patterns in spatial data with extended objects: a scalable buffer-based approach. IEEE Trans Knowl Data Eng 33(2):401–414

    Article  Google Scholar 

  10. Huang Y, Pei J, Xiong H (2006) Mining co-location patterns with rare events from spatial data sets. GeoInformatica 10(3):239–260

    Article  Google Scholar 

  11. Huang Y, Shekhar S, Xiong H (2004) Discovering colocation patterns from spatial data sets: a general approach. IEEE Trans Knowl Data Eng 16(12):1472–1485

    Article  Google Scholar 

  12. Li J, Adilmagambetov A, Jabbar MSM, Osornio-Vargas A, Wine O (2016) On discovering co-location patterns in datasets: a case study of pollutants and child cancers. Geoinformatica 20(4):651–692

    Article  Google Scholar 

  13. Liu B, Chen L, Liu C, Zhang C, Qiu W (2015) RCP mining: towards the summarization of spatial co-location patterns. In: International symposium on spatial and temporal databases (SSTD), pp 451–469

  14. Lu J, Wang L, Fang Y, Li M (2017) Mining competitive pairs hidden in co-location patterns from dynamic spatial databases. In: Pacific Asia knowledge discovery and data mining (PAKDD), pp 467–480

  15. Lu J, Wang L, Fang Y, Zhao J (2018) Mining strong symbiotic patterns hidden in spatial prevalent co-location patterns. Knowl Based Syst 146:190–202

    Article  Google Scholar 

  16. Ouyang Z, Wang L, Wu P (2017) Spatial co-location pattern discovery from fuzzy objects. Int J Artif Intell Tools 26(2):1750003. https://doi.org/10.1142/S0218213017500038

    Article  Google Scholar 

  17. Shekhar S, Huang Y (2001) Discovering spatial co-location patterns: a summary of results. In: International symposium on spatial and temporal databases (SSTD), pp 236–256

  18. Wang L, Bao X, Cao L (2018) Interactive probabilistic post-mining of user-preferred spatial co-location patterns. In: IEEE international conference on data engineering (ICDE), pp 1256–1259

  19. Wang L, Bao X, Chen H, Cao L (2018) Effective lossless condensed representation and discovery of spatial co-location patterns. Inf Sci 436:197–213

    Article  MathSciNet  Google Scholar 

  20. Wang L, Bao X, Zhou L (2018) Redundancy reduction for prevalent co-location patterns. IEEE Trans Knowl Data Eng 30(1):142–155

    Article  Google Scholar 

  21. Wang L, Bao X, Zhou L, Chen H (2019) Mining maximal sub-prevalent co-location patterns. World Wide Web 22(5):1971–1997

    Article  Google Scholar 

  22. Wang L, Bao Y, Lu J, Yip J (2008) A new join-less approach for co-location pattern mining. In: IEEE international conference on computer and information technology (CIT), pp 197–202

  23. Wang L, Bao Y, Lu Z (2009) Efficient discovery of spatial co-location patterns using the iCPI-tree. Open Inf Syst J 3(1):69–80

    Google Scholar 

  24. Yang P, Wang L, Wang X (2018) A parallel spatial co-location pattern mining approach based on ordered clique growth. In: International conference on database systems for advanced applications (DASFAA), pp 734–742

  25. Yang P, Wang L, Wang X (2019) An effective approach on mining co-location patterns from spatial databases with rare features. In: IEEE international conference on mobile data management (MDM), pp 53–62

  26. Yang P, Wang L, Wang X, Fang Y (2018) A parallel joinless algorithm for co-location pattern mining based on group-dependent shard. In: International conference on web information systems engineering (WISE), pp 240–250

  27. Yang P, Zhang T, Wang L (2018) TSRS: trip service recommended system based on summarized co-location patterns. In: APWEB/WAIM, pp 451–455

  28. Yao X, Chen L, Peng L, Chi T (2017) A co-location pattern-mining algorithm with a density-weighted distance thresholding consideration. Inf Sci 396:144–161

    Article  Google Scholar 

  29. Yoo JS, Boulware D, Kimmey D (2020) Parallel co-location mining with MapReduce and NoSQL systems. Knowl Inf Syst 62:1433–1463

    Article  Google Scholar 

  30. Yoo JS, Shekhar S (2004) A partial join approach for mining co-location patterns. In: the 12th Annual ACM international workshop on geographic information systems, pp 241–249

  31. Yoo JS, Shekhar S (2006) A joinless approach for mining spatial colocation patterns. IEEE Trans Knowl Data Eng 18(10):1323–1337

    Article  Google Scholar 

  32. Yu W (2016) Spatial co-location pattern mining for location-based services in road networks. Expert Syst Appl 46:324–335

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (61966036, 61662086, 61762090), and the Project of Innovative Research Team of Yunnan Province (2018HC019).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lizhen Wang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, P., Wang, L., Wang, X. et al. Efficient discovery of co-location patterns from massive spatial datasets with or without rare features. Knowl Inf Syst 63, 1365–1395 (2021). https://doi.org/10.1007/s10115-021-01559-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-021-01559-3

Keywords