Abstract
Traditional prevalent co-location pattern mining (PCPM) methods generate complete table instances (TIs) of all candidates, which is both time and space consuming. Existing Apriori-like methods focus on improving the efficiency of TI generation, while existing Clique-based methods still contain many repeated traversing processes, and therefore neither of them can very well detect the overlap in TIs. To address these challenges, this paper first proposes the concept of extended maximal cliques (EMCs) to detect instance overlap situations, and designs a hash-based storage structure SSHT to reduce mining consumption. Second, a novel approach PCPM-EMC is introduced to detect all prevalent co-location patterns (PCPs), which uses the proposed P-BKp,d algorithm to generate EMCs, and adopts a bidirectional pruning strategy for PCP detection. Lastly, extensive experiments on both real-world and synthetic datasets show that the proposed approach is efficient, and reducing more over 80% space consumption and more than 50% time consumption than existing methods, especially in dense datasets.
The first author is a student.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wang, L., Fang, Y., Zhou, L.: Preference-Based Spatial Co-location Pattern Mining. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-7566-9
Chan, H.K., Long, C., Yan, D., Wong, R.C.: Fraction-score: a new support measure for co-location pattern mining. In: Proceedings of 35th IEEE International Conference on Data Engineering (ICDE), Macao, China, pp. 1514–1525. IEEE (2019)
Shekhar, S., Huang, Y.: Discovering spatial co-location patterns: a summary of results. In: Jensen, C.S., Schneider, M., Seeger, B., Tsotras, V.J. (eds.) SSTD 2001. LNCS, vol. 2121, pp. 236–256. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-47724-1_13
Huang, Y., Shekhar, S., Xiong, H.: Discovering colocation patterns from spatial data sets: a general approach. IEEE Trans. Knowl. Data Eng. 16(12), 1472–1485 (2004)
Yoo, J.S., Shekhar, S.: A joinless approach for mining spatial colocation patterns. IEEE Trans. Knowl. Data Eng. 18(10), 1323–1337 (2006)
Wang, L., Bao, Y., Lu, Z.: Efficient discovery of spatial co-location patterns using the iCPI-tree. Open Inf. Syst. J. 3(1), 69–80 (2009)
Yang, P., Wang, L., Wang, X., Zhou, L.: A spatial co-location pattern mining approach based on column calculation. Scientia Sinica Informationis 52(06), 1053–1068 (2022)
Wang, L., Zhou, L., Lu, J., Yip, J.: An order-clique-based approach for mining maximal co-locations. Inf. Sci. 179(19), 3370–3382 (2009)
Yao, X., Peng, L., Yang, L., Chi, T.: A fast space-saving algorithm for maximal co-location pattern mining. Expert Syst. Appl. 63, 310–323 (2016)
Bao, X., Wang, L.: A clique-based approach for co-location pattern mining. Inf. Sci. 490, 244–264 (2019)
Wu, P., Wang, L., Zou, M.: A maximal ordered ego-clique based approach for prevalent co-location pattern mining. Inf. Sci. 608, 630–654 (2022)
Tran, V., Wang, L., Chen, H., Xiao, Q.: MCHT: A maximal clique and hash table-based maximal prevalent co-location pattern mining algorithm. Expert Syst. Appl. 175, 114830 (2021)
Zhang, S., Wang, L., Tran, V.: CPM-MCHM: a spatial co-location pattern mining algorithm based on maximal clique and hash map. Chinese J. Comput. 45(3), 526–541 (2022)
Wang, X., Lei, L., Wang, L., Yang, P., Chen, H.: Spatial colocation pattern discovery incorporating fuzzy theory. IEEE Trans. Fuzzy Syst. 30(6), 2055–2072 (2021)
Wang, L., Bao, X., Zhou, L.: Redundancy reduction for prevalent co-location patterns. IEEE Trans. Knowl. Data Eng. 30(1), 142–155 (2018)
Tomita, E., Tanaka, A., Takahashi, H.: The worst-case time complexity for generating all maximal cliques and computational experiments. Theor. Comput. Sci. 363(1), 28–42 (2006)
Li, Y., Shao, Z., Yu, D., Liao, X., Jin, H.: Fast maximal clique enumeration for real-world graphs. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds.) DASFAA 2019. LNCS, vol. 11446, pp. 641–658. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-18576-3_38
Cheng, J., Zhu, L., Ke, Y., Chu, S.: Fast algorithms for maximal clique enumeration with limited memory. In: Proceedings of 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, pp. 1240–1248. ACM (2012)
Xu, S., Liao, X., Shao, Z., Hua, Q., Jin, H.: Maximal clique enumeration problem on graphs: status and challenges. Scientia Sinica Informationis 52, 784–803 (2022)
Acknowledgments
This work was supported by the National Natural Science Foundation of China (62276227, 61966036, 62062066), the Project of Innovative Research Team of Yunnan Province (2018HC019), and the Yunnan Fundamental Research Projects (202201AS070015).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Li, J., Wang, L., Tran, V., Li, J., Jiang, X. (2023). Fast Mining Prevalent Co-location Patterns Over Dense Spatial Datasets. In: Meng, X., et al. Spatial Data and Intelligence. SpatialDI 2023. Lecture Notes in Computer Science, vol 13887. Springer, Cham. https://doi.org/10.1007/978-3-031-32910-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-32910-4_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-32909-8
Online ISBN: 978-3-031-32910-4
eBook Packages: Computer ScienceComputer Science (R0)