Abstract
Traditional point-of-interest (POI) data are collected by professional surveying and mapping organizations and are distributed in electronic maps. With the booming Internet and the development of crowdsourcing, the POI data defined in various formats are issued by some Internet companies and non-profit organizations. Due to the multiple sources and diverse formats of POI data, some problems occur in the data fusion process, such as conceptual definition differences, inconsistent classification, inefficient fusion algorithms, inaccurate fusion results, etc. To overcome the challenges of multi-source POI data fusion, this paper proposes a standardized POI data model and an ontology-based POI category system. Furthermore, a fusion framework and a fusion algorithm based on a two-stage clustering approach are proposed. The proposed method is compared with existing algorithms using datasets of different sizes, including POI surveying and mapping data from Kunming, China, Weibo check-in POI data, and real estate POI data. The experimental results demonstrate that the fusion effects of the proposed algorithm are superior to those of existing algorithms in terms of different evaluation indexes and operational efficiency.
Similar content being viewed by others
Change history
19 October 2021
A Correction to this paper has been published: https://doi.org/10.1007/s10489-021-02791-8
References
Jiang S, Alves A, Rodrigues F, et al. (2015) Mining point-of-interest data from social networks for urban land use classification and disaggregation. Comput Environ Urban Syst 53:36–46
Beil F, Ester M, Xu X (2002) Frequent term-based text clustering. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 436–442
Beeri C, Doytsher Y, Kanza Y, et al. (2005) Finding corresponding objects when integrating several geo-spatial datasets. ACM international workshop on Geographic information systems Proceedings of the 13th annual. ACM, pp 87–96
Cai L, Pan J, Wei B, et al. (2018) Visualization Analysis for Spatio-temporal Pattern of Hotspots and Sentiment Change towards Microblog Check-in Data. J Chin Comput Syst 39(9):1889–1894
Chen R (2014) Study on the method of matching and fusion based on the multi-source POI data, M.Sc. Thesis, 68. Lanzhou Jiaotong University, Lanzhou
Ding Z, Jia Y, Zhou B (2014) Research review of micro-blog data mining. Comput Res Dev 51(4):691–706
Guarino N (1998) Formal ontology in information systems: Proceedings of the first international conference, vol 341. IOS press, Trento
Gao X (2013) Study on fusion of multi-source POI based on the spatial location information, M.Sc Thesis, vol 71. Ocean University of China, Qingdao
Huang M (2006) Key issues and applied research of geographic ontology, vol 151. China University of Science and Technology Press, Beijing
Lee JH, Kim MH, Lee YJ (1994) Ranking documents in thesaurus-based Boolean retrieval systems. Inf Process Manag 30(1):79–91
Liu Z, Liu L (2012) An empirical study of Chinese micro blog emotion classification based on machine learning. Comput Eng Appl 1:1–4
Metz CE (1978) Basic principles of ROC analysis. Semin Nuclear Med 8(4):283–298
Nachouki G, Quafafou M (2008) Multi-data source fusion. Inf Fusion 9(4):523–537
Sehgal V, Getoor L, Viechnicki PD (2006) Entity resolution in geospatial data integration. Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems. ACM, New York, pp 83–90
Safra E, Kanza Y, Sagiv Y, et al. (2010) Location-based algorithms for finding sets of corresponding objects over several geo-spatial data sets. Int J Geograph Inf Sci 24(1):69–106
Scheffler T, Schirru R, Lehmann P (2012) Matching points of interest from different social networking sites. The 2012 International Conference on Artificial Intelligence. Springer, Las Vegas, pp 245–248
Salton G, Wong A, Yang C-S (1975) A vector space model for automatic indexing. Commun ACM 18(11):613–620
Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions, and reversals. Sov Phys Doklady 10(8):707–710
Varshney PK (1997) Multisensor data fusion. Electron Commun Eng J 9(6):245–253. IET
WU C, REN F, DU Q, et al. (2014) Classification method of POI data based on formal ontology theory. Geogr Geo-Inf Sci 6:3
Zhu Q (2015) Research on the Ontology-oriented Automatic Semantic Classification of Geographic Information, Ph.D Dissertation, vol 143. Wuhan University, Wuhan
Zhong S, Fang Z, Zhu M, et al. (2017) A geo-ontology-based approach to decision-making in emergency management of meteorological disasters. Nat Hazards 89(2):531–554
Catriel B, Yaron K, Eliyahu S, et al. (2004) Object fusion in geographic information systems. Proceedings of the Thirtieth international conference on Very large data bases, vol 816-827. VLDB Endowment, Toronto
Botts M, Robin A, Greenwood J et al (2014) OGC SensorML: model and XML encoding standard. Techn Stand 2:12– 000
Auer S, Lehmann J, Hellmann S (2009) Linkedgeodata: Adding a spatial dimension to the web of data. The 8th International Semantic Web Conference, vol 731-746. Springer, Chantilly
Heikkinen A, Okkonen A, Karhu A, et al. (2014) A distributed POI data model based on the entity-component approach 2014. IEEE Symposium on Computers and Communications, vol 1–6. IEEE, Madeira
Xu S, Zhang Q, Li Y, et al. (2018) Fusion algorithm of multi-source interest points based on distance category. Comput Appl 38(5):118–122
ISO/TC211 (2003) Geographic Information–Spatial Referencing by Geographic Identifiers, vol 4. International Organization for Standardization, Geneva
Yang W (2004) Information Processing and Management Multisensor Data Fusion and its Applications, vol 205. Xi’an university of electronic science and technology press, Xi’an
Wu M (2007) Research on classification and query of agricultural geographic information based on multi-source heterogeneous spatio-temporal data driven by ontology, M.Sc Thesis, vol 37. Chinese academy of sciences, Beijing
Li D (2010) Research on reputation dimension discovery based on text clustering and corpus, M.Sc Thesis, vol 51. Huazhong University of science and technology, Wuhan
Dai W (2008) Semantic web information organization technology and method, vol 318. Xuelin Verlag, Shanghai
Sester M, Anders KH, Walter V (1998) Linking objects of different spatial data sets by integration and aggregation. GeoInformatica 2:335–358
Li C, Liu L, Dai Z, et al. (2020) Different sourcing point of interest matching method considering multiple constraints. Int J Geo-Inf 9(214):1–16
Piech M, Smywinski-Pohl A, Marcjan R, et al. (2020) Towards automatic points of interest matching. Int J Geo-Inf 9(291):1–29
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Informed Consent
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: There are errors in the online article.
Rights and permissions
About this article
Cite this article
Cai, L., Zhu, L., Jiang, F. et al. Research on multi-source POI data fusion based on ontology and clustering algorithms. Appl Intell 52, 4758–4774 (2022). https://doi.org/10.1007/s10489-021-02561-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02561-6