Skip to main content
Log in

Research on multi-source POI data fusion based on ontology and clustering algorithms

  • Published:
Applied Intelligence Aims and scope Submit manuscript

A Correction to this article was published on 19 October 2021

This article has been updated

Abstract

Traditional point-of-interest (POI) data are collected by professional surveying and mapping organizations and are distributed in electronic maps. With the booming Internet and the development of crowdsourcing, the POI data defined in various formats are issued by some Internet companies and non-profit organizations. Due to the multiple sources and diverse formats of POI data, some problems occur in the data fusion process, such as conceptual definition differences, inconsistent classification, inefficient fusion algorithms, inaccurate fusion results, etc. To overcome the challenges of multi-source POI data fusion, this paper proposes a standardized POI data model and an ontology-based POI category system. Furthermore, a fusion framework and a fusion algorithm based on a two-stage clustering approach are proposed. The proposed method is compared with existing algorithms using datasets of different sizes, including POI surveying and mapping data from Kunming, China, Weibo check-in POI data, and real estate POI data. The experimental results demonstrate that the fusion effects of the proposed algorithm are superior to those of existing algorithms in terms of different evaluation indexes and operational efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Change history

References

  1. Jiang S, Alves A, Rodrigues F, et al. (2015) Mining point-of-interest data from social networks for urban land use classification and disaggregation. Comput Environ Urban Syst 53:36–46

    Article  Google Scholar 

  2. Beil F, Ester M, Xu X (2002) Frequent term-based text clustering. Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 436–442

  3. Beeri C, Doytsher Y, Kanza Y, et al. (2005) Finding corresponding objects when integrating several geo-spatial datasets. ACM international workshop on Geographic information systems Proceedings of the 13th annual. ACM, pp 87–96

  4. Cai L, Pan J, Wei B, et al. (2018) Visualization Analysis for Spatio-temporal Pattern of Hotspots and Sentiment Change towards Microblog Check-in Data. J Chin Comput Syst 39(9):1889–1894

    Google Scholar 

  5. Chen R (2014) Study on the method of matching and fusion based on the multi-source POI data, M.Sc. Thesis, 68. Lanzhou Jiaotong University, Lanzhou

    Google Scholar 

  6. Ding Z, Jia Y, Zhou B (2014) Research review of micro-blog data mining. Comput Res Dev 51(4):691–706

    Google Scholar 

  7. Guarino N (1998) Formal ontology in information systems: Proceedings of the first international conference, vol 341. IOS press, Trento

    Google Scholar 

  8. Gao X (2013) Study on fusion of multi-source POI based on the spatial location information, M.Sc Thesis, vol 71. Ocean University of China, Qingdao

    Google Scholar 

  9. Huang M (2006) Key issues and applied research of geographic ontology, vol 151. China University of Science and Technology Press, Beijing

  10. Lee JH, Kim MH, Lee YJ (1994) Ranking documents in thesaurus-based Boolean retrieval systems. Inf Process Manag 30(1):79–91

    Article  Google Scholar 

  11. Liu Z, Liu L (2012) An empirical study of Chinese micro blog emotion classification based on machine learning. Comput Eng Appl 1:1–4

    Google Scholar 

  12. Metz CE (1978) Basic principles of ROC analysis. Semin Nuclear Med 8(4):283–298

    Article  Google Scholar 

  13. Nachouki G, Quafafou M (2008) Multi-data source fusion. Inf Fusion 9(4):523–537

    Article  Google Scholar 

  14. Sehgal V, Getoor L, Viechnicki PD (2006) Entity resolution in geospatial data integration. Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems. ACM, New York, pp 83–90

  15. Safra E, Kanza Y, Sagiv Y, et al. (2010) Location-based algorithms for finding sets of corresponding objects over several geo-spatial data sets. Int J Geograph Inf Sci 24(1):69–106

    Article  Google Scholar 

  16. Scheffler T, Schirru R, Lehmann P (2012) Matching points of interest from different social networking sites. The 2012 International Conference on Artificial Intelligence. Springer, Las Vegas, pp 245–248

  17. Salton G, Wong A, Yang C-S (1975) A vector space model for automatic indexing. Commun ACM 18(11):613–620

    Article  Google Scholar 

  18. Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions, and reversals. Sov Phys Doklady 10(8):707–710

    MathSciNet  Google Scholar 

  19. Varshney PK (1997) Multisensor data fusion. Electron Commun Eng J 9(6):245–253. IET

  20. WU C, REN F, DU Q, et al. (2014) Classification method of POI data based on formal ontology theory. Geogr Geo-Inf Sci 6:3

    Google Scholar 

  21. Zhu Q (2015) Research on the Ontology-oriented Automatic Semantic Classification of Geographic Information, Ph.D Dissertation, vol 143. Wuhan University, Wuhan

  22. Zhong S, Fang Z, Zhu M, et al. (2017) A geo-ontology-based approach to decision-making in emergency management of meteorological disasters. Nat Hazards 89(2):531–554

    Article  Google Scholar 

  23. Catriel B, Yaron K, Eliyahu S, et al. (2004) Object fusion in geographic information systems. Proceedings of the Thirtieth international conference on Very large data bases, vol 816-827. VLDB Endowment, Toronto

  24. Botts M, Robin A, Greenwood J et al (2014) OGC SensorML: model and XML encoding standard. Techn Stand 2:12– 000

    Google Scholar 

  25. Auer S, Lehmann J, Hellmann S (2009) Linkedgeodata: Adding a spatial dimension to the web of data. The 8th International Semantic Web Conference, vol 731-746. Springer, Chantilly

  26. Heikkinen A, Okkonen A, Karhu A, et al. (2014) A distributed POI data model based on the entity-component approach 2014. IEEE Symposium on Computers and Communications, vol 1–6. IEEE, Madeira

  27. Xu S, Zhang Q, Li Y, et al. (2018) Fusion algorithm of multi-source interest points based on distance category. Comput Appl 38(5):118–122

    Google Scholar 

  28. ISO/TC211 (2003) Geographic Information–Spatial Referencing by Geographic Identifiers, vol 4. International Organization for Standardization, Geneva

  29. Yang W (2004) Information Processing and Management Multisensor Data Fusion and its Applications, vol 205. Xi’an university of electronic science and technology press, Xi’an

  30. Wu M (2007) Research on classification and query of agricultural geographic information based on multi-source heterogeneous spatio-temporal data driven by ontology, M.Sc Thesis, vol 37. Chinese academy of sciences, Beijing

    Google Scholar 

  31. Li D (2010) Research on reputation dimension discovery based on text clustering and corpus, M.Sc Thesis, vol 51. Huazhong University of science and technology, Wuhan

    Google Scholar 

  32. Dai W (2008) Semantic web information organization technology and method, vol 318. Xuelin Verlag, Shanghai

  33. Sester M, Anders KH, Walter V (1998) Linking objects of different spatial data sets by integration and aggregation. GeoInformatica 2:335–358

    Article  Google Scholar 

  34. Li C, Liu L, Dai Z, et al. (2020) Different sourcing point of interest matching method considering multiple constraints. Int J Geo-Inf 9(214):1–16

    Google Scholar 

  35. Piech M, Smywinski-Pohl A, Marcjan R, et al. (2020) Towards automatic points of interest matching. Int J Geo-Inf 9(291):1–29

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li Cai.

Ethics declarations

Informed Consent

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this article was revised: There are errors in the online article.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cai, L., Zhu, L., Jiang, F. et al. Research on multi-source POI data fusion based on ontology and clustering algorithms. Appl Intell 52, 4758–4774 (2022). https://doi.org/10.1007/s10489-021-02561-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-02561-6

Keywords

Navigation