Skip to main content

Integrating Data by Discovering Topological and Proximity Relations Among Spatiotemporal Entities

  • Chapter
  • First Online:
Big Data Analytics for Time-Critical Mobility Forecasting

Abstract

Link discovery (LD) is the process of identifying relations (links) between entities that originate from different data sources, thereby facilitating several tasks, such as data deduplication, record linkage, and data integration. Existing LD frameworks facilitate data integration tasks over multidimensional data. However, limited work has focused on spatial or spatiotemporal LD, which is typically much more processing-intensive due to the complexity of spatial relations. This chapter targets spatiotemporal link discovery, focusing on topological and proximity relations, proposing a framework with several salient features: support both for streaming and archival data, support of spatial relations in 2D and 3D, flexibility in terms of input consumption, improved filtering techniques, use of blocking techniques, proximity-based LD instead of merely topological LD, and a data-parallel design and implementation. The efficiency of the proposed spatiotemporal LD framework is demonstrated by means of experiments on real-life data from the maritime and aviation domains.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allen, J.F.: Maintaining knowledge about temporal intervals. Commun. ACM 26(11), 832–843 (1983)

    Article  Google Scholar 

  2. Bleiholder, J., Naumann, F.: Data fusion. ACM Comput. Surv. 41(1), 1:1–1:41 (2008)

    Google Scholar 

  3. Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., Tzoumas, K.: Apache flink™: stream and batch processing in a single engine. IEEE Data Eng. Bull. 38(4), 28–38 (2015)

    Google Scholar 

  4. Christen, P.: A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowl. Data Eng. 24(9), 1537–1555 (2012)

    Article  Google Scholar 

  5. Christophides, V., Efthymiou, V., Stefanidis, K.: Entity Resolution in the Web of Data. Synthesis Lectures on the Semantic Web: Theory and Technology. Morgan & Claypool Publishers, San Rafael (2015)

    Google Scholar 

  6. Dong, X.L., Srivastava, D.: Big Data Integration. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, San Rafael (2015)

    Book  Google Scholar 

  7. Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: a survey. IEEE Trans. Knowl. Data Eng. 19(1), 1–16 (2007)

    Article  Google Scholar 

  8. Isele, R., Jentzsch, A., Bizer, C.: Efficient multidimensional blocking for link discovery without losing recall. In: Proceedings of WebDB (2011)

    Google Scholar 

  9. Jagadish, H.V., Gehrke, J., Labrinidis, A., Papakonstantinou, Y., Patel, J.M., Ramakrishnan, R., Shahabi, C.: Big data and its technical challenges. Commun. ACM 57(7), 86–94 (2014)

    Article  Google Scholar 

  10. Mamoulis, N.: Spatial Data Management. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, San Rafael (2011)

    Book  Google Scholar 

  11. Nentwig, M., Hartung, M., Ngomo, A.N., Rahm, E.: A survey of current link discovery frameworks. Semant. Web 8(3), 419–436 (2017)

    Article  Google Scholar 

  12. Ngomo, A.N.: A time-efficient hybrid approach to link discovery. In: Proceedings of OM (2011)

    Google Scholar 

  13. Ngomo, A.N.: Link discovery with guaranteed reduction ratio in affine spaces with Minkowski measures. In: Proceedings of ISWC, pp. 378–393 (2012)

    Google Scholar 

  14. Ngomo, A.N.: ORCHID - reduction-ratio-optimal computation of geo-spatial distances for link discovery. In: Proceedings of ISWC, pp. 395–410 (2013)

    Google Scholar 

  15. Ngomo, A.N., Auer, S.: LIMES - a time-efficient approach for large-scale link discovery on the web of data. In: Proceedings of IJCAI, pp. 2312–2317 (2011)

    Google Scholar 

  16. Papadakis, G., Skoutas, D., Thanos, E., Palpanas, T.: A survey of blocking and filtering techniques for entity resolution. CoRR abs/1905.06167 (2019)

    Google Scholar 

  17. Santipantakis, G.M., Kotis, K.I., Vouros, G.A., Doulkeridis, C.: RDF-gen: Generating RDF from streaming and archival data. In: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics, WIMS 2018, Novi Sad, 25–27 June 2018, pp. 28:1–28:10 (2018)

    Google Scholar 

  18. Santipantakis, G.M., Vlachou, A., Doulkeridis, C., Artikis, A., Kontopoulos, I., Vouros, G.A.: A stream reasoning system for maritime monitoring. In: 25th International Symposium on Temporal Representation and Reasoning, TIME 2018, Warsaw, 15–17 October 2018, pp. 20:1–20:17 (2018)

    Google Scholar 

  19. Santipantakis, G.M., Glenis, A., Doulkeridis, C., Vlachou, A., Vouros, G.A.: stLD: towards a spatio-temporal link discovery framework. In: Proceedings of the International Workshop on Semantic Big Data, SBD@SIGMOD 2019, Amsterdam, 5 July 2019, pp. 4:1–4:6 (2019)

    Google Scholar 

  20. Sherif, M.A., Dreßler, K., Smeros, P., Ngomo, A.N.: Radon - rapid discovery of topological relations. In: Proceedings of AAAI, pp. 175–181 (2017)

    Google Scholar 

  21. Smeros, P., Koubarakis, M.: Discovering spatial and temporal links among RDF data. In: Proceedings of LDOW (2016)

    Google Scholar 

Download references

Acknowledgements

The research work has been supported by the datAcron project, which has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 687591 and by the Hellenic Foundation for Research and Innovation (H.F.R.I.) under the “First Call for H.F.R.I. Research Projects to support Faculty members and Researchers and the procurement of high-cost research equipment grant” (Project Number: HFRI-FM17-81).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christos Doulkeridis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Santipantakis, G.M., Doulkeridis, C., Vlachou, A., Vouros, G.A. (2020). Integrating Data by Discovering Topological and Proximity Relations Among Spatiotemporal Entities. In: Vouros, G., et al. Big Data Analytics for Time-Critical Mobility Forecasting. Springer, Cham. https://doi.org/10.1007/978-3-030-45164-6_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-45164-6_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-45163-9

  • Online ISBN: 978-3-030-45164-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics