Skip to main content

Big-Data Aggregating, Linking, Integrating and Representing Using Semantic Web Technologies

  • Conference paper
  • First Online:
The International Conference on Advanced Machine Learning Technologies and Applications (AMLTA2018) (AMLTA 2018)

Abstract

Semantic web provides information for humans as well as computers to semantically maintain a large-scale of data and provide a meaningful content of unstructured data. It offers new benefits for big-data research and applications. Big data is a new term refers to a massive collection of datasets from various sources in structured, semi-structured, and unstructured data collection. Their integration faces many problems such as the structural and the semantic heterogeneity as the processing of these data is difficult using traditional databases and software techniques. In this paper, the data resources are extracted and aggregated from different sources on the web following by using the geospatial ontology to transform this data into RDF format. RDF format is used to integrate the data semantically and construct the big-data semantic model that is used to store data. The major contribution of this research is to aggregate, integrate, and represent geospatial data semantically. A case study of cities data is used to illustrate the proposed workflow functionalities. The main result of this research is to solve the heterogeneous problem in different data sources with improving the data aggregation, integration, and representation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 349.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 449.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For e.g., a serious method to conceptualize field knowledge and modeling that can be used to describe the data semantics.

  2. 2.

    The alignment API 4.0 [28, 29].

  3. 3.

    The data used in Fig. 6 are extracted from [25].

References

  1. Wu, H., Yamaguchi, A.: Semantic web technologies for the big data in life sciences. Biosci. Trends 8(4), 192–201 (2014)

    Article  Google Scholar 

  2. Ahmed, Z., Gerhard, D.: Web to Semantic Web & Role of Ontology (2010). arXiv preprint: arXiv:1008.1331

  3. Jain, V., Singh, M.: Ontology-based information retrieval in semantic web: a survey. Int. J. Inf. Technol. Comput. Sci. (IJITCS) 5(10), 62 (2013)

    Google Scholar 

  4. Di Martino, B., Esposito, A., Nacchia, S., Maisto, S.A.: A semantic model for business process patterns to support cloud deployment. Comput. Sci. Res. Dev. 32(3–4), 257–267 (2017)

    Article  Google Scholar 

  5. Kang, L., Yi, L., Dong, L.: Research on construction methods of big data semantic model. In: Proceedings of the World Congress on Engineering (WCE 2014), vol. 1, London, UK (2014)

    Google Scholar 

  6. Bansal, S.K.: Towards a semantic extract-transform-load (ETL) framework for big data integration. In: 2014 IEEE International Congress on Big Data (BigData Congress), Anchorage, pp. 522–529. IEEE (2014)‏

    Google Scholar 

  7. Bertino, E.: Big data – opportunities and challenges. In: IEEE 37th Annual Computer Software and Applications Conference, Kyoto, Japan, pp. 479–480 (2013)

    Google Scholar 

  8. Thirunarayan, K., Sheth, A.: Semantics-empowered approaches to big data processing for physical-cyber-social applications. In: Semantics for Big Data: Papers from the AAAI Symposium. AAAI Technical report FS-13-04, Arlington, Virginia, USA, pp. 68–75 (2013)

    Google Scholar 

  9. Arputhamary, B., Arockiam, L.: A review on big data integration. Int. J. Comput. Appl., 21–26 (2014)

    Google Scholar 

  10. Bizer, C., Boncz, P., Brodie, M.L., Erling, O.: The meaningful use of big data: four perspectives–four challenges. ACM SIGMOD Rec. 40(4), 56–60 (2012)

    Article  Google Scholar 

  11. Bansal, S.K., Kagemann, S.: Integrating big data: a semantic extract-transform-load framework. Computer 48(3), 42–50 (2014)

    Article  Google Scholar 

  12. Bergamaschi, S., Guerra, F., Orsini, M., Sartori, C., Vincini, M.: A semantic approach to ETL technologies. Data Knowl. Eng. 70(8), 717–731 (2011)

    Article  Google Scholar 

  13. Jiang, L., Cai, H., Xu, B.: A domain ontology approach in the ETL process of data warehousing. In: 2010 IEEE 7th International Conference on e-Business Engineering (ICEBE), Shanghai, pp. 30–35 (2010)

    Google Scholar 

  14. Huang, O.R., Du, Y.L., Zhang, M.H., Zhang, C.: Application of ontology-based automatic ETL in marine data integration. In: IEEE Symposium on Electrical & Electronics Engineering (EEESYM), Kuala Lumpur, Malaysia, pp. 11–13 (2012)

    Google Scholar 

  15. Cruz, I.F., Ganesh, V.R., Mirrezaei, S.I.: Semantic extraction of geographic data from web tables for big data integration. In: Proceedings of the 7th Workshop on Geographic Information Retrieval, Orlando, FL, USA, pp. 19–26. ACM (2013)

    Google Scholar 

  16. Zhang, Y., Chiang, Y.Y., Szekely, P., Knoblock, C.A.: A semantic approach to retrieving, linking, and integrating heterogeneous geospatial data. In: Joint Proceedings of the Workshop on AI Problems and Approaches for Intelligent Environments and Workshop on Semantic Cities, pp. 31–37. ACM (2013)

    Google Scholar 

  17. Boury-Brisset, A.-C.: Managing semantic big data for intelligence. In: STIDS, pp. 41–47 (2013)

    Google Scholar 

  18. Xiong, J., Liu, Y., Liu, W.: Ontology-based integration and sharing of big data educational resources. In: IEEE 11th Web Information System and Application Conference (WISA), Tianjin, China, pp. 245–248 (2014)

    Google Scholar 

  19. Gollapudi, S.: Aggregating financial services data without assumptions: a semantic data reference architecture. In: 2015 IEEE International Conference on Semantic Computing (ICSC), Anaheim, CA, USA, pp. 312–315 (2015)

    Google Scholar 

  20. Saradha, A.: Semantic integration of heterogeneous web data for tourism domain using ontology-based resource description language. J. Comput. Appl. 3(3), 1 (2010)

    Google Scholar 

  21. Jadhao, H., Aghav, D.J., Vegiraju, A.: Semantic tool for analysing unstructured data. Int. J. Sci. Eng. Res. 3(8), 1–7 (2012)

    Google Scholar 

  22. MapCruzin data Homepage. http://www.mapcruzin.com/. Accessed 21 Oct 2017

  23. DATA.GOV Homepage. https://catalog.data.gov/. Accessed 20 Oct 2017

  24. United States Census Homepage. https://www.census.gov/. Accessed 1 Oct 2017

  25. OST/SEC Homepage. http://www.nws.noaa.gov/. Accessed 20 Oct 2017

  26. Cities data Homepage. https://www.uscitieslist.org/. Accessed 19 Oct 2017

  27. Gaslamp media Homepage. https://www.gaslampmedia.com. Accessed 19 Oct 2017

  28. David, J., Euzenat, J., Scharffe, F., Trojahn dos Santos, C.: The alignment API 4.0. Semant. Web Interoperability Usability Appl. 2(1), 3–10 (2011)

    Google Scholar 

  29. Euzenat, J.: An API for ontology alignment. In: International Semantic Web Conference, pp. 698–712. Springer, Heidelberg (2004)

    Google Scholar 

  30. Matthews, B.: Semantic web technologies. E-learning 6(6), 8 (2005)

    MathSciNet  Google Scholar 

  31. RDF Homepage. https://www.w3.org/RDF/. Accessed 15 Oct 2017

  32. RDF Homepage. http://www.webopedia.com/TERM/R/RDF.html. Accessed 1 Oct 2017

  33. OWL Homepage. https://www.w3.org/2001/sw/wiki/OWL. Accessed 21 Oct 2017

  34. SPARQL Query Language for RDF Homepage. https://www.w3.org/TR/rdf-sparql-query/. Accessed 20 Oct 2017

  35. Protégé Homepage. http://protegewiki.stanford.edu/wiki/Main_Page. Accessed 19 Oct 2017

  36. Do, H.H., Melnik, S., Rahm, E.: Comparison of schema matching evaluations. In: Net. ObjectDays: International Conference on Object-Oriented and Internet-Based Technologies, Concepts, and Applications for a Networked World, pp. 221–237. Springer, Heidelberg (2002)‏

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abeer Saber .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Saber, A., Al-Zoghby, A.M., Elmougy, S. (2018). Big-Data Aggregating, Linking, Integrating and Representing Using Semantic Web Technologies. In: Hassanien, A., Tolba, M., Elhoseny, M., Mostafa, M. (eds) The International Conference on Advanced Machine Learning Technologies and Applications (AMLTA2018). AMLTA 2018. Advances in Intelligent Systems and Computing, vol 723. Springer, Cham. https://doi.org/10.1007/978-3-319-74690-6_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-74690-6_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-74689-0

  • Online ISBN: 978-3-319-74690-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics