ABSTRACT
The emergence of web technologies is generating a data deluge called Big Data. All this data is in fact a gold mine to be exploited. However, we are confronted with huge volumes of heterogeneous data (various formats) and varied data (various sources) and in continuous expansion. To deal with this, some research works have introduced ontologies: this is the purpose of this paper. We present the Big Data concept on the one hand and the ontology concept on the other. We first recalled the definitions of Big Data, its main dimensions known by the 3 V (volume, velocity, variety), the fields of application as well as the various problems related to it. We reviewed the different solutions proposed as well as the existing tools by using the NoSQL and the Map-Reduce paradigm implemented in Hadoop and Spark.
We then looked at the concept of ontology, starting by recalling the definition of ontology, so an ontology is a conceptual model to represent reality and on which it is possible to develop systems that can be shared and reused. Ontologies are used to represent a domain and reason about its entities.
Finally, we presented and discussed some research works that combined ontologies and Big Data. We have found that there is a very abundant literature that deals with big data and ontologies separately, but few studies combine the two concepts together. We will therefore focus on the latter in order to enrich the scientific literature in the domain.
- K. Davis and D. Patterson. 2012. Ethics of Big Data: Balancing Risk and Innovation. O'Reilly Media, 2012. Google ScholarDigital Library
- G. Halevi and H. Moed. 2012. The evolution of big data as a research and scientific topic: Overview of the literature. Res. Trends (2012) 3--6.Google Scholar
- K. Krishnan. 2013. Data warehousing in the age of big data. In: The Morgan Kaufmann Series on Business Intelligence, Elsevier Science, 2013. Google ScholarDigital Library
- A. Reeve. 2013. Managing Data in Motion: Data Integration Best Practice Techniques and Technologies. Morgan Kaufmann. Google ScholarDigital Library
- Cheikh Kacfah Emani, Nadine Cullot and Christophe Nicolle. 2015. Understandable Big Data: A survey. Computer Science Review Elsevier 17 (2015) 70 -- 81. Google ScholarDigital Library
- Hadi Hashem. 2016. Modélisation intégratrice du traitement BigData. Modélisation et simulation. Université Paris-Saclay, Français. <NNT: 2016SACLL005>. <tel-01378609>.Google Scholar
- Radenski, A., Gurov, T., Kaloyanova, K., Kirov, N., Nisheva, M., Stanchev, P. and Stoimenova, E. 2016. Big data techniques, systems, applications, and platforms: Case studies from academia. In: Computer Science and Information Systems (FedCSIS), 2016 Federated Conference On. IEEE, pp. 883--888. ResearchGate Link, n.d.Google Scholar
- Gruber, T. R. 1993. A translation approach to portable ontologies. Knowledge Acquisition. 5(2):199--220. Google ScholarDigital Library
- InterOp, I. 2006. State of the art and state of the practice including initial possible research orientations. d8.1. Rapport technique, InterOp, Interoperability Research for Networked Enterprises Applications and Software.Google Scholar
- J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh and A.H. Byers. 2011. Big Data: The Next Frontier for Innovation, Competition, and Productivity. McKinsey Global Institute.Google Scholar
- P. Zikopoulos and C. Eaton. 2011. Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data. McGraw-Hill Education. Google ScholarDigital Library
- I. O'Reilly Media. 2014. Big Data Now. 2014 Edition, O'Reilly Media.Google Scholar
- Yannick Prié. 2010. Ingénierie ontologique. UFR Informatique, Université Claude Bernard Lyon 1, 2010.Google Scholar
- L. Rodríguez-Mazahua, C.-A. Rodríguez-Enríquez, J. L. Sánchez-Cervantes, J. Cervantes, J. L. García-Alcaraz, and G. Alor-Hernández. 2016. A general perspective of Big Data: applications, tools, challenges and trends ", J Supercomput, vol. 72, no 8, p. 3073--3113, août 2016. Google ScholarDigital Library
- Fernandez, M., Gomez-Pérez, A. and Juristo, N. 1997. METHONTOLOGY: From Ontological Art Towards Ontological Engineering. Proceedings of the AAAI-97 Spring Symposium Series on Ontological Engineering, Stanford, CA, USA 1997, p 33--40. 1997.Google Scholar
- Nathalie H. and Josiane M. 2006. TtoO: une méthodologie de construction d'ontologie de domaine à partir d'un thésaurus et d'un corpus de référence ", IRIT, Toulouse- France 2006.Google Scholar
- Gruninger M. and Fox. M. 1995. The logic of enterprise modeling. In Brown, J. & O'Sullivan, D., (Eds.), Reengineering the Enterprise, Chapman and Hall, pages 83--98, 1995.Google Scholar
- Bachimont, B. 2000. Engagement sémantique et engagement ontologique: conception et réalisation d'ontologie en ingénierie des connaissances. In J. Charlet, M. Zacklad, G.Kassel et D. Bourigoult (éd.), "Ingénierie des connaissances. Evolutions récentes et nouveaux défis", Eyrolles, pp. 305--323. 2000.Google Scholar
- Xu Z., Cao X., Dong Y., and Su W. 2004. Formal Approach and Automated Tool for Translating ER Schemata into OWL Ontologies. Advances in Knowledge Discovery and Data Mining, Springer-Verlag Berlin Heidelberg, 2004, pp. 464--475.Google Scholar
- Li Kang, Li Yi and Liu Dong. 2014. Research on construction methods of big data semantic model. Proceedings of the World Congress on Engineering 2014 Vol I, WCE 2014, July 2 - 4, 2014, London, U.K.Google Scholar
- T. Malyuta, B. Smith and R. Rudnicki. n.d. The role of ontology in the era of big data. http://ncor.buffalo.edu/OI2/slides/Big%20Ontology%20-%20Malyuta.pdfGoogle Scholar
- Koutsomitropoulos, D.A. and Kalou, A.K. 2017. A standards-based ontology and support for Big Data Analytics in the insurance industry. ICT Express 2017 3, 57--61.Google Scholar
- Hussain, M., Ahmed, M., Khattak, H.A., Imran, M., Khan, A., Din, S., Ahmad, A., Jeon, G. and Reddy, A. 2018. Towards Ontology-based Multilingual URL Filtering: A Big Data Problem. The Journal of Supercomputing. 2018. https://doi.org/10.1007/s11227--018--2338--1 Google ScholarDigital Library
- Lytvyn, Vasyl, Victoria Vysotska, and Oleh Veres. 2018. Ontology of Big Data Analytics. Edited by Zoran Čekerevac. MEST Journal (MESTE) 2018. 6 (1): 41--60. doi:10.12709/mest.06.06.01.06. ResearchGate. URL https://www.researchgate.net/publication/322507142_ONTOLOGY_OF_BIG_DATA_ANALYTICSGoogle ScholarCross Ref
- Volk, M., Pohl, M. and Turowski, K. 2018. Classifying Big Data Technologies - An Ontology-based Approach. AMCIS 2018 Proceedings.Google Scholar
- Kulcu, S., Dogdu, E. and Ozbayoglu, A.M. 2016. A survey on semantic Web and big data technologies for social network analysis. In: IEEE International Conference on Big Data (Big Data). Presented at the 2016 IEEE International Conference on Big Data (Big Data), pp. 1768--1777. https://doi.org/10.1109/Big Data.2016.7840792.Google ScholarCross Ref
- Wongthongtham, P. and Abu-Salih, B. 2018. Ontology-based Approach for Semantic Data Extraction from Social Big Data: State-of-the-art and Research Directions. CoRR abs/1801.01624.Google Scholar
- Krishna D. Madhuri. 2016. A novel approach for processing Big Data. International Journal of Database Management Systems (IJDMS) Vol.8, No.5.Google Scholar
Recommendations
A Spark-Based Big Data Platform for Massive Remote Sensing Data Processing
ICDS 2015: Proceedings of the Second International Conference on Data Science - Volume 9208With the fast development of remote sensing techniques, the volume of acquired data grows exponentially. This brings a big challenge to process massive remote sensing data. In the paper, an in-memory computing framework is proposed to address this ...
Query Processing over Large RDF using SPARQL in Big Data
ICTCS '16: Proceedings of the Second International Conference on Information and Communication Technology for Competitive StrategiesInternet search is done by exploring the link graph and keyword frequency. In 2012, Google released "Knowledge Graph" --Semantic Web. The human reasoning can be enhanced by the use semantic web an emerging area. Most of the current applications link ...
Evaluating SQL-on-Hadoop for Big Data Warehousing on Not-So-Good Hardware
IDEAS '17: Proceedings of the 21st International Database Engineering & Applications SymposiumBig Data is currently conceptualized as data whose volume, variety or velocity impose significant difficulties in traditional techniques and technologies. Big Data Warehousing is emerging as a new concept for Big Data analytics. In this context, SQL-on-...
Comments