Abstract
Big Data Era has largely contributed in accelerating the development of large, high quality and valuable Knowledge Bases (\(\mathcal {K}\mathcal {B}\)) by academicians (e.g., Cyc, DBpedia, Freebase, and YAGO) and industrials (e.g., Knowledge Graph). On the other hand, serious studies have identified the crucial role of \(\mathcal {K}\mathcal {B}\) for analytical tasks, by offering analysts more entities (people, places, products, etc.). The availability of a huge, high quality and valuable \(\mathcal {K}\mathcal {B}\) may contribute on designing value-added approaches for business intelligence applications. In this paper, we first propose a novel approach for semantic \(\mathcal {D}\mathcal {W}\) design that considers \(\mathcal {K}\mathcal {B}\) in the life cycle. Secondly, based on graph formalization adapted to \(\mathcal {K}\mathcal {B}\), we produce conceptual multidimensional design and a semantic ETL process that orchestrates the graph data flows from data sources to the \(\mathcal {D}\mathcal {W}\) storage. Finally, all steps of our approach are illustrated using the YAGO \(\mathcal {K}\mathcal {B}\) and deployed in Oracle RDF Semantic Graph 12c.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abelló, A., Romero, O., Pedersen, T.B., Llavori, R.B., Nebot, V., Cabo, M.J.A., Simitsis, A.: Using semantic web technologies for exploratory OLAP: a survey. IEEE TKDE 27(2), 571–588 (2015)
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Boston (1995)
Berkani, N., Bellatreche, L., Khouri, S.: Towards a conceptualization of ETL and physical storage of semantic data warehouses as a service. Cluster Comput. 16(4), 915–931 (2013)
Calvanese, D., Lenzerini, M., Nardi, D.: Description logics for conceptual data modeling. In: Chomicki, J., Saake, G. (eds.) Logics for Databases and Information Systems, pp. 229–263. Springer, Heidelberg (1998)
Chu, X., Morcos, J., Ilyas, I.F., Ouzzani, M., Papotti, P., Tang, N., Ye, Y.: Katara: a data cleaning system powered by knowledge bases and crowdsourcing. In: ACM SIGMOD, pp. 1247–1261 (2015)
Dey, A.K., Abowd, G.D., Salber, D.: A conceptual framework and a toolkit for supporting the rapid prototyping of context-aware applications. Hum.-Comput. Interact. 16(2–4), 97–166 (2001)
Djilani, Z., Khouri, S.: Understanding user requirements iceberg: semantic based approach. In: Bellatreche, L., Manolopoulos, Y., Zielinski, B., Liu, R. (eds.) MEDI 2015. LNCS, vol. 9344, pp. 297–310. Springer, Heidelberg (2015). doi:10.1007/978-3-319-23781-7_24
Etzioni, O., Cafarella, M.J., Downey, D., Popescu, A., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)
Fader, A., Soderland, S., Etzioni, O.: Identifying relations for open information extraction. In: EMNLP, pp. 1535–1545 (2011)
Hoffart, J., Suchanek, F.M., Berberich, K., Lewis-Kelham, E., de Melo, G., Weikum, G.: YAGO2: exploring and querying world knowledge in time, space, context, and many languages. In: WWW, pp. 229–232 (2011)
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: Dbpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semant. Web 6(2), 167–195 (2015)
Mahdisoltani, F., Biega, J., Suchanek, F.M.: YAGO3: a knowledge base from multilingual wikipedias. In: CIDR (2015)
Nath, R.P.D., Seddiqui, M.H., Aono, M.: An efficient and scalable approach for ontology instance matching. JCP 9(8), 1755–1768 (2014)
Nebot, V., Llavori, R.B.: Building data warehouses with semantic web data. Decis. Support Syst. 52(4), 853–868 (2012)
Nebot, V., Llavori, R.B., Pérez-Martínez, J.M., Aramburu, M.J., Pedersen, T.B.: Multidimensional integrated ontologies: a framework for designing semantic data warehouses. J. Data Semant. 13, 1–36 (2009)
Nie, Z., Ma, Y., Shi, S., Wen, J., Ma, W.: Web object retrieval. In: WWW, pp. 81–90 (2007)
Romero, O., Abelló, A.: Automating multidimensional design from ontologies. In: ACM DOLAP, pp. 1–8 (2007)
Romero, O., Simitsis, A., Abelló, A.: GEM: requirement-driven generation of ETL and multidimensional conceptual designs. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2011. LNCS, vol. 6862, pp. 80–95. Springer, Heidelberg (2011)
Simitsis, A., Skoutas, D., Castellanos, M.: Natural language reporting for ETL processes. In: ACM DOLAP, pp. 65–72 (2008)
Skoutas, D., Simitsis, A.: Designing ETL processes using semantic web technologies. In: ACM DOLAP, pp. 67–74 (2006)
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW, pp. 697–706 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Berkani, N., Bellatreche, L., Benatallah, B. (2016). A Value-Added Approach to Design BI Applications. In: Madria, S., Hara, T. (eds) Big Data Analytics and Knowledge Discovery. DaWaK 2016. Lecture Notes in Computer Science(), vol 9829. Springer, Cham. https://doi.org/10.1007/978-3-319-43946-4_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-43946-4_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43945-7
Online ISBN: 978-3-319-43946-4
eBook Packages: Computer ScienceComputer Science (R0)