Abstract
In the context of the emergent Web of Data, a large number of organizations, institutes and companies (e.g., DBpedia, Data.gov, GeoNames, PubMed) adopt the Linked Data practices. Utilizing the Semantic Web (SW) technologies, they publish their data and offer SPARQL endpoints (i.e., SPARQL-based search services). On the other hand, the dominant standard for information exchange in the Web today is XML. Additionally, many international standards (e.g., Dublin Core, MPEG-7, METS, TEI, IEEE LOM) in several domains (e.g., Digital Libraries, GIS, Multimedia, e-Learning) have been expressed in XML Schema. The aforementioned have led to an increasing emphasis on XML data, accessed using the XQuery query language. The SW and XML worlds and their developed infrastructures are based on different data models, semantics and query languages. Thus, it is crucial to develop interoperability mechanisms that allow the Web of Data users to access XML datasets, using SPARQL, from their own working environments. It is unrealistic to expect that all the existing legacy data (e.g., Relational, XML, etc.) will be transformed into SW data. Therefore, publishing legacy data as Linked Data and providing SPARQL endpoints over them has become a major research challenge. In this direction, we introduce the SPARQL2XQuery Framework which creates an interoperable environment, where SPARQL queries are automatically translated to XQuery queries, in order to access XML data across the Web. The SPARQL2XQuery Framework provides a mapping model for the expression of OWL–RDF/S to XML Schema mappings as well as a method for SPARQL to XQuery translation. To this end, our Framework supports both manual and automatic mapping specification between ontologies and XML Schemas. In the automatic mapping specification scenario, the SPARQL2XQuery exploits the XS2OWL component which transforms XML Schemas into OWL ontologies. Finally, extensive experiments have been conducted in order to evaluate the schema transformation, mapping generation, query translation and query evaluation efficiency, using both real and synthetic datasets.
Similar content being viewed by others
References
Aguilera, V., Cluet, S., Milo, T., Veltri, P., Vodislav, D.: Views in a large-scale XML repository. VLDB J. 11(3), 271–280 (2002)
Akhtar, W., Kopecký, J., Krennwallner, T., Polleres, A.: XSPARQL: traveling between the XML and RDF worlds - and avoiding the XSLT pilgrimage. In Proc. of ESWC 2008
Amann, B., Beeri, C., Fundulaki, I., Scholl, M.: Querying XML sources using an ontology-based mediator. In Proc. of CoopIS 2002
Angles, R., Gutierrez, C.: The expressive power of SPARQL. In the Proc. of ISWC 2008
Arenas, M., Libkin, L.: XMLdata Exchange: consistency and query answering. J ACM 55(2) Article No. 7, (2008)
Bamford, R., Borkar, V., Brantner, M., Fischer, P., Florescu, D., Graf, D., Kossmann, D., Kraska, T., Muresan, D., Nasoi, S., Zacharioudakis, M.: XQuery Reloaded. PVLDB 2(2), 1342–1353 (2009)
Baral, C.: Knowledge representation, reasoning and declarative problem solving. Cambridge University Press, (2003). doi:10.1017/CBO9780511543357
Battle, S.: Gloze: XML to RDF and back again. In Jena User Conference 2006
Beckett, D. (ed.): SPARQL query results XML format. W3C Rec., Jan. 2008, http://www.w3.org/TR/rdf-sparql-XMLres/
Bedini, I., Gardarin, G., Nguyen, B.: Deriving ontologies from XML schema. In Proc. EDA 2008 Vol. B-4
Bedini, I., Matheus, C., Patel-Schneider, P., Boran, A., Nguyen, B.: Transforming XML schema to OWL using patterns. In Proc. of ICSC 2011
Berglund, A. et al. (eds.): XML Path Language (XPath) 2.0. W3C Rec., Jan. 2007. http://www.w3.org/TR/xpath20/
Bernd, A., Beeri, C., Fundulaki, I., Scholl, M.: Ontology-based integration of XML web resources. In Proc. of ISWC 2002
Berrueta, D., Labra, J.E., Herman, I.: XSLT+SPARQL: scripting the semantic web with SPARQL embedded into XSLT stylesheets. In Proc. of 4th Workshop on Scripting for the Semantic Web 2008
Bikakis, N., Gioldasis, N., Tsinaraki, C., Christodoulakis, S.: Querying XML data with SPARQL. In Proc. of 20th International Conference on Database and Expert Systems Applications (DEXA 2009)
Bikakis, N., Tsinaraki, C., Stavrakantonakis, I., Gioldasis, N., Christodoulakis, S.: The SPARQL2XQuery interoperability framework. Technical Report 2012, Available at: http://www.dblab.ntua.gr/~bikakis/SPARQL2XQueryTR2012.pdf
Bikakis N., Tsinaraki C., Gioldasis N., Stavrakantonakis I., Christodoulakis S.: The XML and Semantic Web Worlds: Technologies, Interoperability and Integration. A survey of the State of the Art. In Semantic Hyper/Multi-media Adaptation: Schemes and Applications, Springer 2013
Bischof S., Lopes N., Polleres A.: Improve Efficiency of Mapping Data between XML and RDF with XSPARQL. In RR 2011
Bischof S., Decker S., Krennwallner T., Lopes N., Polleres A.: Mapping between RDF and XML with XSPARQL. J. Data Semantics 1(3), (2012)
Bizer, C., Cyganiak, R.: D2R Server - Publishing Relational Databases on the Semantic Web. In Proc. of ISWC 2006
Bizer, C., Schultz, A.: The Berlin SPARQL benchmark. Int. J. Semantic Web Inf. Systems - Special Issue on Scalability and Performance of Semantic Web Systems, 2009
Blakeley C.: Mapping relational data to RDF with Virtuoso’s RDF Views. OpenLink Software 2007
Bohannon, P., Fan, W., Flaster, M., Narayan, P.: Information preserving XML Schema embedding. In Proc. of VLDB 2005
Bohring, H., Auer, S.: Mapping XML to OWL ontologies. Leipziger Informatik-Tage 2005
Bonifati, A., Chang, E.Q., Ho, T., Lakshmanan, L.V.S., Pottinger, R., Chung, Y.: Schema mapping and query translation in heterogeneous P2P XML databases. VLDB J. 19(2), 231–256 (2010)
Bray, T. et al. (eds.): Extensible Markup Language (XML) 1.1. W3C Rec., Sept. 2006. http://www.w3.org/TR/xml11/
Brickley, D., Guha, R.V. (eds.): RDF Vocabulary Description Language 1.0: RDF Schema. W3C Rec., Feb. 2004. http://www.w3.org/TR/rdf-schema
Chebotko, A., Lub, S., Fotouhib, F.: Semantics preserving SPARQL-to-SQL translation. Data Knowl. Eng. 68(10), 973–1000 (2009)
Chen, H., Wu, Z., Wang, H., Mao, Y.: RDF/RDFS-based relational database integration. In Proc. of ICDE 2006
Christophides, V., Karvounarakis, G., Koffina, I., Kokkinidis, G., Magkanaraki, A., Plexousakis, D., Serfiotis, G., Tannen, V.: The ICS-FORTH SWIM: a powerful semantic web integration middleware. In Proc. of SWDB 2003
Christophides, V., Karvounarakis, G., Magkanaraki, A., Plexousakis, D., Tannen, V.: The ICS-FORTH Semantic Web Integration Middleware (SWIM). IEEE Data Eng. Bull. 26(4), 11–18 (2003)
Connolly, D. (ed.): Gleaning resource descriptions from dialects of languages. W3C Rec., W3C, Sept. 2007. http://www.w3.org/TR/grddl/
Corby, O., Kefi-Khelif, L., Cherfi, H., Gandon, F., Khelif, K.: Querying the semantic web of data using SPARQL, RDF and XML. Technical Report INRIA 2009
Cruz, I., Huiyong, X., Hsu, F.: An ontology-based framework for XML semantic integration. In Proc. of IDEAS 2004
Cruz, C., Nicolle, C.: Ontology enrichment and automatic population from XML data. In Proc. of 4th Int. Workshop on Ontology-based Techniques, ODBIS 2008
Cruz. I., Xiao, H., Hsu, F.: Peer-to-peer semantic integration of XML and RDF data sources. In Proc. of 3rd AP2PC Workshop 2004
Cyganiak, R.: A relational algebra for SPARQL. Hewlett-Packard Laboratories Technical Report. 2005. http://www.hpl.hp.com/techreports/2005/HPL-2005-170.html
Das, S., Sundara, S., Cyganiak, R. (eds.): R2RML: RDB to RDF Mapping Language. W3C Recommendation, 27 Sept. 2012, http://www.w3.org/TR/r2rml/
DeHaan, D., Toman, D., Consens, M., Özsu, T.: A comprehensive XQuery to SQL translation using dynamic interval encoding. In Proc. of SIGMOD 2003
Deursen, D.V., Poppe, C., Martens, G., Mannens, E., Walle, R.V.d.: XML to RDF conversion: a generic approach. In: Proc. of AXMEDI S 2008
Deutsch, A., Tannen, V.: Reformulation of XML queries and constraints. In Proc. of ICDT 2003
Droop, M., Flarer, M., Groppe, J., Groppe, S., Linnemann, V., Pinggera, J., Santner, F., Schier, M., Schopf, F., Staffler, H., Zugal, S.: Bringing the XML and semantic web worlds closer: transforming XML into RDF and embedding XPath into SPARQL. In Proc. of ICEIS 2008
Droop, M., Flarer, M., Groppe, J., Groppe, S., Linnemann, V., Pinggera, J., Santner, F., Schier, M., Schöpf, F., Staffler, H., Zugal, S.: Embedding XPATH queries into SPARQL queries. In Proc. of the 10th International Conference on Enterprise Information Systems (ICEIS) 2008
Droop, M., Flarer, M., Groppe, J., Groppe, S., Linnemann, V., Pinggera, J., Santner, F., Schier, M., Schopf, F., Staffler, H., Zugal, S.: Translating XPath queries into SPARQL queries. In Proc. of OTM Workshops 2007
Dublin Core Metadata Element Set, Version 1.1. Dublin Core Metadata Initiative, http://dublincore.org/documents/dces/
Elliott, B., Cheng, E., Thomas-Ogbuji, C., Ozsoyoglu, Z.M.: A complete translation from SPARQL into efficient SQL. In Proc. of International Database Engineering & Applications Symposium IDEAS 2009
Encoded Archival Description (EAD). Library of Congress (LOC), http://www.loc.gov/ead/
Fagin R., Kolaitis P., Miller R., Popa L.: Data exchange: semantics and query answering. Theor. Comput. Sci. (TCS) 336(1), (2005)
Fallside, D. (ed.): XML Schema Part 0: Primer. W3C Rec., Oct. 2004. http://www.w3.org/TR/xmlschema-0/
Farrell, J., Lausen, H. (eds.): Semantic annotations for WSDL and XML schema. W3C Rec., W3C, Aug. 2007. http://www.w3.org/TR/sawsdl/
Ferdinand, M., Zirpins, C., Trastour, D.: Lifting XML Schema to OWL. In Proc. of ICWE 2004
Gao, S., Sperberg-McQueen, C.M., Thompson, H.S. (eds.): W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures. W3C Rec., Apr. 2012, http://www.w3.org/TR/xmlschema11-1/
Garcia, R., Celma, O.: Semantic integration and retrieval of multimedia meta- data. In Proc. of Fifth International Workshop on Knowledge Markup and Semantic Annotation 2005
Glimm, B., Ogbuji, C. (eds.): SPARQL 1.1 Entailment Regimes. W3C Working Draft, Jan. 2012, http://www.w3.org/TR/sparql11-entailment/
Groppe, S., Groppe, J., Linnemann, V., et al.: Embedding SPARQL into XQuery/XSLT. In Proc. of ACM SAC 2008
Gruber, T.R.: A translation approach to portable ontology specifications. Knowl Acquis 5(2), 199–220 (1993)
Halevy, A., Ives, Z., Mork, P., Tatarinov, I.: Piazza: data management infrastructure for semantic web applications. In Proc. WWW 2003
Halevy, A.Y., Ives, Z.G., Suciu, D., Tatarinov, I.: Schema mediation in peer data management systems. In Proc. of ICDE, 2003
Harris, S., Seaborne, A. (eds.): SPARQL 1.1 Query Language. W3C Working Draft, July 2012, http://www.w3.org/TR/sparql11-query/
IEEE WG-12: IEEE Standard for Learning Object Metadata (LOM): 1484.12.1-2002, http://ltsc.ieee.org/wg12/
Ives, Z., Halevy, A., Mork, P., Tatarinov, I.: Piazza: mediation and integration infrastructure for Semantic Web data. J. Web Sem. 1(2), 155–175 (2004)
Kappel, G., Kapsammer, E., Retschitzegger, W.: Integrating XML and relational database systems. WWW J. 7(4), (2004
Klein, M.: Interpreting XML via an RDF schema. In Proc. of Second International Workshop on Electronicy Business Hubs 2002
Koffina, I., Serfiotis, G., Christophides, V., Tannen, V.: Mediating RDF/S queries to relational and XML sources. Int. J. Semantic Web Inf. Syst. 2(4), 68–91 (2006)
Krishnamoorthy, K., Kumar, R., Dua, S.R.: Converting SPARQL queries to SQL queries. Microsoft Corporation, U.S. Patent 7818352, 2010
Krishnamurthy, R., Kaushik, R., Naughton, J.: XML-SQL query translation literature: the state of the art and open problems. In Proc. of Xsym 2003
Lehti, P., Fankhauser, P.: XML data integration with OWL: experiences and challenges. In Proc. of 3rd International Symposium on Applications and the Internet 2004
Lenzerini, M.: Data integration: a theoretical perspective. In Proc. of PODS 2002
Liu, C., Vincent, M., Liu, J.: Constraint preserving transformation from relational schema to XML Schema. WWW J. 9(1), 2006
Ma, L., Wang, C., Lu, J., et al.: Effective and Efficient Semantic Web Data Management over DB2. In Proc. of SIGMOD 2008
Makris K., Bikakis N., Gioldasis N., Christodoulakis S.: SPARQL-RW: transparent query access over mapped RDF data sources. In Proc. of 15th International Conference on Extending Database Technology (EDBT) 2012
Makris, K., Bikakis, N., Gioldasis, N., Tsinaraki, C., Christodoulakis, S.: Towards a Mediator based on OWL and SPARQL. In Proc. of 2nd World Summit on the Knowledge Society 2009
Makris, K., Gioldasis, N., Bikakis, N., Christodoulakis, S.: Ontology mapping and SPARQL Rewriting for Querying Federated RDF data sources. In Proc. of ODBASE 2010
Makris, K., Gioldasis, N., Bikakis, N., Christodoulakis, S.: SPARQL Rewriting for Query Mediation over mapped ontologies. Technical Report 2010. Available at: http://www.music.tuc.gr/reports/SPARQLREWRITING.PDF
Malhotra, A. et al. (eds.): XQuery 1.0 and XPath 2.0 Functions and Operators. W3C Rec., Dec. 2010. http://www.w3.org/TR/xpath-functions/
Manola, F., Milles E. (eds.): RDF Primer. W3C Rec., Feb. 2004. http://www.w3.org/TR/rdf-primer
MARC 21 concise format for bibliographic metadata. Library of Congress (LOC), http://www.loc.gov/marc/bibliographic/ecbdhome.html
McGuinness, D.L., van Harmelen, F. (eds.): OWL Web Ontology Language: overview. W3C Rec., Feb. 2004. http://www.w3.org/TR/owl-features
Metadata Authority Description Standard (MADS). Library of Congress (LOC), http://www.loc.gov/standards/mads/
Metadata Encoding and Transmission Standard (METS). Library of Congress (LOC), http://www.loc.gov/standards/mets/
Metadata Object Description Schema (MODS). Library of Congress (LOC), http://www.loc.gov/standards/mods/
Miller, R.J., Haas, L.M., Hernández, M.A.: Schema mapping as query discovery. In Proc. of VLDB 2000
Motik, B., Schneider, P.F.P., Parsia, B. (eds.): OWL 2 Web Ontology Language: Structural Specification and Functional-Style Syntax. W3C Rec., Oct. 2009, http://www.w3.org/TR/owl2-syntax/
MPEG-21 multimedia framework. ISO 21000–17:2003–2007. International Standardization Organization (ISO)
MPEG-7 multimedia content description interface. ISO 15938-1-11:2002–2007. International Standardization Organization (ISO)
NISO Metadata for Images in XML (MIX). Library of Congress (LOC), http://www.loc.gov/standards/mix/
Perez, J., Arenas, M., Gutierrez, C.: Semantics and complexity of SPARQL. ACM Trans. Database Syst. (TODS) 34(3), (2009)
Picalausa, F., Vansummeren, S.: What are real SPARQL queries like? In Proc. of Int. Workshop on Semantic Web Inf. Mngmt. 2011
Polleres, A.: From SPARQL to Rules (and back). In Proc. of WWW 2007
Popa, L., Velegrakis, Y., Miller, R.J., Hernández, M.A., Fagin, R.: Translating Web Data. In Proc. of VLDB 2000
Prud’hommeaux, E., Seaborne, A. (eds.): SPARQL query language for RDF. W3C Rec., Jan. 2008. http://www.w3.org/TR/rdf-sparql-query/
“RDB2RDF” - W3C Working Group. http://www.w3.org/2001/sw/rdb2rdf/
Reif, G., Jazayeri, M., Gall, H.: Towards semantic Web Engineering: WEESA-mapping XML schema to ontologies. In Proc. of WWW2004 Workshop on Application Design, Development, and Implementation Issues in the Semantic Web. 2004
Robie, J., Chamberlin, D., et al. (eds.): XQuery Update Facility 1.0. W3C Rec., March 2011. http://www.w3.org/TR/xquery-update-10/
Rodrigues, T., Rosa, P., Cardoso, J.: Mapping XML to exiting OWL ontologies. International Conference WWW/Internet 2006
Rodrigues, T., Rosa, P., Cardoso, J.: Moving from syntactic to semantic organizations using JXML2OWL. Comput. Ind. 59(8): 808–819 (2008)
Rodriguez-Muro M., Hardi J., Calvanese D.: Quest: Effcient SPARQL-to-SQL for RDF and OWL. In Proc. of ISWC 2012
Sahoo, S., Halb, W., Hellmann, S., Idehen, K., Thibodeau, T., Auer, S., Sequeda, J., Ezzat, A.: A survey of current approaches for mapping of relational databases to RDF. Technical Report. RDB2RDF W3C Working Group, 2009
Schenk, S., Gearon, P., Passant, A. (eds.): SPARQL 1.1 Update. W3C Rec., March 2013. http://www.w3.org/TR/sparql11-update/
Schmidt, M., Hornung, T, Lausen, G., Pinkel, C.: SP2Bench: a SPARQL performance benchmark. In Proc. of ICDE 2009
Sequeda, J., Depena, R., Miranker, D.: Ultrawrap: using SQL views for RDB2RDF. In Proc. of ISWC 2009
Sharable Content Reference Model (SCORM). Advanced Distributed Learning Initiative (ADL), http://www.adlnet.gov/scorm/index.aspx
Siméon, J., Chamberlin, D. (eds.): XQuery 1.0: an XML Query Language. W3C Rec., Jan. 2007. http://www.w3.org/TR/xquery/
Spanos, D.E., Stavrou, P., Mitrou, N.: Bringing relational databases into the Semantic Web: a survey. Semantic Web J. 3(2), 169–209 (2012)
Stavrakantonakis, I., Tsinaraki, C., Bikakis, N., Gioldasis, N., Christodoulakis, S.: SPARQL2XQuery 2.0: Supporting Semantic-based Queries over XML Data. In the Proc. of SMAP 2010
Tatarinov, I., Halevy, A.: Efficient query reformulation in peer data management systems. In Proc. of SIGMOD 2004
Technical Metadata for Text (TextMD). Library of Congress (LOC), http://www.loc.gov/standards/textMD/
Text Encoding and Interchange (TEI). TEI Consortium, http://www.tei-c.org /
Thiran, P., Estievenart, F., Hainaut, J.L., Houben, G.J.: A generic framework for extracting XML data from legacy databases. J. Web Eng. (JWE) 4(3), 205–223 (2005)
Thuy, P.T.T., Lee, Y.K., Lee, S., Jeong, B.S.: Exploiting XML schema for interpreting XML documents as RDF. In Proc. of International Conference on Services Computing 2008
Thuy, P.T.T., Lee, Y.K., Lee, S., Jeong, B.S.: Transforming valid XML documents into RDF via RDF schema. In Proc. of International Conference on Next Generation Web Services Practices 2007
Thuy, P.T.T, Lee, Y.K., Lee, S.: DTD2OWL: automatic transforming XML documents into OWL ontology. In Proc. of Int. Conf. Interaction Sciences 2009
Tsinaraki, C., Christodoulakis, S.: Interoperability of XML schema applications with OWL domain knowledge and semantic web tools. In Proc. of ODBASE 2007
Tsinaraki, C., Christodoulakis, S.: Support for interoperability between OWL based and XML Schema based applications. In the Proc. of DELOS Conference II, 2007
Unbehauen J., Stadler C., Auer S.: Accessing Relational Data on the Web with SparqlMap. JIST 2012
VRA Core 4.0. Visual Resources Association’s (VRA) Data Standards Committee, http://www.vraweb.org/projects/vracore4/index.html
Wache H., Voegele T., Visser U., Stuckenschmidt H., Schuster G., Neumann H., Huebner S.: Ontology-based integration of information - A survey of existing approaches. In Proc. of workshop on Ontologies and Information Sharing 2001
Xiao, H., Cruz, I.: RDF-based metadata management in peer-to- peer systems. In Proc of 2nd IST MMGPS Workshop 2004
Xiao, H., Cruz. I.: Integrating and exchanging XML data using ontologies. J Data Semantics VI (2006)
Yu, C., Popa, L.: Constraint-based XML Query Rewriting for Data Integration. In Proc. of SIGMOD 2004
Zemke, F.: Converting SPARQL to SQL. Technical Report, October 2006. http://lists.w3.org/Archives/Public/public-rdf-dawg/2006OctDec/att-0058/sparql-to-sql.pdf
Author information
Authors and Affiliations
Corresponding author
Additional information
Part of this work was done while Nikos Bikakis and Ioannis Stavrakantonakis were members of MUSIC/TUC Lab at Technical University of Crete.
Rights and permissions
About this article
Cite this article
Bikakis, N., Tsinaraki, C., Stavrakantonakis, I. et al. The SPARQL2XQuery interoperability framework. World Wide Web 18, 403–490 (2015). https://doi.org/10.1007/s11280-013-0257-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-013-0257-x