Skip to main content
Log in

The SPARQL2XQuery interoperability framework

Utilizing Schema Mapping, Schema Transformation and Query Translation to Integrate XML and the Semantic Web

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

In the context of the emergent Web of Data, a large number of organizations, institutes and companies (e.g., DBpedia, Data.gov, GeoNames, PubMed) adopt the Linked Data practices. Utilizing the Semantic Web (SW) technologies, they publish their data and offer SPARQL endpoints (i.e., SPARQL-based search services). On the other hand, the dominant standard for information exchange in the Web today is XML. Additionally, many international standards (e.g., Dublin Core, MPEG-7, METS, TEI, IEEE LOM) in several domains (e.g., Digital Libraries, GIS, Multimedia, e-Learning) have been expressed in XML Schema. The aforementioned have led to an increasing emphasis on XML data, accessed using the XQuery query language. The SW and XML worlds and their developed infrastructures are based on different data models, semantics and query languages. Thus, it is crucial to develop interoperability mechanisms that allow the Web of Data users to access XML datasets, using SPARQL, from their own working environments. It is unrealistic to expect that all the existing legacy data (e.g., Relational, XML, etc.) will be transformed into SW data. Therefore, publishing legacy data as Linked Data and providing SPARQL endpoints over them has become a major research challenge. In this direction, we introduce the SPARQL2XQuery Framework which creates an interoperable environment, where SPARQL queries are automatically translated to XQuery queries, in order to access XML data across the Web. The SPARQL2XQuery Framework provides a mapping model for the expression of OWL–RDF/S to XML Schema mappings as well as a method for SPARQL to XQuery translation. To this end, our Framework supports both manual and automatic mapping specification between ontologies and XML Schemas. In the automatic mapping specification scenario, the SPARQL2XQuery exploits the XS2OWL component which transforms XML Schemas into OWL ontologies. Finally, extensive experiments have been conducted in order to evaluate the schema transformation, mapping generation, query translation and query evaluation efficiency, using both real and synthetic datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aguilera, V., Cluet, S., Milo, T., Veltri, P., Vodislav, D.: Views in a large-scale XML repository. VLDB J. 11(3), 271–280 (2002)

    Google Scholar 

  2. Akhtar, W., Kopecký, J., Krennwallner, T., Polleres, A.: XSPARQL: traveling between the XML and RDF worlds - and avoiding the XSLT pilgrimage. In Proc. of ESWC 2008

  3. Amann, B., Beeri, C., Fundulaki, I., Scholl, M.: Querying XML sources using an ontology-based mediator. In Proc. of CoopIS 2002

  4. Angles, R., Gutierrez, C.: The expressive power of SPARQL. In the Proc. of ISWC 2008

  5. Arenas, M., Libkin, L.: XMLdata Exchange: consistency and query answering. J ACM 55(2) Article No. 7, (2008)

  6. Bamford, R., Borkar, V., Brantner, M., Fischer, P., Florescu, D., Graf, D., Kossmann, D., Kraska, T., Muresan, D., Nasoi, S., Zacharioudakis, M.: XQuery Reloaded. PVLDB 2(2), 1342–1353 (2009)

    Google Scholar 

  7. Baral, C.: Knowledge representation, reasoning and declarative problem solving. Cambridge University Press, (2003). doi:10.1017/CBO9780511543357

  8. Battle, S.: Gloze: XML to RDF and back again. In Jena User Conference 2006

  9. Beckett, D. (ed.): SPARQL query results XML format. W3C Rec., Jan. 2008, http://www.w3.org/TR/rdf-sparql-XMLres/

  10. Bedini, I., Gardarin, G., Nguyen, B.: Deriving ontologies from XML schema. In Proc. EDA 2008 Vol. B-4

  11. Bedini, I., Matheus, C., Patel-Schneider, P., Boran, A., Nguyen, B.: Transforming XML schema to OWL using patterns. In Proc. of ICSC 2011

  12. Berglund, A. et al. (eds.): XML Path Language (XPath) 2.0. W3C Rec., Jan. 2007. http://www.w3.org/TR/xpath20/

  13. Bernd, A., Beeri, C., Fundulaki, I., Scholl, M.: Ontology-based integration of XML web resources. In Proc. of ISWC 2002

  14. Berrueta, D., Labra, J.E., Herman, I.: XSLT+SPARQL: scripting the semantic web with SPARQL embedded into XSLT stylesheets. In Proc. of 4th Workshop on Scripting for the Semantic Web 2008

  15. Bikakis, N., Gioldasis, N., Tsinaraki, C., Christodoulakis, S.: Querying XML data with SPARQL. In Proc. of 20th International Conference on Database and Expert Systems Applications (DEXA 2009)

  16. Bikakis, N., Tsinaraki, C., Stavrakantonakis, I., Gioldasis, N., Christodoulakis, S.: The SPARQL2XQuery interoperability framework. Technical Report 2012, Available at: http://www.dblab.ntua.gr/~bikakis/SPARQL2XQueryTR2012.pdf

  17. Bikakis N., Tsinaraki C., Gioldasis N., Stavrakantonakis I., Christodoulakis S.: The XML and Semantic Web Worlds: Technologies, Interoperability and Integration. A survey of the State of the Art. In Semantic Hyper/Multi-media Adaptation: Schemes and Applications, Springer 2013

  18. Bischof S., Lopes N., Polleres A.: Improve Efficiency of Mapping Data between XML and RDF with XSPARQL. In RR 2011

  19. Bischof S., Decker S., Krennwallner T., Lopes N., Polleres A.: Mapping between RDF and XML with XSPARQL. J. Data Semantics 1(3), (2012)

  20. Bizer, C., Cyganiak, R.: D2R Server - Publishing Relational Databases on the Semantic Web. In Proc. of ISWC 2006

  21. Bizer, C., Schultz, A.: The Berlin SPARQL benchmark. Int. J. Semantic Web Inf. Systems - Special Issue on Scalability and Performance of Semantic Web Systems, 2009

  22. Blakeley C.: Mapping relational data to RDF with Virtuoso’s RDF Views. OpenLink Software 2007

  23. Bohannon, P., Fan, W., Flaster, M., Narayan, P.: Information preserving XML Schema embedding. In Proc. of VLDB 2005

  24. Bohring, H., Auer, S.: Mapping XML to OWL ontologies. Leipziger Informatik-Tage 2005

  25. Bonifati, A., Chang, E.Q., Ho, T., Lakshmanan, L.V.S., Pottinger, R., Chung, Y.: Schema mapping and query translation in heterogeneous P2P XML databases. VLDB J. 19(2), 231–256 (2010)

    Google Scholar 

  26. Bray, T. et al. (eds.): Extensible Markup Language (XML) 1.1. W3C Rec., Sept. 2006. http://www.w3.org/TR/xml11/

  27. Brickley, D., Guha, R.V. (eds.): RDF Vocabulary Description Language 1.0: RDF Schema. W3C Rec., Feb. 2004. http://www.w3.org/TR/rdf-schema

  28. Chebotko, A., Lub, S., Fotouhib, F.: Semantics preserving SPARQL-to-SQL translation. Data Knowl. Eng. 68(10), 973–1000 (2009)

    Google Scholar 

  29. Chen, H., Wu, Z., Wang, H., Mao, Y.: RDF/RDFS-based relational database integration. In Proc. of ICDE 2006

  30. Christophides, V., Karvounarakis, G., Koffina, I., Kokkinidis, G., Magkanaraki, A., Plexousakis, D., Serfiotis, G., Tannen, V.: The ICS-FORTH SWIM: a powerful semantic web integration middleware. In Proc. of SWDB 2003

  31. Christophides, V., Karvounarakis, G., Magkanaraki, A., Plexousakis, D., Tannen, V.: The ICS-FORTH Semantic Web Integration Middleware (SWIM). IEEE Data Eng. Bull. 26(4), 11–18 (2003)

    Google Scholar 

  32. Connolly, D. (ed.): Gleaning resource descriptions from dialects of languages. W3C Rec., W3C, Sept. 2007. http://www.w3.org/TR/grddl/

  33. Corby, O., Kefi-Khelif, L., Cherfi, H., Gandon, F., Khelif, K.: Querying the semantic web of data using SPARQL, RDF and XML. Technical Report INRIA 2009

  34. Cruz, I., Huiyong, X., Hsu, F.: An ontology-based framework for XML semantic integration. In Proc. of IDEAS 2004

  35. Cruz, C., Nicolle, C.: Ontology enrichment and automatic population from XML data. In Proc. of 4th Int. Workshop on Ontology-based Techniques, ODBIS 2008

  36. Cruz. I., Xiao, H., Hsu, F.: Peer-to-peer semantic integration of XML and RDF data sources. In Proc. of 3rd AP2PC Workshop 2004

  37. Cyganiak, R.: A relational algebra for SPARQL. Hewlett-Packard Laboratories Technical Report. 2005. http://www.hpl.hp.com/techreports/2005/HPL-2005-170.html

  38. Das, S., Sundara, S., Cyganiak, R. (eds.): R2RML: RDB to RDF Mapping Language. W3C Recommendation, 27 Sept. 2012, http://www.w3.org/TR/r2rml/

  39. DeHaan, D., Toman, D., Consens, M., Özsu, T.: A comprehensive XQuery to SQL translation using dynamic interval encoding. In Proc. of SIGMOD 2003

  40. Deursen, D.V., Poppe, C., Martens, G., Mannens, E., Walle, R.V.d.: XML to RDF conversion: a generic approach. In: Proc. of AXMEDI S 2008

  41. Deutsch, A., Tannen, V.: Reformulation of XML queries and constraints. In Proc. of ICDT 2003

  42. Droop, M., Flarer, M., Groppe, J., Groppe, S., Linnemann, V., Pinggera, J., Santner, F., Schier, M., Schopf, F., Staffler, H., Zugal, S.: Bringing the XML and semantic web worlds closer: transforming XML into RDF and embedding XPath into SPARQL. In Proc. of ICEIS 2008

  43. Droop, M., Flarer, M., Groppe, J., Groppe, S., Linnemann, V., Pinggera, J., Santner, F., Schier, M., Schöpf, F., Staffler, H., Zugal, S.: Embedding XPATH queries into SPARQL queries. In Proc. of the 10th International Conference on Enterprise Information Systems (ICEIS) 2008

  44. Droop, M., Flarer, M., Groppe, J., Groppe, S., Linnemann, V., Pinggera, J., Santner, F., Schier, M., Schopf, F., Staffler, H., Zugal, S.: Translating XPath queries into SPARQL queries. In Proc. of OTM Workshops 2007

  45. Dublin Core Metadata Element Set, Version 1.1. Dublin Core Metadata Initiative, http://dublincore.org/documents/dces/

  46. Elliott, B., Cheng, E., Thomas-Ogbuji, C., Ozsoyoglu, Z.M.: A complete translation from SPARQL into efficient SQL. In Proc. of International Database Engineering & Applications Symposium IDEAS 2009

  47. Encoded Archival Description (EAD). Library of Congress (LOC), http://www.loc.gov/ead/

  48. Fagin R., Kolaitis P., Miller R., Popa L.: Data exchange: semantics and query answering. Theor. Comput. Sci. (TCS) 336(1), (2005)

  49. Fallside, D. (ed.): XML Schema Part 0: Primer. W3C Rec., Oct. 2004. http://www.w3.org/TR/xmlschema-0/

  50. Farrell, J., Lausen, H. (eds.): Semantic annotations for WSDL and XML schema. W3C Rec., W3C, Aug. 2007. http://www.w3.org/TR/sawsdl/

  51. Ferdinand, M., Zirpins, C., Trastour, D.: Lifting XML Schema to OWL. In Proc. of ICWE 2004

  52. Gao, S., Sperberg-McQueen, C.M., Thompson, H.S. (eds.): W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures. W3C Rec., Apr. 2012, http://www.w3.org/TR/xmlschema11-1/

  53. Garcia, R., Celma, O.: Semantic integration and retrieval of multimedia meta- data. In Proc. of Fifth International Workshop on Knowledge Markup and Semantic Annotation 2005

  54. Glimm, B., Ogbuji, C. (eds.): SPARQL 1.1 Entailment Regimes. W3C Working Draft, Jan. 2012, http://www.w3.org/TR/sparql11-entailment/

  55. Groppe, S., Groppe, J., Linnemann, V., et al.: Embedding SPARQL into XQuery/XSLT. In Proc. of ACM SAC 2008

  56. Gruber, T.R.: A translation approach to portable ontology specifications. Knowl Acquis 5(2), 199–220 (1993)

    Article  Google Scholar 

  57. Halevy, A., Ives, Z., Mork, P., Tatarinov, I.: Piazza: data management infrastructure for semantic web applications. In Proc. WWW 2003

  58. Halevy, A.Y., Ives, Z.G., Suciu, D., Tatarinov, I.: Schema mediation in peer data management systems. In Proc. of ICDE, 2003

  59. Harris, S., Seaborne, A. (eds.): SPARQL 1.1 Query Language. W3C Working Draft, July 2012, http://www.w3.org/TR/sparql11-query/

  60. IEEE WG-12: IEEE Standard for Learning Object Metadata (LOM): 1484.12.1-2002, http://ltsc.ieee.org/wg12/

  61. Ives, Z., Halevy, A., Mork, P., Tatarinov, I.: Piazza: mediation and integration infrastructure for Semantic Web data. J. Web Sem. 1(2), 155–175 (2004)

    Google Scholar 

  62. Kappel, G., Kapsammer, E., Retschitzegger, W.: Integrating XML and relational database systems. WWW J. 7(4), (2004

  63. Klein, M.: Interpreting XML via an RDF schema. In Proc. of Second International Workshop on Electronicy Business Hubs 2002

  64. Koffina, I., Serfiotis, G., Christophides, V., Tannen, V.: Mediating RDF/S queries to relational and XML sources. Int. J. Semantic Web Inf. Syst. 2(4), 68–91 (2006)

    Article  Google Scholar 

  65. Krishnamoorthy, K., Kumar, R., Dua, S.R.: Converting SPARQL queries to SQL queries. Microsoft Corporation, U.S. Patent 7818352, 2010

  66. Krishnamurthy, R., Kaushik, R., Naughton, J.: XML-SQL query translation literature: the state of the art and open problems. In Proc. of Xsym 2003

  67. Lehti, P., Fankhauser, P.: XML data integration with OWL: experiences and challenges. In Proc. of 3rd International Symposium on Applications and the Internet 2004

  68. Lenzerini, M.: Data integration: a theoretical perspective. In Proc. of PODS 2002

  69. Liu, C., Vincent, M., Liu, J.: Constraint preserving transformation from relational schema to XML Schema. WWW J. 9(1), 2006

  70. Ma, L., Wang, C., Lu, J., et al.: Effective and Efficient Semantic Web Data Management over DB2. In Proc. of SIGMOD 2008

  71. Makris K., Bikakis N., Gioldasis N., Christodoulakis S.: SPARQL-RW: transparent query access over mapped RDF data sources. In Proc. of 15th International Conference on Extending Database Technology (EDBT) 2012

  72. Makris, K., Bikakis, N., Gioldasis, N., Tsinaraki, C., Christodoulakis, S.: Towards a Mediator based on OWL and SPARQL. In Proc. of 2nd World Summit on the Knowledge Society 2009

  73. Makris, K., Gioldasis, N., Bikakis, N., Christodoulakis, S.: Ontology mapping and SPARQL Rewriting for Querying Federated RDF data sources. In Proc. of ODBASE 2010

  74. Makris, K., Gioldasis, N., Bikakis, N., Christodoulakis, S.: SPARQL Rewriting for Query Mediation over mapped ontologies. Technical Report 2010. Available at: http://www.music.tuc.gr/reports/SPARQLREWRITING.PDF

  75. Malhotra, A. et al. (eds.): XQuery 1.0 and XPath 2.0 Functions and Operators. W3C Rec., Dec. 2010. http://www.w3.org/TR/xpath-functions/

  76. Manola, F., Milles E. (eds.): RDF Primer. W3C Rec., Feb. 2004. http://www.w3.org/TR/rdf-primer

  77. MARC 21 concise format for bibliographic metadata. Library of Congress (LOC), http://www.loc.gov/marc/bibliographic/ecbdhome.html

  78. McGuinness, D.L., van Harmelen, F. (eds.): OWL Web Ontology Language: overview. W3C Rec., Feb. 2004. http://www.w3.org/TR/owl-features

  79. Metadata Authority Description Standard (MADS). Library of Congress (LOC), http://www.loc.gov/standards/mads/

  80. Metadata Encoding and Transmission Standard (METS). Library of Congress (LOC), http://www.loc.gov/standards/mets/

  81. Metadata Object Description Schema (MODS). Library of Congress (LOC), http://www.loc.gov/standards/mods/

  82. Miller, R.J., Haas, L.M., Hernández, M.A.: Schema mapping as query discovery. In Proc. of VLDB 2000

  83. Motik, B., Schneider, P.F.P., Parsia, B. (eds.): OWL 2 Web Ontology Language: Structural Specification and Functional-Style Syntax. W3C Rec., Oct. 2009, http://www.w3.org/TR/owl2-syntax/

  84. MPEG-21 multimedia framework. ISO 21000–17:2003–2007. International Standardization Organization (ISO)

  85. MPEG-7 multimedia content description interface. ISO 15938-1-11:2002–2007. International Standardization Organization (ISO)

  86. NISO Metadata for Images in XML (MIX). Library of Congress (LOC), http://www.loc.gov/standards/mix/

  87. Perez, J., Arenas, M., Gutierrez, C.: Semantics and complexity of SPARQL. ACM Trans. Database Syst. (TODS) 34(3), (2009)

  88. Picalausa, F., Vansummeren, S.: What are real SPARQL queries like? In Proc. of Int. Workshop on Semantic Web Inf. Mngmt. 2011

  89. Polleres, A.: From SPARQL to Rules (and back). In Proc. of WWW 2007

  90. Popa, L., Velegrakis, Y., Miller, R.J., Hernández, M.A., Fagin, R.: Translating Web Data. In Proc. of VLDB 2000

  91. Prud’hommeaux, E., Seaborne, A. (eds.): SPARQL query language for RDF. W3C Rec., Jan. 2008. http://www.w3.org/TR/rdf-sparql-query/

  92. “RDB2RDF” - W3C Working Group. http://www.w3.org/2001/sw/rdb2rdf/

  93. Reif, G., Jazayeri, M., Gall, H.: Towards semantic Web Engineering: WEESA-mapping XML schema to ontologies. In Proc. of WWW2004 Workshop on Application Design, Development, and Implementation Issues in the Semantic Web. 2004

  94. Robie, J., Chamberlin, D., et al. (eds.): XQuery Update Facility 1.0. W3C Rec., March 2011. http://www.w3.org/TR/xquery-update-10/

  95. Rodrigues, T., Rosa, P., Cardoso, J.: Mapping XML to exiting OWL ontologies. International Conference WWW/Internet 2006

  96. Rodrigues, T., Rosa, P., Cardoso, J.: Moving from syntactic to semantic organizations using JXML2OWL. Comput. Ind. 59(8): 808–819 (2008)

    Google Scholar 

  97. Rodriguez-Muro M., Hardi J., Calvanese D.: Quest: Effcient SPARQL-to-SQL for RDF and OWL. In Proc. of ISWC 2012

  98. Sahoo, S., Halb, W., Hellmann, S., Idehen, K., Thibodeau, T., Auer, S., Sequeda, J., Ezzat, A.: A survey of current approaches for mapping of relational databases to RDF. Technical Report. RDB2RDF W3C Working Group, 2009

  99. Schenk, S., Gearon, P., Passant, A. (eds.): SPARQL 1.1 Update. W3C Rec., March 2013. http://www.w3.org/TR/sparql11-update/

  100. Schmidt, M., Hornung, T, Lausen, G., Pinkel, C.: SP2Bench: a SPARQL performance benchmark. In Proc. of ICDE 2009

  101. Sequeda, J., Depena, R., Miranker, D.: Ultrawrap: using SQL views for RDB2RDF. In Proc. of ISWC 2009

  102. Sharable Content Reference Model (SCORM). Advanced Distributed Learning Initiative (ADL), http://www.adlnet.gov/scorm/index.aspx

  103. Siméon, J., Chamberlin, D. (eds.): XQuery 1.0: an XML Query Language. W3C Rec., Jan. 2007. http://www.w3.org/TR/xquery/

  104. Spanos, D.E., Stavrou, P., Mitrou, N.: Bringing relational databases into the Semantic Web: a survey. Semantic Web J. 3(2), 169–209 (2012)

    Google Scholar 

  105. Stavrakantonakis, I., Tsinaraki, C., Bikakis, N., Gioldasis, N., Christodoulakis, S.: SPARQL2XQuery 2.0: Supporting Semantic-based Queries over XML Data. In the Proc. of SMAP 2010

  106. Tatarinov, I., Halevy, A.: Efficient query reformulation in peer data management systems. In Proc. of SIGMOD 2004

  107. Technical Metadata for Text (TextMD). Library of Congress (LOC), http://www.loc.gov/standards/textMD/

  108. Text Encoding and Interchange (TEI). TEI Consortium, http://www.tei-c.org /

  109. Thiran, P., Estievenart, F., Hainaut, J.L., Houben, G.J.: A generic framework for extracting XML data from legacy databases. J. Web Eng. (JWE) 4(3), 205–223 (2005)

    Google Scholar 

  110. Thuy, P.T.T., Lee, Y.K., Lee, S., Jeong, B.S.: Exploiting XML schema for interpreting XML documents as RDF. In Proc. of International Conference on Services Computing 2008

  111. Thuy, P.T.T., Lee, Y.K., Lee, S., Jeong, B.S.: Transforming valid XML documents into RDF via RDF schema. In Proc. of International Conference on Next Generation Web Services Practices 2007

  112. Thuy, P.T.T, Lee, Y.K., Lee, S.: DTD2OWL: automatic transforming XML documents into OWL ontology. In Proc. of Int. Conf. Interaction Sciences 2009

  113. Tsinaraki, C., Christodoulakis, S.: Interoperability of XML schema applications with OWL domain knowledge and semantic web tools. In Proc. of ODBASE 2007

  114. Tsinaraki, C., Christodoulakis, S.: Support for interoperability between OWL based and XML Schema based applications. In the Proc. of DELOS Conference II, 2007

  115. Unbehauen J., Stadler C., Auer S.: Accessing Relational Data on the Web with SparqlMap. JIST 2012

  116. VRA Core 4.0. Visual Resources Association’s (VRA) Data Standards Committee, http://www.vraweb.org/projects/vracore4/index.html

  117. Wache H., Voegele T., Visser U., Stuckenschmidt H., Schuster G., Neumann H., Huebner S.: Ontology-based integration of information - A survey of existing approaches. In Proc. of workshop on Ontologies and Information Sharing 2001

  118. Xiao, H., Cruz, I.: RDF-based metadata management in peer-to- peer systems. In Proc of 2nd IST MMGPS Workshop 2004

  119. Xiao, H., Cruz. I.: Integrating and exchanging XML data using ontologies. J Data Semantics VI (2006)

  120. Yu, C., Popa, L.: Constraint-based XML Query Rewriting for Data Integration. In Proc. of SIGMOD 2004

  121. Zemke, F.: Converting SPARQL to SQL. Technical Report, October 2006. http://lists.w3.org/Archives/Public/public-rdf-dawg/2006OctDec/att-0058/sparql-to-sql.pdf

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nikos Bikakis.

Additional information

Part of this work was done while Nikos Bikakis and Ioannis Stavrakantonakis were members of MUSIC/TUC Lab at Technical University of Crete.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bikakis, N., Tsinaraki, C., Stavrakantonakis, I. et al. The SPARQL2XQuery interoperability framework. World Wide Web 18, 403–490 (2015). https://doi.org/10.1007/s11280-013-0257-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-013-0257-x

Keywords

Navigation