Abstract
Schema mapping management is an important research area in data transformation, integration, and cleaning systems. The reasons for its success can be found in the declarative nature of its building block (thus enabling clean semantics and easy to use design tools) paired with the efficiency and modularity in the deployment step. In this chapter we cover the evolution of schema-mappings through what we identify as three main ages. We start presenting the foundations of schema mapping tools and the first tools aimed at translating data from a source to a target schema in the first, heroic age. We then discuss the silver age, when schema mapping tools have grown their way into complex systems and have been translated into both commercial and open-source tools. Finally, we show how recent results in schema-mapping are stimulating a third, golden age, with novel research opportunities and a new generation of systems capable of dealing with a significantly larger class of real-life applications.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Given the importance of XQuery engines in practice, we will treat them as their relational counterpart, even if the two platforms cannot be compared in terms of performance.
- 2.
Available at http://www.db.unibas.it/projects/spicy/ and http://sourceforge.net/projects/openii/, respectively.
- 3.
- 4.
- 5.
References
B. Alexe, M.A. Hernández, L. Popa, W.C. Tan, MapMerge: correlating independent schema mappings. PVLDB 3(1), 81–92 (2010)
B. Alexe, W. Tan, Y. Velegrakis, Comparing and evaluating mapping systems with STBenchmark. PVLDB 1(2), 1468–1471 (2008)
S. Amano, C. David, L. Libkin, F. Murlak, XML schema mappings: data exchange and metadata management. J. ACM 61(2), 12:1–12:48 (2014)
M. Arenas, L. Libkin, XML data exchange: consistency and query answering. J. ACM 55(2), 1–72 (2008)
M. Arenas, J. Pérez, J. Reutter, C. Riveros, Query language-based inverses of schema mappings: semantics, computation, and closure properties. VLDB J. 21(6), 823–842 (2012)
P.C. Arocena, B. Glavic, R. Ciucanu, R.J. Miller, The ibench integration metadata generator. PVLDB 9(3), 108–119 (2015)
C. Beeri, M. Vardi, A proof procedure for data dependencies. J. ACM 31(4), 718–741 (1984)
M. Benedikt, G. Konstantinidis, G. Mecca, B. Motik, P. Papotti, D. Santoro, E. Tsamoura, Benchmarking the chase, in PODS (2017)
P.A. Bernstein, S. Melnik, Model management 2.0: manipulating richer mappings, in SIGMOD (2007), pp. 1–12
J. Bleiholder, F. Naumann, Data fusion. ACM Comp. Surv. 41(1), 1–41 (2008)
A. Bonifati, I. Ileana, M. Linardi, Functional dependencies unleashed for scalable data exchange, in SSDBM (2016)
A. Bonifati, G. Mecca, A. Pappalardo, S. Raunich, G. Summa, Schema mapping verification: the spicy way, in EDBT (2008), pp. 85–96
R. Chirkova, L. Libkin, J. Reutter, Tractable XML data exchange via relations, in CIKM (2011)
S. Dessloch, M.A. Hernandez, R. Wisnesky, A. Radwan, J. Zhou, Orchid: integrating schema mapping and ETL, in ICDE (2008), pp. 1307–1316
R. Fagin, P. Kolaitis, R. Miller, L. Popa, Data exchange: semantics and query answering. TCS 336(1), 89–124 (2005)
R. Fagin, P. Kolaitis, A. Nash, L. Popa, Towards a theory of schema-mapping optimization, in ACM PODS (2008), pp. 33–42
R. Fagin, P. Kolaitis, L. Popa, Data exchange: getting to the core. ACM TODS 30(1), 174–210 (2005)
R. Fagin, P. Kolaitis, L. Popa, W. Tan, Composing schema mappings: second-order dependencies to the rescue. ACM TODS 30(4), 994–1055 (2005)
R. Fagin, P.G. Kolaitis, L. Popa, W.C. Tan, Schema matching and mapping, chapter Schema Mapping Evolution Through Composition and Inversion (Springer, Berlin, 2011), pp. 191–222
W. Fan, F. Geerts, Foundations of Data Quality Management (Morgan & Claypool Publishers, San Rafael, 2012)
A. Fuxman, M.A. Hernández, C.T. Howard, R.J. Miller, P. Papotti, L. Popa, Nested mappings: schema mapping reloaded, in VLDB (2006), pp. 67–78
H. Galhardas, D. Florescu, D. Shasha, E. Simon, C.-A. Saita, Declarative data cleaning: language, model, and algorithms, in VLDB (2001), pp. 371–380
F. Geerts, G. Mecca, P. Papotti, D. Santoro, The LLUNATIC data-cleaning framework. PVLDB 6(9), 625–636 (2013)
F. Geerts, G. Mecca, P. Papotti, D. Santoro, Mapping and cleaning, in ICDE (2014), pp. 232–243
F. Geerts, G. Mecca, P. Papotti, D. Santoro, That’s all folks! LLUNATIC goes open source. PVLDB 7(13), 1565–1568 (2014)
G. Gottlob, A. Nash, Efficient core computation in data exchange. J. ACM 55(2), 1–49 (2008)
L.M. Haas, M.A. Hernández, H. Ho, L. Popa, M. Roth, Clio grows up: from research prototype to industrial tool, in SIGMOD (2005), pp. 805–810
M.A. Hernández, P. Papotti, W.C. Tan, Data exchange with data-metadata translations. PVLDB 1(1), 260–273 (2008)
B. Kimelfeld, E. Livshits, L. Peterfreund, Detecting ambiguity in prioritized database repairing, in ICDT (2017)
B. Marnette, G. Mecca, P. Papotti, Scalable data exchange with functional dependencies. PVLDB 3(1), 105–116 (2010)
B. Marnette, G. Mecca, P. Papotti, S. Raunich, D. Santoro, ++Spicy: an opensource tool for second-generation schema mapping and data exchange. PVLDB 4(11), 1438–1441 (2011)
G. Mecca, P. Papotti, S. Raunich, Core schema mappings, in SIGMOD (2009), pp. 655–668
G. Mecca, P. Papotti, S. Raunich, D. Santoro, What is the IQ of your data transformation system? in CIKM (2012), pp. 872–881
G. Mecca, G. Rull, D. Santoro, E. Teniente, Semantic-based mappings, in Proceedings of the Conceptual Modeling - 32th International Conference, ER 2013, Hong-Kong, China, 11–13 November, 2013 (2013), pp. 255–269
R.J. Miller, L.M. Haas, M.A. Hernandez, Schema mapping as query discovery, in VLDB (2000), pp. 77–99
R. Pichler, V. Savenkov, DEMo: data exchange modeling tool. PVLDB 2(2), 1606–1609 (2009)
L. Popa, Y. Velegrakis, R.J. Miller, M.A. Hernandez, R. Fagin, Translating web data, in VLDB (2002), pp. 598–609
A. Roth, M.F. Korth, A. Silberschatz, Extended Algebra and calculus for nested relational databases. ACM TODS 13, 389–417 (1988)
L. Seligman, P. Mork, A. Halevy, K. Smith, M.J. Carey, K. Chen, C. Wolf, J. Madhavan, A. Kannan, D. Burdick, OpenII: an open source information integration toolkit, in SIGMOD (2010), pp. 1057–1060
N.C. Shu, B.C. Housel, R.W. Taylor, S.P. Ghosh, V.Y. Lum, EXPRESS: a data EXtraction, processing and REstructuring system. ACM TODS 2(2), 134–174 (1977)
B. ten Cate, L. Chiticariu, P. Kolaitis, W.C. Tan, Laconic schema mappings: computing core universal solutions by means of SQL queries. PVLDB 2(1), 1006–1017 (2009)
R. Wisnesky, M.A. Hernández, L. Popa, Mapping polymorphism, in ICDT (2010), pp. 196–208
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Mecca, G., Papotti, P., Santoro, D. (2018). Schema Mappings: From Data Translation to Data Cleaning. In: Flesca, S., Greco, S., Masciari, E., Saccà, D. (eds) A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years. Studies in Big Data, vol 31. Springer, Cham. https://doi.org/10.1007/978-3-319-61893-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-61893-7_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61892-0
Online ISBN: 978-3-319-61893-7
eBook Packages: EngineeringEngineering (R0)