ABSTRACT
Model management is a generic approach to solving problems of data programmability where precisely engineered mappings are required. Applications include data warehousing, e-commerce, object-to-relational wrappers, enterprise information integration, database portals, and report generators. The goal is to develop a model management engine that can support tools for all of these applications. The engine supports operations to match schemas, compose mappings, diff schemas, merge schemas, translate schemas into different data models, and generate data transformations from mappings.
Much has been learned about model management since it was proposed seven years ago. This leads us to a revised vision that differs from the original in two main respects: the operations must handle more expressive mappings, and the runtime that executes mappings should be added as an important model management component. We review what has been learned from recent experience, explain the revised model management vision based on that experience, and identify the research problems that the revised vision opens up.
- S. Abiteboul and O. M. Duschka: Complexity of Answering Queries Using Materialized Views. PODS 1998: 254--263. Google ScholarDigital Library
- S. Abiteboul et al.: The Lowell Database Research Self-Assessment. Commun. ACM 48(5): 111--118 (2005). Google ScholarDigital Library
- S. Abiteboul, R. Hull, V. Vianu: Foundations of Databases. Addison-Wesley, 1995.Google ScholarDigital Library
- A. Adya, J. A. Blakeley, S. Melnik, S. Muralidhar, and the ADO.NET Team: Anatomy of the ADO.NET Entity Framework, SIGMOD 2007. Google ScholarDigital Library
- Altova, http://www.altova.com/Google Scholar
- P. Atzeni and R. Torlone: Management of Multiple Models in an Extensible Database Design Tool. EDBT 1996, 79--95. Google ScholarDigital Library
- P. Atzeni, P. Cappellari and P. Bernstein: ModelGen: Model Independent Schema Translation. EDBT 2006, 368--385. Google ScholarDigital Library
- F. Bancilhon and N. Spyratos: Update Semantics of Relational Views. ACM TODS 6(4): 557--575 (1981). Google ScholarDigital Library
- BEA Aqualogic User Interaction, http://www.bea.comGoogle Scholar
- P. A. Bernstein: Applying Model Management to Classical Meta Data Problems. CIDR 2003.Google Scholar
- P. A. Bernstein, T. Bergstraesser, J. Carlson, S. Pal, P. Sanders, D. Shutt: Microsoft Repository Version 2 and the Open Information Model. Inf. Syst. 24(2): 71--98 (1999). Google ScholarDigital Library
- P. A. Bernstein, M. L. Brodie, S. Ceri, D. J. DeWitt, M. J. Franklin, H. Garcia-Molina, J. Gray, G. Held, J. M. Hellerstein, H. V. Jagadish, M. Lesk, D. Maier, J. F. Naughton, H. Pirahesh, M. Stonebraker, J. D. Ullman: The Asilomar Report on Database Research. SIGMOD Record (27)4:74--80 (1998). Google ScholarDigital Library
- P. A. Bernstein, Dayal, U., DeWitt, D. J., Gawlick, D., Gray, J., Jarke, M., Lindsay, B. G., Lockemann, P. C., Maier, D., Neuhold, E. J., Reuter, A., Rowe, L. A., Schek, H. J., Schmidt, J. W., Schrefl, M., and Stonebraker: M. Future Directions in DBMS research. The Laguna Beach Participants. SIGMOD Record (18)1: 17--26 (1989).Google Scholar
- P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, I. Zaihrayeu: Data Management for Peer-to-Peer Computing : A Vision. WebDB 2002: 89--94.Google Scholar
- P. A. Bernstein, T. J. Green, S. Melnik, A. Nash: Implementing Mapping Composition. VLDB 2006: 55--66. Google ScholarDigital Library
- P. A. Bernstein, L. M. Haas, M. Jarke, E. Rahm, G. Wiederhold: Panel: Is Generic Metadata Management Feasible? VLDB 2000: 660--662. Google ScholarDigital Library
- P. A. Bernstein, A. Y. Halevy, R. Pottinger: A Vision of Management of Complex Models. SIGMOD Record 29(4): 55--63 (2000). Google ScholarDigital Library
- P. A. Bernstein, S. Melnik, J. E. Churchill: Incremental Schema Matching. VLDB 2006: 1167--1170. Google ScholarDigital Library
- P. A. Bernstein, S. Melnik, and P. Mork: Interactive Schema Translation with Instance-Level Mappings. VLDB 2005: 1283--1286. Google ScholarDigital Library
- P. A. Bernstein and E. Rahm: Data Warehouse Scenarios for Model Management. ER 2000: 1--15.Google Scholar
- J. Biskup and B. Convent: A Formal View Integration Method. SIGMOD 1986: 398--407.Google Scholar
- J. A. Blakeley, D. Campbell, S. Muralidhar, A. Nori: The ADO.NET Entity Framework: Making the Conceptual Level Real. SIGMOD Record (35)4: 552--565 (2006). Google ScholarDigital Library
- M. S. Bloor and J. Owen: Product Data Exchange. CRC Press, 1995.Google Scholar
- P. Boannon, W. Fan, M. Flaster, P. Narayan: Information Preserving XML Schema Embedding. VLDB 2005: 85--96. Google ScholarDigital Library
- P. Buneman, S. B. Davidson, and A. Kosky: Theoretical Aspects of Schema Merging. EDBT 1992: 152--167. Google ScholarDigital Library
- M. J. Carey: Data delivery in a Service-Oriented World: the BEA AquaLogic Data Services Platform. SIGMOD 2006: 695--705. Google ScholarDigital Library
- T. Catarci and M. Lenzerini: Representing and Using Interschema Knowledge in Cooperative Information Systems. Int. J. Cooperative Inf. Syst. 2(4): 375--398 (1993).Google ScholarCross Ref
- R.G.G. Cattell and D.K. Barry (editors) et al.: The Object Data Standard: ODMG 3.0. Morgan Kaufmann, 2000.Google ScholarDigital Library
- S. Chaudhuri and U. Dayal: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26(1): 65--74 (1997). Google ScholarDigital Library
- L. Chiticariu and W. -C. Tan: Debugging Schema Mappings with Routes. VLDB 2006: 79--90. Google ScholarDigital Library
- G.P. Copeland and D. Maier: Making Smalltalk a Database System. SIGMOD 1984: 316--325.Google Scholar
- Crystal Reports, http://www.businessobjects.com/ products/reporting/crystalreports/default.asp.Google Scholar
- S.B. Davidson, G. Christian Overton, V. Tannen, L. Wong: BioKleisli: A Digital Library for Biomedical Researchers. Int. J. on Digital Libraries 1(1): 36--53 (1997).Google ScholarCross Ref
- U. Dayal: Processing Queries Over Generalization Hierarchies in a Multidatabase System. VLDB 1983: 342--353. Google ScholarDigital Library
- L. Donelson, P. Tarczy-Hornoch, P. Mork, C. Dolan, JA Mitchell, M. Barrier, H. Mei: The BioMediator System as a Data Integration Tool to Answer Diverse Biologic Queries. Medinfo: 768--72, 2003.Google Scholar
- ETL Tool Survey 2006-2007, http://www.etltool.com/Google Scholar
- R. Fagin: Inverting Schema Mappings. PODS 2006: 50--59. Google ScholarDigital Library
- R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa: Data Exchange: Semantics and Query Answering. Theor. Comput. Sci. 336(1): 89--124 (2005). Google ScholarDigital Library
- R. Fagin, P. G. Kolaitis, and L. Popa: Data exchange: Getting to the Core. ACM TODS 30(1): 174--210 (2005). Google ScholarDigital Library
- R. Fagin, P. G. Kolaitis, and L. Popa, W. C. Tan: Composing Schema Mappings: Second-order Dependencies to the Rescue. ACM TODS 30(4): 994--1055 (2005). Google ScholarDigital Library
- R. Fagin, P. G. Kolaitis, L. Popa, and W. C. Tan: Quasi-inverses of Schema Mappings. PODS 2007. Google ScholarDigital Library
- S. M. Falconer and M. Storey: Cognitive Support for Human-Guided Mapping Systems. Tech. Report DCS-318-IR, 2007, Univ. of Victoria , http://www.cs.uvic.ca/~seanf/ files/cog_support_mapping_systems.pdfGoogle Scholar
- M. J. Franklin, A. Y. Halevy, and D. Maier: From Databases to Dataspaces: A New Abstraction for Information Management. SIGMOD Record 34(4): 27--33 (2005). Google ScholarDigital Library
- M. Friedman, A. Y. Levy, and T. D. Millstein: Navigational Plans For Data Integration. AAAI/IAAI 1999: 67--73. Google ScholarDigital Library
- A. Fuxman, M.A. Hernàndez, C. T. H. Ho, R. J. Miller, P. Papotti, and L. Popa: Nested Mappings: Schema Mapping Reloaded. VLDB 2006: 67--78. Google ScholarDigital Library
- A. Gal: Managing Uncertainty in Schema Matching with Top-K Schema Mappings. J. Data Semantics VI: 90--114, Springer LNCS Vol. 4090/2006. Google ScholarDigital Library
- M. Gubanov, P. A. Bernstein: Structural Text Search and Comparison using Automatically Extracted Schema. WebDB 2006.Google Scholar
- L. M. Haas: Beauty and the Beast: The Theory and Practice of Information Integration. ICDT 2007: 28--43. Google ScholarDigital Library
- L. M. Haas, M. A. Hernàndez, H. Ho, L. Popa, and M. Roth: Clio Grows Up: From Research Prototype to Industrial Tool. SIGMOD 2005: 805--810. Google ScholarDigital Library
- A. Y. Halevy: Answering Queries Using Views: A Survey. VLDB J. 10(4): 270--294 (2001). Google ScholarDigital Library
- A. Y. Halevy, N. Ashish, D. Bitton, M. J. Carey, D. Draper, J. Pollock, A. Rosenthal, and Vishal Sikka: Enterprise Information Integration: Successes, Challenges and Controversies. SIGMOD 2005: 778--787. Google ScholarDigital Library
- A. Y. Halevy, M. J. Franklin, and D. Maier: Principles of Dataspace Systems. PODS 2006: 1--9. Google ScholarDigital Library
- A. Y. Halevy, Z. G. Ives, J. Madhavan, P. Mork, D. Suciu, and I. Tatarinov: The Piazza Peer Data Management System. IEEE Trans. Knowl. Data Eng. 16(7): 787--798 (2004). Google ScholarDigital Library
- Hibernate, http://www.hibernate.orgGoogle Scholar
- R. Hull: Relative Information Capacity of Simple Relational Database Schemata. SIAM J. Comput. 15(3): 856--886 (1986). Google ScholarDigital Library
- IBM FileNet Forms Manager, http://www.filenet.com/English/Products/Forms_Manager/.Google Scholar
- IBM Rational Data Architect, http://www--306.ibm.com/software/data/integration/rda/Google Scholar
- C. Keene: Data Services for Next-Generation SOAs. SOA WebServices Journal, 4(12), 2004. http://webservices.sys-con.com/read/47283.htmGoogle Scholar
- R. Kimball and J. Caserta. The Data Warehouse ETL Toolkit, Wiley and Sons, 2004.Google Scholar
- P. G. Kolaitis: Schema Mappings, Data Exchange, and Metadata Management. PODS 2005: 61--75. Google ScholarDigital Library
- J. Lechtenbörger and G. Vossen: On the computation of Relational View Complements. ACM TODS 28(2): 175--208 (2003). Google ScholarDigital Library
- F. Legler and F. Naumann: A Classification of Schema Mappings and Analysis of Mapping Tools. BTW 2007: 449--464.Google Scholar
- M. Lenzerini: Data Integration: A Theoretical Perspective. PODS 2002: 233--246. Google ScholarDigital Library
- J. Lin and A. O. Mendelzon: Merging Databases Under Constraints. Int. J. Cooperative Inf. Syst. 7(1): 55--76 (1998).Google ScholarCross Ref
- P. McBrien and A. Poulovassilis: A Uniform Approach to Inter-model Transformations. CAiSE 1999: 333--348. Google ScholarDigital Library
- S. Melnik: Generic Model Management: Concepts and Algorithms, Springer LNCS 2967, 2004.Google Scholar
- S. Melnik, P. A. Bernstein, A. Y. Halevy, and E. Rahm: Supporting Executable Mappings in Model Management. SIGMOD 2005: 167--178. Google ScholarDigital Library
- S. Melnik, P. A. Bernstein, A. Y. Halevy, and E. Rahm: A Semantics for Model Management Operators. MSR-TR-2004-59, http://research.microsoft.com, June 2004. An early but somewhat extended version of {67}.Google Scholar
- S. Melnik, E. Rahm, P. A. Bernstein: Rondo: A Programming Platform for Generic Model Management. SIGMOD 2003: 193--204. Google ScholarDigital Library
- S. Melnik, A. Adya and P. A. Bernstein, Compiling Mappings to Bridge Applications and Databases, SIGMOD 2007. Google ScholarDigital Library
- Microsoft BizTalk, http://www.microsoft.com/biztalk/Google Scholar
- Microsoft Office InfoPath, http://office.microsoft.com/en-us/infopathGoogle Scholar
- Microsoft Sharepoint Server, http://www.microsoft.com/sharepointGoogle Scholar
- Microsoft SQL Server Reporting Services, http://www.microsoft.com/sql/technologies/reporting/Google Scholar
- R. J. Miller, L. M. Haas, and M. A. Hernàndez: Schema Mapping as Query Discovery. VLDB 2000: 77--88. Google ScholarDigital Library
- R. J. Miller, M. A. Hernàndez, L. M. Haas, L-L. Yan, H. Ho, R. Fagin, L. Popa: The Clio Project: Managing Heterogeneity. SIGMOD Record 30(1): 78--83 (2001). Google ScholarDigital Library
- MSDN Library: The ADO.NET Entity Framework Overview. June 2006. http://msdn2.microsoft.com/en-us/library/aa697427(vs.80).aspxGoogle Scholar
- A. Nash, P. A. Bernstein, and S. Melnik: Composition of Mappings Given by Embedded Dependencies. PODS 2005: 172--183. Extended version to appear in ACM TODS. Google ScholarDigital Library
- Oracle Toplink, http://www.oracle.com/technology/products/ias/toplink/index.htmlGoogle Scholar
- OWL Web Ontology Language Reference, http://www.w3.org/TR/owl-ref/Google Scholar
- P. Papotti and R. Torlone: An Approach to Heterogeneous Data Translation based on XML Conversion. CAiSE Workshops (1) 2004: 7--19.Google Scholar
- R. Pottinger and P. A. Bernstein: Merging Models Based on Given Correspondences. VLDB 2003: 826--873.Google Scholar
- A. Poulovassilis and P. McBrien: A General Formal Framework for Schema Transformation. Data Knowl. Eng. 28(1): 47--71 (1998). Google ScholarDigital Library
- Query Tools: Products, http://www.bitpipe.com/plist/term/Query-Tools.htmlGoogle Scholar
- Resource Description Framework, http://www.w3.org/RDF/Google Scholar
- E. Rahm and P. A. Bernstein: A Survey of Approaches to Automatic Schema Matching. VLDB J. 10(4):334--350 (2001). Google ScholarDigital Library
- E. Rahm and P.A. Bernstein: An On-line Bibliography on Schema Evolution. SIGMOD Record 35(4):30--31, 2006. The full bibliography is at http://se-pubs.dbs.uni-leipzig.de/. Google ScholarDigital Library
- G. Robertson, M. Czerwinski, and J. Churchill: Visualization of Mappings Between Schemas. CHI 2005: 431--439. Google ScholarDigital Library
- M. Roth, M.A. Hernandez, P. Coulthard, L. Yan, L. Popa, H.C. T. Ho, and C.C. Salter: XML Mapping Technology: Making Connections in an XML-centric World. IBM Sys. J. (45,2), 389--409 (2006). Google ScholarDigital Library
- SAP Netweaver Portal, http://www.sap.com/usa/platform/netweaver/components/portal/index.epxGoogle Scholar
- Service Modeling Language, http://www.serviceml.org/Google Scholar
- P. Shvaiko and J. Euzenat: A Survey of Schema-based Matching Approaches. J. Data Semantics IV:146--171 (2005). Google ScholarDigital Library
- N. C. Shu, B. C. Housel, R. W. Taylor, S. P. Ghosh, and V. Y. Lum: EXPRESS: A Data EXtraction, Processing, and REStructuring System. ACM TODS 2(2): 134--174 (1977). Google ScholarDigital Library
- A. Silberschatz, M. Stonebraker, and J. D. Ullman: Database systems: Achievements and opportunities. Commun. ACM (34)10: 110--120 (1991). Google ScholarDigital Library
- J. M. Smith, P. A. Bernstein, U. Dayal, N. Goodman, T. Landers, K. W. T. Lin, E. Wong, "MULTIBASE-Integrating Heterogeneous Distributed Database Systems," Proc. of 1981 National Computer Conf., AFIPS Press, 487--499.Google Scholar
- Solidworks, http://www.solidworks.com/Google Scholar
- S. Spaccapietra and C. Parent: View Integration: A Step Forward in Solving Structural Conflicts. IEEE TKDE 6(2): 258--274 (1994). Google ScholarDigital Library
- Stylus Studio, http://www.stylusstudio.com/Google Scholar
- Y. Velegrakis, R. J. Miller, and L. Popa: Mapping Adaptation under Evolving Schemas. VLDB 2003: 584--595.Google Scholar
- G. Wiederhold: Mediators in the Architecture of Future Information Systems. IEEE Computer 25(3): 38--49 (1992). Google ScholarDigital Library
- Workshop on Information Integration, Oct. 2006, http://db.cis.upenn.edu/iiworkshop/index.htmGoogle Scholar
- L. -L. Yan, R. J. Miller, L. M. Haas, R. Fagin: Data-Driven Understanding and Refinement of Schema Mappings. SIGMOD 2001: 485--496.Google Scholar
- C. Yu and L. Popa: Semantic Adaptation of Schema Mappings when Schemas Evolve. VLDB 2005: 1006--1017. Google ScholarDigital Library
Index Terms
- Model management 2.0: manipulating richer mappings
Recommendations
Quasi-inverses of schema mappings
Schema mappings are high-level specifications that describe the relationship between two database schemas. Two operators on schema mappings, namely the composition operator and the inverse operator, are regarded as especially important. Progress on the ...
Structural characterizations of schema-mapping languages
ICDT '09: Proceedings of the 12th International Conference on Database TheorySchema mappings are declarative specifications that describe the relationship between two database schemas. In recent years, there has been an extensive study of schema mappings and of their applications to several different data inter-operability tasks,...
Quasi-inverses of schema mappings
PODS '07: Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systemsSchema mappings are high-level specifications that describe the relationship between two database schemas. Two operators on schema mappings, namely the composition operator and the inverse operator, are regarded as especially important. Progress on the ...
Comments