Abstract
Global schema generation is the problem of generating a unified and merged schema based on existing heterogeneous local schemas and a set of correspondences which are generated by a schema-matching algorithm. The existing schema integration approaches only target the flat schema structure, but cannot handle hierarchical schema structure well, and may produce redundant information in the global schema due to the weak entities in local schemas. To deal with these kinds of problems, a framework for ontology-based top-k global schema generation is proposed in this paper. The framework employs ontology as a base-merging model to create the global schema with higher quality but less user involvement, because ontology can provide semantic and detailed constraints and develop ways to preserve the hierarchical structure and remove redundant entities. The proposed approach consists of three contiguous steps: (1) local schemas are converted to local ontologies; (2) local ontologies are merged and top-k global ontologies are generated; (3) the user chooses one merged global ontology, which is converted automatically back to the global schema. After comparing the proposed approach with an existing state-of-the-art schema integration approach, the proposed approach better preserves the hierarchical structure and removes the redundant information. As a result, the quality of the generated global schema is higher.
























Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Alawan N (2011) Ontological approach for database integration. Ph.D. Dissertation, De Montfort University, United Kingdom
Aumueller D, Do H, Massmann S, Rahm E (2005) Schema and ontology matching with COMA++. In: The Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp 906–908
Cheatham M, Dragisic Z, Euzenat J, et al (2015) Results of the ontology alignment evaluation initiative 2015. In: Proceedings 10th ISWC workshop on ontology matching, pp 60–115
Chiticariu L, Kolaitis P, Popa L (2008) Interactive generation of integrated schemas. In: The Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp 833–846
Do H, Rahm E (2002) COMA: a system for flexible combination of schema matching approaches. In: The Proceedings of the 28th International Conference on Very Large Data Bases, pp 610–628
Fahad M (2008) ER2OWL: generating OWL ontology from ER diagram. IFIP Int Fed Inf Process 288:28–37
Ding G, Wang, Wang B (2010) Top-k generation of mediated schemas over multiple data sources. In: The Proceedings of the 15th International Conference on Database Systems for Advanced Applications, pp 143–155
Duchateau F, Coletta R, Bellahsene Z, Miller RJ (2009) (Not) yet another matcher. In: The Conference of Information and Knowledge Management, pp 1537–1540
Gottlob G, Pichler R, Savenkov V (2011) Normalization and optimization of schema mappings. Int J Very Large Data Bases 20(2):277–301
Gruber T (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Human-Comput Stud 43(5–6):907–928
Hakimpour F (2002) A Geppert global schema generation using formal ontologies. In: The Proceedings of the 21st International Conference on Conceptual Modeling
Hanif MS, Aono M (2009) An efficient and scalable algorithm for segmented alignment of ontologies of arbitrary size. J Web Semant 7(4):344–356
Kotis K, Vouros GA, Stergiou K (2006) Towards automatic merging of domain ontologies: The HCONE-merge approach”. Web Semant: Sci, Serv Agents on the World Wide Web 4:60–79
Lambrix P, Tan H (2006) SAMBO—a system for aligning and merging biomedical ontologies. J Web Semant 4(1):196–206
Lenat DB (1995) Cyc: a large-scale investment in knowledge infrastructure. Commun ACM 38(11):33–38
Madhavan J, Bernstein PA, Rahm E (2001) Generic schema matching with Cupid. In: The Proceedings of the 27th International Conference on Very Large Data Bases, pp 49–58
Mihoubi H, Simonet A, Simonet M (2000) An ontology driven approach to ontology translation. In: The Proceeding of the 11th International Conference on Database and Expert Systems Applications, pp 573–582
Nguyen H, Taniar D, Rahayu J, Nguyen K (2011) Double-layered schema integration of heterogeneous XML sources. J Syst Softw 84:63–76
Noy NF, Musen MA (2000) PROMPT: algorithm and tool for automated ontology merging and alignment. In: 17th National Conference on Artificial Intelligence
Nyulas C, Tu S (2007) DataMaster—a plug-in for importing schemas and data from relational dtabases into Protégé. In: The Proceedings of the 10th International Protégé Conference
Octavian U, Lise G (2007) Leveraging data and structure in ontology integration. In: The Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp 449–460
Otero-Cerdeira L, Rodríguez-Martínez FJ, Gómez-Rodríguez A (2015) Ontology matching: lliterature review. Expert Syst with Appl 42(2):949–971
Pottinger R, Bernstein P (2003) Merging models based on given correspondences. In: The Proceedings of the 29th Conference on Very Large Data Bases, pp 826–873
Radwan A, Popa L, Stanoi L, Younis A (2009) Top-k generation of integrated schemas based on directed and weighted correspondences. In: The Proceedings of 2009 ACM SIGMOD International Conference on Management of Data, pp 641–654
Raunich S, Rahm E (2011)ATOM: automatic target-driven ontology merging. In: Proceedings of International Conference on Data Engineering
Sharma (2006) Ontology matching using weighted graphs. In: The Proceedings of 1st International Conference on Digital Information Management, pp 121–124
Smith M, Welty C, McGuinness D (2004) OWL web ontology language guide. http://www.w3.org/TR/owl-guide/
Shvaiko P, Euzenat J (2013) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng 25(1):158–176
Stoilos G, Stamou G, Kollias S (2005) A string metric for ontology alignment. In: The Proceedings of the 4th International Semantic Web Conference, pp 623–637
Stumme G, Madche A (2001) FCA-merge: bottom-up merging of ontologies. In: Proc. 17th International Joint Conference on Artificial Intelligence (IJCAI)
Tria F, Lefons E, Tangorra F (2013) Ontological approach to data warehouse source integration. In: Gelenbe E, Lent R (eds) Information sciences and systems. Lecture notes in electrical engineering, vol 264. Springer, pp 251–259
Winkler W (1990) String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage. In: Proceedings of the section on survey research methods. American Statistical Association, pp 354–359
Zhang C, Zhao Z, Chen L, Jagadish HV, Cao C (2014) CrowdMatcher crowd-assisted schema matching. In: The proceedings of the 2014 ACM SIGMOD international conference on Management of data, pp 721–724
Zhao L, Ichise R (2014) Ontology integration for linked data. J Data Semant 3:237–254
Acknowledgments
This work was supported in part by Grants from NSF MRI 1429518 and NSF CRI 0708573, and also by the National Natural Science Foundation of China under Grant No. 61472315 and the National Key Technologies R&D Program of China under Grant No. 2013BAK09B01.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, L., Wei, Y. & Tian, F. A Framework for Ontology-Based Top-K Global Schema Generation. J Data Semant 6, 31–53 (2017). https://doi.org/10.1007/s13740-016-0075-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13740-016-0075-2