Skip to main content
Log in

A Framework for Ontology-Based Top-K Global Schema Generation

  • Original Article
  • Published:
Journal on Data Semantics

Abstract

Global schema generation is the problem of generating a unified and merged schema based on existing heterogeneous local schemas and a set of correspondences which are generated by a schema-matching algorithm. The existing schema integration approaches only target the flat schema structure, but cannot handle hierarchical schema structure well, and may produce redundant information in the global schema due to the weak entities in local schemas. To deal with these kinds of problems, a framework for ontology-based top-k global schema generation is proposed in this paper. The framework employs ontology as a base-merging model to create the global schema with higher quality but less user involvement, because ontology can provide semantic and detailed constraints and develop ways to preserve the hierarchical structure and remove redundant entities. The proposed approach consists of three contiguous steps: (1) local schemas are converted to local ontologies; (2) local ontologies are merged and top-k global ontologies are generated; (3) the user chooses one merged global ontology, which is converted automatically back to the global schema. After comparing the proposed approach with an existing state-of-the-art schema integration approach, the proposed approach better preserves the hierarchical structure and removes the redundant information. As a result, the quality of the generated global schema is higher.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Notes

  1. https://wordnet.princeton.edu/

  2. http://oaei.ontologymatching.org/2014/benchmarks/.

  3. http://oaei.ontologymatching.org/tests/.

References

  1. Alawan N (2011) Ontological approach for database integration. Ph.D. Dissertation, De Montfort University, United Kingdom

  2. Aumueller D, Do H, Massmann S, Rahm E (2005) Schema and ontology matching with COMA++. In: The Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp 906–908

  3. Cheatham M, Dragisic Z, Euzenat J, et al (2015) Results of the ontology alignment evaluation initiative 2015. In: Proceedings 10th ISWC workshop on ontology matching, pp 60–115

  4. Chiticariu L, Kolaitis P, Popa L (2008) Interactive generation of integrated schemas. In: The Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp 833–846

  5. Do H, Rahm E (2002) COMA: a system for flexible combination of schema matching approaches. In: The Proceedings of the 28th International Conference on Very Large Data Bases, pp 610–628

  6. Fahad M (2008) ER2OWL: generating OWL ontology from ER diagram. IFIP Int Fed Inf Process 288:28–37

    MATH  Google Scholar 

  7. Ding G, Wang, Wang B (2010) Top-k generation of mediated schemas over multiple data sources. In: The Proceedings of the 15th International Conference on Database Systems for Advanced Applications, pp 143–155

  8. Duchateau F, Coletta R, Bellahsene Z, Miller RJ (2009) (Not) yet another matcher. In: The Conference of Information and Knowledge Management, pp 1537–1540

  9. Gottlob G, Pichler R, Savenkov V (2011) Normalization and optimization of schema mappings. Int J Very Large Data Bases 20(2):277–301

    Google Scholar 

  10. Gruber T (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Human-Comput Stud 43(5–6):907–928

    Article  Google Scholar 

  11. Hakimpour F (2002) A Geppert global schema generation using formal ontologies. In: The Proceedings of the 21st International Conference on Conceptual Modeling

  12. Hanif MS, Aono M (2009) An efficient and scalable algorithm for segmented alignment of ontologies of arbitrary size. J Web Semant 7(4):344–356

    Article  Google Scholar 

  13. Kotis K, Vouros GA, Stergiou K (2006) Towards automatic merging of domain ontologies: The HCONE-merge approach”. Web Semant: Sci, Serv Agents on the World Wide Web 4:60–79

    Article  Google Scholar 

  14. Lambrix P, Tan H (2006) SAMBO—a system for aligning and merging biomedical ontologies. J Web Semant 4(1):196–206

    Article  Google Scholar 

  15. Lenat DB (1995) Cyc: a large-scale investment in knowledge infrastructure. Commun ACM 38(11):33–38

    Article  Google Scholar 

  16. Madhavan J, Bernstein PA, Rahm E (2001) Generic schema matching with Cupid. In: The Proceedings of the 27th International Conference on Very Large Data Bases, pp 49–58

  17. Mihoubi H, Simonet A, Simonet M (2000) An ontology driven approach to ontology translation. In: The Proceeding of the 11th International Conference on Database and Expert Systems Applications, pp 573–582

  18. Nguyen H, Taniar D, Rahayu J, Nguyen K (2011) Double-layered schema integration of heterogeneous XML sources. J Syst Softw 84:63–76

    Article  Google Scholar 

  19. Noy NF, Musen MA (2000) PROMPT: algorithm and tool for automated ontology merging and alignment. In: 17th National Conference on Artificial Intelligence

  20. Nyulas C, Tu S (2007) DataMaster—a plug-in for importing schemas and data from relational dtabases into Protégé. In: The Proceedings of the 10th International Protégé Conference

  21. Octavian U, Lise G (2007) Leveraging data and structure in ontology integration. In: The Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp 449–460

  22. Otero-Cerdeira L, Rodríguez-Martínez FJ, Gómez-Rodríguez A (2015) Ontology matching: lliterature review. Expert Syst with Appl 42(2):949–971

    Article  Google Scholar 

  23. Pottinger R, Bernstein P (2003) Merging models based on given correspondences. In: The Proceedings of the 29th Conference on Very Large Data Bases, pp 826–873

  24. Radwan A, Popa L, Stanoi L, Younis A (2009) Top-k generation of integrated schemas based on directed and weighted correspondences. In: The Proceedings of 2009 ACM SIGMOD International Conference on Management of Data, pp 641–654

  25. Raunich S, Rahm E (2011)ATOM: automatic target-driven ontology merging. In: Proceedings of International Conference on Data Engineering

  26. Sharma (2006) Ontology matching using weighted graphs. In: The Proceedings of 1st International Conference on Digital Information Management, pp 121–124

  27. Smith M, Welty C, McGuinness D (2004) OWL web ontology language guide. http://www.w3.org/TR/owl-guide/

  28. Shvaiko P, Euzenat J (2013) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng 25(1):158–176

    Article  Google Scholar 

  29. Stoilos G, Stamou G, Kollias S (2005) A string metric for ontology alignment. In: The Proceedings of the 4th International Semantic Web Conference, pp 623–637

  30. Stumme G, Madche A (2001) FCA-merge: bottom-up merging of ontologies. In: Proc. 17th International Joint Conference on Artificial Intelligence (IJCAI)

  31. Tria F, Lefons E, Tangorra F (2013) Ontological approach to data warehouse source integration. In: Gelenbe E, Lent R (eds) Information sciences and systems. Lecture notes in electrical engineering, vol 264. Springer, pp 251–259

  32. Winkler W (1990) String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage. In: Proceedings of the section on survey research methods. American Statistical Association, pp 354–359

  33. Zhang C, Zhao Z, Chen L, Jagadish HV, Cao C (2014) CrowdMatcher crowd-assisted schema matching. In: The proceedings of the 2014 ACM SIGMOD international conference on Management of data, pp 721–724

  34. Zhao L, Ichise R (2014) Ontology integration for linked data. J Data Semant 3:237–254

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported in part by Grants from NSF MRI 1429518 and NSF CRI 0708573, and also by the National Natural Science Foundation of China under Grant No. 61472315 and the National Key Technologies R&D Program of China under Grant No. 2013BAK09B01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Longzhuang Li.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, L., Wei, Y. & Tian, F. A Framework for Ontology-Based Top-K Global Schema Generation. J Data Semant 6, 31–53 (2017). https://doi.org/10.1007/s13740-016-0075-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13740-016-0075-2

Keywords