skip to main content
research-article

High-level change detection in RDF(S) KBs

Published: 26 April 2013 Publication History

Abstract

With the increasing use of Web 2.0 to create, disseminate, and consume large volumes of data, more and more information is published and becomes available for potential data consumers, that is, applications/services, individual users and communities, outside their production site. The most representative example of this trend is Linked Open Data (LOD), a set of interlinked data and knowledge bases. The main challenge in this context is data governance within loosely coordinated organizations that are publishing added-value interlinked data on the Web, bringing together issues related to data management and data quality, in order to support the full lifecycle of data production, consumption, and management. In this article, we are interested in curation issues for RDF(S) data, which is the default data model for LOD. In particular, we are addressing change management for RDF(S) data maintained by large communities (scientists, librarians, etc.) which act as curators to ensure high quality of data. Such curated Knowledge Bases (KBs) are constantly evolving for various reasons, such as the inclusion of new experimental evidence or observations, or the correction of erroneous conceptualizations. Managing such changes poses several research problems, including the problem of detecting the changes (delta) between versions of the same KB developed and maintained by different groups of curators, a crucial task for assisting them in understanding the involved changes. This becomes all the more important as curated KBs are interconnected (through copying or referencing) and thus changes need to be propagated from one KB to another either within or across communities. This article addresses this problem by proposing a change language which allows the formulation of concise and intuitive deltas. The language is expressive enough to describe unambiguously any possible change encountered in curated KBs expressed in RDF(S), and can be efficiently and deterministically detected in an automated way. Moreover, we devise a change detection algorithm which is sound and complete with respect to the aforementioned language, and study appropriate semantics for executing the deltas expressed in our language in order to move backwards and forwards in a multiversion repository, using only the corresponding deltas. Finally, we evaluate through experiments the effectiveness and efficiency of our algorithms using real ontologies from the cultural, bioinformatics, and entertainment domains.

Supplementary Material

a1-papavasileiou-apndx.pdf (papavasileiou.zip)
Supplemental movie, appendix, image and software files for, High-level change detection in RDF(S) KBs

References

[1]
Arenas, M., Consens, M., and Mallea, A. 2010. Revisiting blank nodes in RDF to avoid the semantic mismatch with SPARQL. In Proceedings of the RDF Next Steps Workshop.
[2]
Auer, S. and Herre, H. 2006. A versioning and evolution framework for RDF knowledge bases. In Perspectives of Systems Informatics: Proceedings of the 6th International Andrei Ershov Memorial Conference.
[3]
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., and Patel-Schneider, P., eds. 2002. The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press.
[4]
Bairoch, A., Apweiler, R., et al. 2005. The universal protein resource (UniProt). Nucleic Acids Res. 38, D142--D148.
[5]
Banerjee, J., Kim, W., Kim, H., and Korth, H. 1987. Semantics and implementation of schema evolution in object-oriented databases. In Proceedings of the International Conference on Management of Data.
[6]
Berners-Lee, T., Hendler, J., and Lassila, O. 2001. The semantic web. Sci. Am. 284, 34--43.
[7]
Bizer, C., Heath, T., and Berners-Lee, T. 2009. Linked data: The story so far. Int. J. Semantic Web Inf. Syst. 5, 3, 1--22.
[8]
Brickley, D. and Guha, R. 2004. RDF vocabulary description language 1.0: RDF Schema. www.w3.org/TR/2004/REC-rdf-schema-20040210.
[9]
Buneman, P. 2008. Curated databases. In Proceedings of the 28th ACM SIGMOD/-SIGART Symposium on Principles of Database Systems.
[10]
Chawathe, S. and Garcia-Molina, H. 1997. Meaningful change detection in structured data. In Proceedings of the ACM SIGMOD International Conference on Management of Data.
[11]
Chawathe, S., Rajaraman, A., Garcia-Molina, H., and Widom, J. 1996. Change detection in hierarchically structured information. In Proceedings of the ACM SIGMOD International Conference on Management of Data.
[12]
Christophides, V., Karvounarakis, G., Plexousakis, D., Scholl, M., and Tourtounis, S. 2004. Optimizing taxonomic semantic web queries using labeling schemes. J. Web Semantics 1, 207--228.
[13]
CIDOC. 2010. The CIDOC Conceptual Reference Model. cidoc.ics.forth.gr/official_release_cidoc.html.
[14]
Cloran, R. and Irwin, B. 2005. Transmitting RDF graph deltas for a cheaper semantic web. In Proceedings of the 8th Annual Southern African Telecommunication Networks and Applications Conference (SATNAC'05).
[15]
Cobena, G., Abiteboul, S., and Marian, A. 2001. Detecting changes in XML documents. In Proceedings of the International Conference on Data Engineering (ICDE'01).
[16]
Curino, C., Moon, H., and Zaniolo, C. 2008. Graceful database schema evolution: the prism workbench. In Proceedings of the 34th International Conference on Very Large Data Bases.
[17]
Euzenat, J. and Shvaiko, P. 2007. Ontology Matching. Springer.
[18]
Ferrara, A., Nikolov, A., and Scharffe, F. 2011. Data linking for the semantic web. Int. J. Semantic Web Inf. Syst. 7, 3.
[19]
Flouris, G., Manakanatas, D., Kondylakis, H., Plexousakis, D., and Antoniou, G. 2008. Ontology change: Classification and survey. Knowl. Eng. Rev. 23, 2.
[20]
Franconi, E., Meyer, T., and Varzinczak, I. 2010. Semantic diff as the basis for knowledge base versioning. In Proceedings of the 13th International Workshop on Non-Monotonic Reasoning.
[21]
Hartig, O., Bizer, C., and Freytag, J. C. 2009. Executing SPARQL queries over the web of Linked Data. In Proceedings of the 8th International Semantic Web Conference (ISWC'09).
[22]
Hill, D., Smith, B., McAndrews-Hill, M., and Blake, J. 2008. Gene ontology annotations: What they mean and where they come from. BMC Bioinformatics.
[23]
Horrocks, I., Patel-Schneider, P., and Van Harmelen, F. 2003. From SHIQ and RDF to OWL: The making of a web ontology language. J. Web Semantics 1, 1, 7--26.
[24]
Klein, M. 2004. Change management for distributed ontologies. Ph.D. thesis, Vrije University.
[25]
Konev, B., Walther, D., and Wolter, F. 2008. The logical difference problem for description logic terminologies. In Proceedings of the 4th International Joint Conference on Automated Reasoning (IJCAR'08). 259--274.
[26]
Kontchakov, R., Wolter, F., and Zakharyaschev, M. 2008. Can you tell the difference between DL-Lite ontologies? In Proceedings of the 11th International Conference on Principles of Knowledge Representation and Reasoning (KR'08). 285--295.
[27]
Lausen, G., Meier, M., and Schmidt, M. 2008. SPARQLing constraints for RDF. In Proceedings of the 11th International Conference on Extending Database Technology (EDBT'08). 499--509.
[28]
Lerner, B. 2000. A model for compound type changes encountered in schema evolution. ACM Trans. Datab. Syst. 25, 1, 83--127.
[29]
Levenshtein, V. 1966. Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. Sov. Phys. Dokl. 10.
[30]
Marian, A., Abiteboul, S., Cobena, G., and Mignet, L. 2001. Change-centric management of versions in an XML warehouse. In Proceedings of the International Conference on Very Large Data Bases (VLDB'01). 581--590.
[31]
McBride, B., Manola, F., and Miller, E. 2004. RDF primer. www.w3.org/TR/rdf-primer.
[32]
Motik, B., Horrocks, I., and Sattler, U. 2007. Bridging the gap between OWL and relational databases. In Proceedings of the 17th International World Wide Web Conference.
[33]
Nguyen, G. and Rieu, D. 1989. Schema evolution in object-oriented database systems. Data Knowl. Engin. 4, 1, 43--67.
[34]
Noy, N., Chugh, A., Liu, W., and Musen, M. 2006. A framework for ontology evolution in collaborative environments. In Proceedings of the 5th International Semantic Web Conference.
[35]
Noy, N. and Musen, M. 2002. PromptDiff: A fixed-point algorithm for comparing ontology versions. In Proceedings of the 18th National Conference on Artificial Intelligence (AAAI'02).
[36]
Palma, A., Haase, P., Wang, Y., and D'Aquin, M. 2007. D1.3.1 propagation models and strategies. tech. Rep. D1.3.1. NeOn Deliverable.
[37]
Papavasileiou, V., Flouris, G., Fundulaki, I., Kotzinos, D., and Christophides, V. 2009. On detecting high-level changes in RDF(S) KBs. In Proceedings of the 8th International Semantic Web Conference (ISWC'09).
[38]
Peters, R. and Ozsu, M. 1997. An axiomatic model of dynamic schema evolution in objectbase systems. ACM Trans. Datab. Syst. 22, 1, 75--114.
[39]
Plessers, P. and De Troyer, O. 2005. Ontology change detection using a version log. In Proceedings of the 4th International Semantic Web Conference (ISWC'05).
[40]
Plessers, P., De Troyer, O., and Casteleyn, S. 2007. Understanding ontology evolution: A change detection approach. In Web Semantics: Science, Services and Agents on the WWW: Selected Papers from the International Semantic Web Conference.
[41]
Protege. 2002. protege.stanford.edu.
[42]
Raimond, Y., Giasson, F., Jacobson, K., Fazekas, G., and Gangler, T. 2010. Music ontology specification. musicontology.com/.
[43]
Rogozan, D. and Paquette, G. 2005. Managing ontology changes on the semantic web. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence.
[44]
Schmedding, F. 2011. Incremental SPARQL evaluation for query answering on Linked Data. In Proceedings of the 2nd International Workshop on Consuming Linked Data (COLD'11).
[45]
Serfiotis, G., Koffina, I., Christophides, V., and Tannen, V. 2005. Containment and minimization of RDF(S) query patterns. In Proceedings of the 4th International Semantic Web Conference (ISWC'05).
[46]
Shadbolt, N., Berners-Lee, T., and Hall, W. 2006. The semantic web revisited. IEEE Intell. Syst. 21, 3, 96--101.
[47]
Skarra, A. and Zdonik, S. 1986. The management of changing types in an object-oriented database. In Proceedings of the Conference on Object-Oriented Programming Systems, Languages and Applications (OOPLSA'86).
[48]
Stojanovic, L. 2004. Methods and tools for ontology evolution. Ph.D. thesis, University of Karlsruhe.
[49]
Stojanovic, L., Maedche, A., Motik, B., and Stojanovic, N. 2002. User-driven ontology evolution management. In Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management (EKAW'02). 285--300.
[50]
Tao, J., Sirin, E., Bao, J., and McGuinness, D. 2010. Extending OWL with integrity constraints. In Proceedings of the 23rd International Workshop on Description Logics.
[51]
Volkel, M., Winkler, W., Sure, Y., Kruk, S., and Synak, M. 2005. SemVersion: A versioning system for RDF and ontologies. In Proceedings of the 2nd European Semantic Web Conference.
[52]
Zeginis, D., Tzitzikas, Y., and Christophides, V. 2011. On computing deltas of RDF(S) knowledge bases. ACM Trans. Web.
[53]
Zhdanova, A. 2008. Community-driven ontology evolution: Gene ontology case study. In Proceedings of the 11th International Conference on Business Information Systems (BIS'08).

Cited By

View all
  • (2025)A change language for ontologies and knowledge graphsDatabase10.1093/database/baae1332025Online publication date: 22-Jan-2025
  • (2023)Analysis and implementation of the DynDiff tool when comparing versions of ontologyJournal of Biomedical Semantics10.1186/s13326-023-00295-714:1Online publication date: 28-Sep-2023
  • (2022)OWL ontology evolution: understanding and unifying the complex changesThe Knowledge Engineering Review10.1017/S026988892200006637Online publication date: 21-Nov-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Database Systems
ACM Transactions on Database Systems  Volume 38, Issue 1
April 2013
290 pages
ISSN:0362-5915
EISSN:1557-4644
DOI:10.1145/2445583
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 April 2013
Accepted: 01 September 2012
Revised: 01 October 2011
Received: 01 February 2011
Published in TODS Volume 38, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Change detection
  2. RDF(S)
  3. change management
  4. delta
  5. deterministic change detection
  6. diff
  7. high-level changes

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)27
  • Downloads (Last 6 weeks)3
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)A change language for ontologies and knowledge graphsDatabase10.1093/database/baae1332025Online publication date: 22-Jan-2025
  • (2023)Analysis and implementation of the DynDiff tool when comparing versions of ontologyJournal of Biomedical Semantics10.1186/s13326-023-00295-714:1Online publication date: 28-Sep-2023
  • (2022)OWL ontology evolution: understanding and unifying the complex changesThe Knowledge Engineering Review10.1017/S026988892200006637Online publication date: 21-Nov-2022
  • (2021)Link maintenance for integrity in linked open data evolutionSemantic Web10.3233/SW-20039812:3(517-541)Online publication date: 1-Jan-2021
  • (2021)Knowledge GraphsSynthesis Lectures on Data, Semantics, and Knowledge10.2200/S01125ED1V01Y202109DSK02212:2(1-257)Online publication date: 8-Nov-2021
  • (2021)Ontology Versioning Driven by Instance Evolution in the τOWL FrameworkJournal of Information & Knowledge Management10.1142/S021964922250002221:01Online publication date: 4-Dec-2021
  • (2021)Beware of the hierarchy — An analysis of ontology evolution and the materialisation impact for biomedical ontologiesWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2021.10065870:COnline publication date: 1-Jul-2021
  • (2021)Ontology-driven evolution of software securityData & Knowledge Engineering10.1016/j.datak.2021.101907134:COnline publication date: 1-Jul-2021
  • (2020)Data-driven assessment of structural evolution of RDF graphsSemantic Web10.3233/SW-20036811:5(831-853)Online publication date: 1-Jan-2020
  • (2020)Ontology Evolution in the Context of Model-Based Secure Software EngineeringResearch Challenges in Information Science10.1007/978-3-030-50316-1_26(437-454)Online publication date: 25-Jun-2020
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media