Skip to main content
Log in

Curating the Specificity of Ontological Descriptions under Ontology Evolution

  • Original Article
  • Published:
Journal on Data Semantics

Abstract

Semantic Web Ontologies are not static, but evolve, however, this evolution usually happens independently of the ontological instance descriptions (usually referred as “metadata”) which are stored in the various Metadata Repositories or Knowledge Bases. Nevertheless, it is a common practice for a MR/KB to periodically update its ontologies to their latest versions by “migrating” the available instance descriptions to the latest ontology versions. Such migrations incur gaps regarding the specificity of the migrated metadata, i.e. inability to distinguish those descriptions that should be reexamined (for possible specialization as consequence of the migration) from those for which no reexamination is justified. Consequently, there is a need for principles, techniques, and tools for managing the uncertainty incurred by such migrations, specifically techniques for (a) identifying automatically the descriptions that are candidates for specialization, (b) computing, ranking and recommending possible specializations, and (c) flexible interactive techniques for updating the available descriptions (and their candidate specializations), after the user (curator of the repository) accepts/rejects such recommendations. This problem is especially important for curated knowledge bases which have increased quality requirements (as in E-science). In this paper, we formalize the problem and propose principles, rules, and algorithms for realizing the aforementioned scenario, for the RDF/S framework. Finally, we report experimental results demonstrating the feasibility of the approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

Notes

  1. In this paper, by the term of ontology we refer only to schema information.

  2. Note that according to RDF/S semantics [13], \(\le ^*_{cl}\) and \(\le ^*_{pr}\) are reflexive and transitive relations.

  3. A lower set (else called downward closed set) is a subset \(Y\) of a given partially ordered set \((X,\le )\) s.t. for all elements \(x\) and \(y,\) if \(x \le y\) and \(y\) is an element of \(Y,\) then \(x\) is also in \(Y.\)

  4. Upper set is the dual notion of lower set.

  5. Definition 5(ii) is satisfied because if it holds that \(\lnot valid(o, pr, o^{\prime }, K)\) and \(pr^{\prime } \le _{pr}^* pr\) then certainly it holds that \(\lnot valid(o, pr^{\prime }, o^{\prime }, K),\) due to (ii) of Definition 2. Thus, \( SubTriples(Invalid(K))=Invalid(K) .\) Definition 5(iii) is obvious.

  6. We discuss the notion of validity at Sect. 11 along with the discussion of the related works that focus on that issue.

  7. If \((o \;type \;c) \in K^{\prime }\) then \((c \;type \; Class ) \in \mathcal{C }(K^{\prime })\) and if \((o \;pr \;o^{\prime }) \in K^{\prime }\) then \((pr \;type \;Property) \in \mathcal{C }(K^{\prime }).\)

  8. NBC stands for non-backwards compatible.

  9. The relationships \(\le ^*_{cl}\) and \(\le ^*_{pr}\) hold in \(K^{\prime }.\)

  10. Note that \(t\not \in B_{K^{\prime }},\) if the class \(c\) or property \(pr\) appearing in \(t\) have been deleted in \(K^{\prime }.\)

  11. To keep notations simple, here and below we omit the subscripts \(K\) from the notations of \(\mathcal{C },\) \(P\) and \(M.\)

  12. In the figure, \(posCl(\mathtt{{John}})\) denotes the possible classes of \(\mathtt{{John}}.\)

  13. More information about the prototype is available through its web page: http://www.ics.forth.gr/isl/RIMQA.

  14. http://athena.ics.forth.gr:9090/SWKM.

  15. All experiments were carried out in an ordinary laptop with processor Pentium(R) Dual-Core CPU T4200 @2.0 Ghz, 2 GB Ram, running Windows Vista.

  16. http://www.musicontology.com.

  17. VR stands for Value vs Rank (it measures the relationship between the \(i^{th}\) biggest value and its rank \(i,\) assuming a descending order).

  18. PDF stands for Probability Density Function.

  19. CIDOC CRM (ISO 21127) is a core ontology describing the underlying semantics of data schemata and structures from all museum disciplines and archives (its RDF representation contains 78 classes and 250 properties from which 7 are literal-valued) (available from http://www.cidoc-crm.org/).

  20. For example, there are tools that scan Java bytecode for method calls and create a description of the dependencies between classes and the package/archive encoded in RDF. Other tools transform Maven POM (Project Object Model) files into RDF.

  21. http://leo.hua.gr/palc2012/?q=en.

  22. http://www.cidoc-crm.org/.

  23. XML Schema Definition.

References

  1. Alchourrón CE, Gärdenfors P, Makinson D (1985) On the logic of theory change: partial meet contraction and revision functions. J Symb Logic 50(2):510–530

    Article  MATH  Google Scholar 

  2. Bizer C, Heath T, Berners-Lee T (2009) Linked data-the story so far. Int J Semant Web Inform Syst 5(3):1–22

    Article  Google Scholar 

  3. Brickley D, Guha RV (2004) RDF Vocabulary Description Language 1.0: RDF Schema, W3C Recommendation, February 2004. http://www.w3.org/TR/rdf-schema/

  4. Buneman P, Cheney J, Wang Chiew T, Vansummeren S (2008) Curated Databases. In 27th ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems (PODS-2008), pp 1–12

  5. Christophides V, Plexousakis D, Scholl M, Tourtounis S (2003) On labeling schemes for the semantic web. In: 12th International world wide web conference (WWW-2003), pp 544–555

  6. Dalal M (1988) Investigations into a theory of knowledge base revision: preliminary report. In: 7th National conference on artificial intelligence (AAAI-1988), pp 475–479

  7. Fernández JD, Martínez-Prieto MA, Gutierrez C (2010) Compact representation of large RDF data sets for publishing and exchange. In: 9th international semantic web conference (ISWC-2010), pp 193–208

  8. Ferré S, Rudolph S (2012) Advocatus diaboli-exploratory enrichment of ontologies with negative constraints. In: 18th international conference on knowledge engineering and, knowledge management (EKAW-2012), pp 42–56

  9. Flouris G (2006) On belief change and ontology evolution. PhD thesis, Computer Science Department, University of Crete, Greece

  10. Flouris G, Konstantinidis G, Antoniou G, Christophides V (2013) Formal foundations for RDF/S KB evolution. Int J Knowl Inform Syst (KAIS) 35(1):153–191

    Article  Google Scholar 

  11. Gutierrez C, Hurtado C, Mendelzon A (2004) Foundations of semantic web databases. In: 23rd ACM symposium on principles of database systems (PODS-2004), pp 95–106

  12. Hartung M, Terwilliger J, Rahm E (2011) Recent advances in schema and ontology evolution. Schema matching and mapping

  13. Hayes P (2004) RDF semantics, W3C recommendation, February 2004. http://www.w3.org/TR/rdf-mt/

  14. Jin R, Xiang Y, Ruan N, Wang H (2008) Efficiently answering reachability queries on very large directed graphs. In: ACM SIGMOD international conference on management of data (SIGMOD-2008), pp 595–608

  15. Klein MCA, Fensel D (2001) Ontology versioning on the semantic web. In: First semantic web working symposium (SWWS-2001), pp 75–91

  16. Konstantinidis G, Flouris, G, Antoniou G, Christophides V (2008) A formal approach for RDF/S ontology evolution. In: 18th European conference on artificial intelligence (ECAI-2008), pp 70–74. Patras, Greece, July 2008

  17. Neumann T, Weikum G (2008) RDF-3X: a RISC-style engine for RDF. Proc VLDB Endow (PVLDB) 1(1):647–659

    Google Scholar 

  18. Neumann T, Weikum G (2010) x-RDF-3X: fast querying, high update rates, and consistency for RDF databases. Proc VLDB Endow (PVLDB) 3(1):256–263

    Google Scholar 

  19. Noy NF, Klein MCA (2004) Ontology evolution: not the same as schema evolution. Knowl Inform Syst 6(4):428–440

    Google Scholar 

  20. APARSEN: Network of Excellence (FP7 No 269977) 2011–2014. Deliverable D26.1: Report and Strategy on Annotation, Reputation and Data Quality, 2012. http://www.alliancepermanentaccess.org/wp-content/uploads/downloads/2012/12/APARSEN-REP-D26_1-01-1_0.pdf

  21. Peterson D, (Sandy) Gao S, Malhotra A, Sperberg-McQueen, CM, Thompson HS (2009) W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes, W3C Working Draft 3, December 2009. http://www.w3.org/TR/xmlschema11-2/

  22. Qin L, Atluri V (2009) Evaluating the validity of data instances against ontology evolution over the Semantic Web. Inform Softw Technol 51(1):83–97

    Article  Google Scholar 

  23. Rada R, Mili H, Bicknell E, Blettner M (1989) Development and application of a metric on semantic nets. IEEE Trans Syst Man Cybern 19(1):17–30

    Article  Google Scholar 

  24. Rizzolo F, Velegrakis Y, Mylopoulos J, Bykau S (2009) Modeling concept evolution: a historical perspective. In: 28th International conference on conceptual modeling (ER-2009), pp 331–345

  25. Sahoo SS, Halb W, Hellmann S, Idehen K, Thibodeau T Jr, Auer S, Sequeda J, Ezzat A (2009) A Survey of current approaches for mapping of relational databases to RDF. Report by the W3C RDB2RDF incubator group. http://www.w3.org/2005/Incubator/rdb2rdf/RDB2RDF_SurveyReport.pdf

  26. Theodoridou M, Tzitzikas Y, Doerr M, Marketakis Y, Melessanakis V (2010) Modeling and querying provenance by extending CIDOC CRM. Distrib Parallel Databases 27(2):169–210

    Article  Google Scholar 

  27. Theoharis Y, Georgakopoulos G, Christophides V (2007) On the synthetic generation of semantic web schemas. In: Joint ODBIS & SWDB workshop on semantic web, ontologies, and databases. Collocated with VLDB2007, pp 98–116

  28. Theoharis Y, Tzitzikas Y, Kotzinos D, Christophides V (2008) On graph features of semantic web schemas. IEEE Trans Knowl Data Eng 20(5):692–702

    Article  Google Scholar 

  29. Tzitzikas Y, Kampouraki M, Analyti A (2012) Curating the specificity of metadata while World Models Evolve. In: 9th annual iPres conference on digital preservation (iPres-2012). http://www.ics.forth.gr/analyti/Local_Papers/CuratingSpecificity_iPres_CRC_pv.pdf

  30. Wang H, He H, Yang J, Yu PS, Yu JX (2006) Dual labeling: answering graph reachability queries in constant time. In: 22nd international conference on data engineering (ICDE-2006)

  31. Xuan DN, Bellatreche L, Pierra G (2006) A versioning management model for ontology-based data warehouses. In: 8th International conference on data warehousing and knowledge discovery (DaWaK 2006), pp 195–206

  32. Zeginis D, Tzitzikas Y, Christophides V (2007) On the foundations of computing deltas between RDF models. In: 6th international semantic web conference (ISWC-07), pp 637–651. Springer, Berlin

  33. Zeginis D, Tzitzikas Y, Christophides V (2011) On computing deltas of RDF/S knowledge bases. ACM Trans Web (TWEB) 5(3):14

    Google Scholar 

Download references

Acknowledgments

Work done in the context of NoE APARSEN (Alliance Permanent Access to the Records of Science in Europe, FP7, Proj. No 269977, 2011–2014). Many thanks to Xristina Lantzaki who participated in the user study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anastasia Analyti.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF1088 KB)

Appendices

Appendix A: List of symbols

See Table 12.

Table 12 Symbols and description

Appendix B: Example of Applying the Algorithm for Backwards Compatible Migration

Example 9

Consider the example of Fig. 5 and suppose that \(P_K=\{\mathtt{(Fiat{\_}1\;type\;Vehicle)},\;\mathtt{(Bob\,uses\,BMW{\_}1)},\;\mathtt{(Alice\,works\,at\,FORTH)}\}.\) Executing Algorithm 1, step by step, we have that:

(1) \(K^{\prime } = (S_{K^{\prime }}, I_K);\)

(2) \(P_{K\_Add} = \emptyset ;\)

(3) \(P_{K\_Del} = \emptyset ;\)

(4) \(NC = \{\mathtt{Van,\;Jeep,\;Adult,\;Institute,\;University}\};\)

(6) \(P_{{K}{\_}Add} = P_{{K}{\_}Add} \cup \{\mathtt{(Fiat{\_}1\,type\,Van)}, \mathtt{(BMW{\_}1\,type\,Van)},\;\mathtt{(Fiat{\_}1\,type\,Jeep)}, \mathtt{(BMW{\_}1\,type\,Jeep)}, \mathtt{(Bob\,type\,Adult)}, \mathtt{(Alice type Adult)}, \mathtt{(Computer Science Department type } \mathtt{Institute)}, \mathtt{(FORTH } \mathtt{type Institute)}, \mathtt{(Computer Science Department type } \mathtt{University)}, \mathtt{(FORTH type University)}\};\)

(8) \(P_{K{\_}Del} = P_{K{\_}Del} \cup \{\emptyset \};\)

(9) \(NP = \{\mathtt{paid\,from}\};\)

(11) \(P_{K\_Add} = P_{K\_Add} \cup \{\mathtt{(Alice\,paid\;from\,Computer\,Science\,Department)}, \mathtt{(Bob\,paid\,from\,FORTH)}\};\)

(13) \(P_{K{\_}Del} = P_{K{\_}Del} \cup \{\mathtt{(Alice\,works\,at\,FORTH)}\};\)

Note that, \(\mathtt{(Alice related to FORTH)} \notin (P_K \cup \mathcal C _i(K^{\prime }))\) and \(\mathtt{works\;at}\) \(\le _{cl}\) \(\mathtt{related\;to}\) holds in \(K^{\prime }.\) So, we have to move \(\mathtt{(Alice\;works\;at\;FORTH)}\) from \(P_K\) to \(F_{K^{\prime }}\) (due to Rule \(R2\)).

(14) \(P_{K\_Del} = P_{K\_Del} \cup \{\mathtt{(Fiat\_1\;type\;Vehicle)},\;(\mathtt{Bob\;uses\;BMW\_1})\};\)

Note that \(P_{K\_Del}\) is updated by those triples that belong to \(P_K\) and now belong to \(\mathcal C _i(K^{\prime }).\) So, we have to remove them from \(P_{K}.\)

(15) \(P_{K^{\prime }} = P_K \setminus P_{K\_Del}\;=\)

\(\{\mathtt{(Fiat\_1\;type\;Vehicle),\;(Bob\;uses\;BMW\_1)},\;\mathtt{(Alice\;works\;at\;FORTH)}\} \setminus \)

\(\{\mathtt{(Fiat\_1\;type\;Vehicle)},\;\mathtt{(Bob\;uses\;\mathtt BMW\_1)},\;\mathtt{(Alice\;works\;at\;FORTH)}\} = \emptyset ;\)

(16) \(P_{K^{\prime }} = \{\mathtt{(Fiat\_1\;type\;Van)},\;\mathtt{(BMW\_1\;type\;Van)},\;\mathtt{(Fiat\_1\;type\;Jeep)},\)

\(\mathtt{(BMW\_1\;type\;Jeep)},\;\mathtt{(Bob\;type\;Adult)},\;\mathtt{(Alice\;type\;Adult)},\)

\(\mathtt{(Computer\;Science\;Department\;type\;Institute)},\;\mathtt{(FORTH\;type\;Institute)},\)

\(\mathtt{(Computer\;Science\;Department\;type\;University)},\;\mathtt{(FORTH\;type\;University)},\)

\(\mathtt{(Alice\;paid\;from\;Computer\;Science\;Department)},\;\mathtt{(Bob\;paid\;from\;FORTH)}\}\)

(17) Return \(P_{K^{\prime }};\)

In order to explain line 8 in part B of Algorithm 1, consider Fig. 5. Suppose that we have another version \(S_{K^{\prime \prime }}=S_{K^{\prime }} \cup \{(\mathtt{Van}\) \(\le _{cl}\) \(\mathtt{LoadCarrying\;Vehicle})\},\) where we have a new specialization relationship, i.e. \(\mathtt{Van}\) \(\le _{cl}\) \(\mathtt{LoadCarrying\;Vehicle}.\) Then, according to line 8 of Algorithm 1, we have that \(P_{K\_Del} = P_{K\_Del} \cup \{(\mathtt Fiat\_1\;type\;Van)\}.\) This is because it holds that \(\mathtt{(Fiat\_1}\) \(\mathtt{\;type\;LoadCarrying\;}\) \(\mathtt{Vehicle)}\not \in P_{K^{\prime }}\cup \mathcal C _i(K^{\prime \prime })\) and \((\mathtt{Van}\) \( \le _{cl}\) \(\mathtt{LoadCarrying\;}\) \(\mathtt{Vehicle})\in S_{K^{\prime \prime }}.\) So, we have to move \(\mathtt{(Fiat\_1\;}\) \(\mathtt{type\;Van)}\) from \(P_{K^{\prime }}\) to \(F_{K^{\prime \prime }}\) (due to Rule \(R1\)).

Note that if \((o\;type\;c) \in \mathcal C _i(K^{\prime })\) and \((o\;type\;c^{\prime })\not \in \mathcal C _i(K^{\prime }) \cup P_{K^{\prime }},\) for all \(c^{\prime }\le ^*_{cl} c\) and \(c^{\prime } \not = c,\) then the \( MSA \) property holds for the class instance triple \((o\;type\;c).\) Similarly, if \((o\;pr\;o^{\prime }) \in \mathcal C _i(K^{\prime })\) and \((o\;pr^{\prime }\;o^{\prime })\not \in \mathcal C _i(K^{\prime }) \cup P_{K^{\prime }},\) for all \(pr^{\prime }\le ^*_{pr} pr\) and \(pr^{\prime } \not = pr,\) then the \( MSA \) property holds for the property instance triple \((o\;pr\;o^{\prime }).\)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tzitzikas, Y., Kampouraki, M. & Analyti, A. Curating the Specificity of Ontological Descriptions under Ontology Evolution. J Data Semant 3, 75–106 (2014). https://doi.org/10.1007/s13740-013-0027-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13740-013-0027-z

Keywords

Navigation