Determining relevance of imprecise temporal intervals for cultural heritage information retrieval

doi:10.1016/j.ijhcs.2010.03.002

International Journal of Human-Computer Studies

Volume 68, Issue 9, September 2010, Pages 549-560

https://doi.org/10.1016/j.ijhcs.2010.03.002 Get rights and content

Abstract

Time is an essential concept in cultural heritage applications. Instances of temporal concepts such as time intervals are used for the annotation of cultural objects and also for querying datasets containing information about these objects. Hence it is important to match query and annotation intervals by examining their similarity or closeness. One of the problems is that in many cases time intervals are imprecise. For example, the boundaries of the “Pre-Roman age” and the “Roman age” are inherently imprecise and it may be difficult to distinguish them with clear-cut intervals. In this paper we apply the fuzzy set theory to model imprecise time intervals in order to determine relevance of the relationship between two time intervals. We present a method for matching query and annotation intervals based on their weighted mutual overlapping and closeness. We present (1) methods for calculating these weights to produce a combined measure and (2) results of comparing the combined measure with human evaluators as a case study. The case study takes into consideration archaeological temporal information, which is in most cases inherently fuzzy, and therefore offers a particularly complex and challenging scenario. The results show that our new combined measure that utilizes different weighted measures together in rankings, performs the best in terms of precision and recall. It should be used when ranking annotation intervals according to a given query interval in cultural heritage information retrieval. Our approach intends to be generalizable: overlapping and closeness may be calculated between any two fuzzy temporal intervals. The presented procedure of using user evaluation results as a basis for assigning weights for overlapping and closeness could potentially be used to reveal weights in other domains and purposes as well.

Introduction

Time is one of the central concepts in ontologies representing the world, and hence should also be centric in annotations and queries of the Semantic Web. Time is especially important for managing historical collections, for example in visualizing them on a timeline (Hyvönen et al., 2009, Schreiber et al., 2006).

However, representing time in Semantic Web ontologies is not straightforward because the question of when a certain time was or will be is often uncertain, subjective or vague (Nagypál and Motik, 2003). For example, it may not be known when exactly a given archaeological artifact was manufactured (uncertainty), when “The Middle Ages” was according to opinions of different historians (subjectivity), or when the spring starts (vagueness, imprecision).

In addition, transitions between different phases, such as historical periods, are usually complex processes which are not identifiable by clear cut dates, even if conventional calendric markers are mostly used in order to simplify historical sequences. All these elements are at the basis of imprecise temporal representations especially in the cultural heritage, historical and archaeological contexts.

Nevertheless, representations of time are needed for representing and matching annotations and queries. The definition of temporal intervals and their relations are crucial in the context of archaeological chronologies. Checking whether two time intervals have something in common allows for answering queries like “find all artifacts manufactured around the middle of the I century B.C.”. At the same time, the often inherently fuzzy nature of historical and archaeological temporal information makes this scenario particularly complex and challenging.

In this paper we adapt the idea (Nagypál and Motik, 2003, Visser, 2004) of using fuzzy sets (Zadeh, 1965) for the representation of temporal annotations and queries. The structure of the paper is as follows. In Section 2 we present a new method for matching queries and annotations using their weighted mutual overlapping and closeness. In Section 3 we present a case study where the method has been implemented and provide results of its evaluation with human test subjects. The results in Section 5 suggest that a measure that combines different single, weighted measures together performs best in terms of precision and recall, and should hence be used when ranking annotation intervals with respect to a given query interval. In Section 6 we provide a discussion about the results, and Section 7 provides discussion about the related work. Finally, Section 8 concludes the paper.

Section snippets

Representing and reasoning about temporal overlappings

To represent and calculate overlappings between temporal intervals we use fuzzy sets (Zadeh, 1965) to represent temporal intervals, resulting in fuzzy temporal intervals (Nagypál and Motik, 2003, Visser, 2004). The fuzzy set theory enables modeling imprecise time ranges, such as “around 1950” that have vague boundaries. In the fuzzy set theory the grade of membership $μ$ of the item x in the given set A is a value in range [0,1], whereas in the traditional set theory an item x either belongs to a

Case study: Ancient Milan

In the evaluation of the method we considered a dataset from the “Ancient Milan” project (Bandini et al., 2009). “Ancient Milan” aims at improving access to information concerning the archaeological heritage of Milan (Italy) through the utilization of Semantic Web and Web 2.0 techniques and tools. This data comes from heterogeneous data sources, which are integrated by means of the CIDOC CRM (Crofts et al., 2009) ontology.

Temporal information is provided mostly in the form of chronologies that

Evaluation setting: participants, materials and methods

In order to measure the correlation between different measures and human opinions we used the following evaluation setting. Together there were twelve human evaluators that were given the task to evaluate each of the query intervals according to each of the annotation intervals.

The query periods represent relevant historical phases in the ancient history of Milan, while the annotation periods refer to the temporal intervals when artifacts were produced according to archaeologists. This latter

Ratings from evaluators

Twelve evaluators (E₁,…,E₁₂) evaluated all 12 query intervals with respect to all 66 unique annotation intervals as described in the previous section. In other words, each query/annotation pair was rated by each of the same twelve evaluators. Recall that all query/annotation pairs with no explicit rating were treated as having zero stars. The user agreement about relevance levels between queries and annotations was analyzed using weighted kappa (Cohen, 1968). Weighted kappa coefficients are

Discussion

The results of comparing the rankings given by the method and the ratings given by the evaluators seem promising as there is an apparent correlation between the rankings and the ratings. The precision and recall curve shows that the new combined measure performed best. Among single measures the closeness performed well in traditional precision and recall analysis. This might be due to the fact that other measures measure the level of overlap (or confidence) in different ways and simply do not

Related work

The general properties of time ontologies have been explored e.g. in Vila (1994). There have been discussions e.g. whether the basic primitive is the interval (period) or the point. Other properties for time are characterized by whether time is discrete or dense, bounded or unbounded, and what type of precedence the time ontology allows: linear, branching, parallel or circular (cyclic). Allen (1983) has presented a set of the 13 primitive interval relations that exclusively correspond to every

Conclusions

The proposal and experiments presented in this paper provide an approach for matching time intervals. Imprecision of temporal intervals was modeled using fuzzy sets and a method was developed in order to obtain a better match between human and machine interpretations of periods in information retrieval. For this we proposed to calculate overlappings and closenesses between annotation and query intervals, and showed how they can be combined together. The models and the method were tested in a

Acknowledgments

Discussions with the members of the Semantic Computing Research Group (SeCo) have greatly influenced the ideas and the research setting presented here. We gratefully acknowledge Stina Westman and Mari Laine-Hernandez for valuable comments. Moreover, the insightful comments from three anonymous referees provided useful suggestions to improve the content of the paper. Our research was done in the EU project SmartMuseum⁷ supported within the IST priority of the Seventh

References (22)

S. Schockaert et al.
Temporal reasoning about fuzzy intervals
Artificial Intelligence
(2008)
L.A. Zadeh
Fuzzy sets
Information and Control
(1965)
T. Accary-Barbier et al.
Building and using temporal knowledge in archaeological documentation
Journal of Intelligent Information Systems
(2008)
J.F. Allen
Maintaining knowledge about temporal intervals
Communications of the ACM
(1983)
D.G. Altman
Practical Statistics for Medical Research
(1991)
Bandini, S., Locatelli, M., Mantegari, G., Simone, C., Vizzari, G., 2009. An integrated approach to the valorization of...
Caporusso, D., Donati, M.T., Masseroli, S., Tibiletti, T., 2007. Immagini di Mediolanum. Civiche Raccolte Archeologiche...
J. Cohen
Weighted kappa: nominal scale agreement provision for scaled disagreement or partial credit
Psychological Bulletin
(1968)
Crofts, N., Doerr, M., Gill, T., Stead, S., Stiff, M., 2009. Definition of the CIDOC Conceptual Reference Model Version...
Doerr, M., Plexousakis, D., Kopaka, K., soula Bekiari, C., 2004. Supporting chronological reasoning in archaeology. In:...

D. Dubois et al.

Possibility Theory: An Approach to Computerized Processing of Uncertainty

(1988)

Cited by (0)

View full text

Determining relevance of imprecise temporal intervals for cultural heritage information retrieval

Abstract

Introduction

Section snippets

Representing and reasoning about temporal overlappings

Case study: Ancient Milan

Evaluation setting: participants, materials and methods

Ratings from evaluators

Discussion

Related work

Conclusions

Acknowledgments

Artificial Intelligence

Information and Control

Building and using temporal knowledge in archaeological documentation

Journal of Intelligent Information Systems

Maintaining knowledge about temporal intervals

Communications of the ACM

Practical Statistics for Medical Research

Weighted kappa: nominal scale agreement provision for scaled disagreement or partial credit

Psychological Bulletin

Possibility Theory: An Approach to Computerized Processing of Uncertainty