To read this content please select one of the options below:

Evaluating the degree of domain specificity of terms in large terminologies: The case of AGROVOC

David Martín-Moncunill (Department of Computer Science , University of Alcalá, Madrid, Spain)
Miguel-Ángel Sicilia-Urban (Department of Computer Science, University of Alcalá, Madrid, Spain)
Elena García-Barriocanal (Department of Computer Science, University of Alcalá Madrid, Spain)
Salvador Sánchez-Alonso (Department of Computer Science, University of Alcala, Alcala de Henares, Spain)

Online Information Review

ISSN: 1468-4527

Article publication date: 8 June 2015

302

Abstract

Purpose

Large terminologies usually contain a mix of terms that are either generic or domain specific, which makes the use of the terminology itself a difficult task that may limit the positive effects of these systems. The purpose of this paper is to systematically evaluate the degree of domain specificity of the AGROVOC controlled vocabulary terms as a representative of a large terminology in the agricultural domain and discuss the generic/specific boundaries across its hierarchy.

Design/methodology/approach

A user-oriented study with domain-experts in conjunction with quantitative and systematic analysis. First an in-depth analysis of AGROVOC was carried out to make a proper selection of terms for the experiment. Then domain-experts were asked to classify the terms according to their domain specificity. An evaluation was conducted to analyse the domain-experts’ results. Finally, the resulting data set was automatically compared with the terms in SUMO, an upper ontology and MILO, a mid-level ontology; to analyse the coincidences.

Findings

Results show the existence of a high number of generic terms. The motivation for several of the unclear cases is also depicted. The automatic evaluation showed that there is not a direct way to assess the specificity degree of a term by using SUMO and MILO ontologies, however, it provided additional validation of the results gathered from the domain-experts.

Research limitations/implications

The “domain-analysis” concept has long been discussed and it could be addressed from different perspectives. A resume of these perspectives and an explanation of the approach followed in this experiment is included in the background section.

Originality/value

The authors propose an approach to identify the domain specificity of terms in large domain-specific terminologies and a criterion to measure the overall domain specificity of a knowledge organisation system, based on domain-experts analysis. The authors also provide a first insight about using automated measures to determine the degree to which a given term can be considered domain specific. The resulting data set from the domain-experts’ evaluation can be reused as a gold standard for further research about these automatic measures.

Keywords

Citation

Martín-Moncunill, D., Sicilia-Urban, M.-Á., García-Barriocanal, E. and Sánchez-Alonso, S. (2015), "Evaluating the degree of domain specificity of terms in large terminologies: The case of AGROVOC", Online Information Review, Vol. 39 No. 3, pp. 326-345. https://doi.org/10.1108/OIR-02-2015-0052

Publisher

:

Emerald Group Publishing Limited

Copyright © 2015, Emerald Group Publishing Limited

Related articles