ABSTRACT
Knowledge organization systems such as thesauri or taxonomies are increasingly being expressed using the Simple Knowledge Organization System (SKOS) and published as structured data on the Web. Search engines can exploit these vocabularies and improve search by expanding terms at query or document indexing time. We propose a SKOS-based term expansion and scoring technique that leverages labels and semantic relationships of SKOS concept definitions. We also implemented this technique for Apache Lucene and Solr. Experiments with the Medical Subject Headings vocabulary and an early evaluation with Library of Congress Subject Headings indicated gains in precision when using SKOS-based expansion compared to pseudo relevance feedback and no expansion. Our findings are important for publishers and consumer of Web vocabularies who want to use them for improving search over Web documents.
- J. Bai, D. Song, P. Bruza, J.-Y. Nie, and G. Cao. Query expansion using term relationships in language models for information retrieval. In SIGIR'05, pages 688--69 ACM, 2005. Google ScholarDigital Library
- J. Bhogal, A. Macfarlane, and P. Smith. A review of ontology based query expansion. Information Processing & Management, 43(4):866--886, 2007. Google ScholarDigital Library
- C. Carpineto and G. Romano. A survey of automatic query expansion in information retrieval. ACM Computing Surveys (CSUR), 44(1):1, 2012. Google ScholarDigital Library
- T. Heath and C. Bizer. Linked Data: Evolving the Web into a Global Data Space. Morgan & Claypool, 2011. Google ScholarDigital Library
- W. Hersh, C. Buckley, T. J. Leone, and D. Hickam. OHSUMED: an interactive retrieval evaluation and new large test collection for research. In SIGIR '94, pages 192--201, Aug. 1994. Google ScholarDigital Library
- W. Hersh, S. Price, and L. Donohoe. Assessing thesaurus-based query expansion using the UMLS Metathesaurus. In Proceedings of the AMIA Symposium, page 344, Jan. 2000.Google Scholar
- J. Lin and D. Demner-Fushman. The role of knowledge in conceptual retrieval. In SIGIR '06, pages 99--106, Aug. 2006. Google ScholarDigital Library
- N. A. A. Manaf, S. Bechhofer, and R. Stevens. The current state of skos vocabularies on the web. In ESWC, pages 270--284, 2012. Google ScholarDigital Library
- A. Miles and S. Bechhofer. SKOS Simple Knowledge Organization System Reference. Recommendation, W3C, 2009.Google Scholar
- A. Miles, B. Matthews, M. Wilson, and D. Brickley. SKOS Core: Simple knowledge organisation for the Web, Dec. 2005.Google Scholar
- E. Summers, A. Isaac, C. Redding, and D. Krech. Lcsh, skos and linked data. arXiv preprint arXiv:0805.2855, 2008. Google ScholarDigital Library
- M. Theobald, R. Schenkel, and G. Weikum. Efficient and self-tuning incremental query expansion for top-k query processing. In SIGIR '05, pages 242--249, Aug. 2005. Google ScholarDigital Library
- M. Van Assem, V. Malaisé, A. Miles, and G. Schreiber. A Method to Convert Thesauri to SKOS. The Semantic Web: Research and Applications, pages 95--109, 2006. Google ScholarDigital Library
- E. M. Voorhees. Query expansion using lexical-semantic relations. In SIGIR '94, pages 61--69, Aug. 1994. Google ScholarDigital Library
- W. Zhou, C. Yu, N. Smalheiser, V. Torvik, and J. Hong. Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature. In SIGIR '07, pages 655--662, July 2007. Google ScholarDigital Library
Index Terms
Using SKOS vocabularies for improving web search
Recommendations
Semantic turkey goes SKOS managing knowledge organization systems
I-SEMANTICS '12: Proceedings of the 8th International Conference on Semantic SystemsIn this paper we describe a novel SKOS editor built on top of the web browser Mozilla Firefox. Our tool is targeted towards KOS developers and KOS consumers as well. Indeed, the ability to surf the Web with a standards compliant browser proves useful ...
The Use of SKOS Vocabularies in Digital Repositories: The DSpace Case
ICSC '10: Proceedings of the 2010 IEEE Fourth International Conference on Semantic ComputingThesauri are concept schemes that help in efficiently characterizing and retrieving items from digital libraries. SKOS is a data model that provides a standardized way to represent thesauri-and controlled vocabularies in general-using Resource ...
SKOS core: simple knowledge organisation for the web
DCMI '05: Proceedings of the 2005 international conference on Dublin Core and metadata applications: vocabularies in practiceThis paper introduces SKOS Core, an RDF vocabulary for expressing the basic structure and content of concept schemes (thesauri, classification schemes, subject heading lists, taxonomies, terminologies, glossaries and other types of controlled vocabulary)...
Comments