skip to main content
10.1145/2362499.2362520acmotherconferencesArticle/Chapter ViewAbstractPublication PagessemanticsConference Proceedingsconference-collections
research-article

Are SKOS concept schemes ready for multilingual retrieval applications?

Published:05 September 2012Publication History

ABSTRACT

This article describes our approach to accessing Knowledge Organization Systems expressed using the Simple Knowledge Organization System (SKOS) data model. We share the view that the Web is becoming a multilingual lexical resource and a distribution infrastructure for knowledge resources. We aim to tap into this for the particular use case of Cross-Language Information Retrieval systems. The SKOS framework allows the description of monolingual or multilingual thesauri, controlled vocabularies and other classification systems in a simple machine-understandable representation. It has support for decentralized distribution on the Web of any resource described with it and includes mechanisms to interconnect different concept schemes. Yet, when building our prototype CLIR system different processes require more than the existing content of a SKOS resource: concept descriptions, labels and basic inter-concept relations. For example the SKOS concept indexing phase entails identifying potential occurrences of a SKOS concept in a text and to disambiguate based on the semantics referenced to in the overall SKOS scheme. By design, the SKOS data model does not formally define semantics of its concepts thus we have built a set of three algorithms that help generate a multilingual dataset linking to the original SKOS dataset and providing more details about the lexical entities that describe concepts. This new dataset contains specific RDF triples that aid concept identification, disambiguation and translation in CLIR.

References

  1. SKOS Simple Knowledge Organization System eXtension for Labels (SKOS-XL). http://www.w3.org/TR/skos-reference/skos-xl.html, March 2009.Google ScholarGoogle Scholar
  2. SKOS Simple Knowledge Organization System Reference. http://www.w3.org/TR/skos-reference/, February 2010.Google ScholarGoogle Scholar
  3. N. Calzolari. Initiatives, tendencies and driving forces for a lexical web as part of a language infrastructure. In T. Tokunaga and A. Ortega, editors, Large-Scale Knowledge Resources. Construction and Application, volume 4938 of Lecture Notes in Computer Science, pages 90--105. Springer Berlin / Heidelberg, 2008. 10.1007/978-3-540-78159-2 10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Fellbaum and P. Vossen. Connecting the universal to the specific: Towards the global grid. In Intercultural Collaboration I: Lecture Notes in Computer Science, Springer-Verlag, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Gabrilovich and S. Markovitch. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In In Proceedings of the 20th International Joint Conference on Artificial Intelligence, pages 1606--1611, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. J. G. Gray, N. Gray, and I. Ounis. Searching and exploring controlled vocabularies. In ESAIR "09: Proceedings of the WSDM "09 Workshop on Exploiting Semantic Annotations in Information Retrieval, pages 1--5, New York, NY, USA, 2009. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N. Guarino and P. Giaretta. Ontologies and Knowledge Bases: Towards a Terminological Clarification. Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing, pages 25--32, 1995.Google ScholarGoogle Scholar
  8. D. He and J. Wang. Information Retrieval: Searching in the 21st Century, chapter Cross-Language Information Retrieval. John Wiley & Sons, 2007.Google ScholarGoogle Scholar
  9. D. Lenat. The Dimensions of Context-Space. Cycorp, 1998.Google ScholarGoogle Scholar
  10. G.-A. Levow, D. W. Oard, and P. Resnik. Dictionary-based techniques for cross-language information retrieval. Information Processing Management, 41(3):523--547, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Miles and D. Brickley. SKOS Core Guide. World Wide Web Consortium, Working Draft WD-swbp-skos-core-guide-20051102, November 2005.Google ScholarGoogle Scholar
  12. E. Montiel-Ponsoda, J. Gracia, G. Aguado-De-Cea, and A. Gómez-Pérez. Representing translations on the semantic web. In The 10th International Semantic Web Conference, October 2011.Google ScholarGoogle Scholar
  13. V. Petras, N. Perelman, and F. C. Gey. Using thesauri in cross-language retrieval of german and french indexed collections. In CLEF, pages 349--362, 2002.Google ScholarGoogle Scholar
  14. U. P. School and U. Priss. Lattice-based information retrieval. Knowledge Organization, 27:132--142, 2000.Google ScholarGoogle Scholar
  15. C. Wartena, R. Brussee, L. Gazendam, and W.-O. Huijsen. Apolda: A practical tool for semantic annotation. In Proceedings of the 18th International Conference on Database and Expert Systems Applications, DEXA "07, pages 288--292, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. W. A. Woods. Conceptual indexing: A better way to organize knowledge. Technical report, Mountain View, CA, USA, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Are SKOS concept schemes ready for multilingual retrieval applications?

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          I-SEMANTICS '12: Proceedings of the 8th International Conference on Semantic Systems
          September 2012
          215 pages
          ISBN:9781450311120
          DOI:10.1145/2362499

          Copyright © 2012 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 5 September 2012

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate40of182submissions,22%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader