skip to main content
10.1145/1008992.1009038acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Information retrieval using word senses: root sense tagging approach

Published:25 July 2004Publication History

ABSTRACT

Information retrieval using word senses is emerging as a good research challenge on semantic information retrieval. In this paper, we propose a new method using word senses in information retrieval: root sense tagging method. This method assigns coarse-grained word senses defined in WordNet to query terms and document terms by unsupervised way using co-occurrence information constructed automatically. Our sense tagger is crude, but performs consistent disambiguation by considering only the single most informative word as evidence to disambiguate the target word. We also allow multiple-sense assignment to alleviate the problem caused by incorrect disambiguation.Experimental results on a large-scale TREC collection show that our approach to improve retrieval effectiveness is successful, while most of the previous work failed to improve performances even on small text collection. Our method also shows promising results when is combined with pseudo relevance feedback and state-of-the-art retrieval function such as BM25.

References

  1. S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science 41(6):391--407, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  2. T. Hofmann. Probabilistic Latent Semantic Indexing. In Proceedings of the 22nd Annual ACM Conference on Research and Development in Information Retrieval pages 50--57, Berkeley, California, August 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. K. S. Jones, S. Walker, and S. E. Robertson. A probabilistic model of information retrieval: development and comparative experiments -part 1. Information Processing and Management 36(6):779--808, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Krovetz and W. B. Croft. Lexical ambiguity and information retrieval. Information Systems 10(2):115--141, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. D. Manning and H. Schutze. Foundations of statistical natural language processing MIT Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. E. Robertson and S. Walker. Okapi/keenbow at trec-8. In Proceedings of TREC-8, 8th Text Retrieval Conference pages 151--161, Gaithersburg, US, 2000.Google ScholarGoogle Scholar
  7. M. Sanderson. Retrieving with good sense. Inf. Retr., 2(1):49--69, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Sanderson and C. J. V. Rijsbergen. The impact on retrieval effectiveness of skewed frequency distributions. ACM Transactions on Information Systems 17(4):440--465, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H.Schutze and J. Pedersen. Information retrieval based on word senses. In Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval pages 161--175, 1995.Google ScholarGoogle Scholar
  10. C. Stokoe, M. P. Oakes, and J. Tait. Word sense disambiguation in information retrieval revisited. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval pages 159--166. ACM Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. E. M. Voorhees. Using wordnet to disambiguate word senses for text retrieval. In Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval pages 171--180. ACM Press, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Wallis. Information retrieval based on paraphrase. In Proceedings of the 1st Pacific Association for Computational Linguistics Conference 1993.Google ScholarGoogle Scholar
  13. D. Yarowsky. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of 33rd Annual Meeting of the Association for Computational Linguistics pages 189--196, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Information retrieval using word senses: root sense tagging approach

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
            July 2004
            624 pages
            ISBN:1581138814
            DOI:10.1145/1008992

            Copyright © 2004 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 July 2004

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            Overall Acceptance Rate792of3,983submissions,20%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader