skip to main content
10.1145/1507509.1507519acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Using query logs and click data to create improved document descriptions

Published:09 February 2009Publication History

ABSTRACT

Logfiles of search engines are a promising resource for data mining, since they provide raw data associated to users and web documents. In this paper we focus on the latter aspect and explore how the information in logfiles could be used to improve document descriptions. A pilot experiment demonstrated that document descriptors extracted from the queries that are associated with documents by clicks provide useful semantic information about documents in addition to document descriptors extracted from the full text of the web pages.

References

  1. R. Baeza-Yates and A. Tiberi. Extracting semantic relations from query logs. In KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 76--85, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. O. Boydell and B. Smyth. From social bookmarking to social summarization: an experiment in community-based summary generation. In IUI '07: Proceedings of the 12th international conference on Intelligent user interfaces, pages 42--51, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Buttcher, C. Clarke, and G. Cormack. Domain-specific synonym expansion and validation for biomedical information retrieval (multitext experiments for trec 2004). 2004.Google ScholarGoogle Scholar
  4. H. Cui, J. Wen, J. Nie, and W. Ma. Probabilistic query expansion using query logs. In Proceedings of the 11th international conference on World Wide Web, pages 325--332. ACM Press New York, NY, USA, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Daume and E. Brill. Web search intent induction via automatic query reformulation. In Human Language Technology Conference/North American chapter of the Association for Computational Linguistics (HTL/NAACL), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L. Denoyer and P. Gallinari. The Wikipedia XML corpus. ACM SIGIR Forum, 40(1):64--69, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. Huang, L. Chien, and Y. Oyang. Relevant term suggestion in interactive web search based on contextual information in query session logs. JASTIS, 54(7):638--649, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T. Joachims. Optimizing search engines using clickthrough data. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 133--142, New York, NY, USA, 2002. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In Proceedings of the 28th annual international ACM SIGIR conference, pages 154--161. ACM New York, NY, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Mostert and V. Hollink. Effects of Goal-Oriented Search Suggestions. In Proceedings of the Belgian-Dutch Conference on Artificial Intelligence (BNAIC 2008), 2008.Google ScholarGoogle Scholar
  11. F. Radlinski and T. Joachims. Active exploration for learning rankings from clickthrough data. In Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 570--579. ACM Press New York, NY, USA, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Sun, D. Shen, H. Zeng, Q. Yang, Y. Lu, and Z. Chen. Web-page summarization using clickthrough data. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 194--201. ACM New York, NY, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Tomokiyo and M. Hurst. A Language Model Approach to Keyphrase Extraction. In Proceedings of the ACL Workshop on Multiword Expressions, pages 34--40, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. Xue, H. Zeng, Z. Chen, Y. Yu, W. Ma, W. Xi, and W. Fan. Optimizing web search using web click-through data. In Proceedings of the thirteenth ACM international conference on Information and knowledge management, pages 118--126. ACM New York, NY, USA, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Using query logs and click data to create improved document descriptions

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          WSCD '09: Proceedings of the 2009 workshop on Web Search Click Data
          February 2009
          95 pages
          ISBN:9781605584348
          DOI:10.1145/1507509

          Copyright © 2009 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 9 February 2009

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader