skip to main content
10.1145/1995966.1995972acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Implicit association via crowd-sourced coselection

Authors Info & Claims
Published:06 June 2011Publication History

ABSTRACT

The interaction of vast numbers of search engine users with sets of search results sets is a potential source of significant quantities of resource classification data. In this paper we discuss work which uses coselection data (i.e. multiple click-through events generated by the same user on a single search engine result page) as an indicator of mutual relevance between web resources and a means for the automatic clustering of sense-singular resources. The results indicate that coselection can be used in this way. We ground-truthed unambiguous query clustering, forming a foundation for work on automatic ambiguity detection based on the resulting number of generated clusters. Using the cluster overlap by population principle, the extension of previous work allowed determination of synonyms or lingual translations where overlapping clusters indicated the mutual relevance in coselection and subsequently the irrelevance of the actual label inherited from the user query.

References

  1. E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incorporating user behavior information. Proc. SIGIR, Jan 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. H. Ashman, M. Antunovic, C. Donner, R. Frith, E. Rebelos, J.-F. Schmakeit, G. Smith, and M. Truran, Are clickthroughs useful for image labelling?, Proceedings of IEEE/WIC/ACM Web Intelligence 09, Milan. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. H. Ashman, D. Zhou, J. Goulding, T. Brailsford and M. Truran, The Global Perpetual Dictionary of Everything, Proc. Ausweb 2007, http://ausweb.scu.edu.au/aw07/pa-pers/refereed/ashman/paper.html, 2007.Google ScholarGoogle Scholar
  4. D. Beeferman and A. Berger, Agglomerative clustering of a search engine query log, Proc of SIGKDD, 2000, pp 407--416 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. V. Bush, As we may think, Atlantic Monthly, July 1945.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. W. S. Chan, W. T. Leung and D. L. Lee, Clustering Search Engine Query Log Containing Noisy Clickthroughs, Proc Int. Symposium on Applications and the Internet, 2004, p. 4.Google ScholarGoogle Scholar
  7. O. Chapelle and Y. Zhang, A Dynamic Bayesian Network Click Model for web Search Ranking, Proc ACM WWW, 2009 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P Alexandru Chirita, W. Nejdl, R. Paiu, and C. Kohlschütter. Using odp metadata to personalize search. Proceedings SIGIR 2005, Jan 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. N. Craswell, O. Zoeter, M. Taylor and B. Ramsey, An experimental comparision of click position-bias models, Proc of WSDM, 2008, pp 87--94 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Cronen-Townsend and W Bruce Croft. Quantifying query ambiguity. Proc. second international conference on Human Language Technology Research, Jan 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Dupret and B. Piwowarski, User browsing model to predict search engine click data from past observations, Proc SIGIR, 2008 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L Earl. The resolution of syntactic ambiguity in automatic language processing. Inf. Storage and Retrieval, Jan 1972.Google ScholarGoogle ScholarCross RefCross Ref
  13. L Earl. Use of word government in resolving syntactic and semantic ambiguities. Inf. Storage and Retr., Jan 1973.Google ScholarGoogle ScholarCross RefCross Ref
  14. M. Ester, H. P. Kriegel, J. Sander, and X. Xu, A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, in Proc. KDDM 1996, pp. 226--231.Google ScholarGoogle Scholar
  15. D. Fallows, 2008. Search Soars, Challenging Email as a Favorite Internet Activity. http://pewresearch.org/pubs/921/internet-searchGoogle ScholarGoogle Scholar
  16. S. Fox, K. Karnawat, M. Mydland, S. Dumais and T. White, Evaluating implicit measures to improve web search, ACM TOIS, 2005, vol 63, pp147--481 Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. W. Gale, K. Ward Church, and D. Yarowsky. Estimating upper and lower bounds on the performance of word-sense disambiguation programs. Proc ACL '92, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Google Image Labeler, http://images.goo-gle.com/imagelabeler/, accessed 19/01/2010Google ScholarGoogle Scholar
  19. F. Guo, C. Liu, A. Kannan, T. Minka, M. Taylor, Y. Wang and C. Faloutsos, Click Chain Model in Web Search, Proc ACM WWW, 2009 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. T. H. Haveliwala. Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search. IEEE transactions on knowledge and data engineering, Jan 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. N. Ide and J. Vronis. Introduction to the special issue on word sense disambiguation: the state of the art. Computational linguistics, Jan 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback, In SIGIR, pages 154--161, Brazil, 2005. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. Kaplan. An experimental study of ambiguity and context. Published 1955 in Mechanical Translation, 2(2):39--46, 1950.Google ScholarGoogle Scholar
  24. H. Lieberman, Letizia: An agent that assists web browsing, IJCAI, 1995, vol 14, pp 924--929 Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Liu, C. Yu, and W. Meng. Word sense disambiguation in queries. Proc. ACM Int. conference on Information and knowledge management, Jan 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Sanderson. Word sense disambiguation and information retrieval. Proc. ACM SIGIR 1994, Jan 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Sanderson. Word sense disambiguation and information retrieval. PhD Thesis, 1996.Google ScholarGoogle Scholar
  28. M. Sanderson, Ambiguous queries: test collections need more sense, in Proc. ACM SIGIR, ACM, 2008, pp. 499--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. F. Scholer, M. Shokouhi, B. Billerbeck and A. Turpin, Using clicks as implicit judgments: Expectations versus observations. ECIR, 2008, vol 4956, pp 28--39 Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. D. Shen, M. Qin, W. Chen, Q. Yang and Z. Chen, Mining web query hierarchies from clickthrough data, AAAI, 2007, vol 22, pp 341 Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. G. Smith, M. Antunovic, and H. Ashman, Classifying Images with Image and Text Search Clickthrough Data, Proc. Int. Conf. on Active Media Technology, 2009 Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. G. Smith and H. Ashman, Evaluating implicit judgments from Web search interactions, Proc. Web Sci., 2009.Google ScholarGoogle Scholar
  33. G. Smith, T. Brailsford, C. Donner, D. Hooijmaijers, M. Truran, J. Goulding and H. Ashman, Generating unambiguous URL clusters from Web search, Proc. workshop on Web Search Click Data, pp 28--34, 2009, ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. R. Song, Z. Luo, Y.-Y. Nie, Y. Yu, and H.-W.Hon. Identification of ambiguous queries in web search. Information Processing & Mgmt, Jan 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. K. Sparck-Jones, S. E Robertson, and M. Sanderson. Ambiguous requests: implications for retrieval tests, systems and theories. ACM SIGIR forum, Jan 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. M. Truran, The Theory and Practice of Co-active Search, doctoral thesis, University of Nottingham, 2005.Google ScholarGoogle Scholar
  37. M. Truran, J. Goulding and H. Ashman, Co-active Intelligence for Image Retrieval, Proc of Multimedia 05, ACM, 2005, pp 547--550. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. T. Tsikrika, C. Diou, A de Vries and A. Delopoulos, Image annotation using clickth rough data, CIVR 09, ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. W. Weaver. Translation (published 1955). Machine Translation of Languages: Fourteen Essays, W. N. Locke and A. D. Booth, Eds.(Technology Press of MIT, Cambridge, MA):15--23, 1949Google ScholarGoogle Scholar
  40. J. Wen, J. Nie and H. Zhang, Clustering user queries of a search engine, Proc of ACM WWW, 2001, pp 162--168 Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. J. R. Wen, J. Y. Nie and H. J. Zhang, Query clustering using user logs, ACM Trans. Inf. Syst., vol. 20, pp. 59--81, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. J. R. Wen and H.-J. Zhang. Query Clustering in the Web Context, in Inf. Retrieval and Clustering, Kluwer, 2002Google ScholarGoogle Scholar

Index Terms

  1. Implicit association via crowd-sourced coselection

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia
        June 2011
        348 pages
        ISBN:9781450302562
        DOI:10.1145/1995966
        • General Chair:
        • Paul De Bra,
        • Program Chair:
        • Kaj Grønbæk

        Copyright © 2011 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 6 June 2011

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate378of1,158submissions,33%

        Upcoming Conference

        HT '24
        35th ACM Conference on Hypertext and Social Media
        September 10 - 13, 2024
        Poznan , Poland

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader