skip to main content
10.1145/1135777.1135900acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

Using annotations in enterprise search

Published:23 May 2006Publication History

ABSTRACT

A major difference between corporate intranets and the Internet is that in intranets the barrier for users to create web pages is much higher. This limits the amount and quality of anchor text, one of the major factors used by Internet search engines, making intranet search more difficult. The social phenomenon at play also means that spam is relatively rare. Both on the Internet and in intranets, users are often willing to cooperate with the search engine in improving the search experience. These characteristics naturally lead to considering using user feedback to improve search quality in intranets. In this paper we show how a particular form of feedback, namely user annotations, can be used to improve the quality of intranet search. An annotation is a short description of the contents of a web page, which can be considered a substitute for anchor text. We propose two ways to obtain user annotations, using explicit and implicit feedback, and show how they can be integrated into a search engine. Preliminary experiments on the IBM intranet demonstrate that using annotations improves the search quality.

References

  1. Anchor text optimization. www.seo-gold.com/tutorial/anchor-text-optimization.htmlGoogle ScholarGoogle Scholar
  2. Google enterprise solutions. http://www.google.com/enterprise/http://www.google.com/enterprise/.Google ScholarGoogle Scholar
  3. IBM OmniFind solution for enterprise search. http://www-306.ibm.com/software/data/integration/db2ii/editions_womnifind.htmlGoogle ScholarGoogle Scholar
  4. NCSA mosaic: Annotations overview. http://archive.ncsa.uiuc.edu/SDG/Software/XMosaic/Annotations/overview.htmlGoogle ScholarGoogle Scholar
  5. Panoptic enterprise search engine. http://www.panopticsearch.comhttp://www.panopticsearch.com.Google ScholarGoogle Scholar
  6. StumbleUpon. http://www.stumbleupon.comhttp://www.stumbleupon.com.Google ScholarGoogle Scholar
  7. Verity enterprise search solution. http://www.verity.com/products/search/enterprise_web_search/index.htmlGoogle ScholarGoogle Scholar
  8. Yahoo! MyWeb 2.0 BETA. http://myweb2.search.yahoo.com/Google ScholarGoogle Scholar
  9. Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. In Proc. Proc. 7th World Wide Web Conference, Brisbane, Australia, 1998, pages 107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Vannevar Bush. As we may think. In The Atlantic Monthly, July 1945.Google ScholarGoogle Scholar
  11. Junghoo Cho and Sourashis Roy. Impact of search engines on page popularity. In Proc. 13th World Wide Web Conference, pages 20--29, May 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms. The MIT Press, Cambridge, MA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Laurent Denoue and Laurence Vignollet. New ways of using web annotations. In Proc. 9th World Wide Web Conference, Amsterdam, 2000.Google ScholarGoogle Scholar
  14. Nadav Eiron and Kevin S. McCurley. Analysis of anchor text for web search. In Proc. 26th ACM Conference on Research and Development in Information Retrieval, pages 459--460, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Ronald Fagin, Ravi Kumar, Kevin S. McCurley, Jasmine Novak, D. Sivakumar, John A. Tomlin, and David P. Williamson. Searching the workplace web. In Proc. 12th World Wide Web Conference, Budapest, Hungary, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Susan Feldman and Chris Sherman. The high cost of not finding information. In IDC Technical Report 29127, 2003.Google ScholarGoogle Scholar
  17. Marcus Fontoura, Eugene J. Shekita, Jason Y. Zien, Sridhar Rajagopalan, and Andreas Neumann. High performance index build algorithms for intranet search engines. In VLDB, pages 1158--1169, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. David Hawking. Challenges in enterprise search. In Fifteenth Australian Database Conference, Dunedin, NZ, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Thorsten Joachims. Optimizing search engines using clickthrough data. In Proc. 8th ACM Conference on Knowledge Discovery and Data Mining, Alberta, Canada, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Thorsten Joachims, Dayne Freitag, and Tom Mitchell. Webwatcher: A tour guide for the world wide web. In Proc. International Joint Conference on Artificial Intelligence, Nagoya, Japan, 1997.Google ScholarGoogle Scholar
  21. Thorsten Joachims, Laura Granka, Bing Pang, Helene Hembrooke, and Geri Gay. Accurately interpreting clickthrough data as implicit feedback. In Proc. 28th ACM Conference on Research and Development in Information Retrieval, Salvador, Brazil, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Charles Kemp and Kotagiri Ramamohanarao. Long-time learning for web search engines. In Proc. 6th European Conference on Principles and Practice of Knowledge Discovery in Databases, Helsinki, Finland, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Hannes Marais and Krishna Bharat. Supporting cooperative and personal surfing with a desktop assistant. In 10th annual ACM symposium on User Interface Software and Technology, Banff, Alberta, Canada, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998. Paper SIDL-WP-1999-0120 (version of 11/11/1999).Google ScholarGoogle Scholar
  25. Filip Radlinski and Thorsten Joachims. Query chains: Learning to rank from implicit feedback. In Proc. 11th ACM Conference on Knowledge Discovery and Data Mining, Chicago, IL, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Robert Sedgewick. Algorithms in C++. Addison-Wesley Publishing Company, Boston, MA, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Venu Vasudevan and Mark Palmer. On web annotations: Promises and pitfalls of current web infrastructure. In 32nd Hawaii International Conference on Systems Sciences, Maui, Hawaii, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Vishwa Vinay, Ken Wood, Natasa Milic-Frayling, and Ingemar J. Cox. Comparing relevance feedback algorithms for web search. In Proc. 14th World Wide Web Conference, Chiba, Japan, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. I. Witten, A. Moffat, and T. Bell. Managing Gigabytes. Morgan Kaufmann, 1999.Google ScholarGoogle Scholar

Index Terms

  1. Using annotations in enterprise search

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WWW '06: Proceedings of the 15th international conference on World Wide Web
      May 2006
      1102 pages
      ISBN:1595933239
      DOI:10.1145/1135777

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 May 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

      Upcoming Conference

      WWW '24
      The ACM Web Conference 2024
      May 13 - 17, 2024
      Singapore , Singapore

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader