Article

Using annotations in enterprise search

Authors:

Pavel A. Dmitriev,

Marcus Fontoura,

Eugene ShekitaAuthors Info & Claims

WWW '06: Proceedings of the 15th international conference on World Wide Web

Pages 811 - 817

https://doi.org/10.1145/1135777.1135900

Published: 23 May 2006 Publication History

Abstract

A major difference between corporate intranets and the Internet is that in intranets the barrier for users to create web pages is much higher. This limits the amount and quality of anchor text, one of the major factors used by Internet search engines, making intranet search more difficult. The social phenomenon at play also means that spam is relatively rare. Both on the Internet and in intranets, users are often willing to cooperate with the search engine in improving the search experience. These characteristics naturally lead to considering using user feedback to improve search quality in intranets. In this paper we show how a particular form of feedback, namely user annotations, can be used to improve the quality of intranet search. An annotation is a short description of the contents of a web page, which can be considered a substitute for anchor text. We propose two ways to obtain user annotations, using explicit and implicit feedback, and show how they can be integrated into a search engine. Preliminary experiments on the IBM intranet demonstrate that using annotations improves the search quality.

References

[1]

Anchor text optimization. www.seo-gold.com/tutorial/anchor-text-optimization.html

[2]

Google enterprise solutions. http://www.google.com/enterprise/http://www.google.com/enterprise/.

[3]

IBM OmniFind solution for enterprise search. http://www-306.ibm.com/software/data/integration/db2ii/editions_womnifind.html

[4]

NCSA mosaic: Annotations overview. http://archive.ncsa.uiuc.edu/SDG/Software/XMosaic/Annotations/overview.html

[5]

Panoptic enterprise search engine. http://www.panopticsearch.comhttp://www.panopticsearch.com.

[6]

StumbleUpon. http://www.stumbleupon.comhttp://www.stumbleupon.com.

[7]

Verity enterprise search solution. http://www.verity.com/products/search/enterprise_web_search/index.html

[8]

Yahoo! MyWeb 2.0 BETA. http://myweb2.search.yahoo.com/

[9]

Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. In Proc. Proc. 7th World Wide Web Conference, Brisbane, Australia, 1998, pages 107--117, 1998.

Digital Library

[10]

Vannevar Bush. As we may think. In The Atlantic Monthly, July 1945.

[11]

Junghoo Cho and Sourashis Roy. Impact of search engines on page popularity. In Proc. 13th World Wide Web Conference, pages 20--29, May 2004.

Digital Library

[12]

Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms. The MIT Press, Cambridge, MA, 2003.

Digital Library

[13]

Laurent Denoue and Laurence Vignollet. New ways of using web annotations. In Proc. 9th World Wide Web Conference, Amsterdam, 2000.

[14]

Nadav Eiron and Kevin S. McCurley. Analysis of anchor text for web search. In Proc. 26th ACM Conference on Research and Development in Information Retrieval, pages 459--460, 2003.

Digital Library

[15]

Ronald Fagin, Ravi Kumar, Kevin S. McCurley, Jasmine Novak, D. Sivakumar, John A. Tomlin, and David P. Williamson. Searching the workplace web. In Proc. 12th World Wide Web Conference, Budapest, Hungary, 2003.

Digital Library

[16]

Susan Feldman and Chris Sherman. The high cost of not finding information. In IDC Technical Report 29127, 2003.

[17]

Marcus Fontoura, Eugene J. Shekita, Jason Y. Zien, Sridhar Rajagopalan, and Andreas Neumann. High performance index build algorithms for intranet search engines. In VLDB, pages 1158--1169, 2004.

Digital Library

[18]

David Hawking. Challenges in enterprise search. In Fifteenth Australian Database Conference, Dunedin, NZ, 2004.

Digital Library

[19]

Thorsten Joachims. Optimizing search engines using clickthrough data. In Proc. 8th ACM Conference on Knowledge Discovery and Data Mining, Alberta, Canada, 2002.

Digital Library

[20]

Thorsten Joachims, Dayne Freitag, and Tom Mitchell. Webwatcher: A tour guide for the world wide web. In Proc. International Joint Conference on Artificial Intelligence, Nagoya, Japan, 1997.

[21]

Thorsten Joachims, Laura Granka, Bing Pang, Helene Hembrooke, and Geri Gay. Accurately interpreting clickthrough data as implicit feedback. In Proc. 28th ACM Conference on Research and Development in Information Retrieval, Salvador, Brazil, 2005.

Digital Library

[22]

Charles Kemp and Kotagiri Ramamohanarao. Long-time learning for web search engines. In Proc. 6th European Conference on Principles and Practice of Knowledge Discovery in Databases, Helsinki, Finland, 2002.

Digital Library

[23]

Hannes Marais and Krishna Bharat. Supporting cooperative and personal surfing with a desktop assistant. In 10th annual ACM symposium on User Interface Software and Technology, Banff, Alberta, Canada, 1997.

Digital Library

[24]

Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998. Paper SIDL-WP-1999-0120 (version of 11/11/1999).

[25]

Filip Radlinski and Thorsten Joachims. Query chains: Learning to rank from implicit feedback. In Proc. 11th ACM Conference on Knowledge Discovery and Data Mining, Chicago, IL, USA, 2005.

Digital Library

[26]

Robert Sedgewick. Algorithms in C++. Addison-Wesley Publishing Company, Boston, MA, 1998.

Digital Library

[27]

Venu Vasudevan and Mark Palmer. On web annotations: Promises and pitfalls of current web infrastructure. In 32nd Hawaii International Conference on Systems Sciences, Maui, Hawaii, 1999.

Digital Library

[28]

Vishwa Vinay, Ken Wood, Natasa Milic-Frayling, and Ingemar J. Cox. Comparing relevance feedback algorithms for web search. In Proc. 14th World Wide Web Conference, Chiba, Japan, 2005.

Digital Library

[29]

I. Witten, A. Moffat, and T. Bell. Managing Gigabytes. Morgan Kaufmann, 1999.

Cited By

Deolekar RDangare A(2018)Enterprise Search: A New Dimension in Information Retrieval2018 3rd International Conference for Convergence in Technology (I2CT)10.1109/I2CT.2018.8529602(1-6)Online publication date: Apr-2018
https://doi.org/10.1109/I2CT.2018.8529602
Brusilovsky PSmyth BShapira B(2018)Social SearchSocial Information Access10.1007/978-3-319-90092-6_7(213-276)Online publication date: 3-May-2018
https://doi.org/10.1007/978-3-319-90092-6_7
(2016)Social networks and information retrieval, how are they converging? A survey, a taxonomy and an analysis of social information retrieval approaches and platformsInformation Systems10.1016/j.is.2015.07.00856:C(1-18)Online publication date: 1-Mar-2016
https://dl.acm.org/doi/10.1016/j.is.2015.07.008
Show More Cited By

Index Terms

Using annotations in enterprise search
1. Information systems
  1. Information retrieval

Recommendations

Query Expansion in Enterprise Search
DocEng '18: Proceedings of the ACM Symposium on Document Engineering 2018

Although web search remains an active research area, interest in enterprise search has not kept up with the information requirements of the contemporary workforce. To address these issues, this research aims to develop, implement, and study the query ...
Enterprise and desktop search
WWW '10: Proceedings of the 19th international conference on World wide web

With the growing amount of information on users' desktops and increasing scale and complexity of intranets, Enterprise and Desktop Search are becoming two increasingly important Information Retrieval applications. While the challenges arising there are ...
Search result diversification for enterprise data
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Search result diversification aims to return a list of diversified relevant documents in order to satisfy different user information needs. Most of the efforts focused on Web Search, and few studies have considered another important search domain, i.e., ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

WWW '06: Proceedings of the 15th international conference on World Wide Web

May 2006

1102 pages

ISBN:1595933239

DOI:10.1145/1135777

General Chairs:
Leslie Carr
University of Southampton
,
David De Roure
University of Southampton
,
Arun Iyengar
IBM Research
,
Program Chairs:
Carole Goble
University of Manchester, UK
,
Mike Dahlin
University of Texas at Austin

Copyright © 2006 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 May 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

WWW06

Sponsor:

WWW06: The 15th International World Wide Web Conference 2006

May 23 - 26, 2006

Edinburgh, Scotland

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

68
Total Citations
View Citations
1,013
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Deolekar RDangare A(2018)Enterprise Search: A New Dimension in Information Retrieval2018 3rd International Conference for Convergence in Technology (I2CT)10.1109/I2CT.2018.8529602(1-6)Online publication date: Apr-2018
https://doi.org/10.1109/I2CT.2018.8529602
Brusilovsky PSmyth BShapira B(2018)Social SearchSocial Information Access10.1007/978-3-319-90092-6_7(213-276)Online publication date: 3-May-2018
https://doi.org/10.1007/978-3-319-90092-6_7
(2016)Social networks and information retrieval, how are they converging? A survey, a taxonomy and an analysis of social information retrieval approaches and platformsInformation Systems10.1016/j.is.2015.07.00856:C(1-18)Online publication date: 1-Mar-2016
https://dl.acm.org/doi/10.1016/j.is.2015.07.008
Cagliero LFiori AGrimaudo L(2014)Personalized tag recommendation based on generalized rulesACM Transactions on Intelligent Systems and Technology10.1145/2542182.25421945:1(1-22)Online publication date: 3-Jan-2014
https://dl.acm.org/doi/10.1145/2542182.2542194
Kittur APeters ADiriye ABove MFussell SLutters WMorris MReddy M(2014)Standing on the schemas of giantsProceedings of the 17th ACM conference on Computer supported cooperative work & social computing10.1145/2531602.2531644(999-1010)Online publication date: 15-Feb-2014
https://dl.acm.org/doi/10.1145/2531602.2531644
Crawford RNesterov S(2014)Improved Corporate Search Engine for the National (Australian) Spatial Information Management System: Case StudyINCOSE International Symposium10.1002/j.2334-5837.2009.tb01037.x19:1(1591-1608)Online publication date: 4-Nov-2014
https://doi.org/10.1002/j.2334-5837.2009.tb01037.x
Shimazu KMori KMorita SOkumura Y(2014)4.1.1 A Case Study of the Effects of Platform Software Selection on Information System Maintenance Cost ‐ An Example of Enterprise Search System Establishment ‐INCOSE International Symposium10.1002/j.2334-5837.2009.tb00970.x19:1(593-606)Online publication date: 4-Nov-2014
https://doi.org/10.1002/j.2334-5837.2009.tb00970.x
Bouadjenek MHacid HBouzeghoub MGrossman RUthurusamy RDhillon IKoren Y(2013)LAICOSProceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/2487575.2487705(1446-1449)Online publication date: 11-Aug-2013
https://dl.acm.org/doi/10.1145/2487575.2487705
Bouadjenek MHacid HBouzeghoub MJones GSheridan PKelly Dde Rijke MSakai T(2013)SopraProceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval10.1145/2484028.2484131(861-864)Online publication date: 28-Jul-2013
https://dl.acm.org/doi/10.1145/2484028.2484131
BOUADJENEK MHacid HBouzeghoub MVakali AJones GSheridan PKelly Dde Rijke MSakai T(2013)Using social annotations to enhance document representation for personalized searchProceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval10.1145/2484028.2484130(1049-1052)Online publication date: 28-Jul-2013
https://dl.acm.org/doi/10.1145/2484028.2484130
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten