skip to main content
10.1145/1645953.1646110acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Beyond hyperlinks: organizing information footprints in search logs to support effective browsing

Published: 02 November 2009 Publication History

Abstract

While current search engines serve known-item search such as homepage finding very well, they generally cannot support exploratory search effectively. In exploratory search, users do not know their information needs precisely and also often lack the needed knowledge to formulate effective queries, thus querying alone, as supported by the current search engines, is insufficient, and browsing into related information would be very useful. Currently, browsing is mostly done by following hyperlinks embedded on Web pages. In this paper, we propose to leverage search logs to allow a user to browse beyond hyperlinks with a multi-resolution topic map constructed based on search logs. Specifically, we treat search logs as "footprints" left by previous users in the information space and build a multi-resolution topic map to semantically capture and organize them in multiple granularities. Such a topic map can support a user to zoom in, zoom out, and navigate horizontally over the information space, and thus provide flexible and effective browsing capabilities for end users. To test the effectiveness of the proposed methods of supporting browsing, we rely on real search logs and a commercial search engine to implement our proposed methods. Our experimental results show that the proposed topic map is effective to support browsing beyond hyperlinks.

References

[1]
E. Agichtein, E. Brill, and S. T. Dumais. Improving web search ranking by incorporating user behavior information. In SIGIR, pages 19--26, 2006.
[2]
J. A. Aslam, E. Pelekov, and D. Rus. The star clustering algorithm for static and dynamic information organization. Journal of Graph Algorithms and Applicatins, 8(1):95--129, 2004.
[3]
M. J. Bates. The design of browsing and berrypicking techniques for the online search interface. Online Review, 13:407--424, 1989.
[4]
D. Beeferman and A. L. Berger. Agglomerative clustering of a search engine query log. In KDD, pages 407--416, 2000.
[5]
N. J. Belkin. Interaction with texts: Information retrieval as information-seeking behavior. In Information Retrieval, pages 55--66, 1993.
[6]
M. Bilenko and R. W. White. Mining the search trails of surfing crowds: identifying relevant websites from user activity. In WWW, pages 51--60, 2008.
[7]
P. Brusilovsky and R. Rizzo. Map-based horizontal navigation in educational hypertext. In HYPERTEXT, pages 1--10, 2002.
[8]
H. Chen and S. T. Dumais. Bringing order to the web: automatically categorizing search results. In CHI, pages 145--152, 2000.
[9]
H. Chen, A. L. Houston, R. R. Sewell, and B. R. Schatz. Internet browsing and searching: user evaluations of category map and concept space techniques. JASIS, 49(7):582--603, 1998.
[10]
E. Cutrell, D. Robbins, S. Dumais, and R. Sarin. Fast, flexible filtering with phlat. In CHI, pages 261--270, 2006.
[11]
W. Dakka and P. Ipeirotis. Automatic extraction of useful facet terms from text documents. In ICDE, 2008.
[12]
D. Downey, S. Dumais, D. Liebling, and E. Horvitz. Understanding the relationship between searchers' queries and information goals. In CIKM, pages 449--458, 2008.
[13]
S. T. Dumais, E. Cutrell, and H. Chen. Optimizing search by showing results in context. In CHI, pages 277--284, 2001.
[14]
E. Duval and H. Olivié. Towards the integration of a query mechanism and navigation for retrieval of data on multimedia documents. SIGIR Forum, 26(2):8--25, 1992.
[15]
J. L. Elsas, J. Arguello, J. Callan, and J. G. Carbonell. Retrieval and feedback models for blog feed search. In SIGIR, pages 347--354, 2008.
[16]
J. English, M. A. Hearst, R. R. Sinha, K. Swearingen, and K.-P. Yee. Hierarchical faceted metadata in site search interfaces. In CHI Extended Abstracts, pages 628--639, 2002.
[17]
J. Freyne, R. Farzan, P. Brusilovsky, B. Smyth, and M. Coyle. Collecting community wisdom: integrating social search&social navigation. In IUI, pages 52--61, 2007.
[18]
G. W. Furnas. Effective view navigation. In CHI, pages 367--374, 1997.
[19]
J. Han and M. Kamber. Data Mining: Concepts and Techniques, 2nd Ed. Morgan Kaufmann, 2006.
[20]
M. Hearst. Clustering versus faceted categories for information exploration. CACM, 49(4):59--61, 2006.
[21]
M. Hearst and J. Pedersen. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In SIGIR, pages 76--84, 1996.
[22]
T. Joachims. Evaluating Retrieval Performance Using Clickthrough Data., pages 79--96. Physica/Springer Verlag, 2003. in J. Franke and G. Nakhaeizadeh and I. Renz, "Text Mining".
[23]
R. Jones, B. Rey, O. Madani, and W. Greiner. Generating query substitutions. In WWW, pages 387--396, 2006.
[24]
J. Lin and M. D. Smucker. How do users find things with ?: towards automatic utility evaluation with user simulations. In SIGIR, pages 19--26, 2008.
[25]
m. c. schraefel, D. A. Smith, A. Owens, A. Russell, C. Harris, and M. Wilson. The evolving mspace platform: leveraging the semantic web on the trail of the memex. In HYPERTEXT, pages 174--183, 2005.
[26]
Y. S. Maarek. Organizing documents to support browsing in digital libraries. SIGOIS Bull., 16(2):36--37, 1995.
[27]
J. D. Mackinlay and P. T. Zellweger. Browsing vs. search: can we find a synergy? (panel session). In CHI, pages 179--180, 1995.
[28]
G. Marchionini. Exploratory search: from finding to understanding. CACM, 49(4):41--46, 2006.
[29]
V. L. O'Day and R. Jeffries. Orienteering in an information landscape: how information seekers get from here to there. In INTERCHI, pages 438--445, 1993.
[30]
C. Olston and E. H. Chi. Scenttrails: Integrating browsing and searching on the web. ACM Trans. Comput.-Hum. Interact., 10(3):177--197, 2003.
[31]
S. Pandit and C. Olston. Navigation-aided retrieval. In WWW, pages 391--400, 2007.
[32]
F. Radlinski and T. Joachims. Query chains: learning to rank from implicit feedback. In KDD, pages 239--248, 2005.
[33]
S. E. Robertson and S. Walker. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In SIGIR, pages 232--241, 1994.
[34]
M. Sahami and T. D. Heilman. A web-based kernel function for measuring the similarity of short text snippets. In WWW, pages 377--386, 2006.
[35]
G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Commun. ACM, 18(11):613--620, 1975.
[36]
M. Sanderson and B. Croft. Deriving concept hierarchies from text. In SIGIR, pages 206--213, New York, NY, USA, 1999. ACM.
[37]
D. Shen, M. Qin, W. Chen, Q. Yang, and Z. Chen. Mining web query hierarchies from clickthrough data. In AAAI, pages 341--346, 2007.
[38]
R. Shen, N. S. Vemuri, W. Fan, R. da S. Torres, and E. A. Fox. Exploring digital libraries: integrating browsing, searching, and visualization. In JCDL, pages 1--10, 2006.
[39]
X. Shen, B. Tan, and C. Zhai. Context-sensitive information retrieval using implicit feedback. In SIGIR, pages 43--50, 2005.
[40]
B. Smyth, E. Balfe, J. Freyne, P. Briggs, M. Coyle, and O. Boydell. Exploiting query repetition and regularity in an adaptive community-based web search engine. User Modeling and User-Adapted Interaction, 14(5):383--423, 2005.
[41]
J. Teevan, C. Alvarado, M. S. Ackerman, and D. R. Karger. The perfect search engine is not enough: a study of orienteering behavior in directed search. In CHI, pages 415--422, 2004.
[42]
X. Wang and C. Zhai. Learn from web search logs to organize search results. In SIGIR, pages 87--94, 2007.
[43]
X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. In CIKM, pages 479--488, 2008.
[44]
X. Wang and C. Zhai. Massive implicit feedback: Organizing search logs into topic maps for collaborative surfing. In ACM SIGIR workshop on Understanding the Users (Demo Description), 2009.
[45]
J.-R. Wen, J.-Y. Nie, and H. Zhang. Clustering user queries of a search engine. In WWW, pages 162--168, 2001.
[46]
A. Wexelblat and P. Maes. Footprints: history-rich tools for information foraging. In CHI, pages 270--277, 1999.
[47]
R. W. White, M. Bilenko, and S. Cucerzan. Studying the use of popular destinations to enhance web search interaction. In SIGIR, pages 159--166, 2007.
[48]
R. W. White and R. A. Roth. Exploratory Search: Beyond the Query-Response Paradigm. Morgan and Claypool, 2009.
[49]
P. Willett. Recent trends in hierarchic document clustering: a critical review. Inf. Process. Manage., 24(5):577--597, 1988.
[50]
K.-P. Yee, K. Swearingen, K. Li, and M. Hearst. Faceted metadata for image search and browsing. In CHI, pages 401--408, 2003.
[51]
X. Yuan and N. J. Belkin. Supporting multiple information-seeking strategies in a single system framework. In SIGIR, pages 247--254, 2007.
[52]
O. Zamir and O. Etzioni. Web document clustering: A feasibility demonstration. In SIGIR, pages 46--54, 1998.
[53]
H.-J. Zeng, Q.-C. He, Z. Chen, W.-Y. Ma, and J. Ma. Learning to cluster web search results. In SIGIR, pages 210--217, 2004.

Cited By

View all
  • (2018)Social SearchSocial Information Access10.1007/978-3-319-90092-6_7(213-276)Online publication date: 3-May-2018
  • (2017)An advent of data mining to prognosticate users' successive questions using association rules2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS)10.1109/ICECDS.2017.8389594(1033-1038)Online publication date: Aug-2017
  • (2015)BeomapProceedings of the 15th International Conference on Engineering the Web in the Big Data Era - Volume 911410.1007/978-3-319-19890-3_14(200-218)Online publication date: 23-Jun-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
November 2009
2162 pages
ISBN:9781605585123
DOI:10.1145/1645953
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. beyond hyperlinks
  2. effective browsing
  3. information footprints
  4. multi-resolution topic maps

Qualifiers

  • Research-article

Conference

CIKM '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Social SearchSocial Information Access10.1007/978-3-319-90092-6_7(213-276)Online publication date: 3-May-2018
  • (2017)An advent of data mining to prognosticate users' successive questions using association rules2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS)10.1109/ICECDS.2017.8389594(1033-1038)Online publication date: Aug-2017
  • (2015)BeomapProceedings of the 15th International Conference on Engineering the Web in the Big Data Era - Volume 911410.1007/978-3-319-19890-3_14(200-218)Online publication date: 23-Jun-2015
  • (2013)Robust feature description and matching using local graph2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference10.1109/APSIPA.2013.6694289(1-4)Online publication date: Oct-2013
  • (2013)Clustering user queries into conceptual space2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference10.1109/APSIPA.2013.6694197(1-7)Online publication date: Oct-2013
  • (2012)Role-explicit query identification and intent role annotationProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2398416(1163-1172)Online publication date: 29-Oct-2012
  • (2012)How Does Mobile Context Affect People's Web Search Behavior?Proceedings of the 2012 IEEE 26th International Conference on Advanced Information Networking and Applications10.1109/AINA.2012.134(245-252)Online publication date: 26-Mar-2012
  • (2012)Fast and effective soft linksSoftware: Practice and Experience10.1002/spe.212243:5(577-593)Online publication date: 11-Apr-2012
  • (2011)Predicting Next Search Actions with Search Engine Query LogsProceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 0110.1109/WI-IAT.2011.15(227-234)Online publication date: 22-Aug-2011
  • (2010)Assessing the scenic routeProceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval10.1145/1835449.1835548(587-594)Online publication date: 19-Jul-2010
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media