Skip to main content
Log in

Exploiting Query Repetition and Regularity in an Adaptive Community-Based Web Search Engine

  • Published:
User Modeling and User-Adapted Interaction Aims and scope Submit manuscript

Abstract

Search engines continue to struggle with the challenges presented by Web search: vague queries, impatient users and an enormous and rapidly expanding collection of unmoderated, heterogeneous documents all make for an extremely hostile search environment. In this paper we argue that conventional approaches to Web search -- those that adopt a traditional, document-centric, information retrieval perspective -- are limited by their refusal to consider the past search behaviour of users during future search sessions. In particular, we argue that in many circumstances the search behaviour of users is repetitive and regular; the same sort of queries tend to recur and the same type of results are often selected. We describe how this observation can lead to a novel approach to a more adaptive form of search, one that leverages past search behaviours as a means to re-rank future search results in a way that recognises the implicit preferences of communities of searchers. We describe and evaluate the I-SPY search engine, which implements this approach to collaborative, community-based search. We show that it offers potential improvements in search performance, especially in certain situations where communities of searchers share similar information needs and use similar queries to express these needs. We also show that I-SPY benefits from important advantages when it comes to user privacy. In short, we argue that I-SPY strikes a useful balance between search personalization and user privacy, by offering a unique form of anonymous personalization, and in doing so may very well provide privacy-conscious Web users with an acceptable approach to personalized search.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • E. Balfe B. Smyth (2004) Collaborative Query Recommendation for Web search. In: Proceedings of 16th European Conference on Artificial Intelligence Spain IOS Press Valencia 268–272

    Google Scholar 

  • Bharat, K.: 2000, SearchPad: Explicit Capture of Search Context to Support Web search. In: Proceedings of the 9th International World Wide Web Conference

  • P. Bollmann-Sdorra V. V. Raghavan (1993) ArticleTitleOn the Elusiveness of Adopting a Common Space for Modeling IR Objects: Are Queries Documents? Journal of Americal Society for Information Science 44 IssueID10 579–587

    Google Scholar 

  • K. Bradley R. Rafter B. Smyth (2000) Case-based User Profiling for Content Personalization O. Stock P. Brusilovsky C. Strapparava (Eds) Proceedings of the International Conference on Adaptive Hypermedia and Adaptive Web-based Systems Springer-Verlag Trento, Italy 62–72

    Google Scholar 

  • Brin, S. and Page, L.:1998 The Anatomy of a Large-scale Web Search Engine. In: Proceedings of the 9th International World Wide Web Conference.

  • Budzik, J. and Hammond, K.:2000, {User Interactions with Everyday Applications as Context for Just-in-time Information Access}. In: Proceedings of the 5th International Conference on Intelligent User Interfaces. Louisiana, USA: ACM Press, pp. 44–51

  • W. B. Croft R. Cook D. Wilder (1995) Providing Government information on the internet: Experiences with THOMAS. In: Proceedings of the Second Annual Conference on the Theory and Practice of Digital Libraries. Austin Texas USA 19–24

    Google Scholar 

  • Cui, H., Wen, Ji- R., Nie, J. -Y. and Ma, W. -Y.: 2002, Probabilistic Query Expansion Using Query logs. In: Proceedings of the 11th International World Wide Web Conference. Honolulu, Hawaii, USA: ACM Press pp. 325--332.

  • M. A. Ferrario B. Smyth (2001) ArticleTitleDistributing Case-base Maintenance, the Collaborative Maintenance approach Journal of Computational Intelligence: Special Issue on Maintaining Case-Based Reasoning Systems 17 IssueID2 315–330

    Google Scholar 

  • Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G. and Ruppin, E.: 2001,Placing search in context: The concept revisited. In: Proceedings of the 10th International World Wide Web Conference. Hong kong, pp. 116--131.

  • L. Fitzpatrick M. Dent (1997) Automatic Feedback using Past Queries: Social Searching? In: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Philadelphia, ennsylvania ACM Press USA 306–313

    Google Scholar 

  • J. Freyne B. Smyth M. Coyle E. Balfe P. Briggs (2004) ArticleTitleFurther Eexperiments on Collaborative Ranking in Community-based Web Search AI Review: An International Science and Engineering Journal 21 IssueID(3–4) 229–252

    Google Scholar 

  • G. W. Furnas T. K. Landauer L. M. Gomez S. T. Dumais (1987) ArticleTitleThe Vocabulary Problem in Human–system Communication Communications of the ACM 30 IssueID11 964–971

    Google Scholar 

  • Glance, N. S.:2001, Community Search Assistant. In: Proceedings of the 6th International Conference on Intelligent User Interfaces. Santa Fe, New Mexico, USA: ACM Press, pp. 91–96

  • E. Glover S. Lawrence M.D. Gordon W.P. Birmingham C. Lee Giles (2000) ArticleTitleWeb Search – Your Way Communications of the ACM 44 IssueID12 97–102

    Google Scholar 

  • Haveliwala, T. H.:2002, {Topic-sensitive page rank}. In: Proceedings of the 11th International World-Wide Web Conference. Hanolulu, Hawaii, USA: ACM Press, pp. 517–526

  • Herlocker, J. L., Konstan, J. A., Borchers, A. and Riedl, J.: 1999, An Algorithmic Framework for Performing Collaborative Filtering. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Bekkeley, California, USA: ACM Press, pp. 230--237

  • B. J. Jansen A. Spink J. Bateman T. Saracevic (1998) ArticleTitleReal Life Information Retrieval: A Study of User Queries on the Web SIGIR Forum 32 IssueID1 5–17

    Google Scholar 

  • Kleinberg, J. M.:1998, Authoritative Sources in a Hyperlinked Environment. In: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. San Francisco, California, USA, pp. 668–677

  • A. Kobsa (2002) ArticleTitlePersonalized Hypermedia and International Privacy Communications of ACM 45 IssueID5 64–67

    Google Scholar 

  • Kruger, A., Giles, C. L., Coetzee, F., Glover, E., Flake, G., Lawrence, S. and Omlin, C.: 2000, DEADLINER: Building a New Niche Search Engine. In: Proceedings of the Ninth International Conference on Information and Knowledge Management. McLean, Virgina, USA, pp. 272--281

  • Kushmerick, N.: 1997,Wrapper Induction for Information Extraction. In: Proceedings of the International Joint Conference on Artificial Intelligence. Nagoya, Japan: Morgan-Kaufmann, pp. 729--735

  • S. Lawrence (2000) ArticleTitleContext in Web search IEEE Data Engineering Bulletin 23 IssueID3 25–32

    Google Scholar 

  • Lawrence, S. and Giles, C. L.: 1988, Context and Page Analysis for Improved Web Search. IEEE Internet Computing. July–August, 38–46

  • S. Lawrence C. L. Giles (1999) ArticleTitleSearching the Web: General and Scientific Information Access IEEE Communications 37 IssueID1 116–122

    Google Scholar 

  • H. Lieberman (1995) Letizia: An Agent that Assists Web Browsing C. Mellish (Eds) Proceedings of the International Joint Conference on Artificial Intelligence, IJCAI’95. Montreal Morgan Kaufman Publishers Canada 924–929

    Google Scholar 

  • Lyman, P. and Varian, H. R.: 2003, How Much Information? Retrieved from http://www.sims.berkeley.edu/how-much-info-2003.

  • Mitra, M., Singhal, A. and Buckley, C.: 1998, Improving Automatic Query Expansion. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Melbourne, Australia ACM Press, pp. 206--214.

  • O’Mahony, M. P., Hurley, N. J. and Silvestre, G. C. M.: 2003, An Evaluation of the Performance of Collaborative Filtering. In: 14th Irish Artificial Intelligence and Cognitive Science Conference. Dublin, Ireland, pp. 164--168.

  • S. Ozmutlu A. Spink H. C. Ozmutlu (2000) ArticleTitleMultimedia Web Searching Trends: 1997-2001 Information Processing and Management 39 IssueID4 611–621

    Google Scholar 

  • Raghavan, V. V. and Sever, H.: 1995, On the Reuse of Past Optimal Queries. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle, Washington, USA: ACM Press, pp. 344--350

  • Rhodes, B. J. and Starner, T.: 1966,Remembrance Agent: A Continuously Running Automated Information Retrieval System. In: Proceedings of the 1st International Conference on the Practical Applications of Intelligent Agents and Multi-Agent Technologies. London, UK, pp. 487--495

  • W. Roush (2004) ArticleTitleSearch Beyond Google MIT Technology Review 107 IssueID2 34–45

    Google Scholar 

  • Smyth, B., Balfe, E., Briggs, P., Coyle, M. and Freyne, J.: 2003, Collaborative Web Search. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, IJCAI-03. Acapulco, Mexico: Morgan Kaufmann, pp. 1417--1419.

  • Smyth, B., Freyne, J., Coyle, M., Briggs, P. and Balfe, E.: 2003, Collaborative ranking in community-based Web search. In: 14th Irish Artificial Intelligence and Cognitive Science Conference. Dublin, Ireland, pp. 199--204.

  • Smyth, B., Freyne, J., Coyle, M., Briggs, P. and Balfe, E.: 2003b, I-SPY Anonymous, Community-based Personalization by Collaborative Meta-search. In: Proceedings of the 23rd SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence. Cambridge, UK: Springer, pp. 367--380.

  • Spink, A, Bateman, J and Jansen, B.J.: 1998, {Searching Heterogeneous Collections of the Web: Behaviour of Excite Users}. Information Research 4(2). Retrieved from http://information.net/ir/4-2/paper53.html.

  • L. Terveen W. Hill B. Amento D. McDonald (1997) ArticleTitlePhoaks: A System for Sharing Recommendations Communications of the ACM 40 IssueID3 59–62

    Google Scholar 

  • J.-Y. J.-R. Wen H.-J. Zhang (2002) ArticleTitleQuery Clustering Using User Logs ACM Transactions on Information Systems 20 IssueID1 59–81

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Barry Smyth.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Smyth, B., Balfe, E., Freyne, J. et al. Exploiting Query Repetition and Regularity in an Adaptive Community-Based Web Search Engine. User Model User-Adap Inter 14, 383–423 (2004). https://doi.org/10.1007/s11257-004-5270-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11257-004-5270-4

Keywords

Navigation