skip to main content
10.1145/1150402.1150493acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Mining long-term search history to improve search accuracy

Published: 20 August 2006 Publication History

Abstract

Long-term search history contains rich information about a user's search preferences, which can be used as search context to improve retrieval performance. In this paper, we study statistical language modeling based methods to mine contextual information from long-term search history and exploit it for a more accurate estimate of the query language model. Experiments on real web search data show that the algorithms are effective in improving search accuracy for both fresh and recurring queries. The best performance is achieved when using clickthrough data of past searches that are related to the current query.

References

[1]
J. Allan et al. Challenges in information retrieval. In SIGIR Forum, volume 37, 2003.]]
[2]
A. Z. Broder. A taxonomy of web search. SIGIR Forum, 36(2):3--10, 2002.]]
[3]
T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of SIGKDD 2002, pages 133--142, 2002.]]
[4]
D. Kelly and J. Teevan. Implicit feedback for inferring user preference: A bibliography. SIGIR Forum, 37(2):18--28, 2003.]]
[5]
X. Shen, B. Tan, and C. Zhai. Context-sensitive information retrieval using implicit feedback. In Proceedings of SIGIR 2005, pages 43--50, 2005.]]
[6]
X. Shen, B. Tan, and C. Zhai. Implicit user modeling for personalized search. In Proceedings of CIKM 2005, 2005.]]
[7]
X. Shen, B. Tan, and C. Zhai. Ucair toolbar: A personalized search toolbar (demo). In Proceedings of SIGIR 2005, page 681, 2005.]]
[8]
K. Sparck Jones and P. Willett, editors. Readings in Information Retrieval. Morgan Kaufmann Publishers, 1997.]]
[9]
K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive web search based on user profile constructed without any effort from users. In Proceedings of WWW 2004, pages 675--684, 2004.]]
[10]
J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activites. In Proceedings of SIGIR 2005, 2005.]]
[11]
C. Zhai and J. Lafferty. Model-based feedback in KL divergence retrieval model. In Proceedings of the CIKM 2001, pages 403--410, 2001.]]
[12]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of ACM SIGIR'01, pages 334--342, Sept 2001.]]

Cited By

View all
  • (2024)Google Search in India: Unveiling the Geo-Personalized WebProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632420(403-411)Online publication date: 4-Jan-2024
  • (2023)Personalized and Diversified: Ranking Search Results in an Integrated WayACM Transactions on Information Systems10.1145/363198942:3(1-25)Online publication date: 9-Nov-2023
  • (2023)Patient Clustering via Integrated Profiling of Clinical and Digital DataProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615262(3818-3822)Online publication date: 21-Oct-2023
  • Show More Cited By

Index Terms

  1. Mining long-term search history to improve search accuracy

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2006
    986 pages
    ISBN:1595933395
    DOI:10.1145/1150402
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 August 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. context
    2. query expansion
    3. search history

    Qualifiers

    • Article

    Conference

    KDD06

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)24
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Google Search in India: Unveiling the Geo-Personalized WebProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632420(403-411)Online publication date: 4-Jan-2024
    • (2023)Personalized and Diversified: Ranking Search Results in an Integrated WayACM Transactions on Information Systems10.1145/363198942:3(1-25)Online publication date: 9-Nov-2023
    • (2023)Patient Clustering via Integrated Profiling of Clinical and Digital DataProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615262(3818-3822)Online publication date: 21-Oct-2023
    • (2023)Incorporating Explicit Subtopics in Personalized SearchProceedings of the ACM Web Conference 202310.1145/3543507.3583488(3364-3374)Online publication date: 30-Apr-2023
    • (2023)WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386138(616-625)Online publication date: 15-Dec-2023
    • (2023)Private Web Search Using Proxy-Query Based Query Obfuscation SchemeIEEE Access10.1109/ACCESS.2023.323500011(3607-3625)Online publication date: 2023
    • (2022)SPBERTQA: A Two-Stage Question Answering System Based on Sentence Transformers for Medical TextsKnowledge Science, Engineering and Management10.1007/978-3-031-10986-7_30(371-382)Online publication date: 19-Jul-2022
    • (2021)I Know What You Need: Investigating Document Retrieval Effectiveness with Partial Session ContextsACM Transactions on Information Systems10.1145/348866740:3(1-30)Online publication date: 17-Nov-2021
    • (2021)Are Topics Interesting or Not? An LDA-based Topic-graph Probabilistic Model for Web Search PersonalizationACM Transactions on Information Systems10.1145/347610640:3(1-24)Online publication date: 30-Dec-2021
    • (2021)Personalized Semantic Retrieval System based on Statistical Language Model2021 IEEE/ACIS 20th International Fall Conference on Computer and Information Science (ICIS Fall)10.1109/ICISFall51598.2021.9627486(253-257)Online publication date: 13-Oct-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media