Skip to main content
Log in

Long short-term search session-based document re-ranking model

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Document re-ranking is a core task in session search. However, most existing methods only focus on the short-term session and ignore the long-term history sessions. This leads to inadequate understanding of the user’s search intent, which affects the performance of model re-ranking. At the same time, these methods have weaker capability in understanding user queries. In this paper, we propose a long short-term search session-based re-ranking model (LSSRM). Firstly, we utilize the BERT model to predict the topic relevance between the query and candidate documents, in order to improve the model’s understanding of user queries. Secondly, we initialize the reading vector with topic relevance and use the personalized memory encoder module to model the user’s long-term search intent. Thirdly, we input the user’s current session interaction sequence into Transformer to obtain the vector representation of the user’s short-term search intent. Finally, the user’s search intent and topical relevance information are hierarchically fused to obtain the final document ranking scores. Then re-rank the documents according to this score. We conduct extensive experiments on two real-world session datasets. The experimental results show that our method outperforms the baseline models for the document re-ranking task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Availability of data and materials

The datasets we use are all public.

Notes

  1. http://www.thuir.cn/tiangong-qref/.

  2. https://www.kaggle.com/c/yandex-personalized-web-search-challenge.

References

  1. Huang P-S, He X, Gao J, Deng L, Acero A, Heck L (2013) Learning deep structured semantic models for web search using clickthrough data. In: Proceedings of the 22nd ACM international conference on information & knowledge management. CIKM ’13. Association for Computing Machinery, New York, NY, USA, pp 2333–2338. https://doi.org/10.1145/2505515.2505665

  2. Mitra B, Diaz F, Craswell N (2017) Learning to match using local and distributed representations of text for web search. In: Proceedings of the 26th international conference on world wide web. WWW ’17. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, pp 1291–1299. https://doi.org/10.1145/3038912.3052579

  3. Hu B, Lu Z, Li H, Chen Q (2014) Convolutional neural network architectures for matching natural language sentences. Advances in neural information processing systems 27. 28th Conference on Neural Information Processing Systems (NIPS), Montreal, CANADA, DEC 08-13, 2014

  4. Xiong C, Dai Z, Callan J, Liu Z, Power R (2017) End-to-end neural ad-hoc ranking with kernel pooling. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval. SIGIR ’17. Association for Computing Machinery, New York, NY, USA, pp 55–64. https://doi.org/10.1145/3077136.3080809

  5. Nogueira R, Cho K (2019) Passage re-ranking with bert. arXiv preprint arXiv:1901.04085. Preprint at arXiv:1901.04085

  6. Li M, Popa DN, Chagnon J, Cinar YG, Gaussier E (2023) The power of selecting key blocks with local pre-ranking for long document information retrieval. ACM Trans Inf Syst. https://doi.org/10.1145/3568394

    Article  Google Scholar 

  7. Yilmaz ZA, Yang W, Zhang H, Lin J (2019) Cross-domain modeling of sentence-level evidence for document retrieval. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), vol. Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China, pp 3490–3496

  8. Baeza-Yates R, Hurtado C, Mendoza M (2005) Query recommendation using query logs in search engines. In: Lindner W, Mesiti M, Türker C, Tzitzikas Y, Vakali AI (eds) Current trends in database technology—EDBT 2004 workshops. Springer, Berlin, Heidelberg, pp 588–596

    MATH  Google Scholar 

  9. Croft WB, Metzler D, Strohman T (2010) Search engines: information retrieval in practice vol. 520. Addison-Wesley Reading

  10. Jansen Bernard J, Spink A, Blakely C, Koshman S (2007) Defining a session on web search engines: research articles. J Am Soc Inf Sci Technol 58(6):862–871

    Article  MATH  Google Scholar 

  11. Jones R, Klinkner KL (2008) Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In: Proceedings of the 17th ACM conference on information and knowledge management. CIKM ’08. Association for Computing Machinery, New York, NY, USA, pp 699–708. https://doi.org/10.1145/1458082.1458176

  12. Ahmad WU, Chang K-W, Wang H (2018) Multi-task learning for document ranking and query suggestion. In: International conference on learning representations. https://openreview.net/forum?id=SJ1nzBeA-

  13. Zhu Y, Nie J-Y, Dou Z, Ma Z, Zhang X, Du P, Zuo X, Jiang H (2021) Contrastive learning of user behavior sequence for context-aware document ranking. In: Proceedings of the 30th ACM international conference on information & knowledge management. CIKM ’21. Association for Computing Machinery, New York, NY, USA, pp 2780–2791. https://doi.org/10.1145/3459637.3482243

  14. Bennett PN, White RW, Chu W, Dumais ST, Bailey P, Borisyuk F, Cui X (2012) Modeling the impact of short- and long-term behavior on search personalization. SIGIR ’12. Association for Computing Machinery, New York, NY, USA, pp 185–194. https://doi.org/10.1145/2348283.2348312

  15. Van Gysel C, Kanoulas E, Rijke M (2016) Lexical query modeling in session search. In: Proceedings of the 2016 ACM international conference on the theory of information retrieval. ICTIR ’16. Association for Computing Machinery, New York, NY, USA, pp 69–72. https://doi.org/10.1145/2970398.2970422

  16. Shen X, Tan B, Zhai C (2005) Context-sensitive information retrieval using implicit feedback. In: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval. SIGIR ’05. Association for Computing Machinery, New York, NY, USA, pp 43–50. https://doi.org/10.1145/1076034.1076045

  17. White RW, Chu W, Hassan A, He X, Song Y, Wang H (2013) Enhancing personalized search by mining and modeling task behavior. In: Proceedings of the 22nd international conference on world wide web. WWW ’13. Association for Computing Machinery, New York, NY, USA, pp 1411–1420. https://doi.org/10.1145/2488388.2488511

  18. Zhou Y, Dou Z, Wen J-R (2020) Enhancing re-finding behavior with external memories for personalized search. In: Proceedings of the 13th international conference on web search and data mining. WSDM ’20. Association for Computing Machinery, New York, NY, USA, pp 789–797. https://doi.org/10.1145/3336191.3371794

  19. Chen H, Dou Z, Zhu Q, Zuo X, Wen J-R (2023) Integrating representation and interaction for context-aware document ranking. ACM Trans Inf Syst. https://doi.org/10.1145/3529955

    Article  MATH  Google Scholar 

  20. Zuo X, Dou Z, Wen J-R (2022) Improving session search by modeling multi-granularity historical query change. In: Proceedings of the fifteenth ACM international conference on web search and data mining. WSDM ’22. Association for Computing Machinery, New York, NY, USA, pp 1534–1542. https://doi.org/10.1145/3488560.3498415

  21. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30

  22. Sun F, Liu J, Wu J, Pei C, Lin X, Ou W, Jiang P (2019) Bert4rec: sequential recommendation with bidirectional encoder representations from transformer. In: Proceedings of the 28th ACM international conference on information and knowledge management. CIKM ’19. Association for Computing Machinery, New York, NY, USA, pp 1441–1450. https://doi.org/10.1145/3357384.3357895

  23. Kang W-C, McAuley J (2018) Self-attentive sequential recommendation. In: 2018 IEEE international conference on data mining (ICDM), pp 197–206. https://doi.org/10.1109/ICDM.2018.00035

  24. Bennett PN, White RW, Chu W, Dumais ST, Bailey P, Borisyuk F, Cui X (2012) Modeling the impact of short- and long-term behavior on search personalization. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval. SIGIR ’12. Association for Computing Machinery, New York, NY, USA, pp 185–194. https://doi.org/10.1145/2348283.2348312

  25. Li L, Yang Z, Wang B, Kitsuregawa M (2007) Dynamic adaptation strategies for long-term and short-term user profile to personalize search. In: Dong G, Lin X, Wang W, Yang Y, Yu JX (eds) Advances in data and web management. Springer, Berlin, Heidelberg, pp 228–240

    Chapter  MATH  Google Scholar 

  26. White RW, Bennett PN, Dumais ST (2010) Predicting short-term interests using activity-based search context. In: Proceedings of the 19th ACM international conference on information and knowledge management. CIKM ’10. Association for Computing Machinery, New York, NY, USA, pp 1009–1018. https://doi.org/10.1145/1871437.1871565

  27. Cheng Q, Ren Z, Lin Y, Ren P, Chen Z, Liu X, Rijke Md (2021) Long short-term session search: joint personalized reranking and next query prediction. In: Proceedings of the web conference 2021. WWW ’21. Association for Computing Machinery, New York, NY, USA, pp 239–248. https://doi.org/10.1145/3442381.3449941

  28. Deng C, Zhou Y, Dou Z (2022) Improving personalized search with dual-feedback network. WSDM ’22. Association for Computing Machinery, New York, NY, USA, pp 210–218. https://doi.org/10.1145/3488560.3498447

  29. Ge S, Dou Z, Jiang Z, Nie J-Y, Wen J-R (2018) Personalizing search results using hierarchical rnn with query-aware attention. In: Proceedings of the 27th ACM international conference on information and knowledge management. CIKM ’18. Association for Computing Machinery, New York, NY, USA, pp 347–356. https://doi.org/10.1145/3269206.3271728

  30. Zhou Y, Dou Z, Wen J-R (2020) Encoding history with context-aware representation learning for personalized search. In: Proceedings of the 43rd international ACM SIGIR conference on research and development in information retrieval. SIGIR ’20. Association for Computing Machinery, New York, NY, USA, pp 1111–1120. https://doi.org/10.1145/3397271.3401175

  31. Ma Z, Dou Z, Bian G, Wen J-R (2020) Pstie: Time information enhanced personalized search. In: Proceedings of the 29th ACM international conference on information & knowledge management. CIKM ’20. Association for Computing Machinery, New York, NY, USA, pp 1075–1084. https://doi.org/10.1145/3340531.3411877

  32. Chen H, Dou Z, Zhu Q, Zuo X, Wen J-R (2023) Integrating representation and interaction for context-aware document ranking. ACM Trans Inf Syst 41(1):1–23

    MATH  Google Scholar 

  33. Chen H, Dou Z, Zhu Y, Cao Z, Cheng X, Wen J-R (2022) Enhancing user behavior sequence modeling by generative tasks for session search. In: Proceedings of the 31st ACM international conference on information & knowledge management, pp 180–190

  34. Chen J, Mao J, Liu Y, Zhang F, Zhang M, Ma S (2021) Towards a better understanding of query reformulation behavior in web search. In: Proceedings of the web conference 2021. WWW ’21. Association for Computing Machinery, New York, NY, USA, pp 743–755. https://doi.org/10.1145/3442381.3450127

  35. Ahmad WU, Chang K-W, Wang H (2019) Context attentive document ranking and query suggestion. In: Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval. SIGIR’19. Association for Computing Machinery, New York, NY, USA, pp 385–394. https://doi.org/10.1145/3331184.3331246

Download references

Acknowledgements

We thank Tsinghua University for providing us with the TianGong-QRef dataset.

Funding

“This work was supported by the Key scientific research projects of North Minzu University “Study on grape cluster detection and leaf disease identification based on semi- supervised transfer learning” under Grant 2023ZRLG12; in part by the Ministry of Education Industry-University Cooperation Synergistic Education Program titled “Teaching Content Reform of ‘Intelligent Recommendation System’ Based on Cloud Big Data Platform” under Grant 220802539112039; in part by the Starting Project of Scientific Research in the North Minzu University titled “Research of Information Retrieval Model Based on the Decision Process” under Grant 2020KYQD37.”

Author information

Authors and Affiliations

Authors

Contributions

J.L. and M.W. wrote the main manuscript text, J.W. translated the full text, Y.W. prepared figures, and X.C. prepared tables. All authors reviewed the manuscript.

Corresponding author

Correspondence to Meng Wang.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, J., Wang, M., Wang, J. et al. Long short-term search session-based document re-ranking model. Knowl Inf Syst 67, 223–245 (2025). https://doi.org/10.1007/s10115-024-02205-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-024-02205-4

Keywords