Skip to main content

A Dwell Time-Based Technique for Personalised Ranking Model

  • Conference paper
  • First Online:
Database and Expert Systems Applications (Globe 2015, DEXA 2015)

Abstract

The aim of a Personalised Ranking Model (PRM) is to filter the top-k set of documents from a number of relevant documents matching the search query. Dwell times of previously clicked results have been shown to be valuable for estimating documents’ relevance. The indexing structure of the dwell time is an important parameter. We propose a dwell time-based scoring scheme called Dwell-tf-idf to index text and non-text data, based on which search results are ranked. The effectiveness of incorporating into the ranking process the proposed Dwell-tf-idf scheme is validated by a controlled experiment which shows a significant improvement in the search results within the top-k rank.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A commonly-used threshold is a dwell of at least 30 s [10, 13], a manual check of our data set indicated the longest dwell to be less than 15 min - Time range used is thus 30″- 15′.

  2. 2.

    http://www.kaggle.com/c/yandex-personalised-web-search-challenge

  3. 3.

    The terms ‘F-Measure’ and ‘F-Score’ are used interchangeably for convenience throughout this paper as in some literature reviews.

References

  1. Kelly, D., Belkin, N.J.: Display time as implicit feedback: understanding task effects. In: Proceedings of the 27th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 377–384, ACM (2004)

    Google Scholar 

  2. Agichtein, E., Brill, E., Dumais, S.: Improving web search ranking by incorporating user behaviour information. In: Proceedings of the 29th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 19–26, ACM (2006)

    Google Scholar 

  3. Collins-Thompson, K., Bennett, P.N., White, R.W., De La Chica, S., Sontag, D.: Personalising web search results by reading level. In: Proceedings of the 20th International Conference on Information and Knowledge Management, pp. 403–412, ACM (2011)

    Google Scholar 

  4. Hassan, A., White, R.W.: Personalised models of search satisfaction. In: Proceedings of the 22nd International Conference on Information and Knowledge Management, pp. 2009–2018, ACM (2013)

    Google Scholar 

  5. Al Sharji, S., Beer, M., Uruchurtu, E.: Enhancing the degree of personalisation through vector space model and profile ontology. In: IEEE RIVF International Conference Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), pp. 248–252, IEEE (2013)

    Google Scholar 

  6. Xu, S., Jiang, H., Lau, F.: Mining user dwell time for personalised web search re-ranking. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, vol. 3, pp. 2367–2372, AAAI Press (2011)

    Google Scholar 

  7. Khodaei, A., Shahabi, C., Li, C.: SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search. GeoInformatica 16(3), 563–596 (2012). GeoInformatica

    Article  Google Scholar 

  8. Zobel, J., Moffat, A.: Inverted files for text search engines. Comput. Surv. (CSUR) 38(2), 6 (2006). ACM

    Article  Google Scholar 

  9. Jiang, D., Pei, J., Li, H.: Mining search and browse logs for web search: a survey. Trans. Intell. Syst. Technol. (TIST) 4(4), 57 (2013). ACM

    Google Scholar 

  10. Guo, Q., Agichtein, E.: Beyond dwell time: estimating document relevance from cursor movements and other post-click searcher behaviour. In: Proceedings of the 21st International Conference on World Wide Web, pp. 569–578, ACM (2012)

    Google Scholar 

  11. Silverstein, C., Marais, H., Henzinger, M., Moricz, M.: Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6–12 (1999). ACM

    Article  Google Scholar 

  12. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)

    Book  MATH  Google Scholar 

  13. Hassan, A., Jones, R., Klinkner, K.L.: Beyond DCG: user behaviour as a predictor of a successful search. In: Proceedings of the 3rd International Conference on Web Search and Data Mining, pp. 221–230, ACM (2010)

    Google Scholar 

  14. Salton, G.: Automatic text processing: the transformation, analysis, and retrieval of information by computer. Addison-Wesley (1989)

    Google Scholar 

  15. Hu, Y., Qian, Y., Li, H., Jiang, D., Pei, J., Zheng, Q.: Mining query subtopics from search log data. In: Proceedings of the 35th International SIGIR Conference on Research and Development in Information Retrieval, pp. 305–314, ACM (2012)

    Google Scholar 

  16. Ageev, M., Guo, Q., Lagun, D., Agichtein, E.: Find it if you can: a game for modeling different types of web search success using interaction data. In: Proceedings of the 34th International SIGIR Conference on Research and Development in Information Retrieval, pp. 345–354, ACM (2011)

    Google Scholar 

  17. Elsweiler, D., Ruthven, I.: Towards task-based personal information management evaluations. In: Proceedings of the 30th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 23–30, ACM (2007)

    Google Scholar 

  18. Chia-Jung, L., Teevan, J., De La Chica, S.: Characterising multi-click search behavior and the risks and opportunities of changing results during use. In: Proceedings of the 37th International SIGIR Conference on Research and Development in Information Retrieval, pp. 515–524, ACM (2014)

    Google Scholar 

  19. Bengio, Y., Grandvalet, Y.: No unbiased estimator of the variance of k-fold cross-validation. J. Mach. Learn. Res. 5, 1089–1105 (2004)

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgement

The authors extend their sincere thanks to the Dean, the Head of ETC and staff at the NCT in Oman for their cooperation and support during the data collection.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Safiya Al-Sharji .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Al-Sharji, S., Beer, M., Uruchurtu, E. (2015). A Dwell Time-Based Technique for Personalised Ranking Model. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds) Database and Expert Systems Applications. Globe DEXA 2015 2015. Lecture Notes in Computer Science(), vol 9262. Springer, Cham. https://doi.org/10.1007/978-3-319-22852-5_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22852-5_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22851-8

  • Online ISBN: 978-3-319-22852-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics