A Dwell Time-Based Technique for Personalised Ranking Model

Al-Sharji, Safiya; Beer, Martin; Uruchurtu, Elizabeth

doi:10.1007/978-3-319-22852-5_18

Safiya Al-Sharji¹⁸,
Martin Beer¹⁸ &
Elizabeth Uruchurtu¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9262))

Included in the following conference series:

840 Accesses
1 Citations

Abstract

The aim of a Personalised Ranking Model (PRM) is to filter the top-k set of documents from a number of relevant documents matching the search query. Dwell times of previously clicked results have been shown to be valuable for estimating documents’ relevance. The indexing structure of the dwell time is an important parameter. We propose a dwell time-based scoring scheme called Dwell-tf-idf to index text and non-text data, based on which search results are ranked. The effectiveness of incorporating into the ranking process the proposed Dwell-tf-idf scheme is validated by a controlled experiment which shows a significant improvement in the search results within the top-k rank.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A commonly-used threshold is a dwell of at least 30 s [10, 13], a manual check of our data set indicated the longest dwell to be less than 15 min - Time range used is thus 30″- 15′.
2.
http://www.kaggle.com/c/yandex-personalised-web-search-challenge
3.
The terms ‘F-Measure’ and ‘F-Score’ are used interchangeably for convenience throughout this paper as in some literature reviews.

References

Kelly, D., Belkin, N.J.: Display time as implicit feedback: understanding task effects. In: Proceedings of the 27th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 377–384, ACM (2004)
Google Scholar
Agichtein, E., Brill, E., Dumais, S.: Improving web search ranking by incorporating user behaviour information. In: Proceedings of the 29th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 19–26, ACM (2006)
Google Scholar
Collins-Thompson, K., Bennett, P.N., White, R.W., De La Chica, S., Sontag, D.: Personalising web search results by reading level. In: Proceedings of the 20th International Conference on Information and Knowledge Management, pp. 403–412, ACM (2011)
Google Scholar
Hassan, A., White, R.W.: Personalised models of search satisfaction. In: Proceedings of the 22nd International Conference on Information and Knowledge Management, pp. 2009–2018, ACM (2013)
Google Scholar
Al Sharji, S., Beer, M., Uruchurtu, E.: Enhancing the degree of personalisation through vector space model and profile ontology. In: IEEE RIVF International Conference Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), pp. 248–252, IEEE (2013)
Google Scholar
Xu, S., Jiang, H., Lau, F.: Mining user dwell time for personalised web search re-ranking. In: Proceedings of the 22^nd International Joint Conference on Artificial Intelligence, vol. 3, pp. 2367–2372, AAAI Press (2011)
Google Scholar
Khodaei, A., Shahabi, C., Li, C.: SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search. GeoInformatica 16(3), 563–596 (2012). GeoInformatica
Article Google Scholar
Zobel, J., Moffat, A.: Inverted files for text search engines. Comput. Surv. (CSUR) 38(2), 6 (2006). ACM
Article Google Scholar
Jiang, D., Pei, J., Li, H.: Mining search and browse logs for web search: a survey. Trans. Intell. Syst. Technol. (TIST) 4(4), 57 (2013). ACM
Google Scholar
Guo, Q., Agichtein, E.: Beyond dwell time: estimating document relevance from cursor movements and other post-click searcher behaviour. In: Proceedings of the 21st International Conference on World Wide Web, pp. 569–578, ACM (2012)
Google Scholar
Silverstein, C., Marais, H., Henzinger, M., Moricz, M.: Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6–12 (1999). ACM
Article Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)
Book MATH Google Scholar
Hassan, A., Jones, R., Klinkner, K.L.: Beyond DCG: user behaviour as a predictor of a successful search. In: Proceedings of the 3rd International Conference on Web Search and Data Mining, pp. 221–230, ACM (2010)
Google Scholar
Salton, G.: Automatic text processing: the transformation, analysis, and retrieval of information by computer. Addison-Wesley (1989)
Google Scholar
Hu, Y., Qian, Y., Li, H., Jiang, D., Pei, J., Zheng, Q.: Mining query subtopics from search log data. In: Proceedings of the 35th International SIGIR Conference on Research and Development in Information Retrieval, pp. 305–314, ACM (2012)
Google Scholar
Ageev, M., Guo, Q., Lagun, D., Agichtein, E.: Find it if you can: a game for modeling different types of web search success using interaction data. In: Proceedings of the 34th International SIGIR Conference on Research and Development in Information Retrieval, pp. 345–354, ACM (2011)
Google Scholar
Elsweiler, D., Ruthven, I.: Towards task-based personal information management evaluations. In: Proceedings of the 30th Annual International SIGIR Conference on Research and Development in Information Retrieval, pp. 23–30, ACM (2007)
Google Scholar
Chia-Jung, L., Teevan, J., De La Chica, S.: Characterising multi-click search behavior and the risks and opportunities of changing results during use. In: Proceedings of the 37^th International SIGIR Conference on Research and Development in Information Retrieval, pp. 515–524, ACM (2014)
Google Scholar
Bengio, Y., Grandvalet, Y.: No unbiased estimator of the variance of k-fold cross-validation. J. Mach. Learn. Res. 5, 1089–1105 (2004)
MathSciNet MATH Google Scholar

Download references

Acknowledgement

The authors extend their sincere thanks to the Dean, the Head of ETC and staff at the NCT in Oman for their cooperation and support during the data collection.

Author information

Authors and Affiliations

Communication and Computing Research Institute, Sheffield Hallam University, 153 Arundel Street, Sheffield , S1 2NU, UK
Safiya Al-Sharji, Martin Beer & Elizabeth Uruchurtu

Authors

Safiya Al-Sharji
View author publications
You can also search for this author in PubMed Google Scholar
Martin Beer
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Uruchurtu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Safiya Al-Sharji .

Editor information

Editors and Affiliations

Hewlett-Packard Enterprise, Sunnyvale, California, USA
Qiming Chen
Paul Sabatier University, Toulouse, France
Abdelkader Hameurlain
Blaise Pascal University, Aubiere, France
Farouk Toumani
University of Linz, Linz, Austria
Roland Wagner
Universidad Politécnica de Valencia, Valencia, Spain
Hendrik Decker

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Al-Sharji, S., Beer, M., Uruchurtu, E. (2015). A Dwell Time-Based Technique for Personalised Ranking Model. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds) Database and Expert Systems Applications. Globe DEXA 2015 2015. Lecture Notes in Computer Science(), vol 9262. Springer, Cham. https://doi.org/10.1007/978-3-319-22852-5_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-22852-5_18
Published: 11 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22851-8
Online ISBN: 978-3-319-22852-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics