LongEval: Longitudinal Evaluation of Model Performance at CLEF 2023

Alkhalifa, Rabab; Bilal, Iman; Borkakoty, Hsuvas; Camacho-Collados, Jose; Deveaud, Romain; El-Ebshihy, Alaa; Espinosa-Anke, Luis; Gonzalez-Saez, Gabriela; Galuščáková, Petra; Goeuriot, Lorraine; Kochkina, Elena; Liakata, Maria; Loureiro, Daniel; Tayyar Madabushi, Harish; Mulhem, Philippe; Piroi, Florina; Popel, Martin; Servan, Christophe; Zubiaga, Arkaitz

doi:10.1007/978-3-031-28241-6_58

LongEval: Longitudinal Evaluation of Model Performance at CLEF 2023

Conference paper
First Online: 16 March 2023

1537 Accesses
2 Citations
3 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13982))

Abstract

In this paper, we describe the plans for the first LongEval CLEF 2023 shared task dedicated to evaluating the temporal persistence of Information Retrieval (IR) systems and Text Classifiers. The task is motivated by recent research showing that the performance of these models drops as the test data becomes more distant, with respect to time, from the training data. LongEval differs from traditional shared IR and classification tasks by giving special consideration to evaluating models aiming to mitigate performance drop over time. We envisage that this task will draw attention from the IR community and NLP researchers to the problem of temporal persistence of models, what enables or prevents it, potential solutions and their limitations.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Qwant search engine: https://www.qwant.com/.
2.
https://figshare.com/articles/dataset/TM-Senti/16438281.
3.
https://huggingface.co/roberta-base.

References

Alkhalifa, R., Kochkina, E., Zubiaga, A.: Building for tomorrow: assessing the temporal persistence of text classifiers. arXiv preprint arXiv:2205.05435 (2022)
Alkhalifa, R., Zubiaga, A.: Capturing stance dynamics in social media: open challenges and research directions. Int. J. Digit. Hum., 1–21 (2022)
Google Scholar
Chapelle, O., Zhang, Y.: A dynamic Bayesian network click model for web search ranking. In: Proceedings of the 18th international conference on World Wide Web, WWW 2009, pp. 1–10. Association for Computing Machinery, New York (2009). https://doi.org/10.1145/1526709.1526711
Chuklin, A., Markov, I., Rijke, M.D.: Click models for web search. Synth. Lect. Inf. Concepts Retrieval Serv. 7(3), 1–115 (2015). https://doi.org/10.2200/S00654ED1V01Y201507ICR043
Florio, K., Basile, V., Polignano, M., Basile, P., Patti, V.: Time of your hate: the challenge of time in hate speech detection on social media. Appl. Sci. 10(12), 4180 (2020)
Article Google Scholar
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Lukes, J., Søgaard, A.: Sentiment analysis under temporal shift. In: Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 65–71 (2018)
Google Scholar
Ren, R., et al.: A thorough examination on zero-shot dense retrieval (2022). arxiv:2204.12755. https://doi.org/10.48550/ARXIV.2204.12755
Yin, W., Alkhalifa, R., Zubiaga, A.: The emojification of sentiment on social media: collection and analysis of a longitudinal Twitter sentiment dataset. arXiv preprint arXiv:2108.13898 (2021)

Download references

Acknowledgements

This work is supported by the ANR Kodicare bi-lateral project, grant ANR-19-CE23-0029 of the French Agence Nationale de la Recherche, and by the Austrian Science Fund (FWF, grant I4471-N). This work is also supported by a UKRI/EPSRC Turing AI Fellowship to Maria Liakata (grant no. EP/V030302/1) and The Alan Turing Institute (grant no. EP/N510129/1) through project funding and its Enrichment PhD Scheme for Iman Bilal. This work has been using services provided by the LINDAT/CLARIAH-CZ Research Infrastructure (https://lindat.cz), supported by the Ministry of Education, Youth and Sports of the Czech Republic (Project No. LM2018101) and has been also supported by the Ministry of Education, Youth and Sports of the Czech Republic, Project No. LM2018101 LINDAT/CLARIAH-CZ.

Author information

Authors and Affiliations

Queen Mary University of London, London, UK
Rabab Alkhalifa, Elena Kochkina, Maria Liakata & Arkaitz Zubiaga
Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Rabab Alkhalifa
University of Warwick, Coventry, UK
Iman Bilal & Maria Liakata
Cardiff University, Cardiff, UK
Hsuvas Borkakoty, Jose Camacho-Collados, Luis Espinosa-Anke & Daniel Loureiro
Alan Turing Institute, London, UK
Elena Kochkina & Maria Liakata
Qwant, Paris, France
Romain Deveaud & Christophe Servan
Univ. Grenoble Alpes, CNRS, Grenoble INP, Institute of Engineering Univ. Grenoble Alpes., LIG, Grenoble, France
Gabriela Gonzalez-Saez, Petra Galuščáková, Lorraine Goeuriot & Philippe Mulhem
University of Bath, Bath, UK
Harish Tayyar Madabushi
Research Studios Austria, Data Science Studio, Vienna, Austria
Alaa El-Ebshihy & Florina Piroi
Charles University, Prague, Czech Republic
Martin Popel
Paris-Saclay University, CNRS, LISN, Gif-sur-Yvette, France
Christophe Servan
AMPLYFI, Cardiff, UK
Luis Espinosa-Anke

Authors

Rabab Alkhalifa
View author publications
You can also search for this author in PubMed Google Scholar
Iman Bilal
View author publications
You can also search for this author in PubMed Google Scholar
Hsuvas Borkakoty
View author publications
You can also search for this author in PubMed Google Scholar
Jose Camacho-Collados
View author publications
You can also search for this author in PubMed Google Scholar
Romain Deveaud
View author publications
You can also search for this author in PubMed Google Scholar
Alaa El-Ebshihy
View author publications
You can also search for this author in PubMed Google Scholar
Luis Espinosa-Anke
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Gonzalez-Saez
View author publications
You can also search for this author in PubMed Google Scholar
Petra Galuščáková
View author publications
You can also search for this author in PubMed Google Scholar
Lorraine Goeuriot
View author publications
You can also search for this author in PubMed Google Scholar
Elena Kochkina
View author publications
You can also search for this author in PubMed Google Scholar
Maria Liakata
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Loureiro
View author publications
You can also search for this author in PubMed Google Scholar
Harish Tayyar Madabushi
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Mulhem
View author publications
You can also search for this author in PubMed Google Scholar
Florina Piroi
View author publications
You can also search for this author in PubMed Google Scholar
Martin Popel
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Servan
View author publications
You can also search for this author in PubMed Google Scholar
Arkaitz Zubiaga
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philippe Mulhem .

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Jaap Kamps
Université Grenoble-Alpes, Saint-Martin-d’Hères, France
Lorraine Goeuriot
Università della Svizzera Italiana, Lugano, Switzerland
Fabio Crestani
University of Copenhagen, Copenhagen, Denmark
Maria Maistro
University of Tsukuba, Ibaraki, Japan
Hideo Joho
Dublin City University, Dublin, Ireland
Brian Davis
Dublin City University, Dublin, Ireland
Cathal Gurrin
Universität Regensburg, Regensburg, Germany
Udo Kruschwitz
Dublin City University, Dublin, Ireland
Annalina Caputo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alkhalifa, R. et al. (2023). LongEval: Longitudinal Evaluation of Model Performance at CLEF 2023. In: Kamps, J., et al. Advances in Information Retrieval. ECIR 2023. Lecture Notes in Computer Science, vol 13982. Springer, Cham. https://doi.org/10.1007/978-3-031-28241-6_58

Download citation

DOI: https://doi.org/10.1007/978-3-031-28241-6_58
Published: 16 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28240-9
Online ISBN: 978-3-031-28241-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics