Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance

Alkhalifa, Rabab; Bilal, Iman; Borkakoty, Hsuvas; Camacho-Collados, Jose; Deveaud, Romain; El-Ebshihy, Alaa; Espinosa-Anke, Luis; Gonzalez-Saez, Gabriela; Galuščáková, Petra; Goeuriot, Lorraine; Kochkina, Elena; Liakata, Maria; Loureiro, Daniel; Mulhem, Philippe; Piroi, Florina; Popel, Martin; Servan, Christophe; Tayyar Madabushi, Harish; Zubiaga, Arkaitz

doi:10.1007/978-3-031-42448-9_28

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14163))

Included in the following conference series:

International Conference of the Cross-Language Evaluation Forum for European Languages

642 Accesses
2 Citations

Abstract

We describe the first edition of the LongEval CLEF 2023 shared task. This lab evaluates the temporal persistence of Information Retrieval (IR) systems and Text Classifiers. Task 1 requires IR systems to run on corpora acquired at several timestamps, and evaluates the drop in system quality (NDCG) along these timestamps. Task 2 tackles binary sentiment classification at different points in time, and evaluates the performance drop for different temporal gaps. Overall, 37 teams registered for Task 1 and 25 for Task 2. Ultimately, 14 and 4 teams participated in Task 1 and Task 2, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Alkhalifa, R., et al.: Longeval: longitudinal evaluation of model performance at CLEF 2023. In: Kamps, J., et al. (eds.) ECIR 2023. Lecture Notes in Computer Science, vol. 13982, pp. 499–505. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-28241-6_58
Chapter Google Scholar
Alkhalifa, R., Kochkina, E., Zubiaga, A.: Opinions are made to be changed: temporally adaptive stance classification. In: Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks, pp. 27–32 (2021)
Google Scholar
Alkhalifa, R., Kochkina, E., Zubiaga, A.: Building for tomorrow: Assessing the temporal persistence of text classifiers. arXiv preprint arXiv:2205.05435 (2022)
Alkhalifa, R., Zubiaga, A.: Capturing stance dynamics in social media: open challenges and research directions. Int. J. Digit. Hum. 3, 1–21 (2022)
Google Scholar
Galuščáková, P., et al.: Longeval-retrieval: French-English dynamic test collection for continuous web search evaluation (2023)
Google Scholar
Loureiro, D., Barbieri, F., Neves, L., Espinosa Anke, L., Camacho-collados, J.: TimeLMs: diachronic language models from Twitter. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 251–260. Association for Computational Linguistics, Dublin, Ireland (2022). https://doi.org/10.18653/v1/2022.acl-demo.25, https://aclanthology.org/2022.acl-demo.25
Urbano, J., Lima, H., Hanjalic, A.: A new perspective on score standardization. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1061–1064 (2019)
Google Scholar
Yin, W., Alkhalifa, R., Zubiaga, A.: The emojification of sentiment on social media: Collection and analysis of a longitudinal Twitter sentiment dataset. arXiv preprint arXiv:2108.13898 (2021)

Download references

Acknowledgements

This work is supported by the ANR Kodicare bi-lateral project, grant ANR-19-CE23-0029 of the French Agence Nationale de la Recherche, and by the Austrian Science Fund (FWF, grant I4471-N). This work is also supported by a UKRI/EPSRC Turing AI Fellowship to Maria Liakata (grant no. EP/V030302/1) and The Alan Turing Institute (grant no. EP/N510129/1) through project funding and its Enrichment PhD Scheme for Iman Bilal. This work has been using services provided by the LINDAT/CLARIAH-CZ Research Infrastructure (https://lindat.cz), supported by the Ministry of Education, Youth and Sports of the Czech Republic (Project No. LM2018101) and has been also supported by the Ministry of Education, Youth and Sports of the Czech Republic, Project No. LM2018101 LINDAT/CLARIAH-CZ.

Author information

Authors and Affiliations

Queen Mary University of London, London, UK
Rabab Alkhalifa, Elena Kochkina, Maria Liakata & Arkaitz Zubiaga
Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Rabab Alkhalifa
University of Warwick, Coventry, UK
Iman Bilal & Maria Liakata
Cardiff University, Cardiff, UK
Hsuvas Borkakoty, Jose Camacho-Collados, Luis Espinosa-Anke & Daniel Loureiro
Alan Turing Institute, London, UK
Elena Kochkina & Maria Liakata
Qwant, Paris, France
Romain Deveaud & Christophe Servan
Univ. Grenoble Alpes, CNRS, Grenoble INP (Institute of Engineering Univ. Grenoble Alpes.), LIG, Grenoble, France
Gabriela Gonzalez-Saez, Petra Galuščáková, Lorraine Goeuriot & Philippe Mulhem
University of Bath, Bath, UK
Harish Tayyar Madabushi
Research Studios Austria, Data Science Studio, Vienna, Austria
Alaa El-Ebshihy & Florina Piroi
Charles University, Prague, Czech Republic
Martin Popel
Paris-Saclay University, CNRS, LISN, Orsay, France
Christophe Servan
AMPLYFI, Cardiff, UK
Luis Espinosa-Anke

Authors

Rabab Alkhalifa
View author publications
You can also search for this author in PubMed Google Scholar
Iman Bilal
View author publications
You can also search for this author in PubMed Google Scholar
Hsuvas Borkakoty
View author publications
You can also search for this author in PubMed Google Scholar
Jose Camacho-Collados
View author publications
You can also search for this author in PubMed Google Scholar
Romain Deveaud
View author publications
You can also search for this author in PubMed Google Scholar
Alaa El-Ebshihy
View author publications
You can also search for this author in PubMed Google Scholar
Luis Espinosa-Anke
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Gonzalez-Saez
View author publications
You can also search for this author in PubMed Google Scholar
Petra Galuščáková
View author publications
You can also search for this author in PubMed Google Scholar
Lorraine Goeuriot
View author publications
You can also search for this author in PubMed Google Scholar
Elena Kochkina
View author publications
You can also search for this author in PubMed Google Scholar
Maria Liakata
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Loureiro
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Mulhem
View author publications
You can also search for this author in PubMed Google Scholar
Florina Piroi
View author publications
You can also search for this author in PubMed Google Scholar
Martin Popel
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Servan
View author publications
You can also search for this author in PubMed Google Scholar
Harish Tayyar Madabushi
View author publications
You can also search for this author in PubMed Google Scholar
Arkaitz Zubiaga
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philippe Mulhem .

Editor information

Editors and Affiliations

Democritus University of Thrace, Xanthi, Greece
Avi Arampatzis
University of Amsterdam, Amsterdam, The Netherlands
Evangelos Kanoulas
CERTH-ITI, Thessaloniki, Greece
Theodora Tsikrika
CERTH-ITI, Thessaloniki, Greece
Stefanos Vrochidis
Utrecht University, Utrecht, The Netherlands
Anastasia Giachanou
Elsevier, Amsterdam, The Netherlands
Dan Li
University of Amsterdam, Amsterdam, The Netherlands
Mohammad Aliannejadi
University of Lausanne, Lausanne, Switzerland
Michalis Vlachos
University of Padua, Padova, Italy
Guglielmo Faggioli
University of Padua, Padova, Italy
Nicola Ferro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alkhalifa, R. et al. (2023). Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. In: Arampatzis, A., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, vol 14163. Springer, Cham. https://doi.org/10.1007/978-3-031-42448-9_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-42448-9_28
Published: 11 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-42447-2
Online ISBN: 978-3-031-42448-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance