Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection

Bevendorff, Janek; Chulvi, Berta; Fersini, Elisabetta; Heini, Annina; Kestemont, Mike; Kredens, Krzysztof; Mayerl, Maximilian; Ortega-Bueno, Reyner; Pęzik, Piotr; Potthast, Martin; Rangel, Francisco; Rosso, Paolo; Stamatatos, Efstathios; Stein, Benno; Wiegmann, Matti; Wolska, Magdalena; Zangerle, Eva

doi:10.1007/978-3-030-99739-7_42

Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection

Extended Abstract

Janek Bevendorff¹⁵,
Berta Chulvi¹⁶,
Elisabetta Fersini¹⁷,
Annina Heini¹⁸,
Mike Kestemont¹⁹,
Krzysztof Kredens¹⁸,
Maximilian Mayerl²⁰,
Reyner Ortega-Bueno¹⁶,
Piotr Pęzik¹⁸,
Martin Potthast²¹,
Francisco Rangel²²,
Paolo Rosso¹⁶,
Efstathios Stamatatos²³,
Benno Stein¹⁵,
Matti Wiegmann¹⁵,
Magdalena Wolska¹⁵ &
…
Eva Zangerle²⁰

Conference paper
First Online: 05 April 2022

2574 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13186))

Abstract

The paper gives a brief overview of the four shared tasks to be organized at the PAN 2022 lab on digital text forensics and stylometry hosted at the CLEF 2022 conference. The tasks include authorship verification across discourse types, multi-author writing style analysis, author profiling, and content profiling. Some of the tasks continue and advance past editions (authorship verification and multi-author analysis) and some are new (profiling irony and stereotypes spreaders and trigger detection). The general goal of the PAN shared tasks is to advance the state of the art in text forensics and stylometry while ensuring objective evaluation on newly developed benchmark datasets.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://pan.webis.de/data.html.
2.
To generate the datasets, we have followed a methodology that complies with the EU General Data Protection Regulation [10].

References

Anzovino, M., Fersini, E., Rosso, P.: Automatic identification and classification of misogynistic language on twitter. In: Silberztein, M., Atigui, F., Kornyshova, E., Métais, E., Meziane, F. (eds.) NLDB 2018. LNCS, vol. 10859, pp. 57–64. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91947-8_6
Chapter Google Scholar
Bevendorff, J., et al.: Overview of PAN 2021: authorship verification, profiling hate speech spreaders on twitter, and style change detection. In: Candan, K.S., et al. (eds.) CLEF 2021. LNCS, vol. 12880, pp. 419–431. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85251-1_26
Chapter Google Scholar
Bevendorff, J., et al.: Overview of PAN 2020: authorship verification, celebrity profiling, profiling fake news spreaders on twitter, and style change detection. In: Arampatzis, A., et al. (eds.) CLEF 2020. LNCS, vol. 12260, pp. 372–383. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58219-7_25
Chapter Google Scholar
Frenda, S., Cignarella, A., Basile, V., Bosco, C., Patti, V., Rosso, P.: The unbearable hurtfulness of sarcasm. Expert Syst. Appl. (2022). https://doi.org/10.1016/j.eswa.2021.116398
Kestemont, M., et al.: Overview of the author identification task at PAN 2018: cross-domain authorship attribution and style change detection. In: CLEF 2018 Labs and Workshops, Notebook Papers (2018)
Google Scholar
Koppel, M., Winter, Y.: Determining if two documents are written by the same author. J. Assoc. Inf. Sci. Technol. 65(1), 178–187 (2014)
Article Google Scholar
Potthast, M., Gollub, T., Wiegmann, M., Stein, B.: TIRA integrated research architecture. In: Ferro, N., Peters, C. (eds.) Information Retrieval Evaluation in a Changing World. TIRS, vol. 41, pp. 123–160. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-22948-1_5
Chapter Google Scholar
Rangel, F., De-La-Peña-Sarracén, G.L., Chulvi, B., Fersini, E., Rosso, P.: Profiling hate speech spreaders on twitter task at PAN 2021. In: Faggioli, G., Ferro, N., Joly, A., Maistro, M., Piroi, F. (eds.) CLEF 2021 Labs and Workshops, Notebook Papers, CEUR-WS.org (2021)
Google Scholar
Rangel, F., Giachanou, A., Ghanem, B., Rosso, P.: Overview of the 8th author profiling task at PAN 2019: profiling fake news spreaders on twitter. In: CLEF 2020 Labs and Workshops, Notebook Papers. CEUR Workshop Proceedings (2020)
Google Scholar
Rangel, F., Rosso, P.: On the implications of the general data protection regulation on the organisation of evaluation tasks. Lang. Law/Linguagem e Direito 5(2), 95–117 (2019)
Google Scholar
Rangel, F., Rosso, P.: Overview of the 7th author profiling task at pan 2019: bots and gender profiling. In: CLEF 2019 Labs and Workshops, Notebook Papers (2019)
Google Scholar
Rangel, F., et al.: Overview of the 2nd author profiling task at PAN 2014. In: CLEF 2014 Labs and Workshops, Notebook Papers (2014)
Google Scholar
Rangel, F., Rosso, P., Montes-y-Gómez, M., Potthast, M., Stein, B.: Overview of the 6th author profiling task at PAN 2018: multimodal gender identification in twitter. In: CLEF 2019 Labs and Workshops, Notebook Papers (2018)
Google Scholar
Rangel, F., Rosso, P., Moshe Koppel, M., Stamatatos, E., Inches, G.: Overview of the author profiling task at PAN 2013. In: CLEF 2013 Labs and Workshops, Notebook Papers (2013)
Google Scholar
Rangel, F., Rosso, P., Potthast, M., Stein, B.: Overview of the 5th author profiling task at PAN 2017: gender and language variety identification in twitter. In: Working Notes Papers of the CLEF (2017)
Google Scholar
Rangel, F., Rosso, P., Potthast, M., Stein, B., Daelemans, W.: Overview of the 3rd author profiling task at PAN 2015. In: CLEF 2015 Labs and Workshops, Notebook Papers (2015)
Google Scholar
Rangel, F., Rosso, P., Verhoeven, B., Daelemans, W., Potthast, M., Stein, B.: Overview of the 4th author profiling task at PAN 2016: cross-genre evaluations. In: CLEF 2016 Labs and Workshops, Notebook Papers (2016). ISSN 1613–0073
Google Scholar
Reyes, A., Rosso, P.: On the difficulty of automatically detecting irony: beyond a simple case of negation. Knowl. Inf. Syst. 40(3), 595–614 (2014)
Article Google Scholar
Rodríguez-Sánchez, F., et al.: Overview of exist 2021: sexism identification in social networks. In: Procesamiento del Lenguaje Natural (SEPLN), no. 67, pp. 195–207 (2021)
Google Scholar
Rosso, P., Rangel, F., Potthast, M., Stamatatos, E., Tschuggnall, M., Stein, B.: Overview of PAN 2016–new challenges for authorship analysis: cross-genre profiling, clustering, diarization, and obfuscation. In: 7th International Conference of the CLEF Initiative on Experimental IR Meets Multilinguality, Multimodality, and Interaction (CLEF 2016) (2016)
Google Scholar
Stamatatos, E., Potthast, M., Rangel, F., Rosso, P., Stein, B.: Overview of the PAN/CLEF 2015 evaluation lab. In: Mothe, J., et al. (eds.) CLEF 2015. LNCS, vol. 9283, pp. 518–538. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24027-5_49
Chapter Google Scholar
Sánchez-Junquera, J., Chulvi, B., Rosso, P., Ponzetto, S.: How do you speak about immigrants? taxonomy and stereoimmigrants dataset for identifying stereotypes about immigrants. Appl. Sci. 11(8), 3610 (2021)
Google Scholar
Tschuggnall, M., et al.: Overview of the author identification task at PAN 2017: style breach detection and author clustering. In: CLEF 2017 Labs and Workshops, Notebook Papers (2017)
Google Scholar
Zangerle, E., Mayerl, M., Potthast, M., Stein, B.: Overview of the style change detection task at PAN 2021. In: Faggioli, G., Ferro, N., Joly, A., Maistro, M., Piroi, F. (eds.) CLEF 2021 Labs and Workshops, Notebook Papers, CEUR-WS.org (2021)
Google Scholar
Zangerle, E., Mayerl, M., Specht, G., Potthast, M., Stein, B.: Overview of the style change detection task at PAN 2020. In: CLEF 2020 Labs and Workshops, Notebook Papers (2020)
Google Scholar
Zangerle, E., Tschuggnall, M., Specht, G., Stein, B., Potthast, M.: Overview of the style change detection task at PAN 2019. In: CLEF 2019 Labs and Workshops, Notebook Papers (2019)
Google Scholar

Download references

Acknowledgments

The contributions from Bauhaus-Universität Weimar and Leipzig University have been partially funded by the German Ministry for Science and Education (BMBF) project “Shared Tasks as an innovative approach to implement AI and Big Data-based applications within universities (SharKI)” (grant FKZ 16DHB4021). The Cross-DT corpus was developed at the Aston Institute for Forensic Linguistics with funding from Research England’s Expanding Excellence in England (E3) Fund. The work of the researchers from the Universitat Politècnica de València was partially funded by the Spanish MICINN under the project MISMIS-FAKEnHATE on MISinformation and MIScommunication in social media: FAKE news and HATE speech (PGC2018-096212-B-C31), and by the Generalitat Valenciana under the project DeepPattern (PROMETEO/2019/121). The work of Francisco Rangel has been partially funded by the Centre for the Development of Industrial Technology (CDTI) of the Spanish Ministry of Science and Innovation under the research project IDI-20210776 on Proactive Profiling of Hate Speech Spreaders - PROHATER (Perfilador Proactivo de Difusores de Mensajes de Odio).

Author information

Authors and Affiliations

Bauhaus-Universität Weimar, Weimar, Germany
Janek Bevendorff, Benno Stein, Matti Wiegmann & Magdalena Wolska
Universitat Politècnica de València, Valencia, Spain
Berta Chulvi, Reyner Ortega-Bueno & Paolo Rosso
Universitty Milano-Bicocca, Milan, Italy
Elisabetta Fersini
Aston University, Birmingham, UK
Annina Heini, Krzysztof Kredens & Piotr Pęzik
University of Antwerp, Antwerp, Belgium
Mike Kestemont
University of Innsbruck, Innsbruck, Austria
Maximilian Mayerl & Eva Zangerle
Leipzig University, Leipzig, Germany
Martin Potthast
Symanto Research, Nuremberg, Germany
Francisco Rangel
University of the Aegean, Mytilene, Greece
Efstathios Stamatatos

Authors

Janek Bevendorff
View author publications
You can also search for this author in PubMed Google Scholar
Berta Chulvi
View author publications
You can also search for this author in PubMed Google Scholar
Elisabetta Fersini
View author publications
You can also search for this author in PubMed Google Scholar
Annina Heini
View author publications
You can also search for this author in PubMed Google Scholar
Mike Kestemont
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Kredens
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Mayerl
View author publications
You can also search for this author in PubMed Google Scholar
Reyner Ortega-Bueno
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Pęzik
View author publications
You can also search for this author in PubMed Google Scholar
Martin Potthast
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Rangel
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Rosso
View author publications
You can also search for this author in PubMed Google Scholar
Efstathios Stamatatos
View author publications
You can also search for this author in PubMed Google Scholar
Benno Stein
View author publications
You can also search for this author in PubMed Google Scholar
Matti Wiegmann
View author publications
You can also search for this author in PubMed Google Scholar
Magdalena Wolska
View author publications
You can also search for this author in PubMed Google Scholar
Eva Zangerle
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Magdalena Wolska .

Editor information

Editors and Affiliations

Martin Luther University Halle-Wittenberg, Halle, Germany
Matthias Hagen
Leiden University, Leiden, The Netherlands
Suzan Verberne
University of Glasgow, Glasgow, UK
Craig Macdonald
University of Duisburg-Essen, Essen, Germany
Christin Seifert
University of Stavanger, Stavanger, Norway
Krisztian Balog
Norwegian University of Science and Technology, Trondheim, Norway
Kjetil Nørvåg
University of Stavanger, Stavanger, Norway
Vinay Setty

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bevendorff, J. et al. (2022). Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection. In: Hagen, M., et al. Advances in Information Retrieval. ECIR 2022. Lecture Notes in Computer Science, vol 13186. Springer, Cham. https://doi.org/10.1007/978-3-030-99739-7_42

Download citation

DOI: https://doi.org/10.1007/978-3-030-99739-7_42
Published: 05 April 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-99738-0
Online ISBN: 978-3-030-99739-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics