Towards Reliable E2R Texts: A Proposal for Standardized Evaluation Practices

Madina, Margot; Gonzalez-Dios, Itziar; Siegel, Melanie

doi:10.1007/978-3-031-62849-8_28

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14751))

Included in the following conference series:

International Conference on Computers Helping People with Special Needs

675 Accesses

Abstract

Easy-to-Read (E2R) is a method of enhancing the accessibility of written text by using clear, direct, and simple language. E2R texts are designed to improve readability and accessibility, especially for individuals with cognitive disabilities. However, there is a significant lack of standardized evaluation methods for these texts. Traditional Automatic Text Simplification (ATS) evaluation methods such as BLEU, SARI or ROUGE present several limitations for E2R evaluation. Readability measures such as Flesch Reading Ease (FRE) and Flesch-Kincaid Reading Grade Level (FKGL) do not take into account all document factors. Manual evaluation methods, such as Likert scales, are resource-intensive and lead to subjective assessments. This paper proposes a threefold evaluation method for E2R texts. The first step is an automatic evaluation to measure quantitative aspects related to text complexity. The second step is a checklist-based manual evaluation that takes into account qualitative aspects. The third step is a user evaluation, focusing on the needs of end-users and the understandability of texts. Our methodology ensures thorough assessments of E2R texts, even when user evaluations are not feasible. This approach aims to bring standardization and reliability to the evaluation process of E2R texts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Assessing the Reading Level of Web Texts for WCAG2.0 Compliance—Can It Be Done Automatically?

Mistrík’s Readability Metric – an Online Library

Adaptation of Classic Readability Metrics to Czech

Notes

1.
https://github.com/margotmg/Towards-Reliable-E2R-Texts.git.
2.
Text in German https://www.behindertenbeauftragter.de/DE/AS/rechtliches/behindertengleichstellungsgesetz/behindertengleichstellungsgesetz-node.html.
3.
Leichte Sprache is the German version of E2R.
4.
Same text in its LS version https://www.behindertenbeauftragter.de/DE/LS/rechtliches/behindertengleichstellungsgesetz/behindertengleichstellungsgesetz-node.html.
5.
TTR calculated with respect to the lemmata in first 200 tokens of a document.
6.
The value corresponds to the ratio between content words (nouns, proper nouns, verbs, adjectives, adverbs) over the total number of words in a document.

References

Alva-Manchego, F., Scarton, C., Specia, L.: Data-driven sentence simplification: survey and benchmark. Comput. Linguist. 46(1), 135–187 (2020)
Article Google Scholar
Alva-Manchego, F., Scarton, C., Specia, L.: The (un) suitability of automatic evaluation metrics for text simplification. Comput. Linguist. 47(4), 861–889 (2021)
Article Google Scholar
Amstad, T.: Wie verständlich sind unsere Zeitungen? Studenten-Schreib-Service (1978)
Google Scholar
Bengoetxea, K., Gonzalez-Dios, I.: Multiaztertest: a multilingual analyzer on multiple levels of language for readability assessment. arXiv preprint arXiv:2109.04870 (2021)
Brunato, D., Cimino, A., Dell’Orletta, F., Venturi, G., Montemagni, S.: Profiling-UD: a tool for linguistic profiling of texts. In: Proceedings of the Twelfth Language Resources and Evaluation Conference, pp. 7145–7151 (2020)
Google Scholar
Cumbicus-Pineda, O.M., Gonzalez-Dios, I., Soroa, A.: Linguistic capabilities for a checklist-based evaluation in automatic text simplification. In: CTTS@ SEPLN (2021)
Google Scholar
Fernández Huerta, J.: Medidas sencillas de lecturabilidad. Consigna 214, 29–32 (1959)
Google Scholar
Grabar, N., Saggion, H.: Evaluation of automatic text simplification: where are we now, where should we go from here. In: Traitement Automatique des Langues Naturelles, pp. 453–463. ATALA (2022)
Google Scholar
Hansen-Schirra, S., Maaß, C.: Easy language, plain language, easy language plus: perspectives on comprehensibility and stigmatisation. Easy Lang. Res. Text User Perspectives 2, 17 (2020)
Google Scholar
Jindal, P., MacDermid, J.C.: Assessing reading levels of health information: uses and limitations of Flesch formula. Educ. Health 30(1), 84–88 (2017)
Article Google Scholar
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004)
Google Scholar
Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 150–157 (2003)
Google Scholar
Madina, M., Gonzalez-Dios, I., Siegel, M.: A preliminary study of ChatGPT for Spanish E2R text adaptation. In: LREC-Coling 2024 [forthcoming] (2024)
Google Scholar
Morato, J., Campillo, A., Sanchez-Cuadrado, S., Iglesias, A., Berrios, O.: An accessible evaluation tool to detect easy-to-read barriers. In: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion, pp. 55–60 (2020)
Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318 (2002)
Google Scholar
Reichrath, E., Moonen, X.: Assessing the effects of Language for all. Nord. J. Linguist. 45(2), 232–248 (2022)
Article Google Scholar
Suárez-Figueroa, M.C., Ruckhaus, E., López-Guerrero, J., Cano, I., Cervera, Á.: Towards the assessment of easy-to-read guidelines using artificial intelligence techniques. In: Miesenberger, K., Manduchi, R., Covarrubias Rodriguez, M., Peňáz, P. (eds.) ICCHP 2020. LNCS, vol. 12376, pp. 74–82. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58796-3_10
Chapter Google Scholar
Sulem, E., Abend, O., Rappoport, A.: BLEU is not suitable for the evaluation of text simplification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 738–744. Association for Computational Linguistics, Brussels, Belgium, Oct-November 2018. https://doi.org/10.18653/v1/D18-1081, https://aclanthology.org/D18-1081
Toborek, V., Busch, M., Boßert, M., Bauckhage, C., Welke, P.: A new aligned simple German corpus. In: Rogers, A., Boyd-Graber, J., Okazaki, N. (eds.) Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 11393–11412. Association for Computational Linguistics, Toronto, Canada, July 2023. https://doi.org/10.18653/v1/2023.acl-long.638, https://aclanthology.org/2023.acl-long.638
UNE: UNE 153101:2018 EX Lectura fácil. Pautas y Recomendaciones para la Elaboración de Documentos. Madrid: Asociación Española de Normalización (2018). https://www.une.org/encuentra-tu-norma/busca-tu-norma/norma?c=N0060036
Xu, W., Napoles, C., Pavlick, E., Chen, Q., Callison-Burch, C.: Optimizing statistical machine translation for text simplification. Trans. Assoc. Comput. Linguist. 4, 401–415 (2016)
Article Google Scholar

Download references

Acknowledgements

We would like to thank the evaluators who performed the manual evaluation. This work has been partially supported by the following projects i) Ixa group A type research group (IT-1805-22) funded by the Basque Government ii) DeepKnowledge (PID2021-127777OB-C21) project funded by MCIN/AEI/10.13039/501100011033 and by FEDERe and iii) AWARE Commonsense for a new generation of natural language understanding applications (TED2021-131617B-I00) funded by MCIN/AEI /10.13039/501100011033 by the European Union NextGenerationEU/ PRTR.

Author information

Authors and Affiliations

Darmstadt University of Applied Sciences (Hochschule Darmstadt), 64295, Darmstadt, Germany
Margot Madina & Melanie Siegel
HiTZ Center - Ixa, University of the Basque Country (UPV/EHU), 20018, Donostia/San Sebastián, Spain
Itziar Gonzalez-Dios

Authors

Margot Madina
View author publications
You can also search for this author in PubMed Google Scholar
Itziar Gonzalez-Dios
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Siegel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Margot Madina .

Editor information

Editors and Affiliations

Johannes Kepler University, Linz, Austria
Klaus Miesenberger
Masaryk University, Brno, Czech Republic
Petr Peňáz
Tsukuba University of Technology, Tsukuba, Japan
Makoto Kobayashi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Madina, M., Gonzalez-Dios, I., Siegel, M. (2024). Towards Reliable E2R Texts: A Proposal for Standardized Evaluation Practices. In: Miesenberger, K., Peňáz, P., Kobayashi, M. (eds) Computers Helping People with Special Needs. ICCHP 2024. Lecture Notes in Computer Science, vol 14751. Springer, Cham. https://doi.org/10.1007/978-3-031-62849-8_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-62849-8_28
Published: 05 July 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-62848-1
Online ISBN: 978-3-031-62849-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards Reliable E2R Texts: A Proposal for Standardized Evaluation Practices