skip to main content
10.1145/3594806.3596530acmotherconferencesArticle/Chapter ViewAbstractPublication PagespetraConference Proceedingsconference-collections
research-article

Easy-to-Read Language Resources and Tools for three European Languages

Published:10 August 2023Publication History

ABSTRACT

Easy-to-Read (E2R) is a controlled language variant that makes any written text more accessible through the use of clear, direct and simple language. It makes content accessible to anyone with a minimum reading level, but it is mainly aimed at specific user groups. In recent years, important developments in the field of E2R have been carried out in many different languages in addition to English. The main objective of this paper is to present an updated overview of the existing tools and resources for E2R in German, Spanish and Basque. We aim to offer a benchmark for future studies that focus on these languages and their E2R variants.

References

  1. Sarah Ahrens. 2022. Easy Language Translations for Second Language Learners–Worthwhile Concept or Didactic Mistake?Translation, Mediation and Accessibility for Linguistic Minorities 128 (2022), 175.Google ScholarGoogle Scholar
  2. Rodrigo Alarcón García. 2022. Lexical Simplification for the Systematic Support of Cognitive Accessibility Guidelines. (2022).Google ScholarGoogle Scholar
  3. Antonios Anastasopoulos, Alessandro Cattelan, Zi-Yi Dou, Marcello Federico, Christian Federmann, Dmitriy Genzel, Franscisco Guzmán, Junjie Hu, Macduff Hughes, Philipp Koehn, Rosie Lazar, Will Lewis, Graham Neubig, Mengmeng Niu, Alp Öktem, Eric Paquin, Grace Tang, and Sylwia Tur. 2020. TICO-19: the Translation Initiative for COvid-19. In Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020. Association for Computational Linguistics, Online.Google ScholarGoogle ScholarCross RefCross Ref
  4. Marıa Jesús Aranzabe, Arantza Dıaz De Ilarraza, and Itziar Gonzalez-Dios. 2012. First Approach to Automatic Text Simplification in Basque. In Proceedings of the Natural Language Processing for Improving Textual Accessibility (NLP4ITA) workshop (LREC 2012). 1–8.Google ScholarGoogle Scholar
  5. Dennis Aumiller and Michael Gertz. 2022. Klexikon: A German Dataset for Joint Summarization and Simplification. arXiv preprint arXiv:2201.07198 (2022).Google ScholarGoogle Scholar
  6. Ricardo Baeza-Yates, Luz Rello, and Julia Dembowski. 2015. Cassa: A context-aware synonym simplification algorithm. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 1380–1385.Google ScholarGoogle ScholarCross RefCross Ref
  7. Christopher Balmford. 2002. Plain Language: Beyond a ’Movement’. Repositioning Clear Communication in the Minds of Decision-Makers. In Fourth Biennial Conference of the PLAIN Language Association International, Toronto, Canada.Google ScholarGoogle Scholar
  8. Alessia Battisti and Sarah Ebling. 2019. A corpus for Automatic Readability Assessment and Text Simplification of German. arXiv preprint arXiv:1909.09067 (2019).Google ScholarGoogle Scholar
  9. Alessia Battisti, Sarah Ebling, and Martin Volk. 2019. An Empirical Analysis of Linguistic, Typographic, and Structural Features in Simplified German Texts.Google ScholarGoogle Scholar
  10. Kepa Bengoetxea and Itziar Gonzalez-Dios. 2021. MultiAzterTest: a Multilingual Analyzer on Multiple Levels of Language for Readability Assessment. arXiv preprint arXiv:2109.04870 (2021).Google ScholarGoogle Scholar
  11. María Pilar Castillo Bernal and Marta Estévez Grossi. 2022. Translation, Mediation and Accessibility for Linguistic Minorities. Vol. 128. Frank & Timme GmbH.Google ScholarGoogle Scholar
  12. Stefan Bott, Luz Rello, Biljana Drndarevic, and Horacio Saggion. 2012. Can Spanish Be Simpler? LexSiS: Lexical Simplification for Spanish. In COLING.Google ScholarGoogle Scholar
  13. Stefan Bott and Horacio Saggion. 2011. An Unsupervised Alignment Algorithm for Text Simplification Corpus Construction. In Proceedings of the Workshop on Monolingual Text-To-Text Generation. 20–26.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Stefan Bott, Horacio Saggion, and Simon Mille. 2012. Text Simplification Tools for Spanish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12). 1665–1671.Google ScholarGoogle Scholar
  15. Ursula Bredel and Christiane Maaß. 2016. Leichte Sprache: Theoretische Grundlagen? Orientierung für die Praxis. Bibliographisches Institut GmbH.Google ScholarGoogle Scholar
  16. Leonardo Campillos Llanos, Ana R Terroba Reinares, Sofía Zakhir Puig, Ana Valverde, and Adrián Capllonch-Carrión. 2022. Building a comparable corpus and a benchmark for Spanish medical text simplification. (2022).Google ScholarGoogle Scholar
  17. Oscar M Cumbicus-Pineda, Itziar Gonzalez-Dios, and Aitor Soroa. 2021. A Syntax-Aware Edit-based System for Text Simplification. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021). 324–334.Google ScholarGoogle ScholarCross RefCross Ref
  18. Iria da Cunha. 2022. Un redactor asistido para adaptar textos administrativos a lenguaje claro. Procesamiento del Lenguaje Natural 69 (2022), 39–49.Google ScholarGoogle Scholar
  19. Das Netzwerk Leichte Sprache. 2013. Die Regeln für Leichte Sprache.Google ScholarGoogle Scholar
  20. Javier de la Rosa, Eduardo G Ponferrada, Paulo Villegas, Pablo González de Prado Salas, Manu Romero, and Marıa Grandury. 2022. BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling. Procesamiento del Lenguaje Natural 68 (2022), 13–23.Google ScholarGoogle Scholar
  21. Silvana Deilen. 2022. Visual Segmentation of Compounds in Easy Language: Does the Marking of Morpheme Boundaries Reduce Cognitive Processing Costs?Translation, Mediation and Accessibility for Linguistic Minorities 128 (2022), 161.Google ScholarGoogle Scholar
  22. Yue Dong, Zichao Li, Mehdi Rezagholizadeh, and Jackie Chi Kit Cheung. 2019. EditNTS: A Neural Programmer-Interpreter Model for Sentence Simplification through Explicit Editing". In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 3393–3402.Google ScholarGoogle ScholarCross RefCross Ref
  23. Sarah Ebling, Alessia Battisti, Marek Kostrzewa, Dominik Pfütze, Annette Rios, Andreas Säuberli, and Nicolas Spring. 2022. Automatic Text Simplification for German. Simple and Simplified Languages (2022).Google ScholarGoogle Scholar
  24. Daniel Ferrés and Horacio Saggion. 2022. ALEXSIS: A Dataset for Lexical Simplification in Spanish. In Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022). 3582–3594.Google ScholarGoogle Scholar
  25. Óscar García Muñoz. 2013. Léctura fácil - Métodos de Redacción y Evaluación. Ministerio de Educación.Google ScholarGoogle Scholar
  26. Itziar Gonzalez-Dios. 2016. Euskarazko Egitura Sintaktiko Konplexuen Analisirako eta Testuen Sinplifikazio Automatikorako Proposamena. Readability Assessment and Automatic Text Simplification. The Analysis of Basque Complex Structures. Ph. D. Dissertation. University of the Basque Country (UPV/EHU).Google ScholarGoogle Scholar
  27. Itziar Gonzalez-Dios, María Jesús Aranzabe, Arantza Díaz de Ilarraza, and Haritz Salaberri. 2014. Simple or Complex? Assessing the Readability of Basque Texts. In Proceedings of COLING 2014, the 25th international conference on computational linguistics: Technical papers. 334–344.Google ScholarGoogle Scholar
  28. Itziar Gonzalez-Dios, María Jesús Aranzabe, and Arantza Díaz de Ilarraza. 2018. The corpus of Basque simplified texts (CBST). Language Resources and Evaluation 52, 1 (2018), 217–247.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Itziar Gonzalez-Dios, Iker Gutiérrez-Fandiño, Oscar M. Cumbicus-Pineda, and Aitor Soroa. 2022. IrekiaLF_es: a New Open Benchmark and Baseline Systems for Spanish Automatic Text Simplification. In Proceedings of the TSAR 2022 Workshop.Google ScholarGoogle Scholar
  30. Silvia Hansen-Schirra, Walter Bisang, Arne Nagels, Silke Gutermuth, Julia Fuchs, Liv Borghardt, SILVAN DEILEN, Anne-Kathrin Gros, Laura Schiffl, and Johanna Sommer. 2020. Intralingual Translation into Easy Language–or How to Reduce Cognitive Processing Costs. Easy Language Research: Text and User Perspectives. Berlin: Frank & Timme (2020), 197–225.Google ScholarGoogle Scholar
  31. Silvia Hansen-Schirra and Christiane Maaß. 2020. Easy Language, Plain Language, Easy Language Plus: perspectives on comprehensibility and stigmatisation. Easy Language Research: Text and User Perspectives 2 (2020), 17.Google ScholarGoogle Scholar
  32. Silvia Hansen-Schirra, Jean Nitzke, and Silke Gutermuth. 2021. An Intralingual Parallel Corpus of Translations into German Easy Language (Geasy Corpus): What Sentence Alignments Can Tell Us About Translation Strategies in Intralingual Translation. In New Perspectives on Corpus Translation Studies. Springer, 281–298.Google ScholarGoogle Scholar
  33. Renate Hauser, Jannis Vamvas, Sarah Ebling, and Martin Volk. 2022. A Multilingual Simplified Language News Corpus. In 2nd Workshop on Tools and Resources for REAding DIfficulties (READI). 25.Google ScholarGoogle Scholar
  34. Antje Heine. 2017. Deutsch als Fremd-und Zweitsprache–eine besondere Form Leichter Sprache? Überlegungen aus der Perspektive des Faches DaF/DaZ. Leichte Sprache “im Spiegel theoretischer und angewandter Forschung. Berlin: Frank & Timme (2017), 401–414.Google ScholarGoogle Scholar
  35. Sarah Jablotschkin and Heike Zinsmeister. 2020. LeiKo: A Corpus of Easy-to-Read German. (2020).Google ScholarGoogle Scholar
  36. Daniel Jach. 2020. Korpus Einfaches Deutsch (KED).Google ScholarGoogle Scholar
  37. Jörg Kilian. 2017. „Leichte Sprache “, Bildungssprache und Wortschatz–Zur sprach-und fachdidaktischen Wertigkeit der Regelkonzepte für „leichte Wörter “. Eds. Bettina Bock, Ulla Fix, and Daisy Lange.“Leichte Sprache” im Spiegel theoretischer und angewandter Forschung (2017), 189–209.Google ScholarGoogle Scholar
  38. David Klaper, Sarah Ebling, and Martin Volk. 2013. Building a German/Simple German Parallel Corpus for Automatic Text Simplification. (2013).Google ScholarGoogle Scholar
  39. Daisy Lange. 2018. Comparing ‘Leichte Sprache’,‘einfache Sprache’and ‘Leicht Lesen’: A Corpus-Based Descriptive Approach. In Eds. SUSANNE J. JEKAT, and GARY MASSEY. Barrier-free communication: methods and products: proceedings of the 1st Swiss conference on barrier-free communication. Winterthur: ZHAW Digital Collection. 75–91.Google ScholarGoogle Scholar
  40. Christian Lieske and Melanie Siegel. 2014. Verstehen leicht gemacht. technische kommunikation 1 (2014), 44–49.Google ScholarGoogle Scholar
  41. Camilla Lindholm and Ulla Vanhatalo. 2021. Handbook of Easy Languages in Europe. Frank & Timme.Google ScholarGoogle Scholar
  42. Christiane Maaß. 2015. Leichte Sprache. Das Regelbuch. Lit-Verlag.Google ScholarGoogle Scholar
  43. Christiane Maaß. 2020. Easy Language–Plain Language–Easy Language Plus: Balancing comprehensibility and acceptability. Frank & Timme.Google ScholarGoogle Scholar
  44. M. Carme Mayol and Eugènia Salvador. 1999. Materials de lectura-fàcil: Anàlisi, directrius internacionals i proposta per a elaborar aquests materials a Catalunya. FUS, Grup de Fundacions.Google ScholarGoogle Scholar
  45. Ministerio de Justicia. 2011. Informe de la Comisión de Modernización del Lenguaje Jurídico.Google ScholarGoogle Scholar
  46. Ruslan Mitkov and Sanja Štajner. 2014. The fewer, the better? A Contrastive Study About ways to Simplify. In Proceedings of the Workshop on Automatic Text Simplification-Methods and Applications in the Multilingual Society (ATS-MA 2014). 30–40.Google ScholarGoogle ScholarCross RefCross Ref
  47. Salar Mohtaj, Babak Naderi, and Sebastian Möller. 2022. Overview of the GermEval 2022 Shared Task on Text Complexity Assessment of German Text. In Proceedings of the GermEval 2022 Workshop on Text Complexity Assessment of German Text. 1–9.Google ScholarGoogle Scholar
  48. Salar Mohtaj, Babak Naderi, Sebastian Möller, Faraz Maschhur, Chuyang Wu, and Max Reinhard. 2022. A Transfer Learning Based Model for Text Readability Assessment in German. arXiv preprint arXiv:2207.06265 (2022).Google ScholarGoogle Scholar
  49. Babak Naderi, Salar Mohtaj, Kaspar Ensikat, and Sebastian Möller. 2019. Subjective Assessment of Text Complexity: A Dataset for German Ganguage. arXiv preprint arXiv:1904.07733 (2019).Google ScholarGoogle Scholar
  50. Babak Naderi, Salar Mohtaj, Karan Karan, and Sebastian Möller. 2019. Automated Text Readability Assessment for German Language: a Quality of Experience Approach. In 2019 Eleventh International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 1–3.Google ScholarGoogle ScholarCross RefCross Ref
  51. Jean Nitzke, Silvia Hansen-Schirra, Ann-Kathrin Habig, and Silke Gutermuth. 2022. Translating Subtitles into Easy Language: First Considerations and Empirical Investigations. Translation, Mediation and Accessibility for Linguistic Minorities 128 (2022), 127.Google ScholarGoogle Scholar
  52. Jenny Ortiz Zambrano, Arturo MontejoRáez, Katty Nancy Lino Castillo, Otto Rodrigo Gonzalez Mendoza, and Belkis Chiquinquirá Cañizales Perdomo. 2019. VYTEDU-CW: Difficult words as a barrier in the reading comprehension of university students. In Advances in Emerging Trends and Technologies: Volume 1. Springer, 167–176.Google ScholarGoogle Scholar
  53. Daniel M. Pottmann. 2019. „Leichte Sprache and Einfache Sprache”–German Plain Language and teaching DaF German as a Foreign Language. Studia Linguistica 38 (2019), 81–94.Google ScholarGoogle ScholarCross RefCross Ref
  54. Andre Quispesaravia, Walter Perez, Marco Sobrevilla Cabezudo, and Fernando Alva-Manchego. 2016. Coh-Metrix-Esp: A complexity analysis tool for documents written in Spanish. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). 4694–4698.Google ScholarGoogle Scholar
  55. Annette Rios, Nicolas Spring, Tannon Kew, Marek Kostrzewa, Andreas Säuberli, Mathias Müller, and Sarah Ebling. 2021. A New Dataset and Efficient Baselines for Document-level Text Simplification in German. In Proceedings of the Third Workshop on New Frontiers in Summarization. Association for Computational Linguistics, Online and in Dominican Republic, 152–161.Google ScholarGoogle ScholarCross RefCross Ref
  56. Andreas Säuberli, Sarah Ebling, and Martin Volk. 2020. Benchmarking Data-driven Automatic Text Simplification for German. In Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI). European Language Resources Association, Marseille, France, 41–48.Google ScholarGoogle Scholar
  57. Matthew Shardlow and Fernando Alva-Manchego. 2022. Simple TICO-19: A Dataset for Joint Translation and Simplification of Covid-19 Texts. In Proceedings of the 13th Language Resources and Evaluation Conference, Marseille, France, June. European Language Resources Association.Google ScholarGoogle Scholar
  58. Melanie Siegel and Christian Lieske. 2015. Beitrag der Sprachtechnologie zur Barrierefreiheit: Unterstützung für Leichte Sprache. Zeitschrift für Translationswissenschaft und Fachkommunikation 8, 1 (2015), 40–78.Google ScholarGoogle Scholar
  59. Nicolas Spring, Annette Rios, and Sarah Ebling. 2021. Exploring German Multi-Level Text Simplification. (2021).Google ScholarGoogle Scholar
  60. Sanja Štajner, Horacio Saggion, and Simone Paolo Ponzetto. 2019. Improving Lexical Coverage of Text Simplification Systems for Spanish. Expert Systems with Applications 118 (2019), 80–91.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Ina Steinmetz and Karin Harbusch. 2020. Enabling Fast and Correct Typing in ‘Leichte Sprache’(Easy Language). In Proceedings of The Fourth Widening Natural Language Processing Workshop. 64–67.Google ScholarGoogle ScholarCross RefCross Ref
  62. Vanessa Toborek, Moritz Busch, Malte Boßert, Pascal Welke, and Christian Bauckhage. 2022. A New Aligned Simple German Corpus. arXiv preprint arXiv:2209.01106 (2022).Google ScholarGoogle Scholar
  63. Carmen Torrijos and Sonia Oquendo. 2021. !Hola! Soy Clara y mido la claridad de tu texto. Archiletras científica: revista de investigación de lengua y letras6 (2021), 119–134.Google ScholarGoogle Scholar
  64. UNE. 2018. UNE 153101:2018 EX Lectura fácil. Pautas y Recomendaciones para la Elaboración de Documentos. Madrid: Asociación Española de Normalización.Google ScholarGoogle Scholar
  65. United Nations. 2006. Convention on the Rights of Persons with Disabilities of the United Nations (CRPD).Google ScholarGoogle Scholar
  66. Antonio Ventosa Pérez. 2022. Servicio Web de Lectura Fácil: Calculador de Índices de Lecturabilidad. Bachelor’s Thesis. ETSI_Informatica.Google ScholarGoogle Scholar
  67. Zarah Weiss and Detmar Meurers. 2022. Assessing Sentence Readability for German Language Learners with broad Linguistic Modeling or Readability Formulas: When do Linguistic Insights make a Difference?. In Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2022). 141–153.Google ScholarGoogle ScholarCross RefCross Ref
  68. Wei Xu, Chris Callison-Burch, and Courtney Napoles. 2015. Problems in Current Text Simplification Research: New Data can Help. Transactions of the Association for Computational Linguistics 3 (2015), 283–297.Google ScholarGoogle ScholarCross RefCross Ref
  69. Seid Muhie Yimam, Sanja Štajner, Martin Riedl, and Chris Biemann. 2017. Multilingual and Cross-Lingual Complex Word Identification. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017. INCOMA Ltd., Varna, Bulgaria, 813–822.Google ScholarGoogle Scholar

Index Terms

  1. Easy-to-Read Language Resources and Tools for three European Languages

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      PETRA '23: Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments
      July 2023
      797 pages
      ISBN:9798400700699
      DOI:10.1145/3594806

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 August 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)65
      • Downloads (Last 6 weeks)4

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format