skip to main content
10.1145/3439231.3439257acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdsaiConference Proceedingsconference-collections
research-article

Word-Sense disambiguation system for text readability

Published: 09 June 2021 Publication History

Abstract

People with cognitive, language and learning disabilities face accessibility barriers when reading texts with complex or specialized words. In order for these needs to be addressed, and in accordance with accessibility guidelines, it is beneficial to provide the definitions of complex words to the user. In this sense, human language can, at times, be ambiguous, and many words may be interpreted in multiple ways depending on the context. To offer a correct definition, it is often necessary to carry out Word Sense Disambiguation. In this paper, a system that is based on Natural Language Processing and Language Resources in the field of easy reading to provide a contextualized definition for a complex word is presented. An expert linguistic, specialized in easy reading and plain language, has evaluated the Word Sense Disambiguation system. This research work is part of the EASIER project that offers an accessible platform to provide users with the easiest possible Spanish text to understand and read.

References

[1]
M. de E. OCDE (Organización para la Cooperación y el Desarrollo Económicos, “http://www.educacionyfp.gob.es/prensa/actualidad/2013/10/20131008-piaac.html,” 2013. .
[2]
“UNE, Asociación Española de Normalización, UNE 153101:2018 (Easy to read. Guidelines and recommendations for the elaboration of documents),” 2018. [Online]. Available: https://www.une.org/encuentra-tu-norma/busca-tu-norma/norma?c=N0060036 . [Accessed: 18-Nov-2020].
[3]
L. Moreno, R. Alarcon, I. Segura-Bedmar, and P. Martínez, “Lexical simplification approach to support the accessibility guidelines,” Proc. XX Int. Conf. Hum. Comput. Interact. (Interacción ’19), pp. 1–4, 2019.
[4]
W3c, “Making Content Usable for People with Cognitive and Learning Disabilities, W3C Working Draft,” 2020. [Online]. Available: https://www.w3.org/TR/coga-usable/ . [Accessed: 17-Jul-2020].
[5]
L. Moreno, P. Martínez, I. Segura-Bedmar, and R. Revert, “Exploring language technologies to provide support to WCAG 2.0 and E2R guidelines,” Proc. XVI Int. Conf. Hum. Comput. Interact. - Interacción ’15, pp. 1–8, 2015.
[6]
W3C, “Web Content Accesibility Guidelines (WCAG),” 2019. [Online]. Available: https://www.w3.org/WAI/standards-guidelines/wcag/.
[7]
R. Navigli, “Word Sense Disambiguation: A Survey,” vol. 41, no. 2, 2009.
[8]
M. Lesk, “Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone,” pp. 24–26, 1986.
[9]
Y. K. Lee, H. T. Ng, and T. K. Chia, “Supervised Word Sense Disambiguation with Support Vector Machines and Multiple Knowledge Sources,” no. July, 2004.
[10]
H. Kaji and Y. Morimoto, “Unsupervised Word Sense Disambiguation Using Bilingual Comparable Corpora,” 2005.
[11]
B. Moradi, E. Ansari, and Z. Zabokrtskˇy, “Unsupervised Word Sense Disambiguation Using Word Embeddings,” PROCEEDING 25TH Conf. Fruct Assoc., 2019.
[12]
R. Cao, J. Bai, and H. Shinnou, “Semi-supervised learning for all-words WSD using self-learning and fine-tuning,” no. Paclic 33, pp. 356–361, 2019.
[13]
S. S. Pradhan, D. Dligach, and M. Palmer, “SemEval-2007 Task 17: English Lexical Sample, SRL and All Words,” no. June, pp. 87–92, 2007.
[14]
R. Navigli, D. Jurgens, and D. Vannella, “SemEval-2013 Task 12: Multilingual Word Sense Disambiguation,” vol. 2, no. SemEval, pp. 222–231, 2013.
[15]
A. Moro and R. Navigli, “SemEval-2015 Task 13: Multilingual All-Words Sense Disambiguation and Entity Linking,” no. SemEval, pp. 288–297, 2015.
[16]
J. Devlin, M. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” 2019.
[17]
J. Du, F. Qi, and M. Sun, “Using BERT for Word Sense Disambiguation,” no. 1, 2019.
[18]
J. Medina, “Towards Integrating People with Intellectual Disabilities in the Digital World,” no. Id, pp. 348–357, 2016.
[19]
H. Al-Mubaid and P. Chen, “Application of word prediction and disambiguation to improve text entry for people with physical disabilities (assistive technology),” Int. J. Soc. Humanist. Comput., vol. 1, no. 1, pp. 10–27, 2008.
[20]
L. Sevens, G. Jacobs, V. Vandeghinste, I. Schuurman, and F. Van Eynde, “Improving Text-to-Pictograph Translation Through Word Sense Disambiguation,” pp. 131–135, 2016.
[21]
L. Moreno, R. Alarcon, and P. Martínez, “EASIER system. Language resources for cognitive accessibility,” 22nd Int. ACM SIGACCESS Conf. Comput. Access. (virtual)., 2020.
[22]
R. Alarcon, L. Moreno, I. Segura-bedmar, and P. Martínez, “Lexical simplification approach using easy-to-read resources,” Proces. del Leng. Nat., pp. 95–102, 2019.

Cited By

View all
  • (2024)Analisis Keterbacaan Teks Buku Ajar Bahasa Indonesia SMP Kelas 9 Menggunakan Formula Grafik FryPubmedia Jurnal Penelitian Tindakan Kelas Indonesia10.47134/ptk.v1i3.4201:3(15)Online publication date: 17-May-2024
  • (2024)Generative Information Systems Are Great If You Can ReadProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638345(165-177)Online publication date: 10-Mar-2024
  • (2023)EASIER corpus: A lexical simplification resource for people with cognitive impairmentsPLOS ONE10.1371/journal.pone.028362218:4(e0283622)Online publication date: 12-Apr-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DSAI '20: Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion
December 2020
245 pages
ISBN:9781450389372
DOI:10.1145/3439231
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Cognitive disabilities
  2. Natural Language Processing
  3. Readability
  4. Tool
  5. WCAG
  6. Web accessibility
  7. Word-sense disambiguation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

DSAI 2020

Acceptance Rates

Overall Acceptance Rate 17 of 23 submissions, 74%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)2
Reflects downloads up to 15 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Analisis Keterbacaan Teks Buku Ajar Bahasa Indonesia SMP Kelas 9 Menggunakan Formula Grafik FryPubmedia Jurnal Penelitian Tindakan Kelas Indonesia10.47134/ptk.v1i3.4201:3(15)Online publication date: 17-May-2024
  • (2024)Generative Information Systems Are Great If You Can ReadProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638345(165-177)Online publication date: 10-Mar-2024
  • (2023)EASIER corpus: A lexical simplification resource for people with cognitive impairmentsPLOS ONE10.1371/journal.pone.028362218:4(e0283622)Online publication date: 12-Apr-2023
  • (2022)Dictionary data structure for a text analysis task using cross-references2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)10.1109/CSIT56902.2022.10000460(61-64)Online publication date: 10-Nov-2022
  • (2022)EASIER System. Evaluating a Spanish Lexical Simplification Proposal with People with Cognitive ImpairmentsInternational Journal of Human–Computer Interaction10.1080/10447318.2022.213407440:5(1195-1209)Online publication date: 24-Oct-2022
  • (2021)Multi-purpose search to determine the context of a text message based on the dictionary data structure2021 IEEE 16th International Conference on Computer Sciences and Information Technologies (CSIT)10.1109/CSIT52700.2021.9648803(65-68)Online publication date: 22-Sep-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media