Abstract
In this paper we study the effect of different lexical resources and strategies for selecting synonyms in a lexical simplification system for the Spanish language. The resources used for the experiments are the Spanish EuroWordNet, the Spanish Open Thesaurus and a combination of both. As for the synonym selection strategies, we have used both local and global contexts for word sense disambiguation. We present a novel evaluation framework in lexical simplification that takes into account the level of ambiguity of the word to be simplified. The evaluation compares various instances of the lexical simplification system, a gold standard, and a baseline. On the basis of our results we recommend different resources and word sense disambiguation methods depending on the ambiguity level of the target word to be simplified.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aluísio, S.M., Gasperin, C.: Fostering digital inclusion and accessibility: the PorSimples project for simplification of Portuguese texts. In: NAACL/HLT, Young Investigators Workshop on Computational Approaches to Languages of the Americas, YIWCALA 2010, pp. 46–53. ACL, Stroudsburg (2010), http://dl.acm.org/citation.cfm?id=1868701.1868708
Barrón-Cedeño, A., Vila, M., Martí, M., Rosso, P.: Plagiarism meets paraphrasing: Insights for the next generation in automatic plagiarism detection. Computational Linguistics 39(4) (2013)
Bautista, S., León, C., Hervás, R., Gervás, P.: Empirical identification of text simplification strategies for reading-impaired people. In: European Conference for the Advancement of Assistive Technology (2011)
Biran, O., Brody, S., Elhadad, N.: Putting it simply: A context-aware approach to lexical simplificaion. In: ACL/HLT, pp. 496–501. ACL, Portland (2011), http://www.aclweb.org/anthology/P11-2087
Bott, S., Rello, L., Drndarević, B., Saggion, H.: Can Spanish Be Simpler? LexSiS: Lexical Simplification for Spanish. In: CoLing, December 8-16 (2012)
Burstein, J., Shore, J., Sabatini, J., Lee, Y.W., Ventura, M.: The automated text adaptation tool. In: NAACL/HLT (Demonstrations), pp. 3–4 (2007), http://dblp.uni-trier.de/db/conf/naacl/naacl2007.html#BursteinSSLV07
Carroll, J., Minnen, G., Canning, Y., Devlin, S., Tait, J.: Practical Simplification of English Newspaper Text to Assist Aphasic Readers. In: Proc. of AAAI 1998 Workshop on Integrating Artificial Intelligence and Assistive Technology, pp. 7–10 (1998)
Caseli, H.M., Pereira, T.F., Specia, L., Pardo, T.A.S., Gasperin, C., Aluísio, S.M.: Building a Brazilian Portuguese parallel corpus of original and simplified texts. In: CICLing (2009)
Chandrasekar, R., Doran, D., Srinivas, B.: Motivations and methods for text simplification. In: CoLing, vol. 2, pp. 1041–1044 (1996)
Coster, W., Kauchak, D.: Learning to simplify sentences using Wikipedia. In: Proceedings of the Workshop on Monolingual Text-To-Text Generation. ACL (2011)
De Belder, J., Deschacht, K., Moens, M.F.: Lexical simplification. In: Proceedings of Itec 2010: 1st International Conference on Interdisciplinary Research on Technology, Education and Communication (2010), https://lirias.kuleuven.be/handle/123456789/268437
Devlin, S., Unthank, G.: Helping aphasic people process online information. In: The International ACM SIGACCESS Conference on Computers and Accessibility, pp. 225–226 (2006)
Fleiss, J.L.: Measuring nominal scale agreement among many raters. Psychological Bulletin 76(5), 378–382 (1971)
Gale, W.A., Church, K.W., Yarowsky, D.: One sense per discourse. In: Proceedings of the Workshop on Speech and Natural Language, pp. 233–237. HLT (1992)
Krovetz, R.: More than one sense per discourse. In: NEC Princeton NJ Labs. Research Memorandum (1998)
Lal, P., Ruger, S.: Extract-based summarization with simplification. In: Proceedings of the ACL 2002 Automatic Summarization/DUC 2002 Workshop (2002)
Landis, J., Koch, G.: The measurement of observer agreement for categorical data. Biometrics, 159–174 (1977)
Miller, G., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Introduction to WordNet: An on-line lexical database. International Journal of Lexicography 3(4), 235–244 (1990)
Padró, L., Collado, M., Reese, S., Lloberes, M., Castellón, I.: FreeLing 2.1: Five years of open-source language processing tools. In: LREC, Valletta, Malta (2010)
Rayner, K., Duffy, S.A.: Lexical complexity and fixation times in reading: Effects of word frequency, verb complexity, and lexical ambiguity. Memory & Cognition 14(3), 191–201 (1986)
Rello, L., Baeza-Yates, R., Dempere, L., Saggion, H.: Frequent words improve readability and short words improve understandability for people with dyslexia. In: INTERACT 2013, Cape Town, South Africa (2013)
Salton, G., Wong, A., Yang, C.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)
Siddharthan, A.: An architecture for a text simplification system. In: LREC 2002: Proceedings of the Language Engineering Conference, LEC 2002, pp. 64–71 (2002)
Specia, L.: Translating from complex to simplified sentences. In: Pardo, T.A.S., Branco, A., Klautau, A., Vieira, R., de Lima, V.L.S. (eds.) PROPOR 2010. LNCS, vol. 6001, pp. 30–39. Springer, Heidelberg (2010), http://dx.doi.org/10.1007/978-3-642-12320-7_5
Specia, L., Jauhar, S., Mihalcea, R.: Semeval-2012 task 1: English lexical simplification. In: SemEval 2012 (2012)
Turney, P., Pantel, P.: From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research 37(1), 141–188 (2010)
Vossen, P. (ed.): EuroWordNet: A Multilingual Database with Lexical Semantic Networks, vol. 17. Oxford Univ. Press (2004)
Woodsend, K., Feng, Y., Lapata, M.: Generation with quasi-synchronous grammar. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 513–523. ACL (2010)
Woodsend, K., Lapata, M.: WikiSimple: Automatic simplification of wikipedia articles. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence (AAAI), pp. 927–932 (2011)
Wubben, S., van den Bosch, A., Krahmer, E.: Sentence simplification by monolingual machine translation. In: ACL (2012)
Yatskar, M., Pang, B., Danescu-Niculescu-Mizil, C., Lee, L.: For the sake of simplicity: Unsupervised extraction of lexical simplifications from wikipedia. In: ACL, pp. 365–368 (2010)
Zhu, Z., Bernhard, D., Gurevych, I.: A monolingual tree-based translation model for sentence simplification. In: CoLing, pp. 1353–1361 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saggion, H., Bott, S., Rello, L. (2013). Comparing Resources for Spanish Lexical Simplification. In: Dediu, AH., Martín-Vide, C., Mitkov, R., Truthe, B. (eds) Statistical Language and Speech Processing. SLSP 2013. Lecture Notes in Computer Science(), vol 7978. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39593-2_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-39593-2_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39592-5
Online ISBN: 978-3-642-39593-2
eBook Packages: Computer ScienceComputer Science (R0)