Abstract
The identification of the correct sense of a word is necessary for many tasks in automatic natural language processing like machine translation, information retrieval, speech and text processing. Automatic Word Sense Disambiguation (WSD) is difficult and accuracies with state-of-the art methods are substantially lower than in other areas of text understanding like part-of-speech tagging. One shortcoming of these methods is that they do not utilize substantial sources of background knowledge, such as semantic taxonomies and dictionaries, which are now available in electronic form (the methods largely use shallow syntactic features). Empirical results from the use of Inductive Logic Programming (ILP) have repeatedly shown the ability of ILP systems to use diverse sources of background knowledge. In this paper we investigate the use of ILP for WSD in two different ways: (a) as a stand-alone constructor of models for WSD; and (b) to build interesting features, which can then be used by standard model-builders such as SVM. In our experiments we examine a monolingual WSD task using the 32 English verbs contained in the SENSEVAL-3 benchmark data; and a bilingual WSD task using 7 highly ambiguous verbs in machine translation from English to Portuguese. Background knowledge available is from eight sources that provide a wide range of syntactic and semantic information. For both WSD tasks, experimental results show that ILP-constructed models and models built using ILP-generated features have higher accuracies than those obtained using a state-of-the art feature-based technique equipped with shallow syntactic features. This suggests that the use of ILP with diverse sources of background knowledge can provide one way for making substantial progress in the field of automatic WSD.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, E., Rigau, G.: Word Sense Disambiguation Using Conceptual Density. In: 16th International Conference on Computational Linguistics, Copenhagen (1996)
Bar-Hillel, Y.: Automatic Translation of Languages. In: Alt, F., Booth, D., Meagher, R.E. (eds.) Advances in Computers, Academic Press, New York (1960)
Ciaramita, M., Johnson, M.: Multi-component Word Sense Disambiguation. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, pp. 97–100 (2004)
Cottrell, G.W.: A Connectionist Approach to Word Sense Disambiguation. Research Notes in Artificial Intelligence. Morgan Kaufmann, San Mateo (1989)
Hayes, P.J.: A Process to Implement Some Word Sense Disambiguation. Institut pour les Etudes Semantiques et Cognitives, Geneve (1976)
Hirst, G.: Semantic Intepretation and the Resolution of Ambiguity. Natural Language Processing. Cambridge Universisty Press, Studies in (1987)
Kohavi, R., John, G.H.: Automatic Parameter Selection by Minimizing Estimated Error. In: 12th Int. Conference on Machine Learning, San Francisco (1995)
Kramer, S., Lavrac, N., Flach, P.: Propositionalization Approaches to Relational Data Mining, pp. 262–291. Springer, Heidelberg (2001)
Lamjiri, A., Demerdash, O., Kosseim, F.: Simple features for statistical Word Sense Disambiguation. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, pp. 133–136 (2004)
Lavrac, N., Dzeroski, S., Grobelnik, M.: Learning nonrecursive definitions of relations with LINUS. Technical report, Jozef Stefan Institute (1990)
Lesk, M.: Automated Sense Disambiguation Using Machine-readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone. In: SIGDOC Conference, Toronto, pp. 24–26 (1986)
McRoy, S.: Using Multiple Knowledge Sources for Word Sense Discrimination. Computational Linguistics 18(1), 1–30 (1992)
Mihalcea, R., Chklovski, T., Kilgariff, A.: The SENSEVAL-3 English Lexical Sample Task. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for Semantic Analysis of Text, Barcelona, pp. 25–28 (2004)
Fellbaum, C.: WordNet. An Electronic Lexical Database and some if its Applications. MIT Press, Massachusetts and London (1998)
Mohammad, S., Pedersen, T.: Complementarity of Lexical and Simple Syntactic Features: The SyntaLex Approach to SENSEVAL-3. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, pp. 159–162 (2004)
Mooney, R.J.: Inductive Logic Programming for Natural Language Processing. In: Inductive Logic Programming. LNCS, vol. 1314, pp. 3–24. Springer, Heidelberg (1997)
Muggleton, S.: Inductive Logic Programming: derivations, successes and shortcomings. SIGART Bulletin 5(1), 5–11 (1994)
Muggleton, S., Raedt, L.D.: Inductive logic programming: Theory and methods. Journal of Logic Programming 19,20, 629–679 (1994)
Nienhuys-Cheng, S., de Wolf, R.: Foundations of Inductive Logic Programming. Springer, Heidelberg (1997)
Parker, J., Stahel, M.: Password: English Dictionary for Speakers of Portuguese. Martins Fontes, São Paulo (1998)
Pedersen, T.A.: Baseline Methodology for Word Sense Disambiguation. In: 3rd International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City (2002)
Procter, P.: Longman Dictionary of Contemporary English. Longman Group, Essex (1978)
Quillian, M.R.: A Design for an Understanding Machine, Colloquium of semantic problems in natural language. Cambridge University, Cambridge (1961)
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: Empirical Methods in NLP Conference, University of Pennsylvania (1996)
Resnik, P.: Disambiguating Noun Groupings with Respect to WordNet Senses. In: 3rd Workshop on Very Large Corpora. Cambridge, pp. 54–68 (1995)
Schutze, H.: Automatic Word Sense Discrimination. Computational Linguistics 24(1), 97–124 (1998)
Siegel, S.: Nonparametric Statistics for the Behavioural Sciences. McGraw-Hill, New York (1956)
Specia, L.: A Hybrid Relational Approach for WSD - First Results. In: Student Research Workshop at Coling-ACL, Sydney, pp. 55–60 (2006)
Specia, L., Nunes, M.G.V., Stevenson, M.: Exploiting Parallel Texts to Produce a Multilingual Sense-tagged Corpus for Word Sense Disambiguation. In: RANLP-05, Borovets, pp. 525–531 (2005)
Srinivasan, A.: The Aleph Manual (1999), available at http://www.comlab.ox.ac.uk/oucl/research/areas/machlearn/Aleph/
Stevenson, M., Wilks, Y.: The Interaction of Knowledge Sources for Word Sense Disambiguation. Computational Linguistics 27(3), 321–349 (2001)
Wilks, Y., Stevenson, M.: Combining Independent Knowledge Sources for Word Sense Disambiguation. In: 3rd Conference on Recent Advances in Natural Language Processing, Tzigov Chark, pp. 1–7 (1997)
Yarowsky, D.: Unsupervised Word Sense Disambiguation Rivaling Supervised Methods. In: 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, pp. 189–196 (1995)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Specia, L., Srinivasan, A., Ramakrishnan, G., Volpe Nunes, M.d.G. (2007). Word Sense Disambiguation Using Inductive Logic Programming. In: Muggleton, S., Otero, R., Tamaddoni-Nezhad, A. (eds) Inductive Logic Programming. ILP 2006. Lecture Notes in Computer Science(), vol 4455. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73847-3_37
Download citation
DOI: https://doi.org/10.1007/978-3-540-73847-3_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73846-6
Online ISBN: 978-3-540-73847-3
eBook Packages: Computer ScienceComputer Science (R0)