Word Sense Disambiguation Using Inductive Logic Programming

Specia, Lucia; Srinivasan, Ashwin; Ramakrishnan, Ganesh; Volpe Nunes, Maria das Graças

doi:10.1007/978-3-540-73847-3_37

Lucia Specia¹,
Ashwin Srinivasan^2,3,
Ganesh Ramakrishnan² &
…
Maria das Graças Volpe Nunes¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4455))

Included in the following conference series:

International Conference on Inductive Logic Programming

491 Accesses
4 Citations

Abstract

The identification of the correct sense of a word is necessary for many tasks in automatic natural language processing like machine translation, information retrieval, speech and text processing. Automatic Word Sense Disambiguation (WSD) is difficult and accuracies with state-of-the art methods are substantially lower than in other areas of text understanding like part-of-speech tagging. One shortcoming of these methods is that they do not utilize substantial sources of background knowledge, such as semantic taxonomies and dictionaries, which are now available in electronic form (the methods largely use shallow syntactic features). Empirical results from the use of Inductive Logic Programming (ILP) have repeatedly shown the ability of ILP systems to use diverse sources of background knowledge. In this paper we investigate the use of ILP for WSD in two different ways: (a) as a stand-alone constructor of models for WSD; and (b) to build interesting features, which can then be used by standard model-builders such as SVM. In our experiments we examine a monolingual WSD task using the 32 English verbs contained in the SENSEVAL-3 benchmark data; and a bilingual WSD task using 7 highly ambiguous verbs in machine translation from English to Portuguese. Background knowledge available is from eight sources that provide a wide range of syntactic and semantic information. For both WSD tasks, experimental results show that ILP-constructed models and models built using ILP-generated features have higher accuracies than those obtained using a state-of-the art feature-based technique equipped with shallow syntactic features. This suggests that the use of ILP with diverse sources of background knowledge can provide one way for making substantial progress in the field of automatic WSD.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agirre, E., Rigau, G.: Word Sense Disambiguation Using Conceptual Density. In: 16th International Conference on Computational Linguistics, Copenhagen (1996)
Google Scholar
Bar-Hillel, Y.: Automatic Translation of Languages. In: Alt, F., Booth, D., Meagher, R.E. (eds.) Advances in Computers, Academic Press, New York (1960)
Google Scholar
Ciaramita, M., Johnson, M.: Multi-component Word Sense Disambiguation. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, pp. 97–100 (2004)
Google Scholar
Cottrell, G.W.: A Connectionist Approach to Word Sense Disambiguation. Research Notes in Artificial Intelligence. Morgan Kaufmann, San Mateo (1989)
Google Scholar
Hayes, P.J.: A Process to Implement Some Word Sense Disambiguation. Institut pour les Etudes Semantiques et Cognitives, Geneve (1976)
Google Scholar
Hirst, G.: Semantic Intepretation and the Resolution of Ambiguity. Natural Language Processing. Cambridge Universisty Press, Studies in (1987)
Google Scholar
Kohavi, R., John, G.H.: Automatic Parameter Selection by Minimizing Estimated Error. In: 12th Int. Conference on Machine Learning, San Francisco (1995)
Google Scholar
Kramer, S., Lavrac, N., Flach, P.: Propositionalization Approaches to Relational Data Mining, pp. 262–291. Springer, Heidelberg (2001)
Google Scholar
Lamjiri, A., Demerdash, O., Kosseim, F.: Simple features for statistical Word Sense Disambiguation. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, pp. 133–136 (2004)
Google Scholar
Lavrac, N., Dzeroski, S., Grobelnik, M.: Learning nonrecursive definitions of relations with LINUS. Technical report, Jozef Stefan Institute (1990)
Google Scholar
Lesk, M.: Automated Sense Disambiguation Using Machine-readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone. In: SIGDOC Conference, Toronto, pp. 24–26 (1986)
Google Scholar
McRoy, S.: Using Multiple Knowledge Sources for Word Sense Discrimination. Computational Linguistics 18(1), 1–30 (1992)
Google Scholar
Mihalcea, R., Chklovski, T., Kilgariff, A.: The SENSEVAL-3 English Lexical Sample Task. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for Semantic Analysis of Text, Barcelona, pp. 25–28 (2004)
Google Scholar
Fellbaum, C.: WordNet. An Electronic Lexical Database and some if its Applications. MIT Press, Massachusetts and London (1998)
Google Scholar
Mohammad, S., Pedersen, T.: Complementarity of Lexical and Simple Syntactic Features: The SyntaLex Approach to SENSEVAL-3. In: SENSEVAL-3: 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, pp. 159–162 (2004)
Google Scholar
Mooney, R.J.: Inductive Logic Programming for Natural Language Processing. In: Inductive Logic Programming. LNCS, vol. 1314, pp. 3–24. Springer, Heidelberg (1997)
Google Scholar
Muggleton, S.: Inductive Logic Programming: derivations, successes and shortcomings. SIGART Bulletin 5(1), 5–11 (1994)
Article Google Scholar
Muggleton, S., Raedt, L.D.: Inductive logic programming: Theory and methods. Journal of Logic Programming 19,20, 629–679 (1994)
Article Google Scholar
Nienhuys-Cheng, S., de Wolf, R.: Foundations of Inductive Logic Programming. Springer, Heidelberg (1997)
Google Scholar
Parker, J., Stahel, M.: Password: English Dictionary for Speakers of Portuguese. Martins Fontes, São Paulo (1998)
Google Scholar
Pedersen, T.A.: Baseline Methodology for Word Sense Disambiguation. In: 3rd International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City (2002)
Google Scholar
Procter, P.: Longman Dictionary of Contemporary English. Longman Group, Essex (1978)
Google Scholar
Quillian, M.R.: A Design for an Understanding Machine, Colloquium of semantic problems in natural language. Cambridge University, Cambridge (1961)
Google Scholar
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger. In: Empirical Methods in NLP Conference, University of Pennsylvania (1996)
Google Scholar
Resnik, P.: Disambiguating Noun Groupings with Respect to WordNet Senses. In: 3rd Workshop on Very Large Corpora. Cambridge, pp. 54–68 (1995)
Google Scholar
Schutze, H.: Automatic Word Sense Discrimination. Computational Linguistics 24(1), 97–124 (1998)
MathSciNet Google Scholar
Siegel, S.: Nonparametric Statistics for the Behavioural Sciences. McGraw-Hill, New York (1956)
Google Scholar
Specia, L.: A Hybrid Relational Approach for WSD - First Results. In: Student Research Workshop at Coling-ACL, Sydney, pp. 55–60 (2006)
Google Scholar
Specia, L., Nunes, M.G.V., Stevenson, M.: Exploiting Parallel Texts to Produce a Multilingual Sense-tagged Corpus for Word Sense Disambiguation. In: RANLP-05, Borovets, pp. 525–531 (2005)
Google Scholar
Srinivasan, A.: The Aleph Manual (1999), available at http://www.comlab.ox.ac.uk/oucl/research/areas/machlearn/Aleph/
Stevenson, M., Wilks, Y.: The Interaction of Knowledge Sources for Word Sense Disambiguation. Computational Linguistics 27(3), 321–349 (2001)
Article Google Scholar
Wilks, Y., Stevenson, M.: Combining Independent Knowledge Sources for Word Sense Disambiguation. In: 3rd Conference on Recent Advances in Natural Language Processing, Tzigov Chark, pp. 1–7 (1997)
Google Scholar
Yarowsky, D.: Unsupervised Word Sense Disambiguation Rivaling Supervised Methods. In: 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, pp. 189–196 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

ICMC - University of São Paulo, Trabalhador São-Carlense, 400, São Carlos, 13560-970, Brazil
Lucia Specia & Maria das Graças Volpe Nunes
IBM India Research Laboratory, Block 1, Indian Institute of Technology, New Delhi 110016, India
Ashwin Srinivasan & Ganesh Ramakrishnan
Dept. of Computer Science and Engineering & Centre for Health Informatics, University of New South Wales, Sydney, Australia
Ashwin Srinivasan

Authors

Lucia Specia
View author publications
You can also search for this author in PubMed Google Scholar
Ashwin Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar
Ganesh Ramakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Maria das Graças Volpe Nunes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Stephen Muggleton Ramon Otero Alireza Tamaddoni-Nezhad

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Specia, L., Srinivasan, A., Ramakrishnan, G., Volpe Nunes, M.d.G. (2007). Word Sense Disambiguation Using Inductive Logic Programming. In: Muggleton, S., Otero, R., Tamaddoni-Nezhad, A. (eds) Inductive Logic Programming. ILP 2006. Lecture Notes in Computer Science(), vol 4455. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73847-3_37

Download citation

DOI: https://doi.org/10.1007/978-3-540-73847-3_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73846-6
Online ISBN: 978-3-540-73847-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics