Abstract
Syntactic disambiguation frequently requires knowledge of the semantic categories of nouns, especially in languages with free word order. For example, in Spanish the phrases pintó un cuadro un pintor (lit. painted a picture a painter) and pintó un pintor un cuadro (lit. painted a painter a picture) mean the same: ‘a painter painted a picture’. The only way to tell the subject from the object is by knowing that pintor ‘painter’ is a causal agent and cuadro is a thing. We present a method for extracting semantic information of this kind from existing machine-readable human-oriented explanatory dictionaries. Application of this procedure to two different human-oriented Spanish dictionaries gives additional information as compared with using solely Spanish EuroWordNet. In addition, we show the results of an experiment conducted to evaluate the similarity of word classifications using this method.
Work done under partial support of Mexican Government (CONACyT, SNI, PIFI-IPN, CGEPI-IPN), Korean Government (KIPA Professorship for Visiting Faculty Positions in Korea), ITRI of Chung-Ang University, and RITOS-2. The second author is currently on Sabbatical leave at Chung-Ang University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amsler, R.A.: The structure of the Merriam-Webster Pocket Dictionary. Ph.D. Dissertation, U. of Texas (1980)
Chodorow, M., Byrd, R.J., Heidorn, G.E.: Extracting Semantic Hierarchies from a Large On-Line Dictionary. In: Proc. of the 23rd Meeting of the ACL, pp. 299–304 (1985)
Guthrie, L., Slator, B., Wilks, Y., Bruce, R.: Is there content in empty heads? In: Proc. of the 13th Intl. Conf. on Comp. Linguistics, COLING 1990 (1990)
Vossen, P.: The end of the chain: where does decomposition of lexical knowledge lead us eventually? ACQUILEX WP 010. English Department, U. of Amsterdam (1990)
Proctor, P. (ed.): The Longman Dictionary of Contemporary English, London (1978)
Ageno, A., Castellón, I., Martí, M.A., Ribas, F., Rigau, G., Rodríguez, H., Taulé, M., Verdejo, F.: SEID: An environment for extraction of Semantic Information from on-line dictionaries. In: Proc. of 3rd conf. on Applied NLP, Trento, It. (1992)
Montoyo, A., Palomar, M., Rigau, G.: WordNet Enrichment with Classification Systems. In: Proc. of NAACL 2001, Pittsburgh, PA, USA (2001)
Nastase, V., Szpakowicz, S.: Augmenting WordNet’s Structure Using LDOCE. In: Gelbukh, A. (ed.) CICLing 2003. LNCS, vol. 2588, pp. 281–294. Springer, Heidelberg (2003)
Lara, L.F.: Diccionario del español usual en México, Digital edition. Colegio de México, Center of Linguistic and Literary Studies (1996)
Gelbukh, A., Sidorov, G.: Selección automática del vocabulario definidor en un diccionario explicativo. Procesamiento del Lenguaje Natural 29, 55–62 (2002)
Miller, G.: Nouns in WordNet: a Lexical Inheritance System. International Journal of Lexicography 3(4), 245–264 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Calvo, H., Gelbukh, A. (2004). Extracting Semantic Categories of Nouns for Syntactic Disambiguation from Human-Oriented Explanatory Dictionaries. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-24630-5_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21006-1
Online ISBN: 978-3-540-24630-5
eBook Packages: Springer Book Archive