Abstract
This paper presents an approach to acquisition of some lexical and grammatical constraints from large corpora. Constraints that are discussed are related to grammatical features of a preposition and the corresponding noun phrase that constitute a prepositional phrase. The approach is based on the extraction of a textual environment of a preposition from a corpus, which is then tagged using the system of electronic dictionaries. An algorithm for computation of some kind of the minimal representation of grammatical features associated with the corresponding noun phrases is suggested. The resulting set of features describes the constraints that a noun phrase has to fulfil in order to form a correct prepositional phrase with a given preposition. This set can be checked against other corpora.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Burnard, L. et al: TEI Lite: An Introduction to Text Encoding for Interchange, doc. No: TEI U 5, June 1995.
Nenadić, G., Vitas, D.: Using Local Grammars for Agreement Modeling in Highly Inflective Languages, in Proc. of First Workshop on Text, Speech, Dialogue-TSD 98, Brno, 1998.
Nenadić, G., Vitas, D.: Formal Model of Noun Phrases in Serbo-Croatian, BULAG 23, Universite Franche-Compte, 1998.
Silberztein, M.: Dictionnaries électroniques et analyse automatique de textes: le systéme INTEX, Masson, Paris, 1993.
Silberztein, M.: INTEX 3.4: Reference manual, LADL, Universite Paris 7, 1996.
Stanojčić, Ž., Popovič, Lj.: Gramatika srpskoga jezika, Zavod za udžbenike i nastavna sredstva, Beograd, 1994. (in Serbo-Croatian).
Vitas, D.: Mathematical Model of Serbo-Croatian Morphology (Nominal Inflection), PhD thesis, Faculty of Mathematics, University of Belgrade, 1993. (in Serbo-Croatian).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nenadić, G., Spasić, I. (1999). The Acquisition of Some Lexical Constraints from Corpora. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_21
Download citation
DOI: https://doi.org/10.1007/3-540-48239-3_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive