Abstract
In this paper, we describe an approach to an adaptive morphological analysis based on lexicon corpus acquired from Internet. We focus on automating categorization words into a morphological paradigm in flexive languages. It is done by inducing possible word forms using morphological knowledge base and by looking for word forms of possible inflections in a morphological lexicon.
We developed a prototype system based on the proposed approach. Our system is general (it respects language but it performs better on a flexive language). We tested the system for the Slovak language. System’s lexicon is built by means of browsing Internet pages. Parsed texts, recognized to be written in Slovak, are used to establish database of Slovak words with their frequencies in texts.
The work reported here was partially supported by Slovak Science Grant Agency, grant No. G1/4289/97.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brill, E.: A simple rule-based part of speech tagger. In Proceedings of the Third Annual Conference on Applied Natural Language Processing. 1992.
Chandrasekar, R., Srinivas, B.: Using syntactic information in document filtering: A comparative study of part-of-speech tagging and supertagging. In Proceedings of RIAO’97, Montreal, pp. 531–545. 1997.
Mistrík, J.: Frequency of forms and constructs in Slovak. Bratislava, Veda. 1985.
Páleš, E.: Sapfo: Natural language paraphraser for Slovak language. Bratislava. 1993.
Shinghal, R.: Formal concepts in AI: Fundamentals. Chapter 6. Natural Language Processing: A prescriptive grammar. Chapman & Hall, London. 1992.
van Guilder, L.: Automated part of speech tagging: a brief overview. Georgetown University. 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trabalka, M., Bieliková, M. (1999). Performing Adaptive Morphological Analysis Using Internet Resources. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_12
Download citation
DOI: https://doi.org/10.1007/3-540-48239-3_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive