Abstract
Although the focus on morpheme discovering techniques originated within those linguistic schools which inherited from Franz Boas the concern for the unknown languages of the NewWorld, automatic, unsupervised morphological segmentation remains a field of interest for the computational processing and engineering1 of natural languages, as well as for the plain exercise of getting to know them intimately.2
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
O. Cromm. Afixerkennung in deutschenWortformen. Eine Untersuchung zum nicht-lexikalischen Segmentierungsverfahren von N. D. Andreev. Abschluß des Ergänzungsstudiums Linguistische Datenverarbeitung, Frankfurt am Main, 1996.
J. de Kock and W. Bossaert. Introducción a la lingüýstica automática en las lenguas románicas, volume 202 of Estudios y Ensayos. Gredos, Madrid, 1974.
J. de Kock and W. Bossaert. The Morpheme. An Experiment in Quantitative and Computational Linguistics. Van Gorcum, Amsterdam, Madrid, 1978.
W. B. Frakes. Stemming Algorithms. In W. B. Frakes and R. Baeza, editors, Information Retrieval, Data Structures and Algorithms, pages 131–160. Prentice Hall, New Jersey, 1992.
A. Gelbukh, M. Alexandrov, and S. Y. Han. Detecting Infiection Patterns in Natural Language by Minimization of Morphological Model. In Congreso Iberoamericano de Reconocimiento de Patrones, CIARP-2004, LNCS, 2004.
J. Goldsmith. Unsupervised Learning of the Morphology of a Natural Language. Computational Linguistics, 27(2):153–198, 2001.
J. H. Greenberg. Essays in Linguistics. The University of Chicago Press, Chicago, 1967.
M. A. Hafer and S. F. Weiss. Word Segmentation by Letter Successor Varieties. Information Storage and Retrieval, 10:371–385, 1974.
Z. S. Harris. From Phoneme to Morpheme. Language, 31(2):190–222, 1955.
H. Johnson and J. Martin. Unsupervised Learning of Morphology for English and Inuktitut. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003.
K. Kageura. Bigram Statistics Revisited: A Comparative Examination of Some Statistical Measures in Morphological Analysis of Japanese Kanji Sequences. Journal of Quantitative Linguistics, 6:149–166, 1999.
L. F. Lara and R. Ham Chande. Investigaciones lingüýsticas en lexicograf ýa, chapter Base estadýstica del Diccionario del Español de México, pages 5–39. Volume 89 of Jornadas [13], 1st edition, 1974.
L. F. Lara, R. Ham Chande, and M. I. Garcýa Hidalgo. Investigaciones lingüýsticas en lexicografýa, volume 89 of Jornadas. El Colegio de México, A. C., Mexico, 1st edition, 1979.
A. Medina-Urrea. Automatic Discovery of Afixes by Means of a Corpus: A Catalog of Spanish Afixes. Journal of Quantitative Linguistics, 7(2):97–114, 2000.
A. Medina-Urrea. Investigación cuantitativa de afijos y clýticos del español de México. Glutinometrýa en el Corpus del Español Mexicano Contemporáneo. PhD thesis, El Colegio de México, Mexico, April 2003.
A. Medina-Urrea and M. Alvarado Garcýa. Análisis cuantitativo y cualitativo de la derivación léxica en ralámuli. In Primer Coloquio Leonardo Manrique, Mexico, Conaculta-INAH, September 2004.
A. Medina-Urrea and E. C. Buenrostro Dýaz. Caracterýsticas cuantitativas de la fiexión verbal del chuj. Estudios de Lingüýstica Aplicada, 38:15–31, 2003.
A. Medina-Urrea and J. Hlaváčová. Automatic Recognition of Czech Derivational Prefixes. In Proceedings of CICLing 2005, volume 3406 of Lecture Notes in Computer Science, pages 189–197. Springer, Berlin/Heidelberg/New York, 2005.
M. P. Oakes. Statistics for Corpus Linguistics. Edinburgh University Press, Edinburgh, 1998.
B. B. Rieger. Computing Granular Word Meanings. A Fuzzy Linguistic Approach in Computational Semiotics. In P. Wang, editor, Computing with Words, pages 147–208. John Wiley & Sons, New York, 2001.
J. Rini. Motives for Linguistic Change in the Formation of the Spanish Object Pronouns. Juan de la Cuesta, Newark, Delaware, 1992.
E. Sapir. Language: An Introduction to the Study of Speech. Harcourt, Brace & Company, New York, 1921.
C. E. Shannon and W. Weaver. The Mathematical Theory of Communication. University of Illinois Press, Urbana, 1949.
A. Spencer and A. M. Zwicky. The Handbook of Morphology. Blackwell, Oxford, 1998.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this chapter
Cite this chapter
Medina-Urrea, A. (2007). Affix Discovery by Means of Corpora: Experiments for Spanish, Czech, Ralámuli and Chuj. In: Aspects of Automatic Text Analysis. Studies in Fuzziness and Soft Computing, vol 209. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-37522-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-37522-7_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37520-3
Online ISBN: 978-3-540-37522-7
eBook Packages: EngineeringEngineering (R0)