Abstract
We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the monosyllables. The monosyllables are distinguished in input so that segmentation is not problematic. Frequency information is withheld as is negative data. The methods are all tested using ten-fold cross-validation as well as a fixed number of randomly generated strings. Orthographic and phonetic representations are compared. The work presented in this chapter is part of a larger project comparing different machine learning techniques on linguistic data.
This chapter is a revised compilation of part of the work described in (Tjong Kim Sang, 1998) and (Tjong KimSang & Nerbonne, 1999).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baayen, R., Piepenbrock, R., & van Rijn, H. (1993). The Celex Lexical Database (CD-ROM). Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA.
Cairns, C. E., & Feinstein, M. H. (1982). Markedness and the theory of syllable structure. Linguistic Inquiry, 13 (2).
Ellison, T. M. (1992). The Machine Learning of Phonological Structure. PhD thesis, University of Western Australia.
Finch, S. P. (1993). Finding Structure in Language. PhD thesis, University of Edinburgh.
Gilbers, D. (1992). Phonological Networks. PhD thesis, University of Groningen. ISSN 0928-0030.
Gold, E. (1967). Language identification in the limit. Information and Control, 16, 447–474.
Kazakov, D., & Manandhar, S. (1998). A hybrid approach to word segmentation. In Page, D. (Ed.), Proceedings of the ILP-98. Springer. Lectures Notes in Computer Science, vol. 1446.
Ladefoged, P. (1993). A Course in Linguistic Phonetics (3 edition). Philadelphia.
Muggleton, S. (1992). Inductive logic programming. In Muggleton, S. (Ed.), Inductive Logic Programming, pp. 3–27.
Pinker, S. (1994). The Language Instinct. W. Morrow and Co., New York.
Tjong Kim Sang, E. F. (1998). Machine Learning of Phonotactic Structure. Ph.D. thesis, University of Groningen.
Tjong Kim Sang, E. F., & Nerbonne, J. (1999). Learning simple phonotactics. In Neural, Symbolic, and Reinforcement Methods for Sequence Learning, pp. 41–46. Proc. IJCAI workshop.
van Zonneveld, R. (1988). Two level phonology: Structural stability and segmental variation in dutch child language. In van Besien, F. (Ed.), First Language Acquisition. ABLA papers no. 12, University of Antwerpen.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Tjong Kim Sang, E.F., Nerbonne, J. (2000). Learning the Logic of Simple Phonotactics. In: Cussens, J., Džeroski, S. (eds) Learning Language in Logic. LLL 1999. Lecture Notes in Computer Science(), vol 1925. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40030-3_7
Download citation
DOI: https://doi.org/10.1007/3-540-40030-3_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41145-1
Online ISBN: 978-3-540-40030-1
eBook Packages: Springer Book Archive