Learning the Logic of Simple Phonotactics

Tjong Kim Sang, Erik F.; Nerbonne, John

doi:10.1007/3-540-40030-3_7

Erik F. Tjong Kim Sang³ &
John Nerbonne⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1925))

Included in the following conference series:

International Conference on Learning Language in Logic

392 Accesses
1 Citations

Abstract

We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the monosyllables. The monosyllables are distinguished in input so that segmentation is not problematic. Frequency information is withheld as is negative data. The methods are all tested using ten-fold cross-validation as well as a fixed number of randomly generated strings. Orthographic and phonetic representations are compared. The work presented in this chapter is part of a larger project comparing different machine learning techniques on linguistic data.

This chapter is a revised compilation of part of the work described in (Tjong Kim Sang, 1998) and (Tjong KimSang & Nerbonne, 1999).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baayen, R., Piepenbrock, R., & van Rijn, H. (1993). The Celex Lexical Database (CD-ROM). Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA.
Google Scholar
Cairns, C. E., & Feinstein, M. H. (1982). Markedness and the theory of syllable structure. Linguistic Inquiry, 13 (2).
Google Scholar
Ellison, T. M. (1992). The Machine Learning of Phonological Structure. PhD thesis, University of Western Australia.
Google Scholar
Finch, S. P. (1993). Finding Structure in Language. PhD thesis, University of Edinburgh.
Google Scholar
Gilbers, D. (1992). Phonological Networks. PhD thesis, University of Groningen. ISSN 0928-0030.
Google Scholar
Gold, E. (1967). Language identification in the limit. Information and Control, 16, 447–474.
Article Google Scholar
Kazakov, D., & Manandhar, S. (1998). A hybrid approach to word segmentation. In Page, D. (Ed.), Proceedings of the ILP-98. Springer. Lectures Notes in Computer Science, vol. 1446.
Google Scholar
Ladefoged, P. (1993). A Course in Linguistic Phonetics (3 edition). Philadelphia.
Google Scholar
Muggleton, S. (1992). Inductive logic programming. In Muggleton, S. (Ed.), Inductive Logic Programming, pp. 3–27.
Google Scholar
Pinker, S. (1994). The Language Instinct. W. Morrow and Co., New York.
Google Scholar
Tjong Kim Sang, E. F. (1998). Machine Learning of Phonotactic Structure. Ph.D. thesis, University of Groningen.
Google Scholar
Tjong Kim Sang, E. F., & Nerbonne, J. (1999). Learning simple phonotactics. In Neural, Symbolic, and Reinforcement Methods for Sequence Learning, pp. 41–46. Proc. IJCAI workshop.
Google Scholar
van Zonneveld, R. (1988). Two level phonology: Structural stability and segmental variation in dutch child language. In van Besien, F. (Ed.), First Language Acquisition. ABLA papers no. 12, University of Antwerpen.
Google Scholar

Download references

Author information

Authors and Affiliations

CNTS - Language Technology Group, University of Antwerp, Belgium
Erik F. Tjong Kim Sang
Alfa-informatica, BCN, University of Groningen, The Netherlands
John Nerbonne

Authors

Erik F. Tjong Kim Sang
View author publications
You can also search for this author in PubMed Google Scholar
John Nerbonne
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of York, YO10 5DD, Heslington, York, UK
James Cussens
Jožef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Sašo Džeroski

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Tjong Kim Sang, E.F., Nerbonne, J. (2000). Learning the Logic of Simple Phonotactics. In: Cussens, J., Džeroski, S. (eds) Learning Language in Logic. LLL 1999. Lecture Notes in Computer Science(), vol 1925. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40030-3_7

Download citation

DOI: https://doi.org/10.1007/3-540-40030-3_7
Published: 01 February 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41145-1
Online ISBN: 978-3-540-40030-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics