Phon: A Computational Basis for Phonological Database Building and Model Testing

Rose, Yvan; Hedlund, Gregory J.; Byrne, Rod; Wareham, Todd; MacWhinney, Brian

doi:10.1007/978-3-642-31863-4_2

Yvan Rose⁵,
Gregory J. Hedlund⁵,
Rod Byrne⁶,
Todd Wareham⁶ &
…
Brian MacWhinney⁷

Part of the book series: Theory and Applications of Natural Language Processing ((NLP))

1010 Accesses

Abstract

This paper describes Phon, an open-source software program for the transcription, coding, and analysis of phonetically-transcribed speech corpora. Phon provides support for multimedia data linkage, utterance segmentation, multiple-blind transcription, transcription validation, syllabification, and alignment of target and actual forms. All functions are available through a user-friendly graphical interface. This program provides the basis for the building of PhonBank, a database project that seeks to broaden the scope of CHILDES into phonological development and disorders.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Support for the Unix/Linux platform is currently compromised, primarily because of licensing issues related to the multimedia functions of the application.
2.
Previous experience in computational molecular biology and data mining suggests that, given the large amounts of data involved, various specialized algorithmic techniques will probably have to be invoked to allow time-series pattern matching to run in practical amounts of time and computer memory. The typical approach described in [11] is to simplify the given data, derive approximate analysis-results relative to this simplified data, and (hopefully with minimal effort) reconstruct exact analysis-results relative to the original data. However, there may be other options, such as using so-called fixed-parameter tractable algorithms [4, 16] whose running times are impractical in general but efficient under the restrictions present in learner time-series datasets.

References

Beesley, K. R., & Karttunen, L. (2003). Finite-state morphology. Stanford, CA: CSLI Publications.
Google Scholar
Boersma, P., & Weenink, D. (2011). Praat: Doing phonetics by computer [Computer program]. Version 5.2.18, retrieved 10 March 2011 from http://www.praat.org/.
Davis, S., & Hammond, M. (1995). On the status of onglides in American English. Phonology, 12, 159–182.
Article Google Scholar
Downey, R. G., & Fellows, M. R. (1999). Parameterized complexity. New York: Springer.
Book Google Scholar
Fikkert, P. (1994). On the acquisition of prosodic structure. Dordrecht: ICG Printing.
Google Scholar
Goad, H., & Rose, Y. (2004). Input elaboration, head faithfulness and evidence for representation in the acquisition of left-edge clusters in West Germanic. In R. Kager, J. Pater, & W. Zonneveld (Eds.), Constraints on phonological acquisition (pp. 109–157). Cambridge/ New York: Cambridge University Press.
Google Scholar
Gusfield, D. (1997). Algorithms on strings, trees, and sequences: Computer science and computational biology. Cambridge/New York: Cambridge University Press.
Book MATH Google Scholar
Hedlund, G. J., Maddocks, K., Rose, Y., & Wareham, T. (2005). Natural language syllable alignment: From conception to implementation. In Proceedings of the Fifteenth Annual Newfoundland Electrical and Computer Engineering Conference (NECEC 2005) http://www.ucs.mun.ca/~yrose/Research/Publications/files/2005-HedlundEtAl-SyllAlign.pdf.
Inkelas, S., & Rose, Y. (2007). Positional neutralization: A case study from child language. Language, 83, 707–736.
Article Google Scholar
Kaye, J., & Lowenstamm, J. (1984). De la syllabicité. In Forme sonore du langage (pp. 123–161). Paris: Hermann.
Google Scholar
Keogh, E. (2008). Indexing and mining time series data. In: S. Shekhar & H. Xiong (Eds.), Encyclopedia of GIS (pp. 493–497). New York: Springer.
Chapter Google Scholar
Kondrak, G. (2003). Phonetic alignment and similarity. Computers in the Humanities, 37, 273–291.
Article Google Scholar
Ladefoged, P., & Maddieson, I. (1996). The sounds of the world’s languages. Cambridge, MA: Blackwell.
Google Scholar
Maddocks, K. (2005). An effective algorithm for the alignment of target and actual syllables for the study of language acquisition. B.Sc.h. thesis. Memorial University of Newfoundland.
Google Scholar
Mitsa, T. (2010). Temporal data mining. Boca Raton, FL: Chapman and Hall/CRC.
Book MATH Google Scholar
Niedermeier, R. (2006). Invitation to fixed-parameter algorithms. Cambridge/New York: Oxford University Press.
Book MATH Google Scholar
Roddick, J. F., & Spiliopoulou, M. (2002). A survey of temporal knowledge discovery paradigms and methods. IEEE Transactions on Knowledge and Data Engineering, 14, 750–767.
Article Google Scholar
Rose, Y. (2000). Headedness and prosodic licensing in the L1 acquisition of phonology. Ph.D. dissertation. McGill University.
Google Scholar
Rose, Y., MacWhinney, B., Byrne, R., Hedlund, G. J., Maddocks, K., O’Brien, P., & Wareham, T. (2006). Introducing phon: A software solution for the study of phonological Acquisition. In Proceedings of the 30th Boston University Conference on Language Development (pp. 489–500). Somerville, MA: Cascadilla Press.
Google Scholar
Sankoff, D., & Kruskal, J. B. (Eds.). (1983). Time warps, string edits, and macromolecules: The theory and practice of string comparison. Reading, MA: Addison-Wesley.
Google Scholar
Selkirk, E. (1982). The syllable. In The structure of phonological representation (pp 337–385). Dordrecht: Foris.
Google Scholar
Selkirk, E. (1986). On derived domains in sentence phonology. Phonology, 3, 371–405.
Article Google Scholar

Download references

Acknowledgements

We would like thank the co-organisers of the original ACL workshop (namely, Afra Alishahi, Thierry Poibeau, Anna Korhonen and Aline Villavicencio) for their help and support through all the steps that brought us to this publication and Carla Peddle for assistance in preparing the final version presented here. We are also grateful to two anonymous reviewers for their useful feedback. Current development of Phon and PhonBank is supported by the National Institute of Health. Earlier development of Phon was funded by grants from National Science Foundation, Canada Fund for Innovation, Social Sciences and Humanities Research Council of Canada, Petro-Canada Fund for Young Innovators, and the Office of the Vice-President (Research) and the Faculty of Arts at Memorial University of Newfoundland. TW would also like to acknowledge support provided through NSERC Discovery Grant 228104.

Author information

Authors and Affiliations

Department of Linguistics, Memorial University of Newfoundland, St. John’s, NL, A1B 3X9, Canada
Yvan Rose & Gregory J. Hedlund
Department of Computer Science, Memorial University of Newfoundland, St. John’s, NL, A1B 3X5, Canada
Rod Byrne & Todd Wareham
Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Brian MacWhinney

Authors

Yvan Rose
View author publications
You can also search for this author in PubMed Google Scholar
Gregory J. Hedlund
View author publications
You can also search for this author in PubMed Google Scholar
Rod Byrne
View author publications
You can also search for this author in PubMed Google Scholar
Todd Wareham
View author publications
You can also search for this author in PubMed Google Scholar
Brian MacWhinney
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yvan Rose .

Editor information

Editors and Affiliations

Institute of Informatics, Federal University of Rio Grande do Sul, Av. Bento Gonçalves, Porto Alegre, 9500, Brazil
Aline Villavicencio
Universite Sorbonne Nouvelle, LATTICE-CNRS, Ecole Normale Superieure and, rue d'Ulm 45, Paris, 75005, France
Thierry Poibeau
Computer Laboratory, William Gates Building, University of Cambridge, Thomson Avenue 15 JJ, Cambridge, CB3 0FD, United Kingdom
Anna Korhonen
and Communication (TiCC), Tilburg University, Tilburg center for Cognition, Warandelaan 2, Tilburg, 5037, Netherlands
Afra Alishahi

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rose, Y., Hedlund, G.J., Byrne, R., Wareham, T., MacWhinney, B. (2013). Phon: A Computational Basis for Phonological Database Building and Model Testing. In: Villavicencio, A., Poibeau, T., Korhonen, A., Alishahi, A. (eds) Cognitive Aspects of Computational Language Acquisition. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31863-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-31863-4_2
Published: 27 September 2012
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31862-7
Online ISBN: 978-3-642-31863-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics