Abstract
This paper describes Phon, an open-source software program for the transcription, coding, and analysis of phonetically-transcribed speech corpora. Phon provides support for multimedia data linkage, utterance segmentation, multiple-blind transcription, transcription validation, syllabification, and alignment of target and actual forms. All functions are available through a user-friendly graphical interface. This program provides the basis for the building of PhonBank, a database project that seeks to broaden the scope of CHILDES into phonological development and disorders.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Support for the Unix/Linux platform is currently compromised, primarily because of licensing issues related to the multimedia functions of the application.
- 2.
Previous experience in computational molecular biology and data mining suggests that, given the large amounts of data involved, various specialized algorithmic techniques will probably have to be invoked to allow time-series pattern matching to run in practical amounts of time and computer memory. The typical approach described in [11] is to simplify the given data, derive approximate analysis-results relative to this simplified data, and (hopefully with minimal effort) reconstruct exact analysis-results relative to the original data. However, there may be other options, such as using so-called fixed-parameter tractable algorithms [4, 16] whose running times are impractical in general but efficient under the restrictions present in learner time-series datasets.
References
Beesley, K. R., & Karttunen, L. (2003). Finite-state morphology. Stanford, CA: CSLI Publications.
Boersma, P., & Weenink, D. (2011). Praat: Doing phonetics by computer [Computer program]. Version 5.2.18, retrieved 10 March 2011 from http://www.praat.org/.
Davis, S., & Hammond, M. (1995). On the status of onglides in American English. Phonology, 12, 159–182.
Downey, R. G., & Fellows, M. R. (1999). Parameterized complexity. New York: Springer.
Fikkert, P. (1994). On the acquisition of prosodic structure. Dordrecht: ICG Printing.
Goad, H., & Rose, Y. (2004). Input elaboration, head faithfulness and evidence for representation in the acquisition of left-edge clusters in West Germanic. In R. Kager, J. Pater, & W. Zonneveld (Eds.), Constraints on phonological acquisition (pp. 109–157). Cambridge/ New York: Cambridge University Press.
Gusfield, D. (1997). Algorithms on strings, trees, and sequences: Computer science and computational biology. Cambridge/New York: Cambridge University Press.
Hedlund, G. J., Maddocks, K., Rose, Y., & Wareham, T. (2005). Natural language syllable alignment: From conception to implementation. In Proceedings of the Fifteenth Annual Newfoundland Electrical and Computer Engineering Conference (NECEC 2005) http://www.ucs.mun.ca/~yrose/Research/Publications/files/2005-HedlundEtAl-SyllAlign.pdf.
Inkelas, S., & Rose, Y. (2007). Positional neutralization: A case study from child language. Language, 83, 707–736.
Kaye, J., & Lowenstamm, J. (1984). De la syllabicité. In Forme sonore du langage (pp. 123–161). Paris: Hermann.
Keogh, E. (2008). Indexing and mining time series data. In: S. Shekhar & H. Xiong (Eds.), Encyclopedia of GIS (pp. 493–497). New York: Springer.
Kondrak, G. (2003). Phonetic alignment and similarity. Computers in the Humanities, 37, 273–291.
Ladefoged, P., & Maddieson, I. (1996). The sounds of the world’s languages. Cambridge, MA: Blackwell.
Maddocks, K. (2005). An effective algorithm for the alignment of target and actual syllables for the study of language acquisition. B.Sc.h. thesis. Memorial University of Newfoundland.
Mitsa, T. (2010). Temporal data mining. Boca Raton, FL: Chapman and Hall/CRC.
Niedermeier, R. (2006). Invitation to fixed-parameter algorithms. Cambridge/New York: Oxford University Press.
Roddick, J. F., & Spiliopoulou, M. (2002). A survey of temporal knowledge discovery paradigms and methods. IEEE Transactions on Knowledge and Data Engineering, 14, 750–767.
Rose, Y. (2000). Headedness and prosodic licensing in the L1 acquisition of phonology. Ph.D. dissertation. McGill University.
Rose, Y., MacWhinney, B., Byrne, R., Hedlund, G. J., Maddocks, K., O’Brien, P., & Wareham, T. (2006). Introducing phon: A software solution for the study of phonological Acquisition. In Proceedings of the 30th Boston University Conference on Language Development (pp. 489–500). Somerville, MA: Cascadilla Press.
Sankoff, D., & Kruskal, J. B. (Eds.). (1983). Time warps, string edits, and macromolecules: The theory and practice of string comparison. Reading, MA: Addison-Wesley.
Selkirk, E. (1982). The syllable. In The structure of phonological representation (pp 337–385). Dordrecht: Foris.
Selkirk, E. (1986). On derived domains in sentence phonology. Phonology, 3, 371–405.
Acknowledgements
We would like thank the co-organisers of the original ACL workshop (namely, Afra Alishahi, Thierry Poibeau, Anna Korhonen and Aline Villavicencio) for their help and support through all the steps that brought us to this publication and Carla Peddle for assistance in preparing the final version presented here. We are also grateful to two anonymous reviewers for their useful feedback. Current development of Phon and PhonBank is supported by the National Institute of Health. Earlier development of Phon was funded by grants from National Science Foundation, Canada Fund for Innovation, Social Sciences and Humanities Research Council of Canada, Petro-Canada Fund for Young Innovators, and the Office of the Vice-President (Research) and the Faculty of Arts at Memorial University of Newfoundland. TW would also like to acknowledge support provided through NSERC Discovery Grant 228104.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Rose, Y., Hedlund, G.J., Byrne, R., Wareham, T., MacWhinney, B. (2013). Phon: A Computational Basis for Phonological Database Building and Model Testing. In: Villavicencio, A., Poibeau, T., Korhonen, A., Alishahi, A. (eds) Cognitive Aspects of Computational Language Acquisition. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31863-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-31863-4_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31862-7
Online ISBN: 978-3-642-31863-4
eBook Packages: Computer ScienceComputer Science (R0)