Skip to main content

Phon: A Computational Basis for Phonological Database Building and Model Testing

  • Chapter
  • First Online:
Cognitive Aspects of Computational Language Acquisition

Abstract

This paper describes Phon, an open-source software program for the transcription, coding, and analysis of phonetically-transcribed speech corpora. Phon provides support for multimedia data linkage, utterance segmentation, multiple-blind transcription, transcription validation, syllabification, and alignment of target and actual forms. All functions are available through a user-friendly graphical interface. This program provides the basis for the building of PhonBank, a database project that seeks to broaden the scope of CHILDES into phonological development and disorders.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Support for the Unix/Linux platform is currently compromised, primarily because of licensing issues related to the multimedia functions of the application.

  2. 2.

    Previous experience in computational molecular biology and data mining suggests that, given the large amounts of data involved, various specialized algorithmic techniques will probably have to be invoked to allow time-series pattern matching to run in practical amounts of time and computer memory. The typical approach described in [11] is to simplify the given data, derive approximate analysis-results relative to this simplified data, and (hopefully with minimal effort) reconstruct exact analysis-results relative to the original data. However, there may be other options, such as using so-called fixed-parameter tractable algorithms [4, 16] whose running times are impractical in general but efficient under the restrictions present in learner time-series datasets.

References

  1. Beesley, K. R., & Karttunen, L. (2003). Finite-state morphology. Stanford, CA: CSLI Publications.

    Google Scholar 

  2. Boersma, P., & Weenink, D. (2011). Praat: Doing phonetics by computer [Computer program]. Version 5.2.18, retrieved 10 March 2011 from http://www.praat.org/.

  3. Davis, S., & Hammond, M. (1995). On the status of onglides in American English. Phonology, 12, 159–182.

    Article  Google Scholar 

  4. Downey, R. G., & Fellows, M. R. (1999). Parameterized complexity. New York: Springer.

    Book  Google Scholar 

  5. Fikkert, P. (1994). On the acquisition of prosodic structure. Dordrecht: ICG Printing.

    Google Scholar 

  6. Goad, H., & Rose, Y. (2004). Input elaboration, head faithfulness and evidence for representation in the acquisition of left-edge clusters in West Germanic. In R. Kager, J. Pater, & W. Zonneveld (Eds.), Constraints on phonological acquisition (pp. 109–157). Cambridge/ New York: Cambridge University Press.

    Google Scholar 

  7. Gusfield, D. (1997). Algorithms on strings, trees, and sequences: Computer science and computational biology. Cambridge/New York: Cambridge University Press.

    Book  MATH  Google Scholar 

  8. Hedlund, G. J., Maddocks, K., Rose, Y., & Wareham, T. (2005). Natural language syllable alignment: From conception to implementation. In Proceedings of the Fifteenth Annual Newfoundland Electrical and Computer Engineering Conference (NECEC 2005) http://www.ucs.mun.ca/~yrose/Research/Publications/files/2005-HedlundEtAl-SyllAlign.pdf.

  9. Inkelas, S., & Rose, Y. (2007). Positional neutralization: A case study from child language. Language, 83, 707–736.

    Article  Google Scholar 

  10. Kaye, J., & Lowenstamm, J. (1984). De la syllabicité. In Forme sonore du langage (pp. 123–161). Paris: Hermann.

    Google Scholar 

  11. Keogh, E. (2008). Indexing and mining time series data. In: S. Shekhar & H. Xiong (Eds.), Encyclopedia of GIS (pp. 493–497). New York: Springer.

    Chapter  Google Scholar 

  12. Kondrak, G. (2003). Phonetic alignment and similarity. Computers in the Humanities, 37, 273–291.

    Article  Google Scholar 

  13. Ladefoged, P., & Maddieson, I. (1996). The sounds of the world’s languages. Cambridge, MA: Blackwell.

    Google Scholar 

  14. Maddocks, K. (2005). An effective algorithm for the alignment of target and actual syllables for the study of language acquisition. B.Sc.h. thesis. Memorial University of Newfoundland.

    Google Scholar 

  15. Mitsa, T. (2010). Temporal data mining. Boca Raton, FL: Chapman and Hall/CRC.

    Book  MATH  Google Scholar 

  16. Niedermeier, R. (2006). Invitation to fixed-parameter algorithms. Cambridge/New York: Oxford University Press.

    Book  MATH  Google Scholar 

  17. Roddick, J. F., & Spiliopoulou, M. (2002). A survey of temporal knowledge discovery paradigms and methods. IEEE Transactions on Knowledge and Data Engineering, 14, 750–767.

    Article  Google Scholar 

  18. Rose, Y. (2000). Headedness and prosodic licensing in the L1 acquisition of phonology. Ph.D. dissertation. McGill University.

    Google Scholar 

  19. Rose, Y., MacWhinney, B., Byrne, R., Hedlund, G. J., Maddocks, K., O’Brien, P., & Wareham, T. (2006). Introducing phon: A software solution for the study of phonological Acquisition. In Proceedings of the 30th Boston University Conference on Language Development (pp. 489–500). Somerville, MA: Cascadilla Press.

    Google Scholar 

  20. Sankoff, D., & Kruskal, J. B. (Eds.). (1983). Time warps, string edits, and macromolecules: The theory and practice of string comparison. Reading, MA: Addison-Wesley.

    Google Scholar 

  21. Selkirk, E. (1982). The syllable. In The structure of phonological representation (pp 337–385). Dordrecht: Foris.

    Google Scholar 

  22. Selkirk, E. (1986). On derived domains in sentence phonology. Phonology, 3, 371–405.

    Article  Google Scholar 

Download references

Acknowledgements

We would like thank the co-organisers of the original ACL workshop (namely, Afra Alishahi, Thierry Poibeau, Anna Korhonen and Aline Villavicencio) for their help and support through all the steps that brought us to this publication and Carla Peddle for assistance in preparing the final version presented here. We are also grateful to two anonymous reviewers for their useful feedback. Current development of Phon and PhonBank is supported by the National Institute of Health. Earlier development of Phon was funded by grants from National Science Foundation, Canada Fund for Innovation, Social Sciences and Humanities Research Council of Canada, Petro-Canada Fund for Young Innovators, and the Office of the Vice-President (Research) and the Faculty of Arts at Memorial University of Newfoundland. TW would also like to acknowledge support provided through NSERC Discovery Grant 228104.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yvan Rose .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Rose, Y., Hedlund, G.J., Byrne, R., Wareham, T., MacWhinney, B. (2013). Phon: A Computational Basis for Phonological Database Building and Model Testing. In: Villavicencio, A., Poibeau, T., Korhonen, A., Alishahi, A. (eds) Cognitive Aspects of Computational Language Acquisition. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31863-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31863-4_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31862-7

  • Online ISBN: 978-3-642-31863-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics