Skip to main content

Impact of Irregular Pronunciation on Phonetic Segmentation of Nijmegen Corpus of Casual Czech

  • Conference paper
Text, Speech and Dialogue (TSD 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8655))

Included in the following conference series:

  • 1561 Accesses

Abstract

This paper describes the pilot study of phonetic segmentation applied to Nijmegen Corpus of Casual Czech (NCCCz). This corpus contains informal speech of strong spontaneous nature which influences the character of produced speech at various levels. This work is the part of wider research related to the analysis of pronunciation reduction in such informal speech. We present the analysis of the accuracy of phonetic segmentation when canonical or reduced pronunciation is used. The achieved accuracy of realized phonetic segmentation provides information about general accuracy of proper acoustic modelling which is supposed to be applied in spontaneous speech recognition. As a byproduct of presented spontaneous speech segmentation, this paper also describes the created lexicon with canonical pronunciations of words in NCCCz, a tool supporting pronunciation check of lexicon items, and finally also a minidatabase of selected utterances from NCCCz manually labelled on phonetic level suitable for evaluation purposes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Kohler, K.J.: Segmental reduction in connected speech in German: Phonological facts and phonetic explanations. In: Hardcastle, W.J., Marchal, A. (eds.) Speech Production and Speech Modelling, pp. 69–92. Kluwer Academic Publishers (1990)

    Google Scholar 

  2. Ernestus, M.: Voice assimilation and segment reduction in Dutch: A corpus-based study of the phonology-phonetics interface. LOT, Utrecht (2000)

    Google Scholar 

  3. Johnson, K.: Massive reduction in conversational American English. In: Yoneyama, K., Maekawa, K. (eds.) Proc. of the 10th International Symposium on Spontaneous Speech: Data and Analysis, Tokyo, Japan, pp. 29–54 (2004)

    Google Scholar 

  4. Hinton, G., et al.: Deep neural networks for acoustic modeling in speech recognition. Signal Processing Magazine, 82–97 (2012)

    Google Scholar 

  5. Vaněk, J., Psutka, J.V.: Gender-dependent acoustic models fusion developed for automatic subtitling of parliament meetings broadcasted by the Czech TV. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS (LNAI), vol. 6231, pp. 431–438. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Ernestus, M., Kočková-Amortová, L., Pollak, P.: The Nijmegen Corpus of Casual Czech. In: LREC 2014, Reykjavik, Iceland, May 26-31 (2014)

    Google Scholar 

  7. Pollak, P., Černocký, J.: Czech SPEECON adult database. Technical report (November 2003), http://www.speechdat.org/speecon

  8. Pollak, P., Černocký, J., et al.: Speechdat(E) – Eastern European telephone speech databases. In: Proc of XLDB, Athens, Greece (2000)

    Google Scholar 

  9. Siemund, R., Höge, H., Kunzmann, S., Marasek, K.: SPEECON – Speech data for consumer devices. In: Proc. of the LREC 2000, Athens, Greece (2000)

    Google Scholar 

  10. Hanzl, V., Pollak, P.: Tool for Czech Pronunciation Generation Combining Fixed Rules with Pronunciation Lexicon and Lexicon Management Tool. In: Proc. of LREC 2002, Las Palmas de Gran Canaria, Spain, pp. 1264–1269 (2002)

    Google Scholar 

  11. Cmejla, R., et al.: Bayesian changepoint detection for the automatic assessment of fluency and articulatory disorders. Speech Communication, 178–189 (2013)

    Google Scholar 

  12. Mizera, P., Pollak, P.: Accuracy of HMM-based phonetic segmentation using monophone or triphone acoustic model. In: Proc. of Applied Electronics, Pilsen, Czech Republic (2013)

    Google Scholar 

  13. Schwarz, P.: Phoneme recognition based on long temporal context. PhD Thesis, Brno University of Technology (2009)

    Google Scholar 

  14. Povey, D., Ghoshal, A., et al.: The Kaldi Speech Recognition Toolkit. In: Proc. of ASRU, Hawaii, USA (2011)

    Google Scholar 

  15. Pollak, P., Volin, J., Skarnitzl, R.: Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database. In: Proc of LREC, Marrakech, Morocco (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Mizera, P., Pollak, P., Kolman, A., Ernestus, M. (2014). Impact of Irregular Pronunciation on Phonetic Segmentation of Nijmegen Corpus of Casual Czech. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10816-2_60

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10815-5

  • Online ISBN: 978-3-319-10816-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics