Skip to main content

Application of Zero-Frequency Filtering for Vowel Onset Point Detection

  • Conference paper
Mining Intelligence and Knowledge Exploration

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8891))

Abstract

Vowel onset points in speech signals, are the instances where the voicing of the vowels begin. These points serve as important landmarks for the analysis as well as synthesis of speech signals. These landmarks help to identify the information about the behaviour of transition of several different sounds into and out of the vowel regions. In this paper, we propose a new method to identify vowel onset points for a speech signal using the zero frequency filtered (ZFF) speech signal and its frequency spectrum. The ZFF signal is obtained by passing the speech signal through a resonator with central frequency as 0 Hz. Therefore, ZFF signal essentially contains the low pass components of a given speech signal. Vowels are mostly characterized by the significant energy content in the relatively low frequency bands. Significant improvement in VOP detection performance is observed using proposed method compared to existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rao, K.S., Vuppala, A.K.: Non-uniform time scale modification using instants of significant excitation and vowel onset points. Elsevier Speech Communication 55(6), 745–756 (2013)

    Article  Google Scholar 

  2. Prasanna, S.R.M., Reddy, B.V.S., Krishnamoorthy, P.: Vowel onset point detection using source, spectral peaks, and modulation spectrum energies. IEEE Trans. on Audio, Speech, and Language Processing 17(4), 556–565 (2009)

    Article  Google Scholar 

  3. Prasanna, S.R.M., Gangashetty, S.V., Yegnanarayana, B.: Significance of vowel onset point for speech analysis. In: Proc. of Int. Conf. Signal Processing and Communications, Bangalore, India, pp. 81–88 (2001)

    Google Scholar 

  4. Vuppala, A.K., Rao, K.S., Chakrabarti, S.: Improved consonant-vowel recognition for low bit-rate coded speech. Wiley International Journal of Adaptive control and Signal processing 26(4), 333–349 (2012)

    Article  Google Scholar 

  5. Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Detection of vowel onset points in continuous speech using autoassociative neural network models. In: Proc. Int. Conf. Spoken Language Processing, Jeju Island, Korea, pp. 401–410 (2004)

    Google Scholar 

  6. Vuppala, A.K., Rao, K.S., Chakrabarti, S.: Spotting and recognition of consonant-vowel units from continuous speech using accurate vowel onset points. Springer Circuits, Systems and Signal Processing 31(4), 1459–1474 (2012)

    Article  Google Scholar 

  7. Rao, K.S., Yegnanarayana, B.: Duration modification using glottal closure instants and vowel onset points. Speech Communication 51, 1263–1269 (2009)

    Article  Google Scholar 

  8. Vuppala, A.K., Rao, K.S.: Speaker identification under background noise using features extracted from steady vowel regions. Wiley International Journal of Adaptive control and Signal processing 29(9), 781–792 (2013)

    Article  Google Scholar 

  9. Vuppala, A.K., Yadav, J., Rao, K.S., Chakrabarti, S.: Vowel onset point detection for low bit rate coded speech. IEEE Transactions on Audio, Speech and Language Processing 20(6), 1894–1903 (2012)

    Article  Google Scholar 

  10. Hermes, D.J.: Vowel onset detection. J. Acoust. Soc. Amer. 87, 866–873 (1990)

    Article  Google Scholar 

  11. Wang, J.-F., Wu, C.H., Chang, S.H., Lee, J.Y.: A hierarchical neural network based C/V segmentation algorithm for Mandarin speech recognition. IEEE Trans. on Signal Processing 39(9), 2141–2146 (1991)

    Article  Google Scholar 

  12. Wang, J.-H., Chen, S.-H.: A C/V segmentation algorithm for Mandarin speech using wavelet transforms. In: Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Phoenix, Arizona, pp. 1261–1264 (1999)

    Google Scholar 

  13. Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances. In: Proc. of IEEE ICISIP, pp. 159–164 (2004)

    Google Scholar 

  14. Prasanna, S.R.M., Yegnanarayana, B.: Detection of vowel onset point events using excitation source information. In: Proc. of Interspeech, Lisbon, Portugal, pp. 1133–1136 (2005)

    Google Scholar 

  15. Murty, K.S.R., Yegnanarayana, B.: Epoch extraction from speech signals. IEEE Trans. on Audio, Speech, and Language Processing 16(8), 1602–1613 (2008)

    Article  Google Scholar 

  16. Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., Zue, V.: TIMIT acoustic-phonetic continuous speech corpus linguistic data consortium. In: Proc. of IEEE ICISIP, Philadelphia, PA (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Vuppala, A.K. (2014). Application of Zero-Frequency Filtering for Vowel Onset Point Detection. In: Prasath, R., O’Reilly, P., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science(), vol 8891. Springer, Cham. https://doi.org/10.1007/978-3-319-13817-6_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13817-6_18

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13816-9

  • Online ISBN: 978-3-319-13817-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics