Skip to main content

Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition

  • Conference paper
Human Language Technology. Challenges of the Information Society (LTC 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5603))

Included in the following conference series:

  • 659 Accesses

Abstract

This paper describes a method for speech feature extraction using morphological signal processing based on the so-called “slope transformation”. The proposed approach has been used to extract the signal upper spectral envelope. Results of experiments of the automatic speech recognition (ASR) and automatic speaker identification (ASI), which were undertaken to check the performance of the presented method, have shown some evident improvements of the effectiveness of recognition of isolated words, especially for women voices. The benefits of using slope transformation was also observed in speaker identification experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bakis, R.: Continuous speech recognition via centisecond acoustic states. Acoustical Society of America Journal 59, 97–+ (1976)

    Article  Google Scholar 

  2. Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing] 28(4), 357–366 (1980)

    Article  Google Scholar 

  3. Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)

    MathSciNet  MATH  Google Scholar 

  4. Garofolo, J.S., et al: TIMIT Acoustic-Phonetic Continuous Speech Corpus. Linguistic Data Consortium, Philadelphia (1993)

    Google Scholar 

  5. Grocholewski, S.: Statystyczne podstawy systemu ARM dla jêzyka polskiego. Wyd. Politechniki Poznañskiej (2001)

    Google Scholar 

  6. Gu, L., Rose, K.: Perceptual harmonic cepstral coefficients for speech recognition in noisy environment. In: Proc. ICASSP (2001)

    Google Scholar 

  7. Maragos, P.: Slope transforms: theory and application to nonlinear signal processing. IEEE Transactions on Signal Processing 43(4), 864–877 (1995)

    Article  Google Scholar 

  8. Marciniak, T., Rochowniak, R., Dabrowski, A.: Detection of endpoints of isolated words using slope transformation. In: Proc. MIXDES, pp. 655–659 (2006)

    Google Scholar 

  9. Meyer, A.: Zastosowanie transformacji zafalowaniowej do odszumiania sygnaw audio i poprawy zrozumiaoci mowy. PhD thesis, Poznan University of Technology (2005)

    Google Scholar 

  10. Odell, J., Ollason, D., Woodland, P., Young, S., Jansen, J.: The HTK Book for HTK V2.0. Cambridge University Press, Cambridge (1995)

    Google Scholar 

  11. Yapanel, U.H., Hansen, J.H.L.: A new perceptually motivated mvdr-based acoustic front-end (pmvdr) for robust automatic speech recognition. Speech Communication 50(2), 142–152 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Drgas, S., Dabrowski, A. (2009). Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04235-5_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04234-8

  • Online ISBN: 978-3-642-04235-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics