Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition

Drgas, Szymon; Dabrowski, Adam

doi:10.1007/978-3-642-04235-5_2

Szymon Drgas²¹ &
Adam Dabrowski²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5603))

Included in the following conference series:

Language and Technology Conference

688 Accesses

Abstract

This paper describes a method for speech feature extraction using morphological signal processing based on the so-called “slope transformation”. The proposed approach has been used to extract the signal upper spectral envelope. Results of experiments of the automatic speech recognition (ASR) and automatic speaker identification (ASI), which were undertaken to check the performance of the presented method, have shown some evident improvements of the effectiveness of recognition of isolated words, especially for women voices. The benefits of using slope transformation was also observed in speaker identification experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Spectral Analysis for Automatic Speech Recognition and Enhancement

Iterative Thresholding-Based Spectral Subtraction Algorithm for Speech Enhancement

Enhancing speech intelligibility in reverberant spaces by a speech features distributions dependent pre-processing

Article 19 July 2018

References

Bakis, R.: Continuous speech recognition via centisecond acoustic states. Acoustical Society of America Journal 59, 97–+ (1976)
Article Google Scholar
Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing] 28(4), 357–366 (1980)
Article Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Garofolo, J.S., et al: TIMIT Acoustic-Phonetic Continuous Speech Corpus. Linguistic Data Consortium, Philadelphia (1993)
Google Scholar
Grocholewski, S.: Statystyczne podstawy systemu ARM dla jêzyka polskiego. Wyd. Politechniki Poznañskiej (2001)
Google Scholar
Gu, L., Rose, K.: Perceptual harmonic cepstral coefficients for speech recognition in noisy environment. In: Proc. ICASSP (2001)
Google Scholar
Maragos, P.: Slope transforms: theory and application to nonlinear signal processing. IEEE Transactions on Signal Processing 43(4), 864–877 (1995)
Article Google Scholar
Marciniak, T., Rochowniak, R., Dabrowski, A.: Detection of endpoints of isolated words using slope transformation. In: Proc. MIXDES, pp. 655–659 (2006)
Google Scholar
Meyer, A.: Zastosowanie transformacji zafalowaniowej do odszumiania sygnaw audio i poprawy zrozumiaoci mowy. PhD thesis, Poznan University of Technology (2005)
Google Scholar
Odell, J., Ollason, D., Woodland, P., Young, S., Jansen, J.: The HTK Book for HTK V2.0. Cambridge University Press, Cambridge (1995)
Google Scholar
Yapanel, U.H., Hansen, J.H.L.: A new perceptually motivated mvdr-based acoustic front-end (pmvdr) for robust automatic speech recognition. Speech Communication 50(2), 142–152 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Chair of Control and Systems Engineering, Poznan University of Technology, ul. Piotrowo 3a, Poznan, Poland
Szymon Drgas & Adam Dabrowski

Authors

Szymon Drgas
View author publications
You can also search for this author in PubMed Google Scholar
Adam Dabrowski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Mathematics and Computer Science, Adam Mickiewicz University in Poznań, ul. Umultowska 87, P.O. Box, 61614, Poznań, Poland
Zygmunt Vetulani
Language Technology Lab, German Research Center for Artificial Intelligence (DFKI), Campus D 3 1, Stuhlsatzenhausweg 3, D-66123, Saarbrücken, Germany
Hans Uszkoreit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Drgas, S., Dabrowski, A. (2009). Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-04235-5_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics