Abstract
This paper describes a method for speech feature extraction using morphological signal processing based on the so-called “slope transformation”. The proposed approach has been used to extract the signal upper spectral envelope. Results of experiments of the automatic speech recognition (ASR) and automatic speaker identification (ASI), which were undertaken to check the performance of the presented method, have shown some evident improvements of the effectiveness of recognition of isolated words, especially for women voices. The benefits of using slope transformation was also observed in speaker identification experiment.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bakis, R.: Continuous speech recognition via centisecond acoustic states. Acoustical Society of America Journal 59, 97–+ (1976)
Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing] 28(4), 357–366 (1980)
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)
Garofolo, J.S., et al: TIMIT Acoustic-Phonetic Continuous Speech Corpus. Linguistic Data Consortium, Philadelphia (1993)
Grocholewski, S.: Statystyczne podstawy systemu ARM dla jêzyka polskiego. Wyd. Politechniki Poznañskiej (2001)
Gu, L., Rose, K.: Perceptual harmonic cepstral coefficients for speech recognition in noisy environment. In: Proc. ICASSP (2001)
Maragos, P.: Slope transforms: theory and application to nonlinear signal processing. IEEE Transactions on Signal Processing 43(4), 864–877 (1995)
Marciniak, T., Rochowniak, R., Dabrowski, A.: Detection of endpoints of isolated words using slope transformation. In: Proc. MIXDES, pp. 655–659 (2006)
Meyer, A.: Zastosowanie transformacji zafalowaniowej do odszumiania sygnaw audio i poprawy zrozumiaoci mowy. PhD thesis, Poznan University of Technology (2005)
Odell, J., Ollason, D., Woodland, P., Young, S., Jansen, J.: The HTK Book for HTK V2.0. Cambridge University Press, Cambridge (1995)
Yapanel, U.H., Hansen, J.H.L.: A new perceptually motivated mvdr-based acoustic front-end (pmvdr) for robust automatic speech recognition. Speech Communication 50(2), 142–152 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Drgas, S., Dabrowski, A. (2009). Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-04235-5_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)