Blind Estimation of Spectral Standard Deviation from Room Impulse Response for Reverberation Level Recognition Based on Linear Prediction

Ma, Sai; Xie, Xi

doi:10.1007/978-981-10-4211-9_23

Sai Ma¹² &
Xi Xie¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 685))

Included in the following conference series:

International Forum of Digital TV and Wireless Multimedia Communication

915 Accesses
2 Citations

Abstract

Reverberation is an important factor affecting speech quality and intelligibility, Reverberation Time (RT) and Direct-to-Reverberant Ratio (DRR) are the primary parameters for reverberation strength judgement, spectral standard deviation (SSD) from room impulse response (RIR) and DRR exist as monotonic relationships to some extent which means that SSD can also be an indicator of reverberation characteristics. We propose a blind estimation of spectral standard deviation (BESSD) that is obtained directly from reverberant speech signals. Experiments prove BESSD can be used as an index for male reverberation level recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Blind speech dereverberation using sparse decomposition and multi-channel linear prediction

Article 15 July 2019

Early reflection detection using autocorrelation to improve robustness of speaker verification in reverberant conditions

Article 11 October 2019

Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

Article Open access 23 July 2015

References

Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979). Acoustical Society of America
Article Google Scholar
Jeub, M., Nelke, C., Beaugeant, C., Vary, P.: Blind estimation of the coherent-to-diffuse energy ratio from noisy speech signals. In: 2011 19th European Signal Processing Conference, pp. 1347–1351. IEEE (2011)
Google Scholar
Lehmann, E.A., Johansson, A.M., Nordholm, S.: Reverberation-time prediction method for room impulse responses simulated with the image-source model. In: 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 159–162. IEEE (2007)
Google Scholar
Ikram, M.Z., Morgan, D.R.: A Multiresolution approach to blind separation of speech signals in a reverberant environment. In: Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), vol. 5, pp. 2757–2760. IEEE (2001)
Google Scholar
Radlovic, B.D., Williamson, R.C., Kennedy, R.A.: Equalization in an acoustic reverberant environment: robustness results. IEEE Trans. Speech Audio Process. 8(3), 311–319 (2000). IEEE
Article Google Scholar
Lehmann, E.A., Johansson, A.M.: Particle filter with integrated voice activity detection for acoustic source tracking. EURASIP J. Adv. Sig. Process. 2007(1), 1–11 (2006). Springer
Article Google Scholar
Aarabi, P., Shi, G.: Phase-based dual-microphone robust speech enhancement. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 34(4), 1763–1773 (2004). IEEE
Article Google Scholar
Palomäki, K.J., Brown, G.J., Wang, D.: A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. Speech Commun. 43(4), 361–378 (2004). Elsevier
Article Google Scholar
Joyce, W.B.: Sabine’s reverberation time and ergodic auditoriums. J. Acoust. Soc. Am. 58(3), 643–655 (1975). Acoustical Society of America
Article Google Scholar
Eaton, J., Moore, A.H., Naylor, P.A., Skoglund, J.: Direct-to-reverberant ratio estimation using a null-steered beamformer. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 46–50. IEEE (2015)
Google Scholar
Falk, T.H., Chan, W.-Y.: Temporal dynamics for blind measurement of room acoustical parameters. IEEE Trans. Instrum. Meas. 59(4), 978–989 (2010). IEEE
Article Google Scholar
Gillespie, B.W., Malvar, H.S., Florêncio, D.A.F.: Speech dereverberation via maximum-kurtosis subband adaptive filtering. In: Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), vol. 6, pp. 3701–3704. IEEE (2001)
Google Scholar
Gaubitch, N.D., Ward, D.B., Naylor, P.A.: Statistical analysis of the autoregressive modeling of reverberant speech. J. Acoust. Soc. Am. 120(6), 4031–4039 (2006). Acoustical Society of America
Article Google Scholar
Yegnanarayana, B., Murthy, P.S.: Enhancement of reverberant speech using LP residual signal. IEEE Trans. Speech Audio Process. 8(3), 267–281 (2000). IEEE
Article Google Scholar
Vaidyanathan, P.P.: The theory of linear prediction. Synth. Lect. Sig. Process. 2(1), 1–184 (2007). Morgan & Claypool Publishers
Google Scholar
Peterson, G.E., Barney, H.L.: Control methods used in a study of the vowels. J. Acoust. Soc. Am. 24(2), 175–184 (1952). Acoustical Society of America
Article Google Scholar
Habets, E.A.P., Gannot, S., Cohen, I.: Late reverberant spectral variance estimation based on a statistical model. IEEE Sig. Process. Lett. 16(9), 770–773 (2009). IEEE
Article Google Scholar
Jetzt, J.J.: Critical distance measurement of rooms from the sound energy spectral response. J. Acoust. Soc. Am. 65(5), 1204–1211 (1979). Acoustical Society of America
Article Google Scholar

Download references

Acknowledgments

The authors thank Prof. W.-Y. Chan for his guidance, assistance and support for this research.

Author information

Authors and Affiliations

Communication University of China, Beijing, China
Sai Ma & Xi Xie

Authors

Sai Ma
View author publications
You can also search for this author in PubMed Google Scholar
Xi Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Sai Ma or Xi Xie .

Editor information

Editors and Affiliations

Shanghai Jiao Tong University, Shanghai, China
Xiaokang Yang
Shanghai Jiao Tong University, Shanghai, China
Guangtao Zhai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ma, S., Xie, X. (2017). Blind Estimation of Spectral Standard Deviation from Room Impulse Response for Reverberation Level Recognition Based on Linear Prediction. In: Yang, X., Zhai, G. (eds) Digital TV and Wireless Multimedia Communication. IFTC 2016. Communications in Computer and Information Science, vol 685. Springer, Singapore. https://doi.org/10.1007/978-981-10-4211-9_23

Download citation

DOI: https://doi.org/10.1007/978-981-10-4211-9_23
Published: 12 March 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-4210-2
Online ISBN: 978-981-10-4211-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Blind Estimation of Spectral Standard Deviation from Room Impulse Response for Reverberation Level Recognition Based on Linear Prediction

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Blind speech dereverberation using sparse decomposition and multi-channel linear prediction

Early reflection detection using autocorrelation to improve robustness of speaker verification in reverberant conditions

Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Blind Estimation of Spectral Standard Deviation from Room Impulse Response for Reverberation Level Recognition Based on Linear Prediction

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Blind speech dereverberation using sparse decomposition and multi-channel linear prediction

Early reflection detection using autocorrelation to improve robustness of speaker verification in reverberant conditions

Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation