Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information

Miyoshi, Masato; Delcroix, Marc; Kinoshita, Keisuke; Yoshioka, Takuya; Nakatani, Tomohiro; Hikichi, Takafumi

doi:10.1007/978-1-84996-056-4_9

Masato Miyoshi²,
Marc Delcroix²,
Keisuke Kinoshita²,
Takuya Yoshioka²,
Tomohiro Nakatani² &
…
Takafumi Hikichi²

Part of the book series: Signals and Commmunication Technology ((SCT))

1537 Accesses
2 Citations

Abstract

This chapter discusses multi-microphone inverse filtering, which does not use a priori information of room acoustics, such as room impulse responses between the target speaker and the microphones. One major problem as regards achieving this type of processing is the degradation of the recovered speech caused by excessive equalization of the speech characteristics. To overcome this problem, several approaches have been studied based on a multichannel linear prediction framework, since the framework may be able to perform speech dereverberation as well as noise attenuation. Here, we first discuss the relationship between optimal filtering and linear prediction. Then, we review our four approaches, which differ in terms of their treatment of the statistical properties of a speech signal.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

ATR International Speech database. Online (in Japanese). URL http://www.red.atr. co.jp/database_page/digdb.html
Google Scholar
Aichner, R., Araki, S., Makino, S., Nishikawa, T., Saruwatari, H.: Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming. In: Proc. IEEE Int. Workshop on Neural Networks for Signal Processing, pp. 445–454 (2002)
Google Scholar
Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979)
Article Google Scholar
Atal, B.S.: Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J. Acoust. Soc. Am. 55(6), 1304–1312 (1974)
Article Google Scholar
Ben-Israel, A., Greville, T.N.E.: Generalized inverses: theory and applications. Springer (1974)
Google Scholar
Benesty, J., Makino, S., Chen, J.: Speech enhancement. Springer (2005)
Google Scholar
Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Speech Audio Process. 27(2), 113–120 (1979)
Google Scholar
Campbell, S.L., Jr., C.D.M.: Generalized inverses of linear transformations. Dover Publications (1979)
Google Scholar
Delcroix, M., Hikichi, T., Miyoshi, M.: Dereverberation and denoising using multichannel linear prediction. IEEE Trans. Audio, Speech, Lang. Process. 15(6), 1791–1801 (2007)
Article Google Scholar
Delcroix, M., Hikichi, T., Miyoshi, M.: Precise dereverberation using multi-channel linear prediction. IEEE Trans. Audio, Speech, Lang. Process. 15(2), 430–440 (2007)
Article Google Scholar
Flanagan, J.L.: Computer-steered microphone arrays for sound transduction in large rooms. J. Acoust. Soc. Am. 78(11), 1508–1518 (1985)
Article Google Scholar
Furui, S.: Digital speech processing, synthesis, and recognition. Marcel Dekker (2001)
Google Scholar
Gaubitch, N.D., Naylor, P.A., Ward, D.B.: On the use of linear prediction for dereverberation of speech. In: Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC), vol. 1, pp. 99–102 (2003)
Google Scholar
Giannakis, G.B., Hua, Y., Stoica, P., Tong, L.: Signal processing advances in wireless and mobile communications. Prentice–Hall (2001)
Google Scholar
Gillespie, B.W., Atlas, L.E.: Acoustic diversity for improved speech recognition in reverberant environments. In: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), vol. 1, pp. 557–600 (2002)
Google Scholar
Gillespie, B.W., Malvar, H.S., Florêncio, D.A.F.: Speech dereverberation via maximumkurtosis subband adaptive filtering. In: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pp. 3701–3704 (2001)
Google Scholar
Habets, E.A.P.: Multi-channel speech dereverberation based on a statistical model of late reverberation. In: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), vol. 4, pp. 173–176 (2005)
Google Scholar
Harville, D.A.: Matrix algebra from a statistician’s perspective. Springer (1997)
Google Scholar
Haykin, S.: Adaptive filter theory, 3rd edn. Prentice–Hall (1996)
Google Scholar
Haykin, S.: Unsupervised adaptive filtering: blind source separation. Wiley Interscience (2000)
Google Scholar
Juang, B., Rabiner, L.: Mixture autoregressive hidden Markov models for speech signals. IEEE Trans. Acoust., Speech, Signal Process. ASSP-33(6), 1404–1413 (1985)
Article MathSciNet Google Scholar
Kailath, T., Sayed, A.H., Hassibi, B.: Linear estimation. Prentice–Hall (2000)
Google Scholar
Kameoka, H.: Statistical approach to multipitch analysis. Ph.D. thesis, The University of Tokyo (2007)
Google Scholar
Kinoshita, K., Delcroix, M., Nakatani, T., Miyoshi, M.: A linear prediction-based microphone array for speech dereverberation in a realistic sound field. In: Proc. of Audio Engineering Society 13th Regional Convention (2007)
Google Scholar
Kinoshita, K., Nakatani, T., Miyoshi, M.: Dereverberation of highly reverberant convolutive mixtures based on multi-step linear prediction. In: Proc. Int. Symp. on Circuits and Systems (2008)
Google Scholar
Li, K., Swamy, M.N.S., Ahmad, M.O.: An improved voice activity detection using higher order statistics. IEEE Trans. Speech Audio Process. 13(5), 965–974 (2005)
Article Google Scholar
Mitra, S.K.: Optimal inverse of a matrix. Sankhya 37(A), 550–563 (1975)
Google Scholar
Miyoshi, M.: Estimating AR parameter-sets for linear-recurrent signals in convolutive mixtures. In: Proc. Int. Sypm. on Independent Component Analysis and Blind Signal Separation (ICA), pp. 585–589 (2003)
Google Scholar
Miyoshi, M., Kaneda, Y.: Inverse filtering of room acoustics. IEEE Trans. Speech Audio Process. 36(2), 145–152 (1988)
Google Scholar
Nakatani, T., Juang, B., Hikichi, T., Yoshioka, T., Kinoshita, K., Delcroix, M., Miyoshi, M.: Study on speech dereverberation with autocorrelation codebook. Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP) pp. 193–197 (2007)
Google Scholar
Nakatani, T., Kinoshita, K., Miyoshi, M.: Harmonicity based blind dereverberation for singlechannel speech signals. IEEE Trans. Audio, Speech, Lang. Process. 15(1), 80–95 (2007)
Article Google Scholar
Nelson, P.A., Orduña-Bustamante, F., Hamada, H.: Multichannel signal processing techniques in the reproduction of sound. J. Audio Eng. Soc. 44(11), 973–989 (1996)
Google Scholar
Qiu, W., Hua, Y., Abed-Meraim, K.: A subspace method for the computation of the GCD of polynomials. Automatica 33(4), 741–743 (1997)
Article MATH MathSciNet Google Scholar
Rombouts, S., Heyde, K.: An accurate and efficient algorithm for the computation of the characteristic polynomial of a general square matrix. J. Comput. Phys. 140, 453–458 (1998)
Article MATH MathSciNet Google Scholar
Slock, D.T.M.: Blind fractionally-spaced equalization, perfect-reconstruction filter banks and multichannel lineawr prediction. In: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), vol. IV, pp. 585–588 (1994)
Google Scholar
Sun, X., Douglas, S.: A natural gradient convolutive blind source separation algorithm for speech mixtures. In: Proc. Int. Sypm. on Independent Component Analysis and Blind Signal Separation (ICA), pp. 59–64 (2001)
Google Scholar
Tashev, I., Allred, D.: Reverberation reduction for improved speech recognition. In: Proc. Hands-Free Communication and Microphone Arrays (2005)
Google Scholar
van Trees, H.L.: Optimum array processing. Wiley Interscience (2002)
Google Scholar
Yegnanarayana, B., Murthy, P.S.: Enhancement of reverberant speech using LP residual signal. IEEE Trans. Speech Audio Process. 8(3), 267–281 (2000)
Article Google Scholar
Yoshioka, T., Hikichi, T., Miyoshi, M.: Dereverberation by using time-variant nature of speech production system. EURASIP J. Advances in Signal Process. 2007(Article ID 65698), doi:10.1155/2007/65698 (2007)
Google Scholar
Zhao, Y.: An EM algorithm for linear distortion channel estimation based on observations from a mixture of Gaussian sources. IEEE Trans. Speech Audio Process. 7(4), 400–413 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

NTT Communication Science Laboratories, Seika-cho Soraku-gun, Kyoto, Japan
Masato Miyoshi, Marc Delcroix, Keisuke Kinoshita, Takuya Yoshioka, Tomohiro Nakatani & Takafumi Hikichi

Authors

Masato Miyoshi
View author publications
You can also search for this author in PubMed Google Scholar
Marc Delcroix
View author publications
You can also search for this author in PubMed Google Scholar
Keisuke Kinoshita
View author publications
You can also search for this author in PubMed Google Scholar
Takuya Yoshioka
View author publications
You can also search for this author in PubMed Google Scholar
Tomohiro Nakatani
View author publications
You can also search for this author in PubMed Google Scholar
Takafumi Hikichi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Electronic Engineering, Imperial College London, Exhibition Road, SW7 2AZ, London, UK
Patrick A. Naylor & Nikolay D. Gaubitch &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Miyoshi, M., Delcroix, M., Kinoshita, K., Yoshioka, T., Nakatani, T., Hikichi, T. (2010). Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information. In: Naylor, P., Gaubitch, N. (eds) Speech Dereverberation. Signals and Commmunication Technology. Springer, London. https://doi.org/10.1007/978-1-84996-056-4_9

Download citation

DOI: https://doi.org/10.1007/978-1-84996-056-4_9
Publisher Name: Springer, London
Print ISBN: 978-1-84996-055-7
Online ISBN: 978-1-84996-056-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics