Skip to main content
Log in

Speech deconvolution as an inverse problem

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

In this paper, the problem of speech deconvolution is solved. This problem is encountered in limited-bandwidth speech communication systems such as telephone systems. Three solutions are presented for this problem. In the first solution, a Linear Minimum Mean Square Error (LMMSE) approach is used. The necessary assumptions required to reduce the computational complexity of the LMMSE solution are presented. In the second solution, an inverse filter deconvolution approach is presented. Finally, the regularization theory is used to solve this problem. The common thread between all these solutions is that they treat the speech deconvolution problem as an inverse problem considering the speech degradation model. Simulation results reveal the superiority of these solutions for solving the speech deconvolution problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Anderws, H. C., & Hunt, B. R. (1977). Digital image restoration. Englewood Cliffs: Prentice-Hall.

    Google Scholar 

  • Bai, H., & Wan, E. A. (2003). Two-pass quantile based noise spectrum estimation. Center of Spoken Language Understanding, OGI School of Science and Engineering at OHSU, http://cslu.cse.ogi.edu/publications.

  • Berouti, M., Schwartz, R., & Makhoul, J. (1979). Enhancement of speech corrupted by acoustic noise. In Proc. IEEE int. conf. acoust., speech signal processing (pp. 208–211).

    Google Scholar 

  • Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-27, 113–120.

    Article  Google Scholar 

  • Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete-time processing of speech signals (2nd edn.). New York: IEEE Press.

    Google Scholar 

  • El-Khamy, S. E., Hadhoud, M. M., Dessouky, M. I., Salam, B. M., & Abd El-Samie, F. E. (2004). Optimization of image interpolation as an inverse problem using the LMMSE algorithm. In Proc. IEEE MELECON (pp. 247–250).

    Google Scholar 

  • Ephriam, Y., & Van Trees, H. L. (1993). A signal subspace approach for speech enhancement. In Proc. international conference on acoustic, speech and signal processing, Detroit, MI, USA, May 1993 (Vol. II, pp. 355–358).

    Chapter  Google Scholar 

  • Ephriam, Y., & Van Trees, H. L. (1995). A spectrally-based signal subspace approach for speech enhancement. In IEEE ICASSP (pp. 804–807).

    Google Scholar 

  • Figueriredo, M., & Nowak, R. (2005). A bound optimization approach to wavelet-based image deconvolution. In IEEE intern. conf. on image processing—ICIP’05, Genoa, Italy.

    Google Scholar 

  • Galatsanos, N. P., & Chin, R. T. (1989). Digital restoration of multichannel images. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37(3), 415–421.

    Article  Google Scholar 

  • Ghanbari, Y., & Karami, M. (2004). Spectral subtraction in the wavelet domain for speech enhancement. International Journal of Software and Information Technologies, 1, 26–30.

    Google Scholar 

  • Ghanbari, Y., Karami, M., & Amelifard, B. (2004). Improved multi band spectral subtraction method for speech enhancement. In Proc. 6th IASTED internat. conf. on signal image process, USA (pp. 225–230).

    Google Scholar 

  • Haykin, S. (1996). Adaptive filter theory. New York: Prentice Hall. ISBN 0-13-322760-X.

    Google Scholar 

  • Hu, Y., & Loizou, P. (2002). A subspace approach for enhancing speech corrupted by colored noise. In Proc. international conference on acoustics, speech and signal processing, Orlando, FL, USA, May (Vol. I, pp. 573–576).

    Google Scholar 

  • Jalobeanu, A., Kingsbury, N., & Zerubia, J. (2001). Image deconvolution using hidden Markov tree modeling of complex wavelet packets. In IEEE intern. conf. on image processing—ICIP’01, Thessaloniki, Greece.

    Google Scholar 

  • Jin, C., & Kubicheck, R. (1996). Vector quantization techniques for output-based objective speech quality. In Proc. ICASSP (pp. 491–494).

    Google Scholar 

  • Karayiannis, N. B., & Venetsanopouos, A. N. (1990). Regularization theory in image restoration- the stabilizing functional approach. IEEE Transactions on Acoustics, Speech, and Signal Processing, 38(7), 1155–1179.

    Article  MATH  Google Scholar 

  • Leung, W. Y. V., & Bones, P. J. (2001). Statistical interpolation of sampled images. Optical Engineering, 40(4), 547–553.

    Article  Google Scholar 

  • Lim, J. S., & Oppenheim, A. V. (1978). All-pole modelling of degraded speech. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-26.

  • Manikandan, S. (2006). Speech enhancement based on wavelet de noising. Academic Open Internet Journal, 17.

  • O’Shaughnessy, D. (2000). Speech communication: human and machine. The Institute of Electrical and Electronics Engineers, Inc.

  • O’Shaughnessy, D. (2004). Single ended method for objective speech quality assessment in narrowband telephony applications. In ITU-T (p. 563).

    Google Scholar 

  • Quackenbush, S. R., Barnwell, T. P., III, & Clements, M. A. (1988). Objective measures of speech quality. Englewood Cliffs: Prentice-Hall.

    Google Scholar 

  • Rezayee, A., & Gazor, S. (2001). An adaptive KLT approach for speech enhancement. IEEE Transactions on Speech and Audio Processing, 9, 87–95.

    Article  Google Scholar 

  • Rivaz, P., & Kingsbury, N. (2001). Bayesian image deconvolution and denoising using complex wavelets. In IEEE intern. conf. on image processing—ICIP’01, Thessaloniki, Greece.

    Google Scholar 

  • Shao, Y., & Chang, C. H. (2005). A versatile speech enhancement system based on perceptual wavelet de noising. In IEEE (pp. 864–867).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fathi E. Abd El-samie.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abd El-Fattah, M.A., Dessouky, M.I., Diab, S.M. et al. Speech deconvolution as an inverse problem. Int J Speech Technol 14, 273–284 (2011). https://doi.org/10.1007/s10772-011-9102-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-011-9102-8

Keywords

Navigation