Abstract
The acoustic environment is typically composed of multiple simultaneous events. A remarkable achievement of the auditory system is its ability to disentangle the acoustic mixture and group the sound energy that originates from the same event or source. This process of auditory organization is referred to as auditory scene analysis. The cocktail party problem, or segregation of speech from interfering sounds, has proven to be extremely challenging from the computational standpoint.
Similar content being viewed by others
References
Hu, G., Wang, D.L.: Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Transactions on Neural Networks 15, 1135–1150 (2004)
Hu, G., Wang, D.L.: Auditory segmentation based on onset and offset analysis. IEEE Transactions on Audio, Speech, and Language Processing (2006) (in press)
Wang, D.L., Hu, G.: Unvoiced speech segregation. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 953–956 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, D. (2006). Cocktail Party Processing. In: Sichman, J.S., Coelho, H., Rezende, S.O. (eds) Advances in Artificial Intelligence - IBERAMIA-SBIA 2006. IBERAMIA SBIA 2006 2006. Lecture Notes in Computer Science(), vol 4140. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11874850_4
Download citation
DOI: https://doi.org/10.1007/11874850_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45462-5
Online ISBN: 978-3-540-45464-9
eBook Packages: Computer ScienceComputer Science (R0)