Abstract
In multiple speaker environments such as teleconferences we observe a loss of intelligibility, particularly if the sound is monaural in nature. In this study, we exploit the "Cocktail Party Effect", where a person can isolate one sound above all others using sound localization and gender cues. To improve clarity of speech, each speaker is assigned a direction using Head Related Transfer Functions (HRTFs) which creates an auditory map of multiple conversations. A mixture of male and female voices is used to improve comprehension.
We see 6% improvement in cognition while using a male voice in a female dominated environment and 16% improvement in the reverse case. An improvement of 41% is observed while using sound localization with varying elevations. Finally, the improvement in cognition jumps to 71% when both elevations and azimuths are varied. Compared to our previous study, where only azimuths were used, we observe that combining both the azimuths and elevations gives us better results (57% vs. 71%).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
M. Joshi, K. Kotakonda, N. Gupta, and A. Barreto, “Improving Intelligibilty of Teleconferences Using Binaural Sounds”, REV 2009.
K. J. Faller II, A. Barreto, N. Gupta and N. Rishe, “Performance Comparison of Two Identification Methods for Analysis of Head Related Impulse Responses”, CISSE 2005.
Wenzel,E.M.,Arruda,M.,Kistler,D.J.and Wightman, F.L., “Localization Using Non-individualized Head-Related Transfer Functions,” J.Acoust.Soc.Amer.,Vol.94,111-123, 1993.
Begault, D. R., “A head-up auditory display for TCAS advisories.” Human Factors, 35, 707-717, 1993.
AuSIM, Inc., “HeadZap: AuSIM3D HRTF Measurement System Manual”. AuSIM, Inc., 4962 El Camino Real, Suite 101, Los Altos, CA 94022, 2000.
N. Gupta, A. Barreto and C. Ordonez,“Spectral Modification of Head-Related Transfer Functions for Improved Virtual Sound Spatialization", Proceedings of the 2002 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2002), May 13-17, 2002.
Cherry, E. C., “Some experiments on the recognition of speech, with one and with two ears.” Journal of Acoustical Society of America 25(5), 975—979, 1953.
http://www.samsontech.com/products/productpage.cfm?prodID=1810
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media B.V.
About this paper
Cite this paper
Joshi, M., Iyer, M., Gupta, N., Barreto, A. (2010). Effect of Gender and Sound Spatialization on Speech Intelligibility in Multiple Speaker Environment. In: Sobh, T., Elleithy, K. (eds) Innovations in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9112-3_94
Download citation
DOI: https://doi.org/10.1007/978-90-481-9112-3_94
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9111-6
Online ISBN: 978-90-481-9112-3
eBook Packages: Computer ScienceComputer Science (R0)