ABSTRACT
The rapid development of 3D film stimulates the requirement for 3D audio. Current 3D audio systems mainly focus on the performance of directional sound image. Multichannel techniques extract binaural cues from channels to represent the directional information with less information about sound distance, which results in the degradation of distance perception quality due to the perceptual difference of direction and distance perception. The distance information is the key to distinct 3D audio from 2D audio. We focus on the auditory distance perception in 3D audio coding. The auditory distance estimation model is established based on the auditory perception of the human ear and imported to the multichannel 3D audio coding system to conduct coding and reproduction of the sound from different directions and distances. Experimental results verified the performance of proposed 3D audio coding based on distance perception.
- m23748_Use cases and possible material for 3D AudioGoogle Scholar
- K. Hamasaki, T. Nishiguchi, R. Okumura, Y. Nakayama and A. Ando, A 22.2 multi-channel sound system for ultrahigh-definition TV(UHDTV), SMPTE Motion Imaging J., Vol. 117, No. 3, pp. 40--49, 2008Google Scholar
- Hu et al.: "Perceptual characteristic and compression research in 3D audio technology", 9th International Symposium on Computer Music Modelling and Retrieval (CMMR 2012), 19-22 June 2012, Queen Mary University of LondonGoogle Scholar
- ISO/IEC JTC1/SC29/WG11/N11865, Coding of Moving Pictures and Audio, http://mpeg.chiariglione.org/workingdocuments.php.Google Scholar
- Mershon D H, King L E. Intensity and reverberation as factors in the auditory perception of egocentric distance. Attention, Perception, & Psychophysics, 1975, 18(6): 409--415.Google Scholar
- P. Zahorik, D. S. Brungart, and A. W. Bronkhorst. Auditory Distance Perception in Humans: A Summary of Past and Present Research. Acta Acustica united with Acustica, 2005. May/June(91): 409--420.Google Scholar
- Blauert J. Spatial Hearing-Revised Edition: The Psychophysics of Human Sound Localization. MIT press, 1996.Google Scholar
- KopcoN, Shinn-Cunningham B G. Effect of stimulus spectrum on distance perception for nearby sources. The Journal of the Acoustical Society of America, 2011, 130(3): 1530.Google ScholarCross Ref
- Baumgarte F, Faller C. Binaural cue coding-part I: Psychoacoustic fundamentals and design principles. Speech and Audio Processing, IEEE Transactions on, 2003, 11(6): 509--519.Google Scholar
- Faller C, Baumgarte F. Binaural cue coding-Part II: Schemes and applications. Speech and Audio Processing, IEEE Transactions on, 2003, 11(6): 520--531.Google Scholar
- ITU-R Recommendation. BS.1534-1. Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA). International Telecommunications Union, Geneva, 2001.Google Scholar
- Yan-Chen Lu; Cooke, M., "Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, no. 7, pp. 1793, 1805, Sept. 2010 Google ScholarDigital Library
Index Terms
- 3D Audio Coding Based on Distance Perception
Recommendations
Auditory distance perception in an acoustic pipe
In a study of auditory distance perception, we investigated the effects of exaggeration the acoustic cue of reverberation where the intensity of sound did not vary noticeably. The set of stimuli was obtained by moving a sound source inside a 10.2-m long ...
Distance perception of a virtual sound source synthesized near the listener position
This paper reports on the challenges faced in attempts to synthesize virtual sound sources near the listener position due to the differences between sound fields of real and virtual sound sources, especially if virtual sources reproduced by a line array ...
Theoretical analysis of linearly constrained multi-channel wiener filtering algorithms for combined noise reduction and binaural cue preservation in binaural hearing aids
Besides noise reduction, an important objective of binaural speech enhancement algorithms is the preservation of the binaural cues of all sound sources. For the desired speech source and the interfering sources, e.g., competing speakers, this can be ...
Comments