ABSTRACT
Binaural cue coding (BCC) has been introduced for compact representation of multi-channel audio. It exploits binaural cue parameters for capturing the spatial image of multi-channel audio. Recently, it has been standardized within MPEG as the name of "MPEG Surround." In this paper, we propose a sound source location cue coding (SSLCC) system for compressing multi-channel audio to be suitable at the narrow bandwidth transmission environment. To improve the compression ability of the conventional BCC, the SSLCC system utilizes the virtual source location information (VSLI) as a spatial cue parameter instead of the inter-channel level difference (ICLD) of the BCC system. Also the SSLCC system adopts enhanced pre/post processing algorithms to improve perceptual sound quality. Objective and subjective assessment results show that the proposed SSLCC system reveals better performance than the conventional BCC system.
- Faller, C., and Baumgarte, F., Binaural cue coding-part II: schemes and application, IEEE Trans. on Speech and Audio Proc., 11, 6, Nov. 2003.Google ScholarCross Ref
- Sang Bae Jeon, In Yong Choi, Han-gil Moon, Jeongil Seo, and Keong-mo Sung, "Virtual Source Location Information for Binaural Cue Coding," 119th AES Convention, Oct. 2005.Google Scholar
- Pulkki, V., Virtual sound source positioning using vector base amplitude panning, J. Audio Eng. Soc., 45(June 1997), 456--466.Google Scholar
- ISO-IEC JTC1/SC29/WG11 (MPEG) Document N6691, Procedures for the evaluation of spatial audio coding systems, Redmond, July 2004.Google Scholar
- Klabbers, E., and Veldhuis, R., Reducing audible spectral discontinuities, IEEE Trans. on Speech and Audio Proc., 9, 1(Jan. 2001), 39--51.Google ScholarCross Ref
- ITU-R Recommendation BS. 1543-1, Method for the subjective assessment of intermediate sound quality (MUSHRA), International Telecommunication Union, Geneva, Switzerland, 2001.Google Scholar
Index Terms
- Sound source location cue coding system for compact representation of multi-channel audio
Recommendations
JND-based spatial parameter quantization of multichannel audio signals
In multichannel spatial audio coding (SAC), the accurate representations of virtual sounds and the efficient compressions of spatial parameters are the key to perfect reproduction of spatial sound effects in 3D space. Just noticeable difference (JND) ...
Sound source localization using sparse coding and SOM
ETFA'09: Proceedings of the 14th IEEE international conference on Emerging technologies & factory automationMany kinds of sound source localization systems have been developed for detecting a direction of sound source. They are commonly using time delay of arrival (TDOA) or interaural time difference (ITD) algorithm for sound source localization where, ...
Immersive auditory display system 'sound cask': three-dimensional sound field reproduction system based on the boundary surface control principle
VRST '18: Proceedings of the 24th ACM Symposium on Virtual Reality Software and TechnologySound cask was developed to realize the perfect 3D auditory display that creates 3D sound waves around the listener's head just the same as the primary sound field, based on the boundary surface control (BoSC) principle.
If we consider the sound ...
Comments