ABSTRACT
Spatial audio rendering employs Head Related Transfer Functions (HRTFs) for a realistic reproduction of the sound field. This requires upsampling of the HRTF. Given its popularity for the task of upsampling, a deep learning (DL) based upsampler can appear to be an attractive solution for the said problem. We, however, argue that it is more meaningful to rely on explicit system modeling, and not depend exclusively on DL based data fitting for the said problem.
- Corey I Cheng and Gregory H Wakefield. 2001. introduction to head-related transfer functions (hrtfs): representations of hrtfs in time, frequency, and space. journal of the audio engineering society 49, 4 (april 2001), 231–249.Google Scholar
- Grady Kestler, Shahrokh Yadegari, and David Nahamoo. 2019. Head related impulse response interpolation and extrapolation using Deep Belief Networks. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Brighton, UK, 266–270.Google ScholarCross Ref
- Devansh Zurale, Shahrokh Yadegari, and Shlomo Dubnov. 2022. Deep HRTF Encoding & Interpolation: Exploring Spatial Correlations using Convolutional Neural Networks. In "". Zenodo, Saint-Etienne (France) Zenodo, 350–357.Google Scholar
Recommendations
HRTF Estimation in the Wild
UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and TechnologyHead Related Transfer Functions (HRTFs) play a crucial role in creating immersive spatial audio experiences. However, HRTFs differ significantly from person to person, and traditional methods for estimating personalized HRTFs are expensive, time-...
Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems
MultiMedia ModelingAbstractControl of sound fields using array loudspeakers has been attempted in many practical areas, such as 3D audio, active noise control, and personal audio. In this work, we demonstrate two real-time sound field control systems involving a line array ...
Rendering localized spatial audio in a virtual auditory space
High-quality virtual audio scene rendering is required for emerging virtual and augmented reality applications, perceptual user interfaces, and sonification of data. We describe algorithms for creation of virtual auditory spaces by rendering cues that ...
Comments