extended-abstract

Upsampling of Personalized HRTF for Spatial Audio Rendering: Why Deep Learning is Problematic?

Authors:
Basant Pandagre

Electrical Engineering, Indian Institute of Technology Jodhpur, India

Electrical Engineering, Indian Institute of Technology Jodhpur, India

0000-0002-5089-6584
View Profile

,
Manish Narwaria

Electrical Engineering, Indian Institute of Technology Jodhpur, India

Electrical Engineering, Indian Institute of Technology Jodhpur, India

0000-0001-7789-5322
View Profile

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)January 2023Pages 274–275https://doi.org/10.1145/3570991.3570999

Published:04 January 2023Publication History

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

Pages 274–275

ABSTRACT

Spatial audio rendering employs Head Related Transfer Functions (HRTFs) for a realistic reproduction of the sound field. This requires upsampling of the HRTF. Given its popularity for the task of upsampling, a deep learning (DL) based upsampler can appear to be an attractive solution for the said problem. We, however, argue that it is more meaningful to rely on explicit system modeling, and not depend exclusively on DL based data fitting for the said problem.

References

Corey I Cheng and Gregory H Wakefield. 2001. introduction to head-related transfer functions (hrtfs): representations of hrtfs in time, frequency, and space. journal of the audio engineering society 49, 4 (april 2001), 231–249.Google Scholar
Grady Kestler, Shahrokh Yadegari, and David Nahamoo. 2019. Head related impulse response interpolation and extrapolation using Deep Belief Networks. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Brighton, UK, 266–270.Google ScholarCross Ref
Devansh Zurale, Shahrokh Yadegari, and Shlomo Dubnov. 2022. Deep HRTF Encoding & Interpolation: Exploring Spatial Correlations using Convolutional Neural Networks. In "". Zenodo, Saint-Etienne (France) Zenodo, 350–357.Google Scholar

Recommendations

HRTF Estimation in the Wild
UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

Head Related Transfer Functions (HRTFs) play a crucial role in creating immersive spatial audio experiences. However, HRTFs differ significantly from person to person, and traditional methods for estimating personalized HRTFs are expensive, time-...
Read More
Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems
MultiMedia Modeling
Abstract
Control of sound fields using array loudspeakers has been attempted in many practical areas, such as 3D audio, active noise control, and personal audio. In this work, we demonstrate two real-time sound field control systems involving a line array ...
Read More
Rendering localized spatial audio in a virtual auditory space

High-quality virtual audio scene rendering is required for emerging virtual and augmented reality applications, perceptual user interfaces, and sonification of data. We describe algorithms for creation of virtual auditory spaces by rendering cues that ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)
January 2023
357 pages
ISBN:9781450397971
DOI:10.1145/3570991

Copyright © 2023 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 January 2023
Check for updates
Author Tags
Head Related Transfer Function
Interpolation
Spatial Audio
Qualifiers
- extended-abstract
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate197of680submissions,29%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 134
  Total Downloads
- Downloads (Last 12 months)79
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Upsampling of Personalized HRTF for Spatial Audio Rendering: Why Deep Learning is Problematic?

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

ABSTRACT

References

Cited By

Recommendations

HRTF Estimation in the Wild

Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems

Rendering localized spatial audio in a virtual auditory space

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Upsampling of Personalized HRTF for Spatial Audio Rendering: Why Deep Learning is Problematic?

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

ABSTRACT

References

Cited By

Recommendations

HRTF Estimation in the Wild

Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems

Rendering localized spatial audio in a virtual auditory space

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media