research-article

Individualization Of Head Related Transfer Function Based On PCA And RBF Network

Authors:
Wei Chen

School of Software, Henan Polytechnic University, China

School of Software, Henan Polytechnic University, China

0000-0002-7041-7210
View Profile

,
Hongxu Zhang

School of Computer Science and Technology, Henan Polytechnic University, China

School of Computer Science and Technology, Henan Polytechnic University, China

0000-0003-2549-3131
View Profile

,
Jinxia Yu

School of Computer Science and Technology, Henan Polytechnic University, China

School of Computer Science and Technology, Henan Polytechnic University, China

0000-0002-5079-7906
View Profile

,
Fen Luo

School of Software, Henan Polytechnic University, China

School of Software, Henan Polytechnic University, China

0000-0002-6439-2993
View Profile

ICNCC '22: Proceedings of the 2022 11th International Conference on Networks, Communication and ComputingDecember 2022Pages 109–116https://doi.org/10.1145/3579895.3579912

Published:04 April 2023Publication History

ICNCC '22: Proceedings of the 2022 11th International Conference on Networks, Communication and Computing

Pages 109–116

ABSTRACT

Head-Related Transfer Function (HRTF) describes the acoustic reflection and diffraction effect caused by the influence of the human body (head, torso, etc.) in the transmission of sound waves to the human ear. In Virtual Reality(VR) / Augmented Reality(AR), HRTF is often used to generate virtual 3D audio due to its ability to recreate perceptions of natural sound scenes realistically. However, HRTF varies from person to person due to the differences in anthropometric features. Using non-individualized HRTF to produce 3D sounds may lead to hearing localization bias in users. Therefore, how to obtain individualized HRTF is a hot topic in the field of VR / AR. This paper proposes an effective method to establish the relationship between anthropometric features and HRTF. At first, a method based on multimodal principal component analysis is proposed for the representation of HRTF models with low dimensions. Then a nonlinear mapping representation model between the low-dimensional features of HRTF and anthropometric features is established using Radial Basis Function Neural Network (RBFNN). Objective experiments show that the proposed HRTF Individualization method can reduce the spectral distortion as low as 4.48 dB. The subjective listening experiments based on the principal sagittal plane show that the individualized HRTF obtained using this method can effectively improve the accuracy of subjective listening (about 33%).

References

MIDDLEBROOKS, J.C., 1999. Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. The Journal of the Acoustical Society of America 106, 3, 1493-1510. https://doi.org/10.1121/1.427147Google ScholarCross Ref
ZENG, X.-Y., WANG, S.-G., and GAO, L.-P., 2010. A hybrid algorithm for selecting head-related transfer function based on similarity of anthropometric structures. Journal of Sound and Vibration 329, 19, 4093-4106. https://doi.org/10.1016/j.jsv.2010.03.031Google ScholarCross Ref
ZOTKIN, D., HWANG, J., DURAISWAINI, R., and DAVIS, L.S., 2003. HRTF personalization using anthropometric measurements. In 2003 IEEE workshop on applications of signal processing to audio and acoustics (IEEE Cat. No. 03TH8684) Ieee, 157-160. https://doi.org/10.1109/ASPAA.2003.1285855Google ScholarCross Ref
MIDDLEBROOKS, J.C., 1999. Individual differences in external-ear transfer functions reduced by scaling in frequency. The Journal of the Acoustical Society of America 106, 3, 1480-1492. https://doi.org/10.1121/1.427176Google ScholarCross Ref
ALGAZI, V.R., DUDA, R.O., DURAISWAMI, R., GUMEROV, N.A., and TANG, Z., 2002. Approximating the head-related transfer function using simple geometric models of the head and torso. The Journal of the Acoustical Society of America 112, 5, 2053-2064. https://doi.org/10.1121/1.1508780Google ScholarCross Ref
KREUZER, W., MAJDAK, P., and CHEN, Z., 2009. Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range. The Journal of the Acoustical Society of America 126, 3, 1280-1290. https://doi.org/10.1121/1.3177264Google ScholarCross Ref
MESHRAM, A., MEHRA, R., YANG, H., DUNN, E., FRANM, J.-M., and MANOCHA, D., 2014. P-HRTF: Efficient personalized HRTF computation for high-fidelity spatial sound. In 2014 IEEE International Symposium on Mixed and Augmented Reality (ISMAR) IEEE, 53-61. https://doi.org/10.1109/ISMAR.2014.6948409Google ScholarCross Ref
SHIN, K.H. and PARK, Y., 2008. Enhanced vertical perception through head-related impulse response customization based on pinna response tuning in the median plane. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences 91, 1, 345-356. https://doi.org/10.1109/ISMAR.2014.6948409Google ScholarDigital Library
FINK, K.J. and RAY, L., 2015. Individualization of head related transfer functions using principal component analysis. Applied Acoustics 87, 162-173. https://doi.org/10.1016/j.apacoust.2014.07.005Google ScholarCross Ref
YAMAMOTO, K. and IGARASHI, T., 2017. Fully perceptual-based 3D spatial sound individualization with an adaptive variational autoencoder. ACM Transactions on Graphics (TOG) 36, 6, 1-13. https://doi.org/10.1145/3130800.3130838Google ScholarDigital Library
ZHANG, M., GE, Z., LIU, T., WU, X., and QU, T., 2020. Modeling of individual HRTFs based on spatial principal component analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 785-797. https://doi.org/10.1109/TASLP.2020.2967539Google ScholarDigital Library
MENG, L., WANG, X., CHEN, W., AI, C., and HU, R., 2018. Individualization of head related transfer functions based on radial basis function neural network. In 2018 IEEE International Conference on Multimedia and Expo (ICME) IEEE, 1-6. https://doi.org/10.1109/ICME.2018.8486494Google ScholarCross Ref
IIDA, K. and ISHII, Y., 2018. Effects of adding a spectral peak generated by the second pinna resonance to a parametric model of head-related transfer functions on upper median plane sound localization. Applied Acoustics 129, 239-247. https://doi.org/10.1016/j.apacoust.2017.08.001Google ScholarCross Ref
CHEN, W., HU, R., WANG, X., YANG, C., and MENG, L., 2018. Individualization of head related impulse responses using division analysis. China Communications 15, 5, 92-103. https://doi.org/10.1109/CC.2018.8387989Google ScholarCross Ref
CIPIC, H., 2004. Database Files, Release 1.2, September 23. https://www.sofaconventions.org/data/database/cipic/Google Scholar
ALGAZI, V.R., DUDA, R.O., THOMPSON, D.M., and AVENDANO, C., 2001. The cipic hrtf database. In Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No. 01TH8575) IEEE, 99-102. https://doi.org/10.1109/ASPAA.2001.969552Google ScholarCross Ref
NISHINO, T., INOUE, N., TAKEDA, K., and ITAKURA, F., 2007. Estimation of HRTFs on the horizontal plane using physical features. Applied Acoustics 68, 8, 897-908. https://doi.org/10.1016/j.apacoust.2006.12.010Google ScholarCross Ref
HUGENG, H., WAHAB, W., and GUNAWAN, D., 2011. The effectiveness of chosen partial anthropometric measurements in individualizing head-related transfer functions on median plane. ITB J. ICT 5, 1, 35-56. https://doi.org/10.5614/itbj.ict.2011.5.1.3Google ScholarCross Ref
ROTHBUCHER, M., DURKOVIC, M., SHEN, H., and DIEPOLD, K., 2010. HRTF customization using multiway array analysis. In 2010 18th European Signal Processing Conference IEEE, 229-233. https://doi.org/10.5281/zenodo.41936Google ScholarCross Ref
KUHN, A., ROTHBUCHER, M., and DIEPOLD, K., 2014. HRTF Customization by Regression. Lehrstuhl für Datenverarbeitung.Google Scholar
HU, H.-M., ZHOU, L., MA, H., and WU, Z.-Y., 2008. Head-related transfer function personalization based on partial least square regression. 电子与信息学报 30, 1, 154-158. https://doi.org/10.3724/SP.J.1146.2007.00146Google ScholarCross Ref
HUANG, Q. and LI, L., 2014. Modeling individual HRTF tensor using high-order partial least squares. EURASIP Journal on Advances in Signal Processing 2014, 1, 1-14. https://doi.org/10.1186/1687-6180-2014-58Google ScholarCross Ref
GRINDLAY, G. and VASILESCU, M.A.O., 2007. A multilinear (tensor) framework for HRTF analysis and synthesis. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07 IEEE, I-161-I-164. https://doi.org/10.1109/ICASSP.2007.366641Google ScholarCross Ref
WERSENYI, G., 2003. Localization in a HRTF-based minimum audible angle listening test on a 2D sound screen for GUIB applications. In Audio Engineering Society Convention 115 Audio Engineering Society. http://www.aes.org/e-lib/browse.cfm?elib=12408Google Scholar

Index Terms

Individualization Of Head Related Transfer Function Based On PCA And RBF Network

Recommendations

Admissible Range for Individualization of Head-Related Transfer Function in Median Plane
IIH-MSP '13: Proceedings of the 2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Individualization of the head-related transfer function (HRTF) to each listener is important for the listener to localize sound sources accurately. If an inappropriate HRTF is used, the sound localization is incorrect and there is a lower sense of ...
Read More
Interaural time difference individualization in HRTF by scaling through anthropometric parameters
Abstract
Head-related transfer function (HRTF) individualization can improve the perception of binaural sound. The interaural time difference (ITD) of the HRTF is a relevant cue for sound localization, especially in azimuth. Therefore, individualization of ...
Read More
Individualization of head-related transfer function for three-dimensional virtual auditory display: a review
ICVR'07: Proceedings of the 2nd international conference on Virtual reality

This paper sums up the previous research on Head-Related Transfer Function (HRTF) individualization for three-dimensional (3D) virtual auditory display. Papers which involve experiment research, theoretical computation research, and applications of 3D ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICNCC '22: Proceedings of the 2022 11th International Conference on Networks, Communication and Computing
December 2022
365 pages
ISBN:9781450398039
DOI:10.1145/3579895

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 April 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
HRTF
Individualization
PCA
RBFNN
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 34
  Total Downloads
- Downloads (Last 12 months)34
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Individualization Of Head Related Transfer Function Based On PCA And RBF Network

ICNCC '22: Proceedings of the 2022 11th International Conference on Networks, Communication and Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Admissible Range for Individualization of Head-Related Transfer Function in Median Plane

Interaural time difference individualization in HRTF by scaling through anthropometric parameters

Individualization of head-related transfer function for three-dimensional virtual auditory display: a review

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Individualization Of Head Related Transfer Function Based On PCA And RBF Network

ICNCC '22: Proceedings of the 2022 11th International Conference on Networks, Communication and Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Admissible Range for Individualization of Head-Related Transfer Function in Median Plane

Interaural time difference individualization in HRTF by scaling through anthropometric parameters

Individualization of head-related transfer function for three-dimensional virtual auditory display: a review

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media