skip to main content
10.1145/3579895.3579912acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicnccConference Proceedingsconference-collections
research-article

Individualization Of Head Related Transfer Function Based On PCA And RBF Network

Published:04 April 2023Publication History

ABSTRACT

Head-Related Transfer Function (HRTF) describes the acoustic reflection and diffraction effect caused by the influence of the human body (head, torso, etc.) in the transmission of sound waves to the human ear. In Virtual Reality(VR) / Augmented Reality(AR), HRTF is often used to generate virtual 3D audio due to its ability to recreate perceptions of natural sound scenes realistically. However, HRTF varies from person to person due to the differences in anthropometric features. Using non-individualized HRTF to produce 3D sounds may lead to hearing localization bias in users. Therefore, how to obtain individualized HRTF is a hot topic in the field of VR / AR. This paper proposes an effective method to establish the relationship between anthropometric features and HRTF. At first, a method based on multimodal principal component analysis is proposed for the representation of HRTF models with low dimensions. Then a nonlinear mapping representation model between the low-dimensional features of HRTF and anthropometric features is established using Radial Basis Function Neural Network (RBFNN). Objective experiments show that the proposed HRTF Individualization method can reduce the spectral distortion as low as 4.48 dB. The subjective listening experiments based on the principal sagittal plane show that the individualized HRTF obtained using this method can effectively improve the accuracy of subjective listening (about 33%).

References

  1. MIDDLEBROOKS, J.C., 1999. Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. The Journal of the Acoustical Society of America 106, 3, 1493-1510. https://doi.org/10.1121/1.427147Google ScholarGoogle ScholarCross RefCross Ref
  2. ZENG, X.-Y., WANG, S.-G., and GAO, L.-P., 2010. A hybrid algorithm for selecting head-related transfer function based on similarity of anthropometric structures. Journal of Sound and Vibration 329, 19, 4093-4106. https://doi.org/10.1016/j.jsv.2010.03.031Google ScholarGoogle ScholarCross RefCross Ref
  3. ZOTKIN, D., HWANG, J., DURAISWAINI, R., and DAVIS, L.S., 2003. HRTF personalization using anthropometric measurements. In 2003 IEEE workshop on applications of signal processing to audio and acoustics (IEEE Cat. No. 03TH8684) Ieee, 157-160. https://doi.org/10.1109/ASPAA.2003.1285855Google ScholarGoogle ScholarCross RefCross Ref
  4. MIDDLEBROOKS, J.C., 1999. Individual differences in external-ear transfer functions reduced by scaling in frequency. The Journal of the Acoustical Society of America 106, 3, 1480-1492. https://doi.org/10.1121/1.427176Google ScholarGoogle ScholarCross RefCross Ref
  5. ALGAZI, V.R., DUDA, R.O., DURAISWAMI, R., GUMEROV, N.A., and TANG, Z., 2002. Approximating the head-related transfer function using simple geometric models of the head and torso. The Journal of the Acoustical Society of America 112, 5, 2053-2064. https://doi.org/10.1121/1.1508780Google ScholarGoogle ScholarCross RefCross Ref
  6. KREUZER, W., MAJDAK, P., and CHEN, Z., 2009. Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range. The Journal of the Acoustical Society of America 126, 3, 1280-1290. https://doi.org/10.1121/1.3177264Google ScholarGoogle ScholarCross RefCross Ref
  7. MESHRAM, A., MEHRA, R., YANG, H., DUNN, E., FRANM, J.-M., and MANOCHA, D., 2014. P-HRTF: Efficient personalized HRTF computation for high-fidelity spatial sound. In 2014 IEEE International Symposium on Mixed and Augmented Reality (ISMAR) IEEE, 53-61. https://doi.org/10.1109/ISMAR.2014.6948409Google ScholarGoogle ScholarCross RefCross Ref
  8. SHIN, K.H. and PARK, Y., 2008. Enhanced vertical perception through head-related impulse response customization based on pinna response tuning in the median plane. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences 91, 1, 345-356. https://doi.org/10.1109/ISMAR.2014.6948409Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. FINK, K.J. and RAY, L., 2015. Individualization of head related transfer functions using principal component analysis. Applied Acoustics 87, 162-173. https://doi.org/10.1016/j.apacoust.2014.07.005Google ScholarGoogle ScholarCross RefCross Ref
  10. YAMAMOTO, K. and IGARASHI, T., 2017. Fully perceptual-based 3D spatial sound individualization with an adaptive variational autoencoder. ACM Transactions on Graphics (TOG) 36, 6, 1-13. https://doi.org/10.1145/3130800.3130838Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. ZHANG, M., GE, Z., LIU, T., WU, X., and QU, T., 2020. Modeling of individual HRTFs based on spatial principal component analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 785-797. https://doi.org/10.1109/TASLP.2020.2967539Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. MENG, L., WANG, X., CHEN, W., AI, C., and HU, R., 2018. Individualization of head related transfer functions based on radial basis function neural network. In 2018 IEEE International Conference on Multimedia and Expo (ICME) IEEE, 1-6. https://doi.org/10.1109/ICME.2018.8486494Google ScholarGoogle ScholarCross RefCross Ref
  13. IIDA, K. and ISHII, Y., 2018. Effects of adding a spectral peak generated by the second pinna resonance to a parametric model of head-related transfer functions on upper median plane sound localization. Applied Acoustics 129, 239-247. https://doi.org/10.1016/j.apacoust.2017.08.001Google ScholarGoogle ScholarCross RefCross Ref
  14. CHEN, W., HU, R., WANG, X., YANG, C., and MENG, L., 2018. Individualization of head related impulse responses using division analysis. China Communications 15, 5, 92-103. https://doi.org/10.1109/CC.2018.8387989Google ScholarGoogle ScholarCross RefCross Ref
  15. CIPIC, H., 2004. Database Files, Release 1.2, September 23. https://www.sofaconventions.org/data/database/cipic/Google ScholarGoogle Scholar
  16. ALGAZI, V.R., DUDA, R.O., THOMPSON, D.M., and AVENDANO, C., 2001. The cipic hrtf database. In Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No. 01TH8575) IEEE, 99-102. https://doi.org/10.1109/ASPAA.2001.969552Google ScholarGoogle ScholarCross RefCross Ref
  17. NISHINO, T., INOUE, N., TAKEDA, K., and ITAKURA, F., 2007. Estimation of HRTFs on the horizontal plane using physical features. Applied Acoustics 68, 8, 897-908. https://doi.org/10.1016/j.apacoust.2006.12.010Google ScholarGoogle ScholarCross RefCross Ref
  18. HUGENG, H., WAHAB, W., and GUNAWAN, D., 2011. The effectiveness of chosen partial anthropometric measurements in individualizing head-related transfer functions on median plane. ITB J. ICT 5, 1, 35-56. https://doi.org/10.5614/itbj.ict.2011.5.1.3Google ScholarGoogle ScholarCross RefCross Ref
  19. ROTHBUCHER, M., DURKOVIC, M., SHEN, H., and DIEPOLD, K., 2010. HRTF customization using multiway array analysis. In 2010 18th European Signal Processing Conference IEEE, 229-233. https://doi.org/10.5281/zenodo.41936Google ScholarGoogle ScholarCross RefCross Ref
  20. KUHN, A., ROTHBUCHER, M., and DIEPOLD, K., 2014. HRTF Customization by Regression. Lehrstuhl für Datenverarbeitung.Google ScholarGoogle Scholar
  21. HU, H.-M., ZHOU, L., MA, H., and WU, Z.-Y., 2008. Head-related transfer function personalization based on partial least square regression. 电子与信息学报 30, 1, 154-158. https://doi.org/10.3724/SP.J.1146.2007.00146Google ScholarGoogle ScholarCross RefCross Ref
  22. HUANG, Q. and LI, L., 2014. Modeling individual HRTF tensor using high-order partial least squares. EURASIP Journal on Advances in Signal Processing 2014, 1, 1-14. https://doi.org/10.1186/1687-6180-2014-58Google ScholarGoogle ScholarCross RefCross Ref
  23. GRINDLAY, G. and VASILESCU, M.A.O., 2007. A multilinear (tensor) framework for HRTF analysis and synthesis. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07 IEEE, I-161-I-164. https://doi.org/10.1109/ICASSP.2007.366641Google ScholarGoogle ScholarCross RefCross Ref
  24. WERSENYI, G., 2003. Localization in a HRTF-based minimum audible angle listening test on a 2D sound screen for GUIB applications. In Audio Engineering Society Convention 115 Audio Engineering Society. http://www.aes.org/e-lib/browse.cfm?elib=12408Google ScholarGoogle Scholar

Index Terms

  1. Individualization Of Head Related Transfer Function Based On PCA And RBF Network

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICNCC '22: Proceedings of the 2022 11th International Conference on Networks, Communication and Computing
          December 2022
          365 pages
          ISBN:9781450398039
          DOI:10.1145/3579895

          Copyright © 2022 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 April 2023

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited
        • Article Metrics

          • Downloads (Last 12 months)34
          • Downloads (Last 6 weeks)2

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format