ABSTRACT
Head-Related Transfer Function (HRTF) describes the acoustic reflection and diffraction effect caused by the influence of the human body (head, torso, etc.) in the transmission of sound waves to the human ear. In Virtual Reality(VR) / Augmented Reality(AR), HRTF is often used to generate virtual 3D audio due to its ability to recreate perceptions of natural sound scenes realistically. However, HRTF varies from person to person due to the differences in anthropometric features. Using non-individualized HRTF to produce 3D sounds may lead to hearing localization bias in users. Therefore, how to obtain individualized HRTF is a hot topic in the field of VR / AR. This paper proposes an effective method to establish the relationship between anthropometric features and HRTF. At first, a method based on multimodal principal component analysis is proposed for the representation of HRTF models with low dimensions. Then a nonlinear mapping representation model between the low-dimensional features of HRTF and anthropometric features is established using Radial Basis Function Neural Network (RBFNN). Objective experiments show that the proposed HRTF Individualization method can reduce the spectral distortion as low as 4.48 dB. The subjective listening experiments based on the principal sagittal plane show that the individualized HRTF obtained using this method can effectively improve the accuracy of subjective listening (about 33%).
- MIDDLEBROOKS, J.C., 1999. Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. The Journal of the Acoustical Society of America 106, 3, 1493-1510. https://doi.org/10.1121/1.427147Google ScholarCross Ref
- ZENG, X.-Y., WANG, S.-G., and GAO, L.-P., 2010. A hybrid algorithm for selecting head-related transfer function based on similarity of anthropometric structures. Journal of Sound and Vibration 329, 19, 4093-4106. https://doi.org/10.1016/j.jsv.2010.03.031Google ScholarCross Ref
- ZOTKIN, D., HWANG, J., DURAISWAINI, R., and DAVIS, L.S., 2003. HRTF personalization using anthropometric measurements. In 2003 IEEE workshop on applications of signal processing to audio and acoustics (IEEE Cat. No. 03TH8684) Ieee, 157-160. https://doi.org/10.1109/ASPAA.2003.1285855Google ScholarCross Ref
- MIDDLEBROOKS, J.C., 1999. Individual differences in external-ear transfer functions reduced by scaling in frequency. The Journal of the Acoustical Society of America 106, 3, 1480-1492. https://doi.org/10.1121/1.427176Google ScholarCross Ref
- ALGAZI, V.R., DUDA, R.O., DURAISWAMI, R., GUMEROV, N.A., and TANG, Z., 2002. Approximating the head-related transfer function using simple geometric models of the head and torso. The Journal of the Acoustical Society of America 112, 5, 2053-2064. https://doi.org/10.1121/1.1508780Google ScholarCross Ref
- KREUZER, W., MAJDAK, P., and CHEN, Z., 2009. Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range. The Journal of the Acoustical Society of America 126, 3, 1280-1290. https://doi.org/10.1121/1.3177264Google ScholarCross Ref
- MESHRAM, A., MEHRA, R., YANG, H., DUNN, E., FRANM, J.-M., and MANOCHA, D., 2014. P-HRTF: Efficient personalized HRTF computation for high-fidelity spatial sound. In 2014 IEEE International Symposium on Mixed and Augmented Reality (ISMAR) IEEE, 53-61. https://doi.org/10.1109/ISMAR.2014.6948409Google ScholarCross Ref
- SHIN, K.H. and PARK, Y., 2008. Enhanced vertical perception through head-related impulse response customization based on pinna response tuning in the median plane. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences 91, 1, 345-356. https://doi.org/10.1109/ISMAR.2014.6948409Google ScholarDigital Library
- FINK, K.J. and RAY, L., 2015. Individualization of head related transfer functions using principal component analysis. Applied Acoustics 87, 162-173. https://doi.org/10.1016/j.apacoust.2014.07.005Google ScholarCross Ref
- YAMAMOTO, K. and IGARASHI, T., 2017. Fully perceptual-based 3D spatial sound individualization with an adaptive variational autoencoder. ACM Transactions on Graphics (TOG) 36, 6, 1-13. https://doi.org/10.1145/3130800.3130838Google ScholarDigital Library
- ZHANG, M., GE, Z., LIU, T., WU, X., and QU, T., 2020. Modeling of individual HRTFs based on spatial principal component analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 785-797. https://doi.org/10.1109/TASLP.2020.2967539Google ScholarDigital Library
- MENG, L., WANG, X., CHEN, W., AI, C., and HU, R., 2018. Individualization of head related transfer functions based on radial basis function neural network. In 2018 IEEE International Conference on Multimedia and Expo (ICME) IEEE, 1-6. https://doi.org/10.1109/ICME.2018.8486494Google ScholarCross Ref
- IIDA, K. and ISHII, Y., 2018. Effects of adding a spectral peak generated by the second pinna resonance to a parametric model of head-related transfer functions on upper median plane sound localization. Applied Acoustics 129, 239-247. https://doi.org/10.1016/j.apacoust.2017.08.001Google ScholarCross Ref
- CHEN, W., HU, R., WANG, X., YANG, C., and MENG, L., 2018. Individualization of head related impulse responses using division analysis. China Communications 15, 5, 92-103. https://doi.org/10.1109/CC.2018.8387989Google ScholarCross Ref
- CIPIC, H., 2004. Database Files, Release 1.2, September 23. https://www.sofaconventions.org/data/database/cipic/Google Scholar
- ALGAZI, V.R., DUDA, R.O., THOMPSON, D.M., and AVENDANO, C., 2001. The cipic hrtf database. In Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No. 01TH8575) IEEE, 99-102. https://doi.org/10.1109/ASPAA.2001.969552Google ScholarCross Ref
- NISHINO, T., INOUE, N., TAKEDA, K., and ITAKURA, F., 2007. Estimation of HRTFs on the horizontal plane using physical features. Applied Acoustics 68, 8, 897-908. https://doi.org/10.1016/j.apacoust.2006.12.010Google ScholarCross Ref
- HUGENG, H., WAHAB, W., and GUNAWAN, D., 2011. The effectiveness of chosen partial anthropometric measurements in individualizing head-related transfer functions on median plane. ITB J. ICT 5, 1, 35-56. https://doi.org/10.5614/itbj.ict.2011.5.1.3Google ScholarCross Ref
- ROTHBUCHER, M., DURKOVIC, M., SHEN, H., and DIEPOLD, K., 2010. HRTF customization using multiway array analysis. In 2010 18th European Signal Processing Conference IEEE, 229-233. https://doi.org/10.5281/zenodo.41936Google ScholarCross Ref
- KUHN, A., ROTHBUCHER, M., and DIEPOLD, K., 2014. HRTF Customization by Regression. Lehrstuhl für Datenverarbeitung.Google Scholar
- HU, H.-M., ZHOU, L., MA, H., and WU, Z.-Y., 2008. Head-related transfer function personalization based on partial least square regression. 电子与信息学报 30, 1, 154-158. https://doi.org/10.3724/SP.J.1146.2007.00146Google ScholarCross Ref
- HUANG, Q. and LI, L., 2014. Modeling individual HRTF tensor using high-order partial least squares. EURASIP Journal on Advances in Signal Processing 2014, 1, 1-14. https://doi.org/10.1186/1687-6180-2014-58Google ScholarCross Ref
- GRINDLAY, G. and VASILESCU, M.A.O., 2007. A multilinear (tensor) framework for HRTF analysis and synthesis. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07 IEEE, I-161-I-164. https://doi.org/10.1109/ICASSP.2007.366641Google ScholarCross Ref
- WERSENYI, G., 2003. Localization in a HRTF-based minimum audible angle listening test on a 2D sound screen for GUIB applications. In Audio Engineering Society Convention 115 Audio Engineering Society. http://www.aes.org/e-lib/browse.cfm?elib=12408Google Scholar
Index Terms
- Individualization Of Head Related Transfer Function Based On PCA And RBF Network
Recommendations
Admissible Range for Individualization of Head-Related Transfer Function in Median Plane
IIH-MSP '13: Proceedings of the 2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal ProcessingIndividualization of the head-related transfer function (HRTF) to each listener is important for the listener to localize sound sources accurately. If an inappropriate HRTF is used, the sound localization is incorrect and there is a lower sense of ...
Interaural time difference individualization in HRTF by scaling through anthropometric parameters
AbstractHead-related transfer function (HRTF) individualization can improve the perception of binaural sound. The interaural time difference (ITD) of the HRTF is a relevant cue for sound localization, especially in azimuth. Therefore, individualization of ...
Individualization of head-related transfer function for three-dimensional virtual auditory display: a review
ICVR'07: Proceedings of the 2nd international conference on Virtual realityThis paper sums up the previous research on Head-Related Transfer Function (HRTF) individualization for three-dimensional (3D) virtual auditory display. Papers which involve experiment research, theoretical computation research, and applications of 3D ...
Comments