Abstract
This paper addresses a novel method for calibrating microphone positions included in a microphone array. The performance of microphone array processing deteriorates due to two factors: (1) differences between predetermined position and actual positions of the microphones. (2) sound source signal overlaps in frequency and time. To solve these problems, we propose a probabilistic generative model of the sound propagation process determined by microphone and sound source positions. The model is defined as the product of three probabilities: (1) prior probability of the microphone positions based on reference positions, (2) prior probability of the sound source spectrum, and (3) conditional probability of the recorded spectrum. Based on the model, an iterative algorithm to calibrate the microphone positions is derived as a solution of the maximum a posteriori estimation. Preliminary experiments through numerical simulation with an 8-ch microphone array revealed that the proposed method accurately estimated the microphone positions when using multiple sound sources. Preliminary experiments through numerical simulation with an 8-ch microphone array suggested the proposed method accurately estimated the microphone positions when using multiple sound sources.
This work was supported by JSPS KAKENHI Grant No. 16H02884, 17K00365, and 19K12017.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Strictly, \(\textit{\textbf{S}}\) should be marginalized and the MAP estimation of \(p(\textit{\textbf{X}} | \textit{\textbf{Z}})\) should be solved.
References
Févotte, C., Bertin, N., Durrieu, J.L.: Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis. Neural Comput. 21(3), 793–830 (2009)
Miura, H., Yoshida, T., Nakamura, K., Nakadai, K.: SLAM-based online calibration of asynchronous microphone array for robot audition. In: IROS 2011, pp. 524–529 (2011)
Nakadai, K., Nakajima, H., Hasegawa, Y., Tsujino, H.: Sound source separation of moving speakers for robot audition. In: ICASSP 2009, pp. 3685–3688 (2009)
Nakadai, K., Okuno, H.G., Mizumoto, T.: Development, deployment and applications of robot audition open source software HARK. J. Robot. Mechatron. 29(1), 16–25 (2017)
Nakamura, K., Nakadai, K., Asano, F., Hasegawa, Y., Tsujino, H.: Intelligent sound source localization for dynamic environments. In: IROS 2009, pp. 664–669 (2009)
Nishiura, T., Yamada, T., Nakamura, S., Shikano, K.: Localization of multiple sound sources based on a CSP analysis with a microphone array. In: ICASSP 2000, vol. 2, pp. 1053–1056 (2000)
Nugraha, A., Liutkus, A., Vincent, E.: Multichannel audio source separation with deep neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 24(9), 1652–1664 (2015)
Ono, N., Shibata, K., Kameoka, H.: Self-localization and channel synchronization of smartphone arrays using sound emissions. In: APSIPA ASC 2016, pp. 1–5 (2016)
Raykar, V.C., Kozintsev, I.V., Lienhart, R.: Position calibration of microphones and loudspeakers in distributed computing platforms. IEEE Trans. Speech Audio Process. 13(1), 70–83 (2005)
Smaragdis, P., Févotte, C., Mysore, G.J., Mohammadiha, N., Hoffman, M.: Static and dynamic source separation using nonnegative factorizations: a unified view. IEEE Signal Process. Mag. 31(3), 66–75 (2014)
Su, D., Vidal-Calleja, T., Miro, J.V.: Simultaneous asynchronous microphone array calibration and sound source localisation. In: IROS 2015, pp. 5561–5567 (2015)
Takamichi, S., Mitsui, K., Saito, Y., Koriyama, T., Tanji, N., Saruwatari, H.: JVS corpus: free Japanese multi-speaker voice corpus (2019). arXiv:1908.06248 [cs.SD]
Thrun, S.: Affine structure from sound. In: NIPS 2005, pp. 1353–1360 (2005)
Uhlich, S., Giron, F., Mitsufuji, Y.: Deep neural network based instrument extraction from music. In: ICASSP 2015, pp. 2135–2139 (2015)
Valin, J.M., Rouat, J., Michaud, F.: Enhanced robot audition based on microphone array source separation with post-filter. In: IROS 2004, vol. 3, pp. 2123–2128 (2004)
Zhang, C., Florencio, D., Ba, D.E., Zhang, Z.: Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings. IEEE Trans. Multimedia 10(3), 538–548 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Dan, K., Itoyama, K., Nishida, K., Nakadai, K. (2020). Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions. In: Fujita, H., Fournier-Viger, P., Ali, M., Sasaki, J. (eds) Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices. IEA/AIE 2020. Lecture Notes in Computer Science(), vol 12144. Springer, Cham. https://doi.org/10.1007/978-3-030-55789-8_53
Download citation
DOI: https://doi.org/10.1007/978-3-030-55789-8_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-55788-1
Online ISBN: 978-3-030-55789-8
eBook Packages: Computer ScienceComputer Science (R0)