Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions

Dan, Katsuhiro; Itoyama, Katsutoshi; Nishida, Kenji; Nakadai, Kazuhiro

doi:10.1007/978-3-030-55789-8_53

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12144))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

2007 Accesses
4 Citations

Abstract

This paper addresses a novel method for calibrating microphone positions included in a microphone array. The performance of microphone array processing deteriorates due to two factors: (1) differences between predetermined position and actual positions of the microphones. (2) sound source signal overlaps in frequency and time. To solve these problems, we propose a probabilistic generative model of the sound propagation process determined by microphone and sound source positions. The model is defined as the product of three probabilities: (1) prior probability of the microphone positions based on reference positions, (2) prior probability of the sound source spectrum, and (3) conditional probability of the recorded spectrum. Based on the model, an iterative algorithm to calibrate the microphone positions is derived as a solution of the maximum a posteriori estimation. Preliminary experiments through numerical simulation with an 8-ch microphone array revealed that the proposed method accurately estimated the microphone positions when using multiple sound sources. Preliminary experiments through numerical simulation with an 8-ch microphone array suggested the proposed method accurately estimated the microphone positions when using multiple sound sources.

This work was supported by JSPS KAKENHI Grant No. 16H02884, 17K00365, and 19K12017.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Strictly, \(\textit{\textbf{S}}\) should be marginalized and the MAP estimation of \(p(\textit{\textbf{X}} | \textit{\textbf{Z}})\) should be solved.

References

Févotte, C., Bertin, N., Durrieu, J.L.: Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis. Neural Comput. 21(3), 793–830 (2009)
Article Google Scholar
Miura, H., Yoshida, T., Nakamura, K., Nakadai, K.: SLAM-based online calibration of asynchronous microphone array for robot audition. In: IROS 2011, pp. 524–529 (2011)
Google Scholar
Nakadai, K., Nakajima, H., Hasegawa, Y., Tsujino, H.: Sound source separation of moving speakers for robot audition. In: ICASSP 2009, pp. 3685–3688 (2009)
Google Scholar
Nakadai, K., Okuno, H.G., Mizumoto, T.: Development, deployment and applications of robot audition open source software HARK. J. Robot. Mechatron. 29(1), 16–25 (2017)
Article Google Scholar
Nakamura, K., Nakadai, K., Asano, F., Hasegawa, Y., Tsujino, H.: Intelligent sound source localization for dynamic environments. In: IROS 2009, pp. 664–669 (2009)
Google Scholar
Nishiura, T., Yamada, T., Nakamura, S., Shikano, K.: Localization of multiple sound sources based on a CSP analysis with a microphone array. In: ICASSP 2000, vol. 2, pp. 1053–1056 (2000)
Google Scholar
Nugraha, A., Liutkus, A., Vincent, E.: Multichannel audio source separation with deep neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 24(9), 1652–1664 (2015)
Article Google Scholar
Ono, N., Shibata, K., Kameoka, H.: Self-localization and channel synchronization of smartphone arrays using sound emissions. In: APSIPA ASC 2016, pp. 1–5 (2016)
Google Scholar
Raykar, V.C., Kozintsev, I.V., Lienhart, R.: Position calibration of microphones and loudspeakers in distributed computing platforms. IEEE Trans. Speech Audio Process. 13(1), 70–83 (2005)
Article Google Scholar
Smaragdis, P., Févotte, C., Mysore, G.J., Mohammadiha, N., Hoffman, M.: Static and dynamic source separation using nonnegative factorizations: a unified view. IEEE Signal Process. Mag. 31(3), 66–75 (2014)
Article Google Scholar
Su, D., Vidal-Calleja, T., Miro, J.V.: Simultaneous asynchronous microphone array calibration and sound source localisation. In: IROS 2015, pp. 5561–5567 (2015)
Google Scholar
Takamichi, S., Mitsui, K., Saito, Y., Koriyama, T., Tanji, N., Saruwatari, H.: JVS corpus: free Japanese multi-speaker voice corpus (2019). arXiv:1908.06248 [cs.SD]
Thrun, S.: Affine structure from sound. In: NIPS 2005, pp. 1353–1360 (2005)
Google Scholar
Uhlich, S., Giron, F., Mitsufuji, Y.: Deep neural network based instrument extraction from music. In: ICASSP 2015, pp. 2135–2139 (2015)
Google Scholar
Valin, J.M., Rouat, J., Michaud, F.: Enhanced robot audition based on microphone array source separation with post-filter. In: IROS 2004, vol. 3, pp. 2123–2128 (2004)
Google Scholar
Zhang, C., Florencio, D., Ba, D.E., Zhang, Z.: Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings. IEEE Trans. Multimedia 10(3), 538–548 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering, Tokyo Institute of Technology, 2-12-1, O-okayama, Meguro, Tokyo, 152-8552, Japan
Katsuhiro Dan, Katsutoshi Itoyama, Kenji Nishida & Kazuhiro Nakadai
Honda Research Insititute Japan Co., Ltd., 8-1 Honcho, Wako, Saitama, 351-0114, Japan
Kazuhiro Nakadai

Authors

Katsuhiro Dan
View author publications
You can also search for this author in PubMed Google Scholar
Katsutoshi Itoyama
View author publications
You can also search for this author in PubMed Google Scholar
Kenji Nishida
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Nakadai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Katsuhiro Dan .

Editor information

Editors and Affiliations

Iwate Prefectural University, Takizawa, Japan
Hamido Fujita
Harbin Institute of Technology (Shenzhen), Shenzhen, China
Philippe Fournier-Viger
Texas State University, San Marcos, TX, USA
Moonis Ali
Iwate Prefectural University, Takizawa, Japan
Jun Sasaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dan, K., Itoyama, K., Nishida, K., Nakadai, K. (2020). Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions. In: Fujita, H., Fournier-Viger, P., Ali, M., Sasaki, J. (eds) Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices. IEA/AIE 2020. Lecture Notes in Computer Science(), vol 12144. Springer, Cham. https://doi.org/10.1007/978-3-030-55789-8_53

Download citation

DOI: https://doi.org/10.1007/978-3-030-55789-8_53
Published: 04 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-55788-1
Online ISBN: 978-3-030-55789-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics