Skip to main content

Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions

  • Conference paper
  • First Online:
Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices (IEA/AIE 2020)

Abstract

This paper addresses a novel method for calibrating microphone positions included in a microphone array. The performance of microphone array processing deteriorates due to two factors: (1) differences between predetermined position and actual positions of the microphones. (2) sound source signal overlaps in frequency and time. To solve these problems, we propose a probabilistic generative model of the sound propagation process determined by microphone and sound source positions. The model is defined as the product of three probabilities: (1) prior probability of the microphone positions based on reference positions, (2) prior probability of the sound source spectrum, and (3) conditional probability of the recorded spectrum. Based on the model, an iterative algorithm to calibrate the microphone positions is derived as a solution of the maximum a posteriori estimation. Preliminary experiments through numerical simulation with an 8-ch microphone array revealed that the proposed method accurately estimated the microphone positions when using multiple sound sources. Preliminary experiments through numerical simulation with an 8-ch microphone array suggested the proposed method accurately estimated the microphone positions when using multiple sound sources.

This work was supported by JSPS KAKENHI Grant No. 16H02884, 17K00365, and 19K12017.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Strictly, \(\textit{\textbf{S}}\) should be marginalized and the MAP estimation of \(p(\textit{\textbf{X}} | \textit{\textbf{Z}})\) should be solved.

References

  1. Févotte, C., Bertin, N., Durrieu, J.L.: Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis. Neural Comput. 21(3), 793–830 (2009)

    Article  Google Scholar 

  2. Miura, H., Yoshida, T., Nakamura, K., Nakadai, K.: SLAM-based online calibration of asynchronous microphone array for robot audition. In: IROS 2011, pp. 524–529 (2011)

    Google Scholar 

  3. Nakadai, K., Nakajima, H., Hasegawa, Y., Tsujino, H.: Sound source separation of moving speakers for robot audition. In: ICASSP 2009, pp. 3685–3688 (2009)

    Google Scholar 

  4. Nakadai, K., Okuno, H.G., Mizumoto, T.: Development, deployment and applications of robot audition open source software HARK. J. Robot. Mechatron. 29(1), 16–25 (2017)

    Article  Google Scholar 

  5. Nakamura, K., Nakadai, K., Asano, F., Hasegawa, Y., Tsujino, H.: Intelligent sound source localization for dynamic environments. In: IROS 2009, pp. 664–669 (2009)

    Google Scholar 

  6. Nishiura, T., Yamada, T., Nakamura, S., Shikano, K.: Localization of multiple sound sources based on a CSP analysis with a microphone array. In: ICASSP 2000, vol. 2, pp. 1053–1056 (2000)

    Google Scholar 

  7. Nugraha, A., Liutkus, A., Vincent, E.: Multichannel audio source separation with deep neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 24(9), 1652–1664 (2015)

    Article  Google Scholar 

  8. Ono, N., Shibata, K., Kameoka, H.: Self-localization and channel synchronization of smartphone arrays using sound emissions. In: APSIPA ASC 2016, pp. 1–5 (2016)

    Google Scholar 

  9. Raykar, V.C., Kozintsev, I.V., Lienhart, R.: Position calibration of microphones and loudspeakers in distributed computing platforms. IEEE Trans. Speech Audio Process. 13(1), 70–83 (2005)

    Article  Google Scholar 

  10. Smaragdis, P., Févotte, C., Mysore, G.J., Mohammadiha, N., Hoffman, M.: Static and dynamic source separation using nonnegative factorizations: a unified view. IEEE Signal Process. Mag. 31(3), 66–75 (2014)

    Article  Google Scholar 

  11. Su, D., Vidal-Calleja, T., Miro, J.V.: Simultaneous asynchronous microphone array calibration and sound source localisation. In: IROS 2015, pp. 5561–5567 (2015)

    Google Scholar 

  12. Takamichi, S., Mitsui, K., Saito, Y., Koriyama, T., Tanji, N., Saruwatari, H.: JVS corpus: free Japanese multi-speaker voice corpus (2019). arXiv:1908.06248 [cs.SD]

  13. Thrun, S.: Affine structure from sound. In: NIPS 2005, pp. 1353–1360 (2005)

    Google Scholar 

  14. Uhlich, S., Giron, F., Mitsufuji, Y.: Deep neural network based instrument extraction from music. In: ICASSP 2015, pp. 2135–2139 (2015)

    Google Scholar 

  15. Valin, J.M., Rouat, J., Michaud, F.: Enhanced robot audition based on microphone array source separation with post-filter. In: IROS 2004, vol. 3, pp. 2123–2128 (2004)

    Google Scholar 

  16. Zhang, C., Florencio, D., Ba, D.E., Zhang, Z.: Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings. IEEE Trans. Multimedia 10(3), 538–548 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katsuhiro Dan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dan, K., Itoyama, K., Nishida, K., Nakadai, K. (2020). Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions. In: Fujita, H., Fournier-Viger, P., Ali, M., Sasaki, J. (eds) Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices. IEA/AIE 2020. Lecture Notes in Computer Science(), vol 12144. Springer, Cham. https://doi.org/10.1007/978-3-030-55789-8_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-55789-8_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-55788-1

  • Online ISBN: 978-3-030-55789-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics