Abstract
In this work we deal with the fusion of the estimates of independentmicrophone arrays to produce an improved estimate of the Direction of Arrival(DOA) of one moving speaker, as well as localization coordinates of multiple moving speakers based on Time Delay Of Arrivals (TDOA). Our approach (a) fuses measurements from independent arrays, (b) incorporates kinematic information of speakers’ movement by using parallel Kalman filters, and (c) associates observations to specific speakers by using a Probabilistic Data Association (PDA) technique. We demonstrate that a network of arrays combined with statistical fusion techniques provides a consistent and coherent way to reduce uncertainty and ambiguity of measurements. The efficiency of the approach is illustrated on a simulation dealing with beamforming onemoving speaker on an extended basis and localization of two closely spaced moving speakers with crossing trajectories.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bar-Shalom, Y., Li, X., Kirubarajan, T.: Estimation with application to tracking and navigation. Wiley, Chichester (2001)
Blackman, S., Popoli, R.: Design and analysis of modern tracking systems. Artech House (1999)
Chen, H., et al.: Multiple Target Tracking with Multiple Finite Resolution Sensors. In: 5th International Conference on Information Fusion (2002)
Aarabi, P., Zaky, S.: Robust sound localization using multi-source audiovisual information fusion. Elsevier, Information Fusion 2, 209–223 (2001)
Beal, M., Attias, H., Jojic, N.: Audio visual sensor fusion with probabilistic Graphical models. In: 7th European Conference on Computer Vision, vol. 1, pp. 736–750
Brandstcin, M.: A Framework for Speech Source Localization Using Sensor Arrays, PhD thesis, Brown University, Providence, RI (1995)
Willner, D., et al.: Kalman filter algorithms for a multi sensor system. In: IEEE Conf. on Decision &Control (1976)
Sturim, D., Brandstein, M., Silverman, H.: Tracking Multiple Talkers using Microphone Array Measurements. In: IEEE Proc. ICASSP, pp. 371–374 (1997)
Mazor, E., Averbuch, A., Bar-Shalom, Y., Dayan, J.: IMM methods in target tracking. IEEE Trans, on Aerospace and Electronics Svstems 34(l), 103–123 (1998)
Johnson, D., Dudgeon, D.: Array Signal Processing: Concepts and Techniques. Prentice Hall, Englewood Cliffs (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Potamitis, I., Tremoulis, G., Fakotakis, N. (2003). Multi-array Multi-speaker Tracking. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-39398-6_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive