Automatic speaker tracking by camera using two‐channel‐based sound source localization
International Journal of Intelligent Computing and Cybernetics
ISSN: 1756-378X
Article publication date: 29 March 2011
Abstract
Purpose
The purpose of this paper is two‐fold. First, to deal with the problem of audio speaker localization and second, to deal with the problem of mobile camera control. The task of speaker localization consists of determining the position of the active speaker and the task of camera control consists of orienting a mobile camera towards that active speaker. These steps represent the main task of speaker tracking, which is the global purpose of the research work.
Design/methodology/approach
In this approach, two‐channel‐based estimation of the speaker position is achieved by comparing the signals received by two cardioids microphones, which are placed the one against the other and separated by a fixed distance. The localization technique presented in this paper is inspired from the human ears, which act as two different sound observation points, enabling humans to estimate the direction of the speaking person with a good precision. Concerning the camera control part, the authors have conceived an automatic system for generating the command signals and controlling the rotation of the mobile camera by a stepper motor.
Findings
The off‐line experiments of speaker tracking by camera have been done in a small meeting room without echo cancelation. Results show the good performances of the proposed localization methods and a correct tracking by camera.
Practical implications
This new technique can be used for the automatic supervision of smart rooms.
Originality/value
The work described in this paper is original, since it uses only two microphones for the speaker localization.
Keywords
Citation
Sayoud, H., Ouamour, S. and Khennouf, S. (2011), "Automatic speaker tracking by camera using two‐channel‐based sound source localization", International Journal of Intelligent Computing and Cybernetics, Vol. 4 No. 1, pp. 40-60. https://doi.org/10.1108/17563781111115787
Publisher
:Emerald Group Publishing Limited
Copyright © 2011, Emerald Group Publishing Limited