To read this content please select one of the options below:

Automatic speaker tracking by camera using two‐channel‐based sound source localization

Halim Sayoud (Faculty of Electronics, USTHB University, Alger, Algeria)
Siham Ouamour (Faculty of Electronics, USTHB University, Alger, Algeria)
Salah Khennouf (Faculty of Electronics, USTHB University, Alger, Algeria)

International Journal of Intelligent Computing and Cybernetics

ISSN: 1756-378X

Article publication date: 29 March 2011

445

Abstract

Purpose

The purpose of this paper is two‐fold. First, to deal with the problem of audio speaker localization and second, to deal with the problem of mobile camera control. The task of speaker localization consists of determining the position of the active speaker and the task of camera control consists of orienting a mobile camera towards that active speaker. These steps represent the main task of speaker tracking, which is the global purpose of the research work.

Design/methodology/approach

In this approach, two‐channel‐based estimation of the speaker position is achieved by comparing the signals received by two cardioids microphones, which are placed the one against the other and separated by a fixed distance. The localization technique presented in this paper is inspired from the human ears, which act as two different sound observation points, enabling humans to estimate the direction of the speaking person with a good precision. Concerning the camera control part, the authors have conceived an automatic system for generating the command signals and controlling the rotation of the mobile camera by a stepper motor.

Findings

The off‐line experiments of speaker tracking by camera have been done in a small meeting room without echo cancelation. Results show the good performances of the proposed localization methods and a correct tracking by camera.

Practical implications

This new technique can be used for the automatic supervision of smart rooms.

Originality/value

The work described in this paper is original, since it uses only two microphones for the speaker localization.

Keywords

Citation

Sayoud, H., Ouamour, S. and Khennouf, S. (2011), "Automatic speaker tracking by camera using two‐channel‐based sound source localization", International Journal of Intelligent Computing and Cybernetics, Vol. 4 No. 1, pp. 40-60. https://doi.org/10.1108/17563781111115787

Publisher

:

Emerald Group Publishing Limited

Copyright © 2011, Emerald Group Publishing Limited

Related articles