Abstract
This paper presents the novel system for automatic camera steering towards the active speakers in the conference room environment. The speech signal captured by the array is analysed simultaneously in time-frequency domain using Short Time Fourier Transform (STFT) and Discrete Wavelet Transform (DWT). This paper analyses and studies the concepts and performance of Beamforming and Subspace methods based on above transform in estimating the direction of arrival (DOA) of speech signals impinging on the array. The estimated DOA is converted into rotational command and inputted to the motor controller unit to rotate the mounted camera motor. The presented system is controlled interactively by graphical user interface (GUI) developed using MATLAB. Experimental results show that DWT based Subspace methods outperform in localizing the speakers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mukul, M.K., Prasad, R., Choudhary, M.M., Matsuno, F.: Steering of camera by stepper motor towards active speaker using microphone array. In: SICE Annual Conference, pp. 19–24 (2008)
Gatica-Perez, D., Odobez, J.M., Ba, S., Smith, K., Lathoud, G.: Tracking people in meetings with particles. In: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) (2005). invited paper: EPFL-CONF-83308
Maeng, J., Williams, E.R.: Automatic voice tracking camera system and method of operation. U.S. Patent 6731334 B1 (2004)
Garg, S., Tiwari, S., Chauhan, S.S., Singh, S., Ahmad, S.: Rotating camera based on speaker voice. Int. J. Adv. Res. Electr. Electron. Instrum. Eng. 2, 1674–1681 (2013)
Cohen, L.: Time-Frequency Analysis. Prentice-Hall, New York (1995)
Time-frequency representation. https://en.wikipedia.org/wiki/Time-frequency_representation/
Sejdic, E., Djurovic, I., Jiang, J.: Time-frequency feature representation using energy concentration: An overview of recent advances. Digit. Signal Proc. 19, 153–183 (2009)
Allen, J.B.: Applications of the short time fourier transform to speech processing and spectral analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 1982, vol. 7, pp. 1012–1015 (1982)
Kayhan, A.S., El-Jaroudi, A., Chaparro, L.F.: Evolutionary periodogram for non-stationary signals. IEEE Trans. Signal Process. 42(6), 1527–1536 (1994)
Riley, M.D.: Speech Time-Frequency Representations, vol. 63. Springer, New York (2012)
Barford, P., Kline, J., Plonka, D., Ron, A.: A signal analysis of network anomalies. In: Proceedings of the 2nd ACM SIGCOMM Workshop on Internet Measurement, pp. 71–82 (2002)
Vetterli, M., Herley, C.: Wavelets and filter banks: Theory and design. IEEE Trans. Signal Process. 40(9), 2207–2232 (1992)
Krim, H., Viberg, M.: Two decades of array signal processing research: the parametric approach. Signal Process. Mag. IEEE 13(4), 67–94 (1996)
Hwang, H.K., Aliyazicioglu, Z., Grice, M., Yakovlev, A.: Direction of arrival estimation using a root-MUSIC algorithm. In: Proceedings of the International Multi Conference of Engineers and Computer Scientists, vol. 2 (2008)
Vesa, A.: Direction of the arrival estimation using music and root-music algorithm. In: 18th Telecommunications Forum, pp. 582–585 (2010)
Haardt, M., Zoltowski, M.D., Mathews, C.P., Nossek, J.A.: 2D unitary ESPRIT for efficient 2D parameter estimation. In: International Conference on Acoustics, Speech, and Signal Processing ICASSP 1995, vol. 3, pp. 2096–2099 (1995)
Khmou, Y., Safi, S., Frikel, M.: Comparative study between several direction of arrival estimation methods. J. Telecommun. Inf. Technol. 1, 41–48 (2014)
Capon, J.: High-resolution frequency-wavenumber spectrum analysis. Proc. IEEE 57(8), 1408–1418 (1969)
Ejaz, S., Shafiq, M.A.: Comparison of spectral and subspace algorithms for FM source estimation. Prog. Electromagnet. Res. C 14, 11–21 (2010)
Gu, X., Zhang, Y.: Resolution threshold analysis of MUSIC algorithm in radar imaging. Prog. Electromagnet. Res. B 31, 297–321 (2011)
Schmidt, R.O.: Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag. 34(3), 276–280 (1986)
Godora, L.C.: Application of antenna arrays to mobile communications, beamforming and direction-of-arrival considerations. Proc. IEEE 85, 1195–1245 (1997)
Benesty, J., Chen, J., Huang, Y.: Microphone Array Signal Processing, vol. 1. Springer, Heidelberg (2008)
Anand, A., Mukul, M.K.: Comparison of STFT based direction of arrival estimation techniques for speech signal. In: RTEICT, pp. 356–360, 20–21 May 2016
Anand, A., Mukul, M.K.: Comparative analysis of different direction of arrival estimation techniques. In: RTEICT, pp. 200–205, 20–21 May 2016
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Mukul, M.K. (2018). DWT Based Source Localization Using Microphone Array. In: Abraham, A., Cherukuri, A., Madureira, A., Muda, A. (eds) Proceedings of the Eighth International Conference on Soft Computing and Pattern Recognition (SoCPaR 2016). SoCPaR 2016. Advances in Intelligent Systems and Computing, vol 614. Springer, Cham. https://doi.org/10.1007/978-3-319-60618-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-60618-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60617-0
Online ISBN: 978-3-319-60618-7
eBook Packages: EngineeringEngineering (R0)