Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 614))

Included in the following conference series:

  • 1274 Accesses

Abstract

This paper presents the novel system for automatic camera steering towards the active speakers in the conference room environment. The speech signal captured by the array is analysed simultaneously in time-frequency domain using Short Time Fourier Transform (STFT) and Discrete Wavelet Transform (DWT). This paper analyses and studies the concepts and performance of Beamforming and Subspace methods based on above transform in estimating the direction of arrival (DOA) of speech signals impinging on the array. The estimated DOA is converted into rotational command and inputted to the motor controller unit to rotate the mounted camera motor. The presented system is controlled interactively by graphical user interface (GUI) developed using MATLAB. Experimental results show that DWT based Subspace methods outperform in localizing the speakers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mukul, M.K., Prasad, R., Choudhary, M.M., Matsuno, F.: Steering of camera by stepper motor towards active speaker using microphone array. In: SICE Annual Conference, pp. 19–24 (2008)

    Google Scholar 

  2. Gatica-Perez, D., Odobez, J.M., Ba, S., Smith, K., Lathoud, G.: Tracking people in meetings with particles. In: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) (2005). invited paper: EPFL-CONF-83308

    Google Scholar 

  3. Maeng, J., Williams, E.R.: Automatic voice tracking camera system and method of operation. U.S. Patent 6731334 B1 (2004)

    Google Scholar 

  4. Garg, S., Tiwari, S., Chauhan, S.S., Singh, S., Ahmad, S.: Rotating camera based on speaker voice. Int. J. Adv. Res. Electr. Electron. Instrum. Eng. 2, 1674–1681 (2013)

    Google Scholar 

  5. Cohen, L.: Time-Frequency Analysis. Prentice-Hall, New York (1995)

    Google Scholar 

  6. Time-frequency representation. https://en.wikipedia.org/wiki/Time-frequency_representation/

  7. Sejdic, E., Djurovic, I., Jiang, J.: Time-frequency feature representation using energy concentration: An overview of recent advances. Digit. Signal Proc. 19, 153–183 (2009)

    Article  Google Scholar 

  8. Allen, J.B.: Applications of the short time fourier transform to speech processing and spectral analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 1982, vol. 7, pp. 1012–1015 (1982)

    Google Scholar 

  9. Kayhan, A.S., El-Jaroudi, A., Chaparro, L.F.: Evolutionary periodogram for non-stationary signals. IEEE Trans. Signal Process. 42(6), 1527–1536 (1994)

    Article  Google Scholar 

  10. Riley, M.D.: Speech Time-Frequency Representations, vol. 63. Springer, New York (2012)

    Google Scholar 

  11. Barford, P., Kline, J., Plonka, D., Ron, A.: A signal analysis of network anomalies. In: Proceedings of the 2nd ACM SIGCOMM Workshop on Internet Measurement, pp. 71–82 (2002)

    Google Scholar 

  12. Vetterli, M., Herley, C.: Wavelets and filter banks: Theory and design. IEEE Trans. Signal Process. 40(9), 2207–2232 (1992)

    Article  MATH  Google Scholar 

  13. Krim, H., Viberg, M.: Two decades of array signal processing research: the parametric approach. Signal Process. Mag. IEEE 13(4), 67–94 (1996)

    Article  Google Scholar 

  14. Hwang, H.K., Aliyazicioglu, Z., Grice, M., Yakovlev, A.: Direction of arrival estimation using a root-MUSIC algorithm. In: Proceedings of the International Multi Conference of Engineers and Computer Scientists, vol. 2 (2008)

    Google Scholar 

  15. Vesa, A.: Direction of the arrival estimation using music and root-music algorithm. In: 18th Telecommunications Forum, pp. 582–585 (2010)

    Google Scholar 

  16. Haardt, M., Zoltowski, M.D., Mathews, C.P., Nossek, J.A.: 2D unitary ESPRIT for efficient 2D parameter estimation. In: International Conference on Acoustics, Speech, and Signal Processing ICASSP 1995, vol. 3, pp. 2096–2099 (1995)

    Google Scholar 

  17. Khmou, Y., Safi, S., Frikel, M.: Comparative study between several direction of arrival estimation methods. J. Telecommun. Inf. Technol. 1, 41–48 (2014)

    Google Scholar 

  18. Capon, J.: High-resolution frequency-wavenumber spectrum analysis. Proc. IEEE 57(8), 1408–1418 (1969)

    Article  Google Scholar 

  19. Ejaz, S., Shafiq, M.A.: Comparison of spectral and subspace algorithms for FM source estimation. Prog. Electromagnet. Res. C 14, 11–21 (2010)

    Article  Google Scholar 

  20. Gu, X., Zhang, Y.: Resolution threshold analysis of MUSIC algorithm in radar imaging. Prog. Electromagnet. Res. B 31, 297–321 (2011)

    Article  Google Scholar 

  21. Schmidt, R.O.: Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag. 34(3), 276–280 (1986)

    Article  Google Scholar 

  22. Godora, L.C.: Application of antenna arrays to mobile communications, beamforming and direction-of-arrival considerations. Proc. IEEE 85, 1195–1245 (1997)

    Article  Google Scholar 

  23. Benesty, J., Chen, J., Huang, Y.: Microphone Array Signal Processing, vol. 1. Springer, Heidelberg (2008)

    Google Scholar 

  24. Anand, A., Mukul, M.K.: Comparison of STFT based direction of arrival estimation techniques for speech signal. In: RTEICT, pp. 356–360, 20–21 May 2016

    Google Scholar 

  25. Anand, A., Mukul, M.K.: Comparative analysis of different direction of arrival estimation techniques. In: RTEICT, pp. 200–205, 20–21 May 2016

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manoj Kumar Mukul .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Mukul, M.K. (2018). DWT Based Source Localization Using Microphone Array. In: Abraham, A., Cherukuri, A., Madureira, A., Muda, A. (eds) Proceedings of the Eighth International Conference on Soft Computing and Pattern Recognition (SoCPaR 2016). SoCPaR 2016. Advances in Intelligent Systems and Computing, vol 614. Springer, Cham. https://doi.org/10.1007/978-3-319-60618-7_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-60618-7_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-60617-0

  • Online ISBN: 978-3-319-60618-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics