Skip to main content

One-Channel Separation and Recognition of Mixtures of Environmental Sounds: The Case of Bird-Song Classification in Composite Soundscenes

  • Chapter
New Directions in Intelligent Interactive Multimedia

Part of the book series: Studies in Computational Intelligence ((SCI,volume 142))

  • 942 Accesses

Abstract

In this paper, we address the problem of automatic separations and recognition of bird vocalizations in a mixture of other environmental sounds using a single microphone. We present a novel single-channel audio separation method that trains statistical models of the sources to perform the separation on a feature space of low dimensionality and we lead the separated streams to automatic recognition engines that recognize general sound events. The experimental part tests and evaluates the system on mixtures of bird and insects songs as well as dog barks and other environmental sounds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Riede, K.: Acoustic monitoring of Orthoptera and its potential for conservation. Journal of Insect Conservation 2(3-4), 217–223 (1998)

    Article  Google Scholar 

  2. Goldhor, R.: Recognition of Environmental Sounds. In: Proceedings of the ICASSP 1993, vol. 1, pp. 149–152 (1993)

    Google Scholar 

  3. Eronen, A., et al.: Audio-Based Context Recognition. IEEE Transactions on Audio, Speech, and Language Processing 14(1), 321–329 (2006)

    Article  Google Scholar 

  4. Cowling, M., Sitte, R.: Comparison of techniques for environmental sound recognition. Pattern Recognition Letters 24(15), 2895–2907 (2003)

    Article  Google Scholar 

  5. Roweis, S.: One microphone source separation. In: Proc. NIPS, pp. 793–799 (2000)

    Google Scholar 

  6. Kristjansson, T., Attias, H., Hershey, J.: Single microphone source separation using high resolution signal reconstruction. In: Proc. of the ICASSP, pp. 817–820 (2004)

    Google Scholar 

  7. Benaroya, L., Bimbot, F., Gribonval, R.: Audio Source Separation With a Single Sensor. IEEE Trans. on Audio, Speech, and Lang. Proc. 14(1), 191–199 (2006)

    Article  Google Scholar 

  8. Ozerov, A., Philippe, P., Bimbot, F., Gribonval, R.: Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs. IEEE Trans on Audio, Speech, and Lang. proc. 15(5), 1564–1578 (2007)

    Article  Google Scholar 

  9. Ellis, D.: Model-Based Scene Analysis. In: Wang, D., Brown, G. (eds.) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, ch. 4, pp. 115–146. Wiley/IEEE Press (2006)

    Google Scholar 

  10. Virtanen, T.: Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria. IEEE Trans on Audio, Speech, and Language processing 15(3), 1066–1074 (2007)

    Article  Google Scholar 

  11. Gales, M., Young, S.: Robust Continuous Speech Recognition using Parallel Model Combination. IEEE Trans on Audio, Speech, and Lang. proc. 4, 352–359 (1996)

    Article  Google Scholar 

  12. Nabney, I.: Netlab: Algorithms for Pattern Recognition. Springer, UK (2002)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

George A. Tsihrintzis Maria Virvou Robert J. Howlett Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Potamitis, I. (2008). One-Channel Separation and Recognition of Mixtures of Environmental Sounds: The Case of Bird-Song Classification in Composite Soundscenes. In: Tsihrintzis, G.A., Virvou, M., Howlett, R.J., Jain, L.C. (eds) New Directions in Intelligent Interactive Multimedia. Studies in Computational Intelligence, vol 142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68127-4_61

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68127-4_61

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68126-7

  • Online ISBN: 978-3-540-68127-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics