One-Channel Separation and Recognition of Mixtures of Environmental Sounds: The Case of Bird-Song Classification in Composite Soundscenes

Potamitis, Ilyas

doi:10.1007/978-3-540-68127-4_61

Ilyas Potamitis¹

Part of the book series: Studies in Computational Intelligence ((SCI,volume 142))

942 Accesses

Abstract

In this paper, we address the problem of automatic separations and recognition of bird vocalizations in a mixture of other environmental sounds using a single microphone. We present a novel single-channel audio separation method that trains statistical models of the sources to perform the separation on a feature space of low dimensionality and we lead the separated streams to automatic recognition engines that recognize general sound events. The experimental part tests and evaluates the system on mixtures of bird and insects songs as well as dog barks and other environmental sounds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Riede, K.: Acoustic monitoring of Orthoptera and its potential for conservation. Journal of Insect Conservation 2(3-4), 217–223 (1998)
Article Google Scholar
Goldhor, R.: Recognition of Environmental Sounds. In: Proceedings of the ICASSP 1993, vol. 1, pp. 149–152 (1993)
Google Scholar
Eronen, A., et al.: Audio-Based Context Recognition. IEEE Transactions on Audio, Speech, and Language Processing 14(1), 321–329 (2006)
Article Google Scholar
Cowling, M., Sitte, R.: Comparison of techniques for environmental sound recognition. Pattern Recognition Letters 24(15), 2895–2907 (2003)
Article Google Scholar
Roweis, S.: One microphone source separation. In: Proc. NIPS, pp. 793–799 (2000)
Google Scholar
Kristjansson, T., Attias, H., Hershey, J.: Single microphone source separation using high resolution signal reconstruction. In: Proc. of the ICASSP, pp. 817–820 (2004)
Google Scholar
Benaroya, L., Bimbot, F., Gribonval, R.: Audio Source Separation With a Single Sensor. IEEE Trans. on Audio, Speech, and Lang. Proc. 14(1), 191–199 (2006)
Article Google Scholar
Ozerov, A., Philippe, P., Bimbot, F., Gribonval, R.: Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs. IEEE Trans on Audio, Speech, and Lang. proc. 15(5), 1564–1578 (2007)
Article Google Scholar
Ellis, D.: Model-Based Scene Analysis. In: Wang, D., Brown, G. (eds.) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, ch. 4, pp. 115–146. Wiley/IEEE Press (2006)
Google Scholar
Virtanen, T.: Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria. IEEE Trans on Audio, Speech, and Language processing 15(3), 1066–1074 (2007)
Article Google Scholar
Gales, M., Young, S.: Robust Continuous Speech Recognition using Parallel Model Combination. IEEE Trans on Audio, Speech, and Lang. proc. 4, 352–359 (1996)
Article Google Scholar
Nabney, I.: Netlab: Algorithms for Pattern Recognition. Springer, UK (2002)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Music Technology and Acoustics, Technological Educational Institute of Crete, , Greece
Ilyas Potamitis

Authors

Ilyas Potamitis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

George A. Tsihrintzis Maria Virvou Robert J. Howlett Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Potamitis, I. (2008). One-Channel Separation and Recognition of Mixtures of Environmental Sounds: The Case of Bird-Song Classification in Composite Soundscenes. In: Tsihrintzis, G.A., Virvou, M., Howlett, R.J., Jain, L.C. (eds) New Directions in Intelligent Interactive Multimedia. Studies in Computational Intelligence, vol 142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68127-4_61

Download citation

DOI: https://doi.org/10.1007/978-3-540-68127-4_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68126-7
Online ISBN: 978-3-540-68127-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics