Abstract
In this paper, we address the problem of automatic separations and recognition of bird vocalizations in a mixture of other environmental sounds using a single microphone. We present a novel single-channel audio separation method that trains statistical models of the sources to perform the separation on a feature space of low dimensionality and we lead the separated streams to automatic recognition engines that recognize general sound events. The experimental part tests and evaluates the system on mixtures of bird and insects songs as well as dog barks and other environmental sounds.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Riede, K.: Acoustic monitoring of Orthoptera and its potential for conservation. Journal of Insect Conservation 2(3-4), 217–223 (1998)
Goldhor, R.: Recognition of Environmental Sounds. In: Proceedings of the ICASSP 1993, vol. 1, pp. 149–152 (1993)
Eronen, A., et al.: Audio-Based Context Recognition. IEEE Transactions on Audio, Speech, and Language Processing 14(1), 321–329 (2006)
Cowling, M., Sitte, R.: Comparison of techniques for environmental sound recognition. Pattern Recognition Letters 24(15), 2895–2907 (2003)
Roweis, S.: One microphone source separation. In: Proc. NIPS, pp. 793–799 (2000)
Kristjansson, T., Attias, H., Hershey, J.: Single microphone source separation using high resolution signal reconstruction. In: Proc. of the ICASSP, pp. 817–820 (2004)
Benaroya, L., Bimbot, F., Gribonval, R.: Audio Source Separation With a Single Sensor. IEEE Trans. on Audio, Speech, and Lang. Proc. 14(1), 191–199 (2006)
Ozerov, A., Philippe, P., Bimbot, F., Gribonval, R.: Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs. IEEE Trans on Audio, Speech, and Lang. proc. 15(5), 1564–1578 (2007)
Ellis, D.: Model-Based Scene Analysis. In: Wang, D., Brown, G. (eds.) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, ch. 4, pp. 115–146. Wiley/IEEE Press (2006)
Virtanen, T.: Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria. IEEE Trans on Audio, Speech, and Language processing 15(3), 1066–1074 (2007)
Gales, M., Young, S.: Robust Continuous Speech Recognition using Parallel Model Combination. IEEE Trans on Audio, Speech, and Lang. proc. 4, 352–359 (1996)
Nabney, I.: Netlab: Algorithms for Pattern Recognition. Springer, UK (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Potamitis, I. (2008). One-Channel Separation and Recognition of Mixtures of Environmental Sounds: The Case of Bird-Song Classification in Composite Soundscenes. In: Tsihrintzis, G.A., Virvou, M., Howlett, R.J., Jain, L.C. (eds) New Directions in Intelligent Interactive Multimedia. Studies in Computational Intelligence, vol 142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68127-4_61
Download citation
DOI: https://doi.org/10.1007/978-3-540-68127-4_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68126-7
Online ISBN: 978-3-540-68127-4
eBook Packages: EngineeringEngineering (R0)