Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks

Dziubinski, Marek; Dalka, Piotr; Kostek, Bozena

doi:10.1007/s10844-005-0320-x

Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks

Published: March 2005

Volume 24, pages 133–157, (2005)
Cite this article

Journal of Intelligent Information Systems Aims and scope Submit manuscript

Marek Dziubinski¹,
Piotr Dalka¹ &
Bozena Kostek¹

136 Accesses
24 Citations
Explore all metrics

Abstract

Blind separation of musical sounds contained in sound mixtures is a challenging and difficult task. It is due to the fact that in Western music, mixed harmonic sources may be correlated with each other, i.e. their harmonic partials might be overlapping in the frequency domain if the signals remain in harmonic relation. Evaluation of the separation results is also problematic, since analysis of the energy-based error between the original signals used for mixing and the separated ones, in some cases, do not correspond with perceptual evaluation results. In this paper, four separation algorithms, engineered by the Authors, are presented. Then, musical instrument sound identification based on artificial neural networks is performed as a means of evaluating the performance of the separation algorithms. Results are discussed and conclusions are derived.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic speech recognition: a survey

Article 10 November 2020

A comprehensive survey on automatic speech recognition using neural networks

Article 15 August 2023

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Article Open access 03 January 2024

References

Back, A.D. and Weigend, A.S. (1998). A First Application of Independent Component Analysis to Extracting Structure from Stock Returns. Int. J. on Neural Systems, 8(4), 474–484.
Google Scholar
Bregman, A.S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. Massachusetts: {MIT Press, Cambridge}.
Google Scholar
Brown, J.C. (1999). Computer Identification of Musical Instruments using Pattern Recognition with Cepstral Coefficients as Features. J. Acoust. Soc. Am., 105, 1933–1941.
Google Scholar
Cardoso, J.F. (1997). Informax and Maximum Likelihood for Source Separation. IEEE Letters on Signal Processing, 4, 112–114.
Google Scholar
Casey, M. and Westner, A. (2000). Separation of Mixed audio Sources by Independent Subspace Analysis. In Proc. International Computer Music Conference, ICMC Berlin.
de Cheveigné, A. (1993). Separation of Concurrent Harmonic Sounds: Fundamental Frequency Estimation and a Time-Domain Cancellation Model of Auditory Processing. J. Acoust. Soc. Am., 93(6), 3271– 3290.
Google Scholar
Comon, P. (1994). Independent Component Analysis—A New Concept? Signal Processing, 36, 287–314.
Google Scholar
Goto, M. (2000). A Robust Predominant-F0 Estimation Method for Real-Time Detection of Melody and Bass Lines in CD Recordings. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (pp. II-757–760), Istanbul, Turkey.
Herrera, P., Amatriain, X., Batlle, E., and Serra, X. (2000). Towards Instrument Segmentation for Music Content Description: A Critical Review of Instrument Classification Techniques. In Proc. International Symp. on Music Information Retrieval, Plymouth, Massachusetts, October 23–25.
Herrera, P., Peeters, G., and Dubnov, S. (2003). Automatic Classification of Musical Instrument Sounds. J. Music Research, 32 (19), 3–21.
Google Scholar
Kiviluoto, K. and Orja, E. (1998). Independent Component Analysis for Parallel Financial Time Series. In Proc. International Conf. on Neural Information Processing, 2 (pp. 895–898), Tokyo, Japan.
Klapuri, A. (1998). Number Theoretical Means of Resolving a Mixture of Several Harmonic Sounds. In Proc. European Signal Processing Conference.
Klapuri, A. (2001). Multipitch Estimation and Sound Separation by the Spectral Smoothness Principle, In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 3381–3384), Salt Lake City, USA.
Klapuri, A., Virtanen, T., and Holm, J.-M. (2000). Robust Multipitch Estimation for the Analysis and Manipulation of Polyphonic Musical Signals. In Proc. COST-G6 Conf. on Digital Audio Effects, Verona, Italy.
Kostek, B. (1999). Soft Computing in Acoustics, Applications of Neural Networks, Fuzzy Logic and Rough Sets to Musical Acoustics, Studies in Fuzziness and Soft Computing. New York: Physica Verlag, Heidelberg.
Google Scholar
Kostek, B. (2004). Musical Instrument Classification and Duet Analysis Employing Music Information Retrieval Techniques. Proc. of the IEEE, 92(4), 712–729.
Google Scholar
Kostek, B. and Czyzewski, A. (2001). Representing Musical Instrument Sounds for Their Automatic Classification. J. Audio Eng. Soc., 49(9) 768–785.
Google Scholar
Kostek, B., Dziubinski, M., and Zwan, P. (2002a). Further Developments of Methods for Searching Optimum Musical and Rhythmic Feature Vectors. 21st Audio Engineering Society Conference, St. Petersburg.
Kostek, B., Zwan, P., and Dziubinski, M. (2002b). Statistical Analysis of Musical Sound Features Derived from Wavelet Representation. 112th Audio Engineering Society Convention, Munich, Germany.
Kowalski, M.A., Sikorski, K.A., and Stenger, F. (1995). Selected Topics in Approximation and Computation. New York, USA: Oxford University Press.
Google Scholar
Lakatos, S. (2000). A Common Perceptual Space for Harmonic and Percussive Timbres. Perception & Psychophysics, 62, 1426–1439.
Google Scholar
Liu, Z., Wang, Y., and Chen, T. (1998). Audio Feature Extraction and Analysis for Scene Segmentation and Classification. J. of VLSI Signal Processing, 20(1/2), 61–79.
Google Scholar
Marques, J. and Moreno, P.J. (1999). A Study of Musical Instrument Classification Using Gaussian Mixture Models and Support Vector Machines. Cambridge Research Laboratory Technical Report, 99/4.
Martin, K.D. and Kim, Y.E. (1998). Musical Instrument Identification: A Pattern-Recognition Approach. 136th Meeting of Acoustic Society of America.
McAdams, S., Winsberg, S., de Soete, G., and Krimphoff, J. (1995). Perceptual Scaling of Synthesized Musical Timbres: Common Dimensions, Specificities, and Latent Subject Classes. Psychological Research, 58, 177–192.
Google Scholar
McAulay, R.J. and Quatieri, T.F. (1986). Speech Analysis/Synthesis Based on Sinusoidal Representation. IEEE Transactions on Acoustics, Speech, and Signal Processing, 34(6), 744–754.
Google Scholar
Pawlak, Z. (1982). Rough Sets. Int. J. Computer and Information Sciences, 11, 341–356.
Google Scholar
Ristaniemi, T. and Joutsensalo, J. (1999). On the Performance of Blind Source Separation in CDMA Downlink. In Proc. Int. Workshop on Independent Component Analysis and Signal Separation (ICA’99) (pp. 437–441), France.
Serra, X. and Smith, J.O. (1990). Spectral Modeling Synthesis: A Sound Analysis/Synthesis System Based on a Deterministic Plus Stochastic Decomposition. Computer Music Journal, 14(4), 12–24.
Google Scholar
Smith, J.O. and Serra, X. (1987). PARSHL: An Analysis/Synthesis Program for Non Harmonic Sounds Based on a Sinusoidal Representations. IN Proc. Int. Computer Music Conf. (pp. 290–297), Urbana-Champaign, Illinois, USA.
Szczuko, P., Dalka, P., Dabrowski, P., and Kostek, B. (2004). MPEG-7-based Low-Level Descriptor Effectiveness in the Automatic Musical Sound Classification. 116th Audio Eng. Conv., Preprint No. 6105, Berlin.
Tolonen, T. (1999). Methods for Separation of Harmonic Sound Sources using Sinusoidal Modeling. 106th Audio Engineering Society Conv., Munich, Germany.
Vigario, R., Sarela, J., and Oja, E. (1988). Independent Component Analysis in Wave Decomposition of Auditory Evoked Fields. In Proc. Int. Conf. On Artificial Neural Networks (ICANN’98), (pp. 287–292), Skovde, Sweden.
Virtanen, T. and Klapuri, A. (2000). Separation of Harmonic Sound Sources Using Sinusoidal Modeling. In IEEE International Conference on Acoustics, Speech and Signal Processing, Istanbul, Turkey.
Virtanen, T. and Klapuri, A. (2001). Separation of Harmonic Sounds Using Multipitch Analysis and Iterative Parameter Estimation. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New York, USA.
ISO/IEC JTC 1/SC 29 (2001). Information Technology—Multimedia Content Description Interface—Part 4: Audio, International Organization For Standardization.

Download references

Author information

Authors and Affiliations

Multimedia Systems Department, Gdansk University of Technology, Narutowicza 11/12, 80-952, Gdansk, Poland
Marek Dziubinski, Piotr Dalka & Bozena Kostek

Authors

Marek Dziubinski
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Dalka
View author publications
You can also search for this author in PubMed Google Scholar
Bozena Kostek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marek Dziubinski.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dziubinski, M., Dalka, P. & Kostek, B. Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks. J Intell Inf Syst 24, 133–157 (2005). https://doi.org/10.1007/s10844-005-0320-x

Download citation

Received: 14 May 2004
Revised: 15 September 2004
Accepted: 30 October 2004
Issue Date: March 2005
DOI: https://doi.org/10.1007/s10844-005-0320-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks

Abstract

Access this article

Similar content being viewed by others

Automatic speech recognition: a survey

A comprehensive survey on automatic speech recognition using neural networks

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks

Abstract

Access this article

Similar content being viewed by others

Automatic speech recognition: a survey

A comprehensive survey on automatic speech recognition using neural networks

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation