Abstract
Multimedia databases usually store thousands of audio files such as music, speech and other sounds. One of the challenges in modern multimedia system is to classify and retrieve certain kinds of audio from the database. This paper proposes a novel classification algorithm for a content-based audio retrieval. The algorithm, called Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure (GBFCM(DM)), is a neural network-based algorithm which utilizes the Divergence Measure to exploit the statistical nature of the audio data to improve the classification accuracy. Experiment results confirm that the proposed algorithm outperforms 3.025%-5.05% in accuracy in comparison with conventional algorithms such as the k-Means or the Self-Organizing Map.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based classification, search, and retrieval of audio. IEEE Multimedia, 27–36 (1996)
Saunders, J.: Real time discrimination of broadcast speech/music. In: Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP), pp. 993–996 (1996)
Foote, J.: Content-based retrieval of music and audio. Proc. SPIE, Multimedia Storage and Archiving Systems II, 138–147 (1997)
Li, G., Khokar, A.: Content-based indexing and retrieval of audio data using wavelets. In: Proc. Int. Conf. Multimedia Expo II, pp. 885–888 (2000)
Lu, L., Li, S., Zhang, H.: Content-based audio segmentation using support vector machines. In: Proc. IEEE Int. Conf. Multimedia and Expo (ICME), Tokyo, Japan, pp. 749–752 (2001)
Tzanetakis, G., Cook, P.: Music genre classification of audio signals. IEEE Trans. Speech Audio Process. 10, 293–302 (2002)
Turnbull, D., Elkan, C.: Fast Recognition of Musical Genres Using RBF Networks. IEEE Trans. Knowledge and Data Engineering 17, 580–584 (2005)
Burred, J.J., Lerch, A.: A hierachical approach to automatic music genre classification. In: Proc. 6th Int. Conf. Digital Audio Effect, London, UK (2003)
Malheiro, R., Paiva, R.P., Mendes, A.J., Mendes, T., Cardoso, A.: Classification of recorded classical music using neural networks. In: 4th Int. ICSC Sym. on Engineering of Intelligent Systems (2004)
Park, D.C., Dagher, I.: Gradient Based Fuzzy c-means (GBFCM) Algorithm. In: IEEE Int. Conf. on Neural Networks, ICNN 1994, vol. 3, pp. 1626–1631 (1994)
Tolonen, T., Karjalainen, M.: A computationally efficient multipitch analysis model. IEEE Trans. Speech Audio Processing 8, 708–716 (2000)
Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Int 2, 1–8 (1980)
Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum, New York (1981)
Looney, C.: Pattern Recognition Using Neural Networks, pp. 252–254. Oxford University press, New York (1997)
Windham, M.P.: Cluster Validity for the Fuzzy cneans clustering algorithm. IEEE Trans. Pattern Anal. Mach. Int. 4, 357–363 (1982)
Fukunaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn. Academic Press Inc., London (1990)
Yang, C.: Music Database Retrieval Based on Spectral Similarity, Stanford Univ Database Group, Stanford, CA, Tech. Rep. 2001-14 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Park, DC., Nguyen, DH., Beack, SH., Park, S. (2005). Classification of Audio Signals Using Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_61
Download citation
DOI: https://doi.org/10.1007/11581772_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)