Classification of Audio Signals Using Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure

Park, Dong-Chul; Nguyen, Duc-Hoai; Beack, Seung-Hwa; Park, Sancho

doi:10.1007/11581772_61

Dong-Chul Park¹⁸,
Duc-Hoai Nguyen¹⁸,
Seung-Hwa Beack¹⁸ &
…
Sancho Park¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1198 Accesses
7 Citations

Abstract

Multimedia databases usually store thousands of audio files such as music, speech and other sounds. One of the challenges in modern multimedia system is to classify and retrieve certain kinds of audio from the database. This paper proposes a novel classification algorithm for a content-based audio retrieval. The algorithm, called Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure (GBFCM(DM)), is a neural network-based algorithm which utilizes the Divergence Measure to exploit the statistical nature of the audio data to improve the classification accuracy. Experiment results confirm that the proposed algorithm outperforms 3.025%-5.05% in accuracy in comparison with conventional algorithms such as the k-Means or the Self-Organizing Map.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based classification, search, and retrieval of audio. IEEE Multimedia, 27–36 (1996)
Google Scholar
Saunders, J.: Real time discrimination of broadcast speech/music. In: Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP), pp. 993–996 (1996)
Google Scholar
Foote, J.: Content-based retrieval of music and audio. Proc. SPIE, Multimedia Storage and Archiving Systems II, 138–147 (1997)
Google Scholar
Li, G., Khokar, A.: Content-based indexing and retrieval of audio data using wavelets. In: Proc. Int. Conf. Multimedia Expo II, pp. 885–888 (2000)
Google Scholar
Lu, L., Li, S., Zhang, H.: Content-based audio segmentation using support vector machines. In: Proc. IEEE Int. Conf. Multimedia and Expo (ICME), Tokyo, Japan, pp. 749–752 (2001)
Google Scholar
Tzanetakis, G., Cook, P.: Music genre classification of audio signals. IEEE Trans. Speech Audio Process. 10, 293–302 (2002)
Article Google Scholar
Turnbull, D., Elkan, C.: Fast Recognition of Musical Genres Using RBF Networks. IEEE Trans. Knowledge and Data Engineering 17, 580–584 (2005)
Article Google Scholar
Burred, J.J., Lerch, A.: A hierachical approach to automatic music genre classification. In: Proc. 6th Int. Conf. Digital Audio Effect, London, UK (2003)
Google Scholar
Malheiro, R., Paiva, R.P., Mendes, A.J., Mendes, T., Cardoso, A.: Classification of recorded classical music using neural networks. In: 4th Int. ICSC Sym. on Engineering of Intelligent Systems (2004)
Google Scholar
Park, D.C., Dagher, I.: Gradient Based Fuzzy c-means (GBFCM) Algorithm. In: IEEE Int. Conf. on Neural Networks, ICNN 1994, vol. 3, pp. 1626–1631 (1994)
Google Scholar
Tolonen, T., Karjalainen, M.: A computationally efficient multipitch analysis model. IEEE Trans. Speech Audio Processing 8, 708–716 (2000)
Article Google Scholar
Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Int 2, 1–8 (1980)
Article MATH Google Scholar
Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum, New York (1981)
MATH Google Scholar
Looney, C.: Pattern Recognition Using Neural Networks, pp. 252–254. Oxford University press, New York (1997)
Google Scholar
Windham, M.P.: Cluster Validity for the Fuzzy cneans clustering algorithm. IEEE Trans. Pattern Anal. Mach. Int. 4, 357–363 (1982)
Article Google Scholar
Fukunaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn. Academic Press Inc., London (1990)
MATH Google Scholar
Yang, C.: Music Database Retrieval Based on Spectral Similarity, Stanford Univ Database Group, Stanford, CA, Tech. Rep. 2001-14 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Information Engineering, Myong Ji University, Korea
Dong-Chul Park, Duc-Hoai Nguyen & Seung-Hwa Beack
Davan Tech Co., Seongnam, Korea
Sancho Park

Authors

Dong-Chul Park
View author publications
You can also search for this author in PubMed Google Scholar
Duc-Hoai Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Hwa Beack
View author publications
You can also search for this author in PubMed Google Scholar
Sancho Park
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu, 500-712, Gwangju, Korea
Yo-Sung Ho
Multimedia Security Lab, Korea University, Science Campus, 136-701, Seoul, Korea
Hyoung Joong Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, DC., Nguyen, DH., Beack, SH., Park, S. (2005). Classification of Audio Signals Using Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_61

Download citation

DOI: https://doi.org/10.1007/11581772_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics