Skip to main content

Classification of Audio Signals Using Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure

  • Conference paper
Advances in Multimedia Information Processing - PCM 2005 (PCM 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Included in the following conference series:

Abstract

Multimedia databases usually store thousands of audio files such as music, speech and other sounds. One of the challenges in modern multimedia system is to classify and retrieve certain kinds of audio from the database. This paper proposes a novel classification algorithm for a content-based audio retrieval. The algorithm, called Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure (GBFCM(DM)), is a neural network-based algorithm which utilizes the Divergence Measure to exploit the statistical nature of the audio data to improve the classification accuracy. Experiment results confirm that the proposed algorithm outperforms 3.025%-5.05% in accuracy in comparison with conventional algorithms such as the k-Means or the Self-Organizing Map.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based classification, search, and retrieval of audio. IEEE Multimedia, 27–36 (1996)

    Google Scholar 

  2. Saunders, J.: Real time discrimination of broadcast speech/music. In: Proc. Int. Conf. Acoustic, Speech, Signal Processing (ICASSP), pp. 993–996 (1996)

    Google Scholar 

  3. Foote, J.: Content-based retrieval of music and audio. Proc. SPIE, Multimedia Storage and Archiving Systems II, 138–147 (1997)

    Google Scholar 

  4. Li, G., Khokar, A.: Content-based indexing and retrieval of audio data using wavelets. In: Proc. Int. Conf. Multimedia Expo II, pp. 885–888 (2000)

    Google Scholar 

  5. Lu, L., Li, S., Zhang, H.: Content-based audio segmentation using support vector machines. In: Proc. IEEE Int. Conf. Multimedia and Expo (ICME), Tokyo, Japan, pp. 749–752 (2001)

    Google Scholar 

  6. Tzanetakis, G., Cook, P.: Music genre classification of audio signals. IEEE Trans. Speech Audio Process. 10, 293–302 (2002)

    Article  Google Scholar 

  7. Turnbull, D., Elkan, C.: Fast Recognition of Musical Genres Using RBF Networks. IEEE Trans. Knowledge and Data Engineering 17, 580–584 (2005)

    Article  Google Scholar 

  8. Burred, J.J., Lerch, A.: A hierachical approach to automatic music genre classification. In: Proc. 6th Int. Conf. Digital Audio Effect, London, UK (2003)

    Google Scholar 

  9. Malheiro, R., Paiva, R.P., Mendes, A.J., Mendes, T., Cardoso, A.: Classification of recorded classical music using neural networks. In: 4th Int. ICSC Sym. on Engineering of Intelligent Systems (2004)

    Google Scholar 

  10. Park, D.C., Dagher, I.: Gradient Based Fuzzy c-means (GBFCM) Algorithm. In: IEEE Int. Conf. on Neural Networks, ICNN 1994, vol. 3, pp. 1626–1631 (1994)

    Google Scholar 

  11. Tolonen, T., Karjalainen, M.: A computationally efficient multipitch analysis model. IEEE Trans. Speech Audio Processing 8, 708–716 (2000)

    Article  Google Scholar 

  12. Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Int 2, 1–8 (1980)

    Article  MATH  Google Scholar 

  13. Bezdek, J.C.: Pattern recognition with fuzzy objective function algorithms. Plenum, New York (1981)

    MATH  Google Scholar 

  14. Looney, C.: Pattern Recognition Using Neural Networks, pp. 252–254. Oxford University press, New York (1997)

    Google Scholar 

  15. Windham, M.P.: Cluster Validity for the Fuzzy cneans clustering algorithm. IEEE Trans. Pattern Anal. Mach. Int. 4, 357–363 (1982)

    Article  Google Scholar 

  16. Fukunaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn. Academic Press Inc., London (1990)

    MATH  Google Scholar 

  17. Yang, C.: Music Database Retrieval Based on Spectral Similarity, Stanford Univ Database Group, Stanford, CA, Tech. Rep. 2001-14 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, DC., Nguyen, DH., Beack, SH., Park, S. (2005). Classification of Audio Signals Using Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_61

Download citation

  • DOI: https://doi.org/10.1007/11581772_61

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30027-4

  • Online ISBN: 978-3-540-32130-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics