Skip to main content
Log in

Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

 In recent years, available audio corpora are rapidly increasing from fast growing Internet and digital libraries. How to classify and retrieve sound files relevant to the user's interest from large databases is crucial for building multimedia web search engines. In this paper, content-based technology has been applied to classify and retrieve audio clips using a fuzzy logic system, which is intuitive due to the fuzzy nature of human perception of audio, especially audio clips with mixed types. Two features selected from various extracted features are used as input to a constructed fuzzy inference system (FIS). The outputs of the FIS are two types of hierarchical audio classes. The membership functions and rules are derived from the distributions of extracted audio features. Speech and music can thus be discriminated by the FIS. Furthermore, female and male speech can be separated by another FIS, whereas percussion can be distinguished from other music instruments. In addition, we can use multiple FISs to form a “fuzzy tree” for retrieval of more types of audio clips. With this approach, we can classify and retrieve generic audios more accurately, using fewer features and less computation time, compared to other existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, M., Wan, C. & Wang, L. Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines. Soft Computing 6, 357–364 (2002). https://doi.org/10.1007/s00500-002-0189-3

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-002-0189-3

Navigation