Skip to main content
Log in

An overview of audio information retrieval

  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract.

The problem of audio information retrieval is familiar to anyone who has returned from vacation to find an answering machine full of messages. While there is not yet an “AltaVista” for the audio data type, many workers are finding ways to automatically locate, index, and browse audio using recent advances in speech recognition and machine listening. This paper reviews the state of the art in audio information retrieval, and presents recent advances in automatic speech recognition, word spotting, speaker and music identification, and audio similarity with a view towards making audio less “opaque”. A special section addresses intelligent interfaces for navigating and browsing audio and multimedia documents, using automatically derived information to go beyond the tape recorder metaphor.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Foote, J. An overview of audio information retrieval. Multimedia Systems 7, 2–10 (1999). https://doi.org/10.1007/s005300050106

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s005300050106

Keywords

Navigation