Concept framework for audio information retrieval: ARF

Li, GuoHui; Wu, DeFeng; Zhang, Jun

doi:10.1007/BF02947127

Concept framework for audio information retrieval: ARF

Published: September 2003

Volume 18, pages 667–673, (2003)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Li GuoHui¹,
Wu DeFeng¹ &
Zhang Jun¹

77 Accesses
5 Citations
Explore all metrics

Abstract

The majority of researches on content-based retrieval focused on visual media. However audio is also an important medium and information carrier from the viewpoint of human auditory perception, so it is needed to retrieve for audio collection. Audio is handled by conventional methods as an opaque stream medium, which is not suitable for information retrieval by its content. In fact, audio carries rich aural information with the form of speech, musical, and sound effects, so it could be retrieved based on its aural content, such as acoustic features, musical melodies and associated semantics. In this paper, a concept framework (ARF) for content-based audio retrieval is proposed from systematic perspectives, which describes audio content model, audio retrieval architecture and audio query schemes. Audio contents are represented by a hierarchical model and a set of formal descriptions from physical to acoustic to semantic level, which depict acoustic features, logical structure and semantics of audio and audio objects. The architecture consisting of audio meta-database, populating and accessing modules presents a system structure view of audio information retrieval. The query schemes give generalized approaches and modes concerning how users deliver audio information needs to audio collections. Finally, an audio retrieval example implemented is used to explain and specify the application of the components in the proposed ARF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sound Sharing and Retrieval

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

Article 10 January 2017

Music Information Retrieval: A Window into the Needs and Challenges

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

ISO/IEC JTC1/SC29/WG11. MPEG-7 applications. N4676, Mar. 2002, Jeju, Korea.
Ricardo Baeza-Yates, Berthier Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley Longman Limited. 1999.
Erling Woodet al. Content based classification, search, and retrieval of audic.IEEE Multimedia, 1996.
Wactlar H, Hauptmann A, Witbrock M. Informedia: News-on-demand experiments in speech recognition. InProc. ARPA Speech Recognition Workshop, Arden House, Harriman, NY, Feb. 18–21, 1996.
Brown M G, Foote J T, Jones G J Fet al. Video mail retrieval by voice: An overview of the Cambridge/ Olivetti retrieval system. InProc. 2nd ACM Int. Conf. Multimedia Workshop on Multimedia Data-Base Management, San Francisco USA, October, 1994, pp.47–55.
Laura Slaughteret al. A graphical interface for speech-based retrieval. InProc. the Third ACM Digital Library Conference, Pittsburgh, PA, June, 1998.
McNab R J, Smith L A, Witten I H. Signal processing for melody transcription. InProc. Australian Computer Science Conference, Melbourne, Australia, January, 1996, pp.301–307.
Ruben Gonzalez, Kathy Melin. Content-based retrieval of audio. InProc. Australian Telecommunication Networks & Applications Conference, 1996, pp.357–362.
Jonathan Foote. Content-based retrieval of music and audio. InProc. SPIE, Multimedia Storage and Archiving Systems II, 1997, 3229: 138–147.
Jonathan Foote. An overview of audio information retrieval.Multimedia Systems 1999, 7(1): 2–11.
Article Google Scholar
Guohui Li, Ashfaq A. Khokhar. Content-based indexing and retrieval of audio data using wavelet. InProc. IEEE International Conference on Multimedia and Expro (ICME'2000), August 2000, New York, pp.885–888.
ISO/IEC JTC1/SC29/WG11. MPEG-7 Overview. Doc. N4980, Klangenfurt, July 2002.

Download references

Author information

Authors and Affiliations

Department of Management Science and Engineering, National University of Defense Technology, 410073, Changsha, P.R. China
Li GuoHui, Wu DeFeng & Zhang Jun

Authors

Li GuoHui
View author publications
You can also search for this author inPubMed Google Scholar
Wu DeFeng
View author publications
You can also search for this author inPubMed Google Scholar
Zhang Jun
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Li GuoHui.

Additional information

This research was sponsored by the National Natural Science Foundation of China (NSFC) under Grant No. 60273066

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, G., Wu, D. & Zhang, J. Concept framework for audio information retrieval: ARF. J. Comput. Sci. & Technol. 18, 667–673 (2003). https://doi.org/10.1007/BF02947127

Download citation

Received: 11 February 2001
Revised: 29 November 2002
Issue Date: September 2003
DOI: https://doi.org/10.1007/BF02947127

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Concept framework for audio information retrieval: ARF

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Sound Sharing and Retrieval

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

Music Information Retrieval: A Window into the Needs and Challenges

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now