Audio Tag Annotation and Retrieval Using Tag Count Information

Lo, Hung-Yi; Lin, Shou-De; Wang, Hsin-Min

doi:10.1007/978-3-642-17832-0_32

Hung-Yi Lo^21,22,
Shou-De Lin²² &
Hsin-Min Wang²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6523))

Included in the following conference series:

International Conference on Multimedia Modeling

1416 Accesses
1 Citations

Abstract

Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. With the explosive growth of digital music available on the Web, automatic audio tagging, which can be used to annotate unknown music or retrieve desirable music, is becoming increasingly important. This can be achieved by training a binary classifier for each tag based on the labeled music data. However, since social tags are usually assigned by people with different levels of musical knowledge, they inevitably contain noisy information. To address the noisy label problem, we propose a novel method that exploits the tag count information. By treating the tag counts as costs, we model the audio tagging problem as a cost-sensitive classification problem. The results of audio tag annotation and retrieval experiments show that the proposed approach outperforms our previous method, which won the MIREX 2009 audio tagging competition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Foote, J., Cooper, M.: Media segmentation using self-similarity decomposition. In: SPIE (2003)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1) (1997)
Google Scholar
Hoffman, M., Blei, D., Cook, P.: Easy as cba: A simple probabilistic model for tagging music. In: ISMIR (2009)
Google Scholar
Lamere, P.: Social tagging and music information retrieval. Journal of New Music Research 37(2), 101–114 (2008)
Article Google Scholar
Lo, H.Y., Wang, J.C., Wang, H.M.: Homogeneous segmentation and classifier ensemble for audio tag annotation and retrieval. In: ICME (2010)
Google Scholar
Mandel, M.I., Ellis, D.P.W.: A web-based game for collecting music metadata. In: ISMIR (2007)
Google Scholar
Ness, S., Theocharis, A., Tzanetakis, L.G.M., Improving, G.: automatic msic tag annotation using stacked generalization of probabilistic svm outputs. In: ACM MM (2009)
Google Scholar
Sun, Y., Kamel, M.S., Wong, A.K.C., Wang, Y.: Cost-sensitive boosting for classification of imbalanced data. Pattern Recognition 40(12), 3358–3378 (2007)
Article MATH Google Scholar
Tingle, D., Kim, Y., Turnbull, D.: Exploring automatic music annotation with “acoustically-objective” tags. In: MIR (2010)
Google Scholar
Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. on Audio, Speech, and Language Processing 16, 467–476 (2008)
Article Google Scholar
Zadrozny, B., Langford, J., Abe, N.: Cost-sensitive learning by cost-proportionate example weighting. In: ICDM (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Science, Academia Sinica, Taipei, Taiwan
Hung-Yi Lo & Hsin-Min Wang
Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan
Hung-Yi Lo & Shou-De Lin

Authors

Hung-Yi Lo
View author publications
You can also search for this author in PubMed Google Scholar
Shou-De Lin
View author publications
You can also search for this author in PubMed Google Scholar
Hsin-Min Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Taiwan Ocean University, Keelung, Taiwan
Kuo-Tien Lee & Jun-Wei Hsieh &
National Chiao Tung University, Hsinchu, Taiwan
Wen-Hsiang Tsai
Academia Sinica, Taipei, Taiwan
Hong-Yuan Mark Liao
Cornell University, Ithaca, NY, USA
Tsuhan Chen
National Kaohsiung First University of Science and Technology, Kaohsiung, Taiwan
Chien-Cheng Tseng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lo, HY., Lin, SD., Wang, HM. (2011). Audio Tag Annotation and Retrieval Using Tag Count Information. In: Lee, KT., Tsai, WH., Liao, HY.M., Chen, T., Hsieh, JW., Tseng, CC. (eds) Advances in Multimedia Modeling. MMM 2011. Lecture Notes in Computer Science, vol 6523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17832-0_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-17832-0_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17831-3
Online ISBN: 978-3-642-17832-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics