Skip to main content

Audio Tag Annotation and Retrieval Using Tag Count Information

  • Conference paper
Advances in Multimedia Modeling (MMM 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6523))

Included in the following conference series:

Abstract

Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. With the explosive growth of digital music available on the Web, automatic audio tagging, which can be used to annotate unknown music or retrieve desirable music, is becoming increasingly important. This can be achieved by training a binary classifier for each tag based on the labeled music data. However, since social tags are usually assigned by people with different levels of musical knowledge, they inevitably contain noisy information. To address the noisy label problem, we propose a novel method that exploits the tag count information. By treating the tag counts as costs, we model the audio tagging problem as a cost-sensitive classification problem. The results of audio tag annotation and retrieval experiments show that the proposed approach outperforms our previous method, which won the MIREX 2009 audio tagging competition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Foote, J., Cooper, M.: Media segmentation using self-similarity decomposition. In: SPIE (2003)

    Google Scholar 

  2. Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1) (1997)

    Google Scholar 

  3. Hoffman, M., Blei, D., Cook, P.: Easy as cba: A simple probabilistic model for tagging music. In: ISMIR (2009)

    Google Scholar 

  4. Lamere, P.: Social tagging and music information retrieval. Journal of New Music Research 37(2), 101–114 (2008)

    Article  Google Scholar 

  5. Lo, H.Y., Wang, J.C., Wang, H.M.: Homogeneous segmentation and classifier ensemble for audio tag annotation and retrieval. In: ICME (2010)

    Google Scholar 

  6. Mandel, M.I., Ellis, D.P.W.: A web-based game for collecting music metadata. In: ISMIR (2007)

    Google Scholar 

  7. Ness, S., Theocharis, A., Tzanetakis, L.G.M., Improving, G.: automatic msic tag annotation using stacked generalization of probabilistic svm outputs. In: ACM MM (2009)

    Google Scholar 

  8. Sun, Y., Kamel, M.S., Wong, A.K.C., Wang, Y.: Cost-sensitive boosting for classification of imbalanced data. Pattern Recognition 40(12), 3358–3378 (2007)

    Article  MATH  Google Scholar 

  9. Tingle, D., Kim, Y., Turnbull, D.: Exploring automatic music annotation with “acoustically-objective” tags. In: MIR (2010)

    Google Scholar 

  10. Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. on Audio, Speech, and Language Processing 16, 467–476 (2008)

    Article  Google Scholar 

  11. Zadrozny, B., Langford, J., Abe, N.: Cost-sensitive learning by cost-proportionate example weighting. In: ICDM (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lo, HY., Lin, SD., Wang, HM. (2011). Audio Tag Annotation and Retrieval Using Tag Count Information. In: Lee, KT., Tsai, WH., Liao, HY.M., Chen, T., Hsieh, JW., Tseng, CC. (eds) Advances in Multimedia Modeling. MMM 2011. Lecture Notes in Computer Science, vol 6523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17832-0_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17832-0_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17831-3

  • Online ISBN: 978-3-642-17832-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics