Semantic Region Detection in Acoustic Music Signals

Maddage, Namunu Chinthaka; Xu, Changsheng; Shenoy, Arun; Wang, Ye

doi:10.1007/978-3-540-30542-2_108

Namunu Chinthaka Maddage^19,20,
Changsheng Xu¹⁹,
Arun Shenoy²⁰ &
…
Ye Wang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3332))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

749 Accesses
1 Citations

Abstract

We propose a novel approach to detect semantic regions (pure vocals, pure instrumental and instrumental mixed vocals) in acoustic music signals. The acoustic music signal is first segmented at the beat level based on our proposed rhythm tracking algorithm. Then for each segment Cepstral coefficients are extracted from the Octave Scale to characterize music content. Finally, a hierarchical classification method is proposed to detect semantic regions. Different from previous methods, our proposed approach fully considers the music knowledge in segmenting and detecting the semantic regions in music signals. Experimental results illustrate that over 80% accuracy is achieved for semantic region detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berenzweig, A.L., Ellis, D.P.W.: Location singing voice segments within music signals. In: Proc. IEEE WASPAA, Paltz, New York (October 2001)
Google Scholar
Bilmes, J.: A gentle tutorial on the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. Technical Report ICSI-TR-97-021, University of Berkeley (1998)
Google Scholar
Collobert, R., Bengio, S.: SVMTorch: Support Vector Machines for Large-Scale Regression Problems. Journal of Machine Learning Research 1, 143–160 (2001)
Article MathSciNet Google Scholar
Deller, J.R., Hansen, J.H.L., Proakis, H.J.G.: Discrete-Time Processing of Speech Signals. IEEE Press, Los Alamitos (2000)
Google Scholar
Duxburg, C., Sandler, M., Davies, M.: A Hybrid Approach to Musical Note Onset Detection. In: Proc. of DAFX, Germany (2002)
Google Scholar
Eronen, A., Klapuri, A.: Musical Instrument Recognition Using Cepstral Cofficients and Temporal Features. In: Proc. of ICASSP, Istanbul (2000)
Google Scholar
Fujinaga, I.: Machine Recognition of Timbre Using Steady-state Tone of Acoustic Musical Instruments. In: Proc. of ICMC, pp. 207–210 (1998)
Google Scholar
Goto, M.: An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds. Journal of New Music Research 30(2), 159–171 (2001)
Article Google Scholar
Goto, M., Muraoka, Y.: A Beat Tracking System for Acoustic Signals of Music. In: Proc. of the Second ACM Intl. Conf. on Multimedia, pp. 365–372 (1994)
Google Scholar
Jiang, D., et al.: Music Type Classification by Spectral Contrast Fatures. In: Proc. of ICME, Switzerland (2002)
Google Scholar
Maddage, N.C., Xu, C., Wang, Y.: A SVM-Based Classification Approach to Musical Audio. In: Proc of ISMIR, Maryland, USA (2003)
Google Scholar
Kim, Y.K., Brian, W.: Singer Identification in Popular Music Recordings Using Voice Coding Features. In: Proc. of ISMIR, France (2002)
Google Scholar
Saitou, T., Unoki, M., Akagi, M.: Extraction of F0 Dynamic Characteristics and Developments of F0 Control Model in Singing Voice. In: Proc. of ICAD, Japan (2002)
Google Scholar
Scheirer, E.D.: Tempo and Beat Analysis of Acoustic Musical Signals. JASA 103(1) (1998)
Google Scholar
Sundberg, J.: The Science of the Singing Voice. Northern Illinois University Press, Dekalb (1987)
Google Scholar
Young, S., et al.: The HTK Book. Version 3.2 (2002)
Google Scholar
Zhang, T.: Automatic singer identification. In: Proc.of ICME, Maryland, USA (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore, 119613
Namunu Chinthaka Maddage & Changsheng Xu
School of Computing, National University of Singapore, Singapore, 117543
Namunu Chinthaka Maddage, Arun Shenoy & Ye Wang

Authors

Namunu Chinthaka Maddage
View author publications
You can also search for this author in PubMed Google Scholar
Changsheng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Arun Shenoy
View author publications
You can also search for this author in PubMed Google Scholar
Ye Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information and Communication Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, 113-8656, Tokyo, Japan
Kiyoharu Aizawa
Tokyo Research Laboratory, IBM Research, 1623-14 Shimo-tsuruma, 242-0001, Yamato, Kanagawa, Japan
Yuichi Nakamura
National Institute of Informatics, Tokyo, Japan
Shin’ichi Satoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maddage, N.C., Xu, C., Shenoy, A., Wang, Y. (2004). Semantic Region Detection in Acoustic Music Signals. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30542-2_108

Download citation

DOI: https://doi.org/10.1007/978-3-540-30542-2_108
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23977-2
Online ISBN: 978-3-540-30542-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics