Abstract
We propose a song wave retrieval method. Both song wave data and a query wave for song wave data are transformed into phoneme sequences by frame-wise labeling of each frame feature. By applying a search algorithm, called Continuous Dynamic Programming (CDP), to these phoneme sequences, we can detect a set of similar parts in a song database, each of which is similar to a query song wave. Song retrieval rates hit 78% in four clauses from whole databases. Differences in each query from song wave data and speech wave data is investigated.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ando, A.: Real-time Speech Recognition. In: The Institute of Electronics, Information and Communication Engineers (September 2003) (Japanese)
Sonoda, T., et al.: A WWW-Based Melody Retrieval System. IEICE Transcription J84-D-II(1), 721–731 (April 1999) (Japanese)
Goto, M.: F0 Estimation of Melody and Bass Lines in Musical Audio Signals. IEICE Transcription J84-D-II(1), 12–22 (January 2001) (Japanese)
Hashiguchi, H., et al.: Music Signal Spotting Retrieval by a Humming Query Using Model Driven Path Continuous Dynamic Programming. IEICE Transcription J84-D-II(12), 2479–2488 (December 2001) (Japanese)
Wang, H.: Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese. Speech Communication 32(1-2), 49–60 (2000)
Oka, R., et al.: Vocabulary-free Speech Retrieval Based on Phoneme Symbol Labeling of Frame Feature. IEICE Transcription J86-D-II(6), 764–775 (June 2003) (Japanese)
Oka, R.: Spotting Method for Classification of Real World Data. The Computer Journal 41(8), 559–565 (1998)
Matsumura, H., et al.: Speaker-Independent Spoken Word Recognition by Using the Orientation Patterns Obtained from the Vector Field of Spectrum Pattern. IEICE Transcription 72-D-II(4), 487–498 (1989) (Japanese)
ATR. ATR SPEECH DATABASE, 503 Phonetically balanced sentences (1992) (Japanese)
Goto, M., et al.: RWC Music Database: Popular Music Database and Royalty-Free Music Database. IPSJ Transactions on MUS, 2001-MUS-42-6 2001(103), 35–42 (October 2001) (Japanese)
Yaguchi, Y., et al.: Song Wave Retrieval Based on Frame-wise Phoneme Recognition. Technical Report of IEICE, SP2004-50 (June 2004) (Japanese)
Furui, S.: Digital Speech Recognition. Tokai university press (September 1985) (Japanese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yaguchi, Y., Oka, R. (2005). Song Wave Retrieval Based on Frame-Wise Phoneme Recognition. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_41
Download citation
DOI: https://doi.org/10.1007/11562382_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29186-2
Online ISBN: 978-3-540-32001-2
eBook Packages: Computer ScienceComputer Science (R0)