Skip to main content

Song Wave Retrieval Based on Frame-Wise Phoneme Recognition

  • Conference paper
Information Retrieval Technology (AIRS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3689))

Included in the following conference series:

  • 1032 Accesses

Abstract

We propose a song wave retrieval method. Both song wave data and a query wave for song wave data are transformed into phoneme sequences by frame-wise labeling of each frame feature. By applying a search algorithm, called Continuous Dynamic Programming (CDP), to these phoneme sequences, we can detect a set of similar parts in a song database, each of which is similar to a query song wave. Song retrieval rates hit 78% in four clauses from whole databases. Differences in each query from song wave data and speech wave data is investigated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Ando, A.: Real-time Speech Recognition. In: The Institute of Electronics, Information and Communication Engineers (September 2003) (Japanese)

    Google Scholar 

  2. Sonoda, T., et al.: A WWW-Based Melody Retrieval System. IEICE Transcription J84-D-II(1), 721–731 (April 1999) (Japanese)

    Google Scholar 

  3. Goto, M.: F0 Estimation of Melody and Bass Lines in Musical Audio Signals. IEICE Transcription J84-D-II(1), 12–22 (January 2001) (Japanese)

    Google Scholar 

  4. Hashiguchi, H., et al.: Music Signal Spotting Retrieval by a Humming Query Using Model Driven Path Continuous Dynamic Programming. IEICE Transcription J84-D-II(12), 2479–2488 (December 2001) (Japanese)

    Google Scholar 

  5. Wang, H.: Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese. Speech Communication 32(1-2), 49–60 (2000)

    Article  Google Scholar 

  6. Oka, R., et al.: Vocabulary-free Speech Retrieval Based on Phoneme Symbol Labeling of Frame Feature. IEICE Transcription J86-D-II(6), 764–775 (June 2003) (Japanese)

    Google Scholar 

  7. Oka, R.: Spotting Method for Classification of Real World Data. The Computer Journal 41(8), 559–565 (1998)

    Article  MATH  Google Scholar 

  8. Matsumura, H., et al.: Speaker-Independent Spoken Word Recognition by Using the Orientation Patterns Obtained from the Vector Field of Spectrum Pattern. IEICE Transcription 72-D-II(4), 487–498 (1989) (Japanese)

    Google Scholar 

  9. ATR. ATR SPEECH DATABASE, 503 Phonetically balanced sentences (1992) (Japanese)

    Google Scholar 

  10. Goto, M., et al.: RWC Music Database: Popular Music Database and Royalty-Free Music Database. IPSJ Transactions on MUS, 2001-MUS-42-6 2001(103), 35–42 (October 2001) (Japanese)

    Google Scholar 

  11. Yaguchi, Y., et al.: Song Wave Retrieval Based on Frame-wise Phoneme Recognition. Technical Report of IEICE, SP2004-50 (June 2004) (Japanese)

    Google Scholar 

  12. Furui, S.: Digital Speech Recognition. Tokai university press (September 1985) (Japanese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yaguchi, Y., Oka, R. (2005). Song Wave Retrieval Based on Frame-Wise Phoneme Recognition. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_41

Download citation

  • DOI: https://doi.org/10.1007/11562382_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29186-2

  • Online ISBN: 978-3-540-32001-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics