skip to main content
10.1145/1007568.1007677acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

SoundCompass: a practical query-by-humming system; normalization of scalable and shiftable time-series data and effective subsequence generation

Published: 13 June 2004 Publication History

Abstract

This paper describes our practical query-by-humming system, SoundCompass, which is being used as a karaoke song selection system in Japan. First, we describe the fundamental techniques employed by SoundCompass such as normalization in a time-wise sense of music data, time-scalable and tone-shiftable time-series data, and making subsequences for efficient matching. Second, we describe techniques to make effective feature vectors based on real music data and do matching with them to develop accurate query-by-humming. Third, we share valuable knowledge that has been obtained through month's of practical use of Sound Compass. Fourth, we describe the latest version of the SoundCompass system that incorporates these new techniques and knowledge, as well as describe quantitative evaluations that prove the practicality of SoundCompass. The new system provides flexible and accurate similarity retrieval based on k-nearest neighbor searches with multi-dimensional spatial indices structured with multi-dimensional features vectors.

References

[1]
W. P. Birmingham, R. B. Dannenberg, G. H. Wakefield, M. Bartsch, D. Bykowski, D. Mazzoni, C. Meek, M. Mellody, and W. Rand. MUSART: Music Retrieval Via Aural Queries. In Third International Conference on Music Information Retrieval, 2001.
[2]
C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast Subsequence Matching in Time-Series Database. In Proceedings of the ACM SIGMOD, International Conference on management of Data, pages 419--429, 1994.
[3]
J. Foote. Visualizing Music and Audio using Self-Similarity. In Proc. ACM Multimedia 99, pages 77--80, November 1999.
[4]
A. Ghias, J. Logan, and D. Chamberlin. Query By Humming. In Proc. ACM Multimedia 95, pages 231--236, November 1995.
[5]
J.-S. R. Jang and H.-R. Lee. Hierarchical Filtering Method for Content-based Music Retrieval via Acoustic Input. In Proc. of the 9th ACM International Conference on Multimedia, pages 401--410, 2001.
[6]
E. Keogh. Exact Indexing of Dynamic Time Warping. In Proc. of the 28th VLDB Conference, 2002.
[7]
N. Kosugi, H. Nagata, and T. Nakanishi. Query-by-Humming on Internet. In 14th DEXA 2003 Proceedings, pages 589--600, 2003.
[8]
N. Kosugi, Y. Nishihara, T. Sakata, M. Yamamuro, and K. Kushima. A Practical Query-By-Humming System for a Large Music Database. In Proc. of the 8th ACM International Conference on Multimedia, pages 333--342, 2000.
[9]
Rodger McNab. INTERACTIVE APPLICATIONS OF MUSIC TRANSCRIPTION. Master's thesis, Computer Science at the University of Waikato, 1996.
[10]
T. Nishimura, H. Hashiguchi, J. Takita, J. Xin Zhang, M. Goto, and R. Oka. Music Signal Spotting retrieval by a Humming Query Using Start Frame Feature Dependent Continuous Dynamic Programming. In Third International Conference on Music Information Retrieval, 2001.
[11]
L. Rabiner and B.-H. Juang. FUNDAMENTALS OF SPEECH RECOGNITION. PTR Prentice-Hall, Inc. 1993.
[12]
Y. Zhu and D. Shasha. Warping Indexes with Envelope Transforms for Query by Humming. In Procedings of the ACM SIGMOD, International Conference on management of Data, pages 181--192, 2003.

Cited By

View all
  • (2024)Research and Development of an AI Music Therapist – Toward Digital Therapeutics of Music TherapyInformation Integration and Web Intelligence10.1007/978-3-031-78093-6_13(151-166)Online publication date: 1-Dec-2024
  • (2016)A time-series phrase correlation calculation system with acoustic signal processing for music media creation2016 International Conference on Knowledge Creation and Intelligent Computing (KCIC)10.1109/KCIC.2016.7883645(188-193)Online publication date: Nov-2016
  • (2012)Community Site for Music Therapists Based on the Session Records of Music TherapyProceedings of the 2012 15th International Conference on Network-Based Information Systems10.1109/NBiS.2012.128(319-325)Online publication date: 26-Sep-2012
  • Show More Cited By
  1. SoundCompass: a practical query-by-humming system; normalization of scalable and shiftable time-series data and effective subsequence generation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data
    June 2004
    988 pages
    ISBN:1581138598
    DOI:10.1145/1007568
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 June 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Conference

    SIGMOD/PODS04
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 14 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Research and Development of an AI Music Therapist – Toward Digital Therapeutics of Music TherapyInformation Integration and Web Intelligence10.1007/978-3-031-78093-6_13(151-166)Online publication date: 1-Dec-2024
    • (2016)A time-series phrase correlation calculation system with acoustic signal processing for music media creation2016 International Conference on Knowledge Creation and Intelligent Computing (KCIC)10.1109/KCIC.2016.7883645(188-193)Online publication date: Nov-2016
    • (2012)Community Site for Music Therapists Based on the Session Records of Music TherapyProceedings of the 2012 15th International Conference on Network-Based Information Systems10.1109/NBiS.2012.128(319-325)Online publication date: 26-Sep-2012
    • (2010)A novel approach based on fault tolerance and recursive segmentation to query by hummingProceedings of the 2010 international conference on Advances in computer science and information technology10.5555/1875558.1875611(544-557)Online publication date: 23-Jun-2010
    • (2010)A Novel Approach Based on Fault Tolerance and Recursive Segmentation to Query by HummingAdvances in Computer Science and Information Technology10.1007/978-3-642-13577-4_49(544-557)Online publication date: 2010
    • (2008)Scaling and time warping in time series queryingThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-006-0040-z17:4(899-921)Online publication date: 1-Jul-2008
    • (2008)User Specific Training of a Music Search EngineMachine Learning for Multimodal Interaction10.1007/978-3-540-78155-4_7(72-83)Online publication date: 2008
    • (2007)User specific training of a music search engineProceedings of the 4th international conference on Machine learning for multimodal interaction10.5555/1787422.1787432(72-83)Online publication date: 28-Jun-2007
    • (2005)Scaling and time warping in time series queryingProceedings of the 31st international conference on Very large data bases10.5555/1083592.1083668(649-660)Online publication date: 30-Aug-2005

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media