Skip to main content

Applying Statistical Models and Parametric Distance Measures for Music Similarity Search

  • Conference paper
  • First Online:
Advances in Data Analysis, Data Handling and Business Intelligence

Abstract

Automatic deriving of similarity relations between music pieces is an inherent field of music information retrieval research. Due to the nearly unrestricted amount of musical data, the real-world similarity search algorithms have to be highly efficient and scalable. The possible solution is to represent each music excerpt with a statistical model (ex. Gaussian mixture model) and thus to reduce the computational costs by applying the parametric distance measures between the models. In this paper we discuss the combinations of applying different parametric modelling techniques and distance measures and weigh the benefits of each one against the others.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Anderson, C. (2006). The long tail: Why the future of business is selling less of more. New York: Hyperion.

    Google Scholar 

  • Aucouturier, J.-J., & Pachet, F. (2004). Improving timbre similarity: How high is the sky? Journal of Negative Results in Speech and Audio Sciences, 1(1).

    Google Scholar 

  • Aucouturier, J.-J., Pachet, F., & Sandler, M. (2005, December). The way it sounds: Timbre models for analysis and retrieval of music signals. IEEE Transactions on Multimedia, 7(6).

    Google Scholar 

  • Berenzweig, A., Logan, B., Ellis, D. P. W., & Whitman, B. (2003, October). A large scale evaluation of acoustic and subjective music similarity. In Proceedings of the Fourth IEEE International Conference on Music Information Retrieval (ISMIR 2003). Baltimore, MD, USA.

    Google Scholar 

  • Bogert, B. P., Healy, M. J. R., & Tukey, J. W. (1963). The frequency analysis of time series for echoes: Cepstrum, pseudoautocovariance, cross-cepstrum, and saphe cracking. In M. Rosenblatt (Ed.), Proceedings of the Symposium on Time Series Analysis (pp. 209–243). New York: Wiley.

    Google Scholar 

  • Cohen, W. W., & Fan, W. (2000, May 15–19). Web-collaborative filtering: Recommending music by crawling the web. In Proceedings of the 9th International World Wide Web Conference (WWW9) (pp. 658–698). Amsterdam, The Netherlands.

    Google Scholar 

  • Dempster, N. M., Laird, A. P., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B, 39, 185–197.

    MathSciNet  Google Scholar 

  • Dittmar, C., Bastuck, C., & Gruhne, M. (2007, December 5–7). Novel mid-level audio features for music similarity. In Proceedings of the International Conference on Music Communication Science (ICoMCS 2007). Sydney, Australia.

    Google Scholar 

  • Dwork, C., Kumar, R., Naor, M., & Sivakumar, D. (2001, May 1–5). Rank aggregation for the web. In Proceedings of the 10th International World Wide Web Conference (WWW10). Hong Kong.

    Google Scholar 

  • Helén, M., & Virtanen, T. (2007). Query by example of audio signals using Euclidean distance between Gaussian mixture models. In Proceedings of IEEE International Conference on Audio, Speech and Signal Processing (ICASSP 2007) (pp. 225–228). Honolulu, USA.

    Google Scholar 

  • Herre, J., Allamanche, J., & Ertel, C. (2003, October 19–22). How similar do songs sound? Towards modeling human perception of musical similarities. In Proceedings of the IEEE Wokshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2003). Mohonk, NY, USA.

    Google Scholar 

  • Hershey, J. R., & Olsen, P. A. (2007). Approximating the Kullback–Leibler divergence between Gaussian mixture models. In Proceedings of IEEE International Conference on Audio, Speech and Signal Processing (ICASSP 2007) (pp. 317–320). Honolulu, USA.

    Google Scholar 

  • Kim, H.-G., Moreau, N., & Sikora, Th. (2005, October). MPEG-7 audio and beyond: Audio content indexing and retrieval. New York: Wiley.

    Google Scholar 

  • Kullback, S. (1968). Information theory and statistics. Mineola, NY: Dover Publications.

    Google Scholar 

  • Logan, B., & Salomon, A. (2001). A Music similarity function based on signal analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2001). Tokyo, Japan.

    Google Scholar 

  • Moschou, V., Kotti, M., Benetos, E., & Kotropoulos, C. (2007). Systematic comparison of BIC-based speaker segmentation systems. In Proceedings of IEEE 9th Workshop on Multimedia Signal Processing (MMSP 2007). Crete, Greece.

    Google Scholar 

  • Pampalk, E. (2006). Computational models of music similarity and their application in music information retrieval. Ph.D. Thesis, Vienna University of Technology, Vienna, Austria.

    Google Scholar 

  • Rubner, Y., Tomasi, C., & Guibas, L. J. (1998). A metric for distributions with applications to image databases. In Proceedings of the IEEE International Conference on Computer Vision (ICCV98). Bombay, India.

    Google Scholar 

  • Tzanetakis, G., Essl, G., & Cook, P. (2001, October 15–17). Automatic musical genre classification of audio signals. In Proceedings of the 2nd Annual International Symposium on Music Information Retrieval (MUSIC IR 2001). Bloomington, IN, USA.

    Google Scholar 

Download references

Acknowledgements

This work has been partly supported by the PHAROS and the DIVAS projects, funded under the EC IST 6th Framework Program. Furthermore, the work on this publication is supported by grant No. 01QM07017 of the German THESEUS program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hanna Lukashevich .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lukashevich, H., Dittmar, C., Bastuck, C. (2009). Applying Statistical Models and Parametric Distance Measures for Music Similarity Search. In: Fink, A., Lausen, B., Seidel, W., Ultsch, A. (eds) Advances in Data Analysis, Data Handling and Business Intelligence. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01044-6_37

Download citation

Publish with us

Policies and ethics