Applying Statistical Models and Parametric Distance Measures for Music Similarity Search

Lukashevich, Hanna; Dittmar, Christian; Bastuck, Christoph

doi:10.1007/978-3-642-01044-6_37

Hanna Lukashevich⁵,
Christian Dittmar &
Christoph Bastuck

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

2905 Accesses
1 Citations

Abstract

Automatic deriving of similarity relations between music pieces is an inherent field of music information retrieval research. Due to the nearly unrestricted amount of musical data, the real-world similarity search algorithms have to be highly efficient and scalable. The possible solution is to represent each music excerpt with a statistical model (ex. Gaussian mixture model) and thus to reduce the computational costs by applying the parametric distance measures between the models. In this paper we discuss the combinations of applying different parametric modelling techniques and distance measures and weigh the benefits of each one against the others.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anderson, C. (2006). The long tail: Why the future of business is selling less of more. New York: Hyperion.
Google Scholar
Aucouturier, J.-J., & Pachet, F. (2004). Improving timbre similarity: How high is the sky? Journal of Negative Results in Speech and Audio Sciences, 1(1).
Google Scholar
Aucouturier, J.-J., Pachet, F., & Sandler, M. (2005, December). The way it sounds: Timbre models for analysis and retrieval of music signals. IEEE Transactions on Multimedia, 7(6).
Google Scholar
Berenzweig, A., Logan, B., Ellis, D. P. W., & Whitman, B. (2003, October). A large scale evaluation of acoustic and subjective music similarity. In Proceedings of the Fourth IEEE International Conference on Music Information Retrieval (ISMIR 2003). Baltimore, MD, USA.
Google Scholar
Bogert, B. P., Healy, M. J. R., & Tukey, J. W. (1963). The frequency analysis of time series for echoes: Cepstrum, pseudoautocovariance, cross-cepstrum, and saphe cracking. In M. Rosenblatt (Ed.), Proceedings of the Symposium on Time Series Analysis (pp. 209–243). New York: Wiley.
Google Scholar
Cohen, W. W., & Fan, W. (2000, May 15–19). Web-collaborative filtering: Recommending music by crawling the web. In Proceedings of the 9th International World Wide Web Conference (WWW9) (pp. 658–698). Amsterdam, The Netherlands.
Google Scholar
Dempster, N. M., Laird, A. P., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B, 39, 185–197.
MathSciNet Google Scholar
Dittmar, C., Bastuck, C., & Gruhne, M. (2007, December 5–7). Novel mid-level audio features for music similarity. In Proceedings of the International Conference on Music Communication Science (ICoMCS 2007). Sydney, Australia.
Google Scholar
Dwork, C., Kumar, R., Naor, M., & Sivakumar, D. (2001, May 1–5). Rank aggregation for the web. In Proceedings of the 10th International World Wide Web Conference (WWW10). Hong Kong.
Google Scholar
Helén, M., & Virtanen, T. (2007). Query by example of audio signals using Euclidean distance between Gaussian mixture models. In Proceedings of IEEE International Conference on Audio, Speech and Signal Processing (ICASSP 2007) (pp. 225–228). Honolulu, USA.
Google Scholar
Herre, J., Allamanche, J., & Ertel, C. (2003, October 19–22). How similar do songs sound? Towards modeling human perception of musical similarities. In Proceedings of the IEEE Wokshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2003). Mohonk, NY, USA.
Google Scholar
Hershey, J. R., & Olsen, P. A. (2007). Approximating the Kullback–Leibler divergence between Gaussian mixture models. In Proceedings of IEEE International Conference on Audio, Speech and Signal Processing (ICASSP 2007) (pp. 317–320). Honolulu, USA.
Google Scholar
Kim, H.-G., Moreau, N., & Sikora, Th. (2005, October). MPEG-7 audio and beyond: Audio content indexing and retrieval. New York: Wiley.
Google Scholar
Kullback, S. (1968). Information theory and statistics. Mineola, NY: Dover Publications.
Google Scholar
Logan, B., & Salomon, A. (2001). A Music similarity function based on signal analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2001). Tokyo, Japan.
Google Scholar
Moschou, V., Kotti, M., Benetos, E., & Kotropoulos, C. (2007). Systematic comparison of BIC-based speaker segmentation systems. In Proceedings of IEEE 9th Workshop on Multimedia Signal Processing (MMSP 2007). Crete, Greece.
Google Scholar
Pampalk, E. (2006). Computational models of music similarity and their application in music information retrieval. Ph.D. Thesis, Vienna University of Technology, Vienna, Austria.
Google Scholar
Rubner, Y., Tomasi, C., & Guibas, L. J. (1998). A metric for distributions with applications to image databases. In Proceedings of the IEEE International Conference on Computer Vision (ICCV98). Bombay, India.
Google Scholar
Tzanetakis, G., Essl, G., & Cook, P. (2001, October 15–17). Automatic musical genre classification of audio signals. In Proceedings of the 2nd Annual International Symposium on Music Information Retrieval (MUSIC IR 2001). Bloomington, IN, USA.
Google Scholar

Download references

Acknowledgements

This work has been partly supported by the PHAROS and the DIVAS projects, funded under the EC IST 6th Framework Program. Furthermore, the work on this publication is supported by grant No. 01QM07017 of the German THESEUS program.

Author information

Authors and Affiliations

Fraunhofer IDMT, Ehrenbergstr. 31, 98693, Ilmenau, Germany
Hanna Lukashevich

Authors

Hanna Lukashevich
View author publications
You can also search for this author in PubMed Google Scholar
Christian Dittmar
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Bastuck
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hanna Lukashevich .

Editor information

Editors and Affiliations

Universität der Bundeswehr, Fak. Wirtschafts-/Sozialwissenschaften, Helmut-Schmidt-Universität, Holstenhofweg 85, Hamburg, 22043, Germany
Andreas Fink
Dept. Mathematical Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, United Kingdom
Berthold Lausen
Universität der Bundeswehr, Fak. Wirtschafts-/Sozialwissenschaften, Helmut-Schmidt-Universität, Holstenhofweg 85, Hamburg, 22043, Germany
Wilfried Seidel
FB 12 Mathematik und Informatik, Datenbionik AG, Universität Marburg, Hans-Meerwein-Straße, Marburg, 35032, Germany
Alfred Ultsch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lukashevich, H., Dittmar, C., Bastuck, C. (2009). Applying Statistical Models and Parametric Distance Measures for Music Similarity Search. In: Fink, A., Lausen, B., Seidel, W., Ultsch, A. (eds) Advances in Data Analysis, Data Handling and Business Intelligence. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01044-6_37

Download citation

DOI: https://doi.org/10.1007/978-3-642-01044-6_37
Published: 31 July 2009
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01043-9
Online ISBN: 978-3-642-01044-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics