Abstract
Automatic deriving of similarity relations between music pieces is an inherent field of music information retrieval research. Due to the nearly unrestricted amount of musical data, the real-world similarity search algorithms have to be highly efficient and scalable. The possible solution is to represent each music excerpt with a statistical model (ex. Gaussian mixture model) and thus to reduce the computational costs by applying the parametric distance measures between the models. In this paper we discuss the combinations of applying different parametric modelling techniques and distance measures and weigh the benefits of each one against the others.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson, C. (2006). The long tail: Why the future of business is selling less of more. New York: Hyperion.
Aucouturier, J.-J., & Pachet, F. (2004). Improving timbre similarity: How high is the sky? Journal of Negative Results in Speech and Audio Sciences, 1(1).
Aucouturier, J.-J., Pachet, F., & Sandler, M. (2005, December). The way it sounds: Timbre models for analysis and retrieval of music signals. IEEE Transactions on Multimedia, 7(6).
Berenzweig, A., Logan, B., Ellis, D. P. W., & Whitman, B. (2003, October). A large scale evaluation of acoustic and subjective music similarity. In Proceedings of the Fourth IEEE International Conference on Music Information Retrieval (ISMIR 2003). Baltimore, MD, USA.
Bogert, B. P., Healy, M. J. R., & Tukey, J. W. (1963). The frequency analysis of time series for echoes: Cepstrum, pseudoautocovariance, cross-cepstrum, and saphe cracking. In M. Rosenblatt (Ed.), Proceedings of the Symposium on Time Series Analysis (pp. 209–243). New York: Wiley.
Cohen, W. W., & Fan, W. (2000, May 15–19). Web-collaborative filtering: Recommending music by crawling the web. In Proceedings of the 9th International World Wide Web Conference (WWW9) (pp. 658–698). Amsterdam, The Netherlands.
Dempster, N. M., Laird, A. P., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B, 39, 185–197.
Dittmar, C., Bastuck, C., & Gruhne, M. (2007, December 5–7). Novel mid-level audio features for music similarity. In Proceedings of the International Conference on Music Communication Science (ICoMCS 2007). Sydney, Australia.
Dwork, C., Kumar, R., Naor, M., & Sivakumar, D. (2001, May 1–5). Rank aggregation for the web. In Proceedings of the 10th International World Wide Web Conference (WWW10). Hong Kong.
Helén, M., & Virtanen, T. (2007). Query by example of audio signals using Euclidean distance between Gaussian mixture models. In Proceedings of IEEE International Conference on Audio, Speech and Signal Processing (ICASSP 2007) (pp. 225–228). Honolulu, USA.
Herre, J., Allamanche, J., & Ertel, C. (2003, October 19–22). How similar do songs sound? Towards modeling human perception of musical similarities. In Proceedings of the IEEE Wokshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2003). Mohonk, NY, USA.
Hershey, J. R., & Olsen, P. A. (2007). Approximating the Kullback–Leibler divergence between Gaussian mixture models. In Proceedings of IEEE International Conference on Audio, Speech and Signal Processing (ICASSP 2007) (pp. 317–320). Honolulu, USA.
Kim, H.-G., Moreau, N., & Sikora, Th. (2005, October). MPEG-7 audio and beyond: Audio content indexing and retrieval. New York: Wiley.
Kullback, S. (1968). Information theory and statistics. Mineola, NY: Dover Publications.
Logan, B., & Salomon, A. (2001). A Music similarity function based on signal analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2001). Tokyo, Japan.
Moschou, V., Kotti, M., Benetos, E., & Kotropoulos, C. (2007). Systematic comparison of BIC-based speaker segmentation systems. In Proceedings of IEEE 9th Workshop on Multimedia Signal Processing (MMSP 2007). Crete, Greece.
Pampalk, E. (2006). Computational models of music similarity and their application in music information retrieval. Ph.D. Thesis, Vienna University of Technology, Vienna, Austria.
Rubner, Y., Tomasi, C., & Guibas, L. J. (1998). A metric for distributions with applications to image databases. In Proceedings of the IEEE International Conference on Computer Vision (ICCV98). Bombay, India.
Tzanetakis, G., Essl, G., & Cook, P. (2001, October 15–17). Automatic musical genre classification of audio signals. In Proceedings of the 2nd Annual International Symposium on Music Information Retrieval (MUSIC IR 2001). Bloomington, IN, USA.
Acknowledgements
This work has been partly supported by the PHAROS and the DIVAS projects, funded under the EC IST 6th Framework Program. Furthermore, the work on this publication is supported by grant No. 01QM07017 of the German THESEUS program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lukashevich, H., Dittmar, C., Bastuck, C. (2009). Applying Statistical Models and Parametric Distance Measures for Music Similarity Search. In: Fink, A., Lausen, B., Seidel, W., Ultsch, A. (eds) Advances in Data Analysis, Data Handling and Business Intelligence. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01044-6_37
Download citation
DOI: https://doi.org/10.1007/978-3-642-01044-6_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01043-9
Online ISBN: 978-3-642-01044-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)