ABSTRACT
This paper presents an unsupervised method for systematically identifying anomalies in music datasets. The model integrates categorical regression and robust estimation techniques to infer anomalous scores in music clips. When applied to a music genre recognition dataset, the new method is able to detect corrupted, distorted, or mislabeled audio samples based on commonly used features in music information retrieval. The evaluation results show that the algorithm outperforms other anomaly detection methods and is capable of finding problematic samples identified by human experts. The proposed method introduces a preliminary framework for anomaly detection in music data that can serve as a useful tool to improve data integrity in the future.
- C. M. Bishop. Pattern Recognition and Machine Learning (Information Science and Statistics). 2006. Google ScholarDigital Library
- M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. Lof: identifying density-based local outliers. SIGMOD Record, 29(2):93--104, May 2000. Google ScholarDigital Library
- V. Chandola, A. Banerjee, and V. Kumar. Anomaly detection: A survey. ACM Computer Survey, 41(3):15:1--15:58, July 2009. Google ScholarDigital Library
- L. Hansen, T. Lehn-Schiøler, K. Petersen, J. Arenas-Garcia, J. Larsen, and S. Jensen. Learning and clean-up in a large scale music database. In European Signal Processing Conference (EUSIPCO), pages 946--950, 2007.Google Scholar
- A. Lerch. An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics. John Wiley and Sons, 2012. Google ScholarCross Ref
- C. Liu. Robit Regression: A Simple Robust Alternative to Logistic and Probit Regression, pages 227--238. John Wiley & Sons, Ltd, 2005.Google Scholar
- S. Ramaswamy, R. Rastogi, and K. Shim. Efficient algorithms for mining outliers from large data sets. SIGMOD Record, 29(2):427--438, May 2000. Google ScholarDigital Library
- M. Schedl, E. Gómez, and J. Urbano. Music Information Retrieval: Recent Developments and Applications, volume 8. 2014. Google ScholarDigital Library
- M. Sordo, O. Celma, M. Blech, and E. Guaus. The Quest for Musical Genres: Do the Experts and the Wisdom of Crowds Agree? In International Symposium on Music Information Retrieval, pages 255--260, 2008.Google Scholar
- B. L. Sturm. An analysis of the GTZAN music genre dataset. In Proceedings of the second international ACM workshop on Music Information Retrieval with user-centered and multimodal strategies, 2012. Google ScholarDigital Library
- B. L. Sturm. The State of the Art Ten Years After a State of the Art: Future Research in Music Information Retrieval. Journal of New Music Research, 2013.Google Scholar
- D. E. Tyler. Robust statistics: Theory and methods. Journal of the American Statistical Association, 103:888--889, 2008.Google ScholarCross Ref
- G. Tzanetakis and P. Cook. Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing, 10(5):293--302, 2002.Google ScholarCross Ref
Index Terms
- An Unsupervised Approach to Anomaly Detection in Music Datasets
Recommendations
Computational Analysis of Jazz Music: Estimating Tonality through Chord Progression Distances
CSAE '23: Proceedings of the 7th International Conference on Computer Science and Application EngineeringCurrently, research in music informatics focuses extensively on music theory, particularly on the theoretical systems of Western classical music dating back to the 19th century. However, contemporary popular music genres such as pop, rock, and jazz often ...
Music Key Detection for Musical Audio
MMM '05: Proceedings of the 11th International Multimedia Modelling ConferenceThe key or the scale information of a piece of music provides important clues on its high level musical content, like harmonic and melodic context, which can be useful for music classification, retrieval or further content analysis. Researchers have ...
A Query-by-Singing System for Retrieving Karaoke Music
This paper investigates the problem of retrieving karaoke music using query-by-singing techniques. Unlike regular CD music, where the stereo sound involves two audio channels that usually sound the same, karaoke music encompasses two distinct channels ...
Comments