Abstract:
We present an automated method for unsupervised detection of structural boundaries in musical recordings. The proposed method utilizes a compressed representation of feat...Show MoreMetadata
Abstract:
We present an automated method for unsupervised detection of structural boundaries in musical recordings. The proposed method utilizes a compressed representation of features capturing timbre and chroma, in an 1-D time series derived via PCA. Time delay embedding and multi-scale comparison using the Wald-Wolfowitz statistical test are incorporated in order to calculate a Self Dissimilarity Matrix. A novelty curve is estimated by convolving an appropriate kernel along the main diagonal of the matrix, while the structural boundaries are located on the local maxima of the derived curve. We evaluate the proposed method on a popular dataset, using two different ground truth annotations. We demonstrate that the 1-D compressed representation of features contains enough information in order to detect boundaries with high precision, outperforming several methods from the literature.
Date of Conference: 01-03 July 2013
Date Added to IEEE Xplore: 10 October 2013
Electronic ISBN:978-1-4673-5807-1