Abstract
A method is proposed for classifying music genre for audio retrieval systems using time-delay neural networks. The proposed classification method considers eight types of music genre: Blues, Country, Hard Core, Hard Rock, Jazz, R&B(Soul), Techno, and Trash Metal. The melody between bars in the music is used to distinguish the different genres. The melody pattern is extracted based on the sound of a snare drum, which is used to effectively represent the rhythm periodicity. Classification is based on a time-delay neural network that uses a Fourier transformed vector of the melody as an input pattern. This classification method was used to analyze 80 training data from ten different musical pieces for each genre and a further 40 test data from five additional musical pieces for each genre. The accuracy of the genre classifications that were obtained for the two sets of data was 92.5% and 60%, respectively.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based Classification, Search, and Retrieval of Audio. IEEE Multimedia 3(3), 27–36 (1996)
Kosugi, N., Nishihara, Y., Kon’ya, S., Yamamuro, M., Kushima, K.: Music by Humming Using Similarity Retrieval over High Dimensional Feature Vector Space. In: IEEE Pacific Rim Conference on Communications, Computer and Signal Processing, pp. 404–407 (1999)
Blackburn, S., DeRoure, D.: A Tool for Content-based Navigation of Music. Proceeding of the ACM Multimedia, 361–368 (1998)
Liu, C.C., Hsu, J.L., Chen, A.L.P.: An Approximate String Matching Algorithm for Content-based Music Data Retrieval. In: IEEE International Conference on Multimedia Computing and Systems, vol. 1, pp. 451–456 (1999)
Tzanetakis, G., Cook, P.: Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing 10(5), 293–302 (2002)
Pachet, F., Roy, P., Cazaly, D.: A Combinatorial Approach to Content-based Music Selection. In: IEEE International Conference on Multimedia Computing and Systems, vol. 1, pp. 457–462 (1999)
Waible, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K.J.: Phoneme Recognition Using Time-delay Neural Networks. IEEE Transactions on Acoustics, Speech, and Signal Processing 37, 328–339 (1989)
Logan, B.: Mel Frequency Cepstral Coefficients for Music Modeling. In: International Symposium on Music Information Retrieval (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, JW., Park, SB., Kim, SK. (2006). Music Genre Classification Using a Time-Delay Neural Network. In: Wang, J., Yi, Z., Zurada, J.M., Lu, BL., Yin, H. (eds) Advances in Neural Networks - ISNN 2006. ISNN 2006. Lecture Notes in Computer Science, vol 3972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760023_27
Download citation
DOI: https://doi.org/10.1007/11760023_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34437-7
Online ISBN: 978-3-540-34438-4
eBook Packages: Computer ScienceComputer Science (R0)