Abstract
The aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large database that included approximately 30000 audio files divided into 11 classes corresponding to music genres with different cardinalities. Every audio file was described by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable value of factors was employed. The tests were conducted in the WEKA application with the use of k-Nearest Neighbors (kNN), Bayesian Network (Net) and Sequential Minimal Optimization (SMO) algorithms. All results were analyzed in terms of the recognition rate and computation time efficiency.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abdi, H., Williams, L.J.: Principal Component Analysis. Wiley Interdisciplinary Reviews: Computational Statistics 2 (2010)
Chang-Hsing, L., Hwai-San, L., Jau-Ling, S., Kun-Ming, Y.: Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features. IEEE Transaction on Multimedia 11(4) (2009)
Cichowski, J., Czyżewski, A., Kostek, B.: Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture. Multimedia Tools and Applications (2012), doi:10.1007/s11042-013-1636-0
Haque, M.A., Kim, J.-M.: An analysis of content-based classification of audio signals using a fuzzy c-means algorithm. Multimedia Tools and Applications 63(1), 77–92 (2013)
Kiktova-Vozarikova, E., Juhar, J., Cizmar, A.: Feature selection for acoustic events detection. Multimedia Tools and Applications (2013)
Kostek, B.: Music Information Retrieval in Music Repositories. In: Skowron, A., Suraj, Z. (eds.) Rough Sets and Intelligent Systems. ISRL, vol. 42, pp. 463–489. Springer, Heidelberg (2013)
Kostek, B.: Perception-Based Data Processing in Acoustics, Applications to Music Information Retrieval and Psychophysiology of Hearing. SCI, vol. 3. Springer, Heidelberg (2005)
Kostek, B.: Soft Computing in Acoustics, Applications of Neural Networks, Fuzzy Logic and Rough Sets to Musical Acoustics. STUDFUZZ. Physica Verlag, Heidelberg (1999)
Kostek, B., Czyzewski, A.: Representing Musical Instrument Sounds for their Automatic Classification. J. Audio Eng. Soc. 49, 768–785 (2001)
Kostek, B., Hofmann, P., Kaczmarek, A., Spaleniak, P.: Creating a Reliable Music Discovery and Recommendation System. In: Intelligent Tools for Building a Scientific Information Platform: From Research to Implementation. Springer Verlag (2013)
Kostek, B., Kupryjanow, A., Zwan, P., Jiang, W., Raś, Z.W., Wojnarski, M., Swietlicka, J.: Report of the ISMIS 2011 Contest: Music Information Retrieval. In: Kryszkiewicz, M., Rybinski, H., Skowron, A., Raś, Z.W. (eds.) ISMIS 2011. LNCS, vol. 6804, pp. 715–724. Springer, Heidelberg (2011)
Kotropoulos, C., Benetos, E., Panagakis, E.: Music genre classification: A multilinear approach. In: ISMIR (2008)
Kupryjanow, A., Czyzewski, A.: Real-Time Speech Signal Segmentation Methods. J. Audio Eng. Soc. 61(7/8), 521–534 (2013)
Łopatka, K., Czyżewski, A.: Automatic Regular Voice, Raised Voice, and Scream Recognition Employing Fuzzy Logic. In: 132nd Audio Eng. Soc. Convention, preprint 8636, Budapest, Hungary, April 26-29 (2012)
Mei-Lan, C.: Automatic Music Genre Classification, Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
Mlynek, D., Zoia, G., Scaringella, N.: Automatic Genre Classification of Music Content. IEEE Signal Processing Magazine (2006)
MPEG 7 standard, http://mpeg.chiariglione.org/standards/mpeg-7
Ntalampiras, S., Potamitis, I., Fakotakis, N.: Acoustic Detection of Human Activities in Natural Environments. J. Audio Eng. Soc. 60(9), 686–695 (2012)
Qi, L., Tao, L., Mitsunori, O.: A Comparative Study on Content-Based Music Genre Classification. In: Proc. SIGIR (2003)
Pertusa, A., Rauber, A., Inesta, J., Lidy, T.: Combining audio and symbolic descriptors for music classification from audio. Music Information Retrieval Information Exchange, MIREX (2007)
Ragni, A., Gautier, C., Gales, M.J.F.: Support Vector Machines for Noise Robust ASR, Cambridge University Engineering Department Trumpington St., Cambridge, U.K IEEE ASRU (2009)
Remco, R.: BayesianNet algorithm description, University of Waikato (2007), http://www.cs.waikato.ac.nz/~remco/weka_bn/
Schnitzer, D., Flexer, A., Widmer, G.: A fast audio similarity retrieval method for millions of music tracks. Multimedia Tools and Applications (2012)
Shlens, J.: A Tutorial on Principal Component Analysis, Version 2 (December 10, 2005)
Tzanetakis, G., Cook, P.: Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing 10(5) (July 2002)
Tzacheva, A.A., Bell, K.J.: Music Information Retrieval with Temporal Features and Timbre. In: An, A., Lingras, P., Petty, S., Huang, R. (eds.) AMT 2010. LNCS, vol. 6335, pp. 212–219. Springer, Heidelberg (2010)
Weka Mining Software, http://www.cs.waikato.ac.nz/ml/weka/
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemometrics and Intelligent Laboratory Systems
Xu, B., Liang, J., Wang, L., Huang, S., Wang, S.: Music Genre Classification Based on Multiple Classifier Fusion, Chinese Academy of Sciences, Beijing, China (2008)
Yao-Chang, H., Shyh-Kang, J.: An Audio Recommendation System Based on Audio Signature Description Scheme in MPEG-7 Audio. In: 2004 IEEE International Conference on Multimedia and Expo, ICME (2004)
Zheng, J., Oussalah, M.: Automatic System for Music Genre Classification University of Birmingham, Electronics, Electrical and Computer Engineering (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Hoffmann, P., Kostek, B. (2014). Music Data Processing and Mining in Large Databases for Active Media. In: Ślȩzak, D., Schaefer, G., Vuong, S.T., Kim, YS. (eds) Active Media Technology. AMT 2014. Lecture Notes in Computer Science, vol 8610. Springer, Cham. https://doi.org/10.1007/978-3-319-09912-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-09912-5_8
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09911-8
Online ISBN: 978-3-319-09912-5
eBook Packages: Computer ScienceComputer Science (R0)