Abstract
Music regional classification, which is an important branch of music automatic classification, aims at classifying folk songs according to different regional style. Chinese folk songs have developed various regional musical styles in the process of its evolution. Regional classification of Chinese folk songs can promote the development of music recommendation systems which recommending proper style of music to users and improve the efficiency of the music retrieval system. However, the accuracy of existing music regional classification systems is not high enough, because most methods do not consider temporal characteristics of music for both features extraction and classification. In this paper, we proposed an approach based on conditional random field (CRF) which can fully take advantage of the temporal characteristics of musical audio features for music regional classification. Considering the continuity, high dimensionality and large size of the audio feature data, we employed two ways to calculate the label sequence of musical audio features in CRF, which are Gaussian Mixture Model (GMM) and Restricted Boltzmann Machine (RBM). The experimental results demonstrated that the proposed method based on CRF-RBM outperforms other existing music regional classifiers with the best accuracy of 84.71% on Chinese folk songs datasets. Besides, when the proposed methods were applied to the Greek folk songs dataset, the CRF-RBM model also performs the best.
Similar content being viewed by others
Notes
A YouTube video of introduction to Chinese folk songs by Linna Gong (in Chinese): https://www.youtube.com/watch?v=HcBqnIHgYdg. The live singing without accompaniment of MoLiHua from southern Jiangsu is the part of the video from 11:53 to 12:19, while MoLiHua from northeastern China is from 12:28 to 13:00.
Musical Folklore Archives Melpo Merlie: http://www.mla.gr/
Thrace and Macedonia: http://epth.sfm.gr/
References
Bassiou N, Kotropoulos C, Papazoglou-Chalikias A (2015) Greek folk music classification into two genres using lyrics and audio via canonical correlation analysis. In: 2015 9th international symposium on image and signal processing and analysis(ISPA), pp 238–243
Byrd RH, Hansen SL, Nocedal J, Singer Y (2014) A stochastic quasi-newton method for large-scale optimization. Siam Journal on Optimization 26(2):1008–1031
Chouzenoux E, Pesquet JC, Repetti A (2014) Variable metric forward–backward algorithm for minimizing the sum of a differentiable function and a convex function. ACM Trans Multimed Comput Commun Appl 162(1):107–132
Conklin D (2013) Multiple viewpoint systems for music classification. Journal of New Music Research 42(1):19–26
Corrêa D C, Rodrigues FA (2016) A survey on symbolic data-based music genre classification. Expert Syst Appl 60:190–210
Du YX (1993) The music dialect area and its divisions of Han folk songs (in Chinese). Chin Music 1:14–16
Fotiadou E, Bassiou N, Kotropoulos C (2016) Greek folk music classification using auditory cortical representations. In: 2016 24th European signal processing conference (EUSIPCO), pp 1133–1137
Fu ZY, Lu GJ, Ting KM, Zhang DS (2011) A survey of audio-based music classification and annotation. IEEE Trans Multimedia 13(1):303–319
Han KH (1989) Folk songs of the Han Chinese: characteristics and classifications. Asian Music 20(2):107–128
Hillewaere R, Manderick B, Conklin D (2009) Global feature versus event models for folk song classification. In: 2009 10th international society for music information retrieval conference, pp 729–734
Hinton GE (2002) Training products of experts by minimizing contrastive divergence. Neural Comput 14(8):1771–1800
Hinton GE (2012) A practical guide to training restricted boltzmann machines. Momentum 9(1):599–619
Hinton GE, Osindero S, Teh YW (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
Huang YF, Lin SM, Wu HY, Li YS (2014) Music genre classification based on local feature selection using a self-adaptive harmony search algorithm. Data Knowl Eng 92:60–76
Kawase A (2017) Quantitative analysis of traditional folk songs from Shikoku district. In: 2017 international conference on culture and computing, pp 170–177
Kawase A, Tokosumi A (2010) Regional classification of traditional Japanese folk songs. Kansei Engineering International Journal 10(1):19–27
Kedyte V, Panteli M, Weyde T, Dixon S (2017) Geographical origin prediction of folk music recordings from the United Kingdom. In: 2017 18th international society for music information retrieval conference, pp 23–27
Kereliuk C, Sturm BL, Larsen J (2015) Deep learning and music adversaries. IEEE Trans Multimedia 17(11):2059–2071
Khoo S, Man Z, Cao Z (2012) Automatic Han Chinese folk song classification using the musical feature density map. In: 2012 6th international conference on signal processing and communication systems(ICSPCS), pp 1–9
Khoo S, Man Z, Cao Z, Zheng J (2013) German vs. Austrian folk song classification. In: 2013 8th IEEE conference on industrial electronics and applications(ICIEA), pp 131–136
Lafferty J, Mccallum A, Pereira FC (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: International conference on machine learning, pp 282–289
Larochelle H, Bengio Y (2008) Classification using discriminative restricted boltzmann machines. In: International conference on machine learning, pp 536–543
Larochelle H, Mandel M, Pascanu R, Bengio Y (2012) Learning algorithms for the classification restricted boltzmann machine. J Mach Learn Res 13(1):643–669
Li J, Ding J, Yang X (2017) The regional style classification of Chinese folk songs based on GMM-CRF model. In: 2017 9th international conference on computer and automation engineering, pp 66–72
Li J, Dong L, Ding J, Yang X (2015) Exploring the general melodic characteristics of XinTianYou folk songs. In: 2015 12th sound and music computing conference, pp 393–399
Li J, Wang Y, Yang X (2016) General characteristics analysis of Chinese folk songs based on layered stabilities detection(LSD) audio segmentation algorithm. In: 2016 42nd international computer music conference(ICMC), pp 16–20
Li J, Wang Y, Yang X (2017) Regional recognition of Chinese folk songs based on LSD audio segmentation algorithm. In: 2017 9th international conference on computer and automation engineering, pp 60–65
Liu Y, Wei L, Liu ZL, Wang P (2008) The feature selection of regional style classification of Chinese folk songs. Acta Electronica Sinica 36(S1):152–156
Liu Y, Xu JP, Wei L, Tian Y (2007) The study of the classification of Chinese folk songs by regional style. In: International conference on semantic computing(ICSC), pp 657–662
Mannepalli K, Sastry PN, Suman M (2015) MFCC-GMM Based accent recognition system for Telugu speech signals. Int J Speech Technol 19(1):87–93
Martel J, Nakashika T, Garcia C, Idrissi K (2013) A combination of hand-crafted and hierarchical high-level learnt feature extraction for music genre classification. In: International conference on artificial neural networks, pp 397–404
Miao J, Qiao JZ (1985) A study of similar color area divisions in Han folk songs(in Chinese). Journal of Central Conservatory of Music 1(1):26–33
Nanni L, Costa YMG, Lucio DR, Silla CN Jr, Brahnam S (2017) Combining visual and acoustic features for audio classification tasks. Pattern Recogn Lett 88:49–56
Panteli M, Benetos E, Dixon S (2016) Learning a feature space for similarity in world music. In: 2016 17th international society for music information retrieval conference, pp 538–544
Rajan R, Murthy HA (2017) Music genre classification by fusion of modified group delay and melodic features. In: 2017 Twenty-third national conference on communications, pp 1–6
Scaringella N, Zoia G, Mlynek D (2006) Automatic genre classification of music content: a survey. IEEE Signal Proc Mag 23(2):133–141
Song H, Sun K, Li B, Liu X (2011) HBS And HFS feature selection methods for Chinese folk music classification. In: IEEE international conference on transportation, mechanical, and electrical engineering, pp 2441–2444
Tzanetakis G, Cook P (2000) Marsyas: a framework for audio analysis. Organised Sound 4(3):169–175
Uzunbas MG, Chen C, Metaxas D (2016) An efficient conditional random field approach for automatic and interactive neuron segmentation. Med Image Anal 27:31–44
Van Der Maaten L, Hinton GE (2012) Visualizing non-metric similarities in multiple maps. Mach Learn 87(1):33–55
Van Der Maaten L (2014) Accelerating t-SNE using tree-based algorithms. J Mach Learn Res 15(1):3221–3245
Wu MJ, Jang JSR (2015) Combining acoustic and multilevel visual features for music genre classification. ACM Trans Multimed Comput Commun Appl 12(1):1–17
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, J., Luo, J., Ding, J. et al. Regional classification of Chinese folk songs based on CRF model. Multimed Tools Appl 78, 11563–11584 (2019). https://doi.org/10.1007/s11042-018-6637-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6637-6