Abstract
Classification of classical melodic structures by style, composer, genre, period, etc., is a rather complex task. The level of difficulty varies across melodic frameworks. It would be interesting to see how we can impart this ability to a machine. This paper reports an improved pattern matching technique for composer and raga classification using a fuzzy analytical hierarchy process-based approach. The technique makes use of class-specific patterns extracted from a pattern discovery technique known as Structure Induction Algorithm for r superdiagonals and compactness trawler. Further, to represent inexact matches a modified matching technique is proposed to assign weights to the exact matching scores in a probabilistic manner. Subsequently, the weighted scores are fuzzified to quantify the extent of match. Finally, the fuzzy scores are aggregated and classified on the basis of minimum Euclidean distance from an ideal solution in the pattern space. Experiments conducted on datasets containing a wide range of melodies from classical western and classical Indian background show that the proposed technique exhibits consistently better classification success rate compared to the exact n-Gram-based approach and a widely used matching algorithm based on Levenshtein distance.
Similar content being viewed by others
References
Alzahrani SM, Salim N, Abraham A (2012) Understanding plagiarism linguistic patterns, textual features, and detection methods. IEEE Trans Syst Man Cybern Part C (Appl Rev) 42(2):133–149
Bakshi H (2005) 101 Raga-S for the 21st century and beyond: a music lover’s guide to hindustani music. Trafford Publishing, Bloomington
Bimbot F, Pieraccini R, Levin E, Atal B (1995) Variable-length sequence modeling: multigrams. IEEE Signal Proc Lett 2(6):111–113
Cambouropoulos E (1998) Towards a general computational theory of musical structure. PhD thesis, The University of Edinburgh
Cantareira GD, Nonato LG, Paulovich FV (2016) Moshviz: A detail+ overview approach to visualize music elements. IEEE Trans Multimedia 18(11):2238–2246
Chen SF, Rosenfeld R (2000) A survey of smoothing techniques for ME models. IEEE Trans Speech Audio Proc 8(1):37–50
Chen J-H, Chen C-S, Chen Y-S (2003) Fast algorithm for robust template matching with M-estimators. IEEE Trans Signal Proce 51(1):230–243
Chen R, Acs G, Castelluccia C (2012) Differentially private sequential data publication via variable-length n-grams. In: Proceedings of the 2012 ACM conference on computer and communications security, pp 638-649
Collins T (2011) Improved methods for pattern discovery in music, with applications in automated stylistic composition. Doctoral dissertation, The Open University
Collins T, Thurlow J, Laney R,Willis A,Garthwaite P (2010) A comparative evaluation of algorithms for discovering translational patterns in baroque keyboard works. In: Proceedings of the international symposium on music information retrieval, Utrecht, The Netherlands, 9–13 Aug 2010
Collins T, Arzt A, Flossmann S, Widmer G (2013) SIARCT-CFP: improving precision and the discovery of inexact musical patterns in point-set representations. In: Ismir, pp 549–554
Conklin D (2010) Discovery of distinctive patterns in music. Intell Data Anal 14(5):547–554
Conklin D (2015) Chord sequence generation with semiotic patterns. MML 2015:1
Crow D, Smith B (1992) Db_jiabits: Comparing minimal knowledge and knowledge-based approaches to pattern recognition in the domain of user-computer interactions. Ellis Horwood, Amsterdam
Cucerzan S, Brill E (2004) Spelling correction as an iterative process that exploits the collective knowledge of web users. In: EMNLP, vol 4, pp 293–300
Goto, M, et al (2004) Development of the RWC music database. In: Proceedings of the 18th international congress on acoustics (ICA 2004), vol 1, pp 553–556
Grachten M, Arcos J-L, De Mántaras RL (2004) Melodic similarity: looking for a good abstraction level. Representations 2:7
Grachten M, Arcos JL, De Mántaras RL (2005) Melody retrieval using the implication/realization model. MIREX, http://www.music-ir.org/evaluation/mirex-results/article/s/similarity/grachten.pdf
Hillewaere R, Manderick B, Conklin D (2012) String methods for folk tune genre classification. In: 13th ISMIR, vol 2012
Hirsimaki T, Pylkkonen J, Kurimo M (2009) Importance of high-order n-gram models in morph-based speech recognition. IEEE Trans Audio Speech Lang Proc 17(4):724–732
Hsu J-L, Chen AL, Liu C-C (1998) Efficient repeating pattern finding in music databases. In: Proceedings of the seventh international conference on information and knowledge management, pp 281–288
Hussein AS (2015) Arabic document similarity analysis using n-grams and singular value decomposition. In: 2015 IEEE 9th international conference on research challenges in information science (RCIS), pp 445–455
Kaur C, Kumar R (2016) Classification of melodic structures using fuzzified n-gram matching scores. In: 2016 IEEE international conference on fuzzy systems, pp 685–690
Klapuri A (2010) Pattern induction and matching in music signals. In: International symposium on computer music modeling and retrieval, pp 188–204
Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions, and reversals. In: Soviet physics doklady, vol 10, pp 707–710
Li J, Butler-Purry KL, Benner CL, Russell B (2004) Selecting a fuzzy aggregation operator for multicriteria fault location problem. In: IEEE PES power systems conference and exposition, pp 1476–1482
Liao H, Xu Z, Herrera-Viedma E, Herrera F (2017) Hesitant fuzzy linguistic term set and its application in decision making: a state-of-the-art survey. Int J Fuzzy Syst 1–27. https://doi.org/10.1007/s40815-017-0432-9
Liao H, Mi X, Xu Z, Xu J, Herrera F (2018) Intuitionistic fuzzy analytic network process. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2017.2788881
Liu W, Liao H (2017) A bibliometric analysis of fuzzy decision research during 1970–2015. Int J Fuzzy Syst 19(1):1–14
LLC C A (2015) The ultimate classical music destination, http://classicalarchives.com
Louboutin C, Meredith D (2016) Using general-purpose compression algorithms for music analysis. J New Music Res 45(1):1–16
Ma X, Wang D, Tejedor J (2016) Similar word model for unfrequent word enhancement in speech recognition. IEEE/ACM Trans Audio Speech Lang Proc 24(10):1819–1830
Marxer R, Purwins H (2016) Unsupervised incremental online learning and prediction of musical audio signals. IEEE/ACM Trans Audio Speech Lang Proc 24(5):863–874
Meredith D (2013) Cosiatec and siateccompress: pattern discovery by geometric compression. In: International society for music information retrieval conference
Meredith D (2015) Music analysis and point-set compression. J New Music Res 44(3):245–270
Meredith D, Lemström K, Wiggins GA (2002) Algorithms for discovering repeated patterns in multidimensional representations of polyphonic music. J New Music Res 31(4):321–345
Mohri M, Pereira F, Riley M (2002) Weighted finite-state transducers in speech recognition. Comput Speech Lang 16(1):69–88
Mongeau M, Sankoff D (1990) Comparison of musical sequences. Comput Hum 24(3):161–175
Montfort M (1985) Ancient traditions-future possibilities: rhythmic training through the traditions of Africa, Bali, and India. Panoramic Press, Phoenix
Müllensiefen D, Frieler K (2004) Optimizing measures of melodic similarity for the exploration of a large folk song database. In: Ismir
Müller M, Konz V, Bogler W, Arifi-Müller V (2011) Saarland music data (SMD). In: Proceedings of the international society for music information retrieval conference (ISMIR): late breaking session
Park AS, Glass JR (2008) Unsupervised pattern discovery in speech. IEEE Trans Audio Speech Lang Proc 16(1):186–197
Pedrycz W, Song M (2011) Analytic hierarchy process (AHP) in group decision making and its optimization with an allocation of information granularity. IEEE Trans Fuzzy Syst 19(3):527–539. https://doi.org/10.1109/TFUZZ.2011.2116029
Picone J, Staples T, Kondo K, Arai N (1999) Kanji-to-Hiragana conversion based on a length-constrained n-gram analysis. IEEE Trans Speech Audio Proc 7(6):685–696
Pingle YP, Bhagwat A (2015) Music therapy and data mining using indian ragas as a supplementary medicine. In: 2015 2nd international conference on computing for sustainable global development (indiacom), pp 347–350
Poliner GE, Ellis DP, Ehmann AF, Gómez E, Streich S, Ong B (2007) Melody transcription from music audio: approaches and evaluation. IEEE Trans Audio Speech Lang Proc 15(4):1247–1256
Pollastri E, Simoncelli G (2001) Classification of melodies by composer with hidden markov models. In: Proceedings of first international conference on web delivering of music, pp 88–95
Ren P, Xu Z, Liao H (2016) Intuitionistic multiplicative analytic hierarchy process in group decision making. Comput Ind Eng 101:513–524
Roark B, Saraclar M, Collins M (2007) Discriminative n-gram language modeling. Comput Speech Lang 21(2):373–392
Rolland P-Y (1999) Discovering patterns in musical sequences. J New Music Res 28(4):334–350
Sahasrabuddhe H, Upadhye R (1992) On the computational model of raga music of India. In: Workshop on AI and music: European conference on AI
Sethy A, Georgiou PG, Ramabhadran B, Narayanan S (2009) An iterative relative entropy minimization-based data selection approach for n-gram model adaptation. IEEE Trans Audio Speech Lang Proc 17(1):13–23
Singha G (2001) Sikh musicology: sri guru granth sahib and hymns of the human spirit. Kanishka Publishers, Delhi
Siu M, Ostendorf M (2000) Variable n-grams and extensions for conversational speech language modeling. IEEE Trans Speech Audio Proc 8(1):63–75
Smith LA, McNab RJ, Witten IH (1998) Sequence-based melodic comparison: a dynamic programming approach. Comput Musicol A Dir Res 11:101–118
Thrasher AR (2016) Qupai in chinese music: melodic models in form and practice. Routledge, London
Wu Y-C, Chen HH (2016) Generation of affective accompaniment in accordance with emotion flow. IEEE/ACM Trans Audio Speech Lang Proc 24(12):2277–2287
Xu H, Ou Z (2016) Scalable discovery of audio fingerprint motifs in broadcast streams with determinantal point process based motif clustering. IEEE/ACM Trans Audio Speech Lang Proc 24(5):978–989
Xu Z, Liao H (2014) Intuitionistic fuzzy analytic hierarchy process. IEEE Trans Fuzzy Syst 22(4):749–761. https://doi.org/10.1109/TFUZZ.2013.2272585
Zonis E (1973) Classical persian music: an introduction. Harvard University Press, Cambridge
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants performed by any of the authors.
Additional information
Communicated by V. Loia.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kaur, C., Kumar, R. A fuzzy hierarchy-based pattern matching technique for melody classification. Soft Comput 23, 7375–7392 (2019). https://doi.org/10.1007/s00500-018-3383-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-018-3383-7