Skip to main content
Log in

A fuzzy hierarchy-based pattern matching technique for melody classification

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Classification of classical melodic structures by style, composer, genre, period, etc., is a rather complex task. The level of difficulty varies across melodic frameworks. It would be interesting to see how we can impart this ability to a machine. This paper reports an improved pattern matching technique for composer and raga classification using a fuzzy analytical hierarchy process-based approach. The technique makes use of class-specific patterns extracted from a pattern discovery technique known as Structure Induction Algorithm for r superdiagonals and compactness trawler. Further, to represent inexact matches a modified matching technique is proposed to assign weights to the exact matching scores in a probabilistic manner. Subsequently, the weighted scores are fuzzified to quantify the extent of match. Finally, the fuzzy scores are aggregated and classified on the basis of minimum Euclidean distance from an ideal solution in the pattern space. Experiments conducted on datasets containing a wide range of melodies from classical western and classical Indian background show that the proposed technique exhibits consistently better classification success rate compared to the exact n-Gram-based approach and a widely used matching algorithm based on Levenshtein distance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Alzahrani SM, Salim N, Abraham A (2012) Understanding plagiarism linguistic patterns, textual features, and detection methods. IEEE Trans Syst Man Cybern Part C (Appl Rev) 42(2):133–149

    Article  Google Scholar 

  • Bakshi H (2005) 101 Raga-S for the 21st century and beyond: a music lover’s guide to hindustani music. Trafford Publishing, Bloomington

    Google Scholar 

  • Bimbot F, Pieraccini R, Levin E, Atal B (1995) Variable-length sequence modeling: multigrams. IEEE Signal Proc Lett 2(6):111–113

    Article  Google Scholar 

  • Cambouropoulos E (1998) Towards a general computational theory of musical structure. PhD thesis, The University of Edinburgh

  • Cantareira GD, Nonato LG, Paulovich FV (2016) Moshviz: A detail+ overview approach to visualize music elements. IEEE Trans Multimedia 18(11):2238–2246

  • Chen SF, Rosenfeld R (2000) A survey of smoothing techniques for ME models. IEEE Trans Speech Audio Proc 8(1):37–50

    Article  Google Scholar 

  • Chen J-H, Chen C-S, Chen Y-S (2003) Fast algorithm for robust template matching with M-estimators. IEEE Trans Signal Proce 51(1):230–243

    Article  MathSciNet  MATH  Google Scholar 

  • Chen R, Acs G, Castelluccia C (2012) Differentially private sequential data publication via variable-length n-grams. In: Proceedings of the 2012 ACM conference on computer and communications security, pp 638-649

  • Collins T (2011) Improved methods for pattern discovery in music, with applications in automated stylistic composition. Doctoral dissertation, The Open University

  • Collins T, Thurlow J, Laney R,Willis A,Garthwaite P (2010) A comparative evaluation of algorithms for discovering translational patterns in baroque keyboard works. In: Proceedings of the international symposium on music information retrieval, Utrecht, The Netherlands, 9–13 Aug 2010

  • Collins T, Arzt A, Flossmann S, Widmer G (2013) SIARCT-CFP: improving precision and the discovery of inexact musical patterns in point-set representations. In: Ismir, pp 549–554

  • Conklin D (2010) Discovery of distinctive patterns in music. Intell Data Anal 14(5):547–554

    Article  Google Scholar 

  • Conklin D (2015) Chord sequence generation with semiotic patterns. MML 2015:1

    Google Scholar 

  • Crow D, Smith B (1992) Db_jiabits: Comparing minimal knowledge and knowledge-based approaches to pattern recognition in the domain of user-computer interactions. Ellis Horwood, Amsterdam

    Google Scholar 

  • Cucerzan S, Brill E (2004) Spelling correction as an iterative process that exploits the collective knowledge of web users. In: EMNLP, vol 4, pp 293–300

  • Goto, M, et al (2004) Development of the RWC music database. In: Proceedings of the 18th international congress on acoustics (ICA 2004), vol 1, pp 553–556

  • Grachten M, Arcos J-L, De Mántaras RL (2004) Melodic similarity: looking for a good abstraction level. Representations 2:7

    Google Scholar 

  • Grachten M, Arcos JL, De Mántaras RL (2005) Melody retrieval using the implication/realization model. MIREX, http://www.music-ir.org/evaluation/mirex-results/article/s/similarity/grachten.pdf

  • Hillewaere R, Manderick B, Conklin D (2012) String methods for folk tune genre classification. In: 13th ISMIR, vol 2012

  • Hirsimaki T, Pylkkonen J, Kurimo M (2009) Importance of high-order n-gram models in morph-based speech recognition. IEEE Trans Audio Speech Lang Proc 17(4):724–732

    Article  Google Scholar 

  • Hsu J-L, Chen AL, Liu C-C (1998) Efficient repeating pattern finding in music databases. In: Proceedings of the seventh international conference on information and knowledge management, pp 281–288

  • Hussein AS (2015) Arabic document similarity analysis using n-grams and singular value decomposition. In: 2015 IEEE 9th international conference on research challenges in information science (RCIS), pp 445–455

  • Kaur C, Kumar R (2016) Classification of melodic structures using fuzzified n-gram matching scores. In: 2016 IEEE international conference on fuzzy systems, pp 685–690

  • Klapuri A (2010) Pattern induction and matching in music signals. In: International symposium on computer music modeling and retrieval, pp 188–204

  • Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions, and reversals. In: Soviet physics doklady, vol 10, pp 707–710

  • Li J, Butler-Purry KL, Benner CL, Russell B (2004) Selecting a fuzzy aggregation operator for multicriteria fault location problem. In: IEEE PES power systems conference and exposition, pp 1476–1482

  • Liao H, Xu Z, Herrera-Viedma E, Herrera F (2017) Hesitant fuzzy linguistic term set and its application in decision making: a state-of-the-art survey. Int J Fuzzy Syst 1–27. https://doi.org/10.1007/s40815-017-0432-9

  • Liao H, Mi X, Xu Z, Xu J, Herrera F (2018) Intuitionistic fuzzy analytic network process. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2017.2788881

  • Liu W, Liao H (2017) A bibliometric analysis of fuzzy decision research during 1970–2015. Int J Fuzzy Syst 19(1):1–14

    Article  Google Scholar 

  • LLC C A (2015) The ultimate classical music destination, http://classicalarchives.com

  • Louboutin C, Meredith D (2016) Using general-purpose compression algorithms for music analysis. J New Music Res 45(1):1–16

    Article  Google Scholar 

  • Ma X, Wang D, Tejedor J (2016) Similar word model for unfrequent word enhancement in speech recognition. IEEE/ACM Trans Audio Speech Lang Proc 24(10):1819–1830

    Article  Google Scholar 

  • Marxer R, Purwins H (2016) Unsupervised incremental online learning and prediction of musical audio signals. IEEE/ACM Trans Audio Speech Lang Proc 24(5):863–874

    Article  Google Scholar 

  • Meredith D (2013) Cosiatec and siateccompress: pattern discovery by geometric compression. In: International society for music information retrieval conference

  • Meredith D (2015) Music analysis and point-set compression. J New Music Res 44(3):245–270

    Article  Google Scholar 

  • Meredith D, Lemström K, Wiggins GA (2002) Algorithms for discovering repeated patterns in multidimensional representations of polyphonic music. J New Music Res 31(4):321–345

    Article  Google Scholar 

  • Mohri M, Pereira F, Riley M (2002) Weighted finite-state transducers in speech recognition. Comput Speech Lang 16(1):69–88

    Article  Google Scholar 

  • Mongeau M, Sankoff D (1990) Comparison of musical sequences. Comput Hum 24(3):161–175

    Article  Google Scholar 

  • Montfort M (1985) Ancient traditions-future possibilities: rhythmic training through the traditions of Africa, Bali, and India. Panoramic Press, Phoenix

    Google Scholar 

  • Müllensiefen D, Frieler K (2004) Optimizing measures of melodic similarity for the exploration of a large folk song database. In: Ismir

  • Müller M, Konz V, Bogler W, Arifi-Müller V (2011) Saarland music data (SMD). In: Proceedings of the international society for music information retrieval conference (ISMIR): late breaking session

  • Park AS, Glass JR (2008) Unsupervised pattern discovery in speech. IEEE Trans Audio Speech Lang Proc 16(1):186–197

    Article  Google Scholar 

  • Pedrycz W, Song M (2011) Analytic hierarchy process (AHP) in group decision making and its optimization with an allocation of information granularity. IEEE Trans Fuzzy Syst 19(3):527–539. https://doi.org/10.1109/TFUZZ.2011.2116029

    Article  Google Scholar 

  • Picone J, Staples T, Kondo K, Arai N (1999) Kanji-to-Hiragana conversion based on a length-constrained n-gram analysis. IEEE Trans Speech Audio Proc 7(6):685–696

    Article  Google Scholar 

  • Pingle YP, Bhagwat A (2015) Music therapy and data mining using indian ragas as a supplementary medicine. In: 2015 2nd international conference on computing for sustainable global development (indiacom), pp 347–350

  • Poliner GE, Ellis DP, Ehmann AF, Gómez E, Streich S, Ong B (2007) Melody transcription from music audio: approaches and evaluation. IEEE Trans Audio Speech Lang Proc 15(4):1247–1256

    Article  Google Scholar 

  • Pollastri E, Simoncelli G (2001) Classification of melodies by composer with hidden markov models. In: Proceedings of first international conference on web delivering of music, pp 88–95

  • Ren P, Xu Z, Liao H (2016) Intuitionistic multiplicative analytic hierarchy process in group decision making. Comput Ind Eng 101:513–524

    Article  Google Scholar 

  • Roark B, Saraclar M, Collins M (2007) Discriminative n-gram language modeling. Comput Speech Lang 21(2):373–392

    Article  Google Scholar 

  • Rolland P-Y (1999) Discovering patterns in musical sequences. J New Music Res 28(4):334–350

    Article  MathSciNet  Google Scholar 

  • Sahasrabuddhe H, Upadhye R (1992) On the computational model of raga music of India. In: Workshop on AI and music: European conference on AI

  • Sethy A, Georgiou PG, Ramabhadran B, Narayanan S (2009) An iterative relative entropy minimization-based data selection approach for n-gram model adaptation. IEEE Trans Audio Speech Lang Proc 17(1):13–23

    Article  Google Scholar 

  • Singha G (2001) Sikh musicology: sri guru granth sahib and hymns of the human spirit. Kanishka Publishers, Delhi

    Google Scholar 

  • Siu M, Ostendorf M (2000) Variable n-grams and extensions for conversational speech language modeling. IEEE Trans Speech Audio Proc 8(1):63–75

    Article  Google Scholar 

  • Smith LA, McNab RJ, Witten IH (1998) Sequence-based melodic comparison: a dynamic programming approach. Comput Musicol A Dir Res 11:101–118

  • Thrasher AR (2016) Qupai in chinese music: melodic models in form and practice. Routledge, London

    Book  Google Scholar 

  • Wu Y-C, Chen HH (2016) Generation of affective accompaniment in accordance with emotion flow. IEEE/ACM Trans Audio Speech Lang Proc 24(12):2277–2287

    Article  Google Scholar 

  • Xu H, Ou Z (2016) Scalable discovery of audio fingerprint motifs in broadcast streams with determinantal point process based motif clustering. IEEE/ACM Trans Audio Speech Lang Proc 24(5):978–989

    Article  Google Scholar 

  • Xu Z, Liao H (2014) Intuitionistic fuzzy analytic hierarchy process. IEEE Trans Fuzzy Syst 22(4):749–761. https://doi.org/10.1109/TFUZZ.2013.2272585

    Article  Google Scholar 

  • Zonis E (1973) Classical persian music: an introduction. Harvard University Press, Cambridge

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chandanpreet Kaur.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kaur, C., Kumar, R. A fuzzy hierarchy-based pattern matching technique for melody classification. Soft Comput 23, 7375–7392 (2019). https://doi.org/10.1007/s00500-018-3383-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-018-3383-7

Keywords

Navigation