Abstract
The production of traditional music is too complicated, consuming a lot of financial and human resources. Therefore, this paper aims to use artificial intelligence (AI) for songwriting and to explore the development and application of the Generative Adversarial Network (GAN) in smart music. An improved GAN-based Multi-Track Music (MTM)-GAN is established. The model is validated with the generation of 5 different music tracks for bass, drums, guitar, piano, and strings. The verification results are compared with the music generated by the existing Multi-Track Sequential GAN (MuseGAN) index evaluation method. The results show that many music clips generated by the MTM-GAN model are smooth and have a certain artistic aesthetic effect. Through the comparison of the two convergence curves of MuseGAN and MTM-GAN, when the penalty term is increased, the MTM-GAN of Consistency Term (CT) converges faster, and the training process is more stable. The numerical space of the parameter distribution obtained by the MTM-GAN-based music segment test is significantly smaller than that of MuseGAN. The probability of MTM-GAN overfitting is small. 62.8% of music listeners cannot distinguish the generated melody from the real melody. Therefore, the proposed model has the advantages of a more stable, more realistic, and faster fitting speed in music generation, indicating that the music generation method is effective.















Similar content being viewed by others
Data availability
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
References
Al-Janabi S, Alkaim A, Al-Janabi E et al (2021) Intelligent forecaster of concentrations (PM2. 5, PM10, NO2, CO, O3, SO2) caused air pollution (IFCsAP). Neural Comput Appl 33(21):14199–14229
Al-Janabi S, Alkaim AF, Adel Z (2020) An innovative synthesis of deep learning techniques (DcapsNet & DCOM) for generation electrical renewable energy from wind energy. Soft Comput 24(14):10943–10962
Al-Janabi S, Alkaim AF (2020) A nifty collaborative analysis to predicting a novel tool (DRFLLS) for missing values estimation. Soft Comput 24(1):555–569
Al-Janabi S, Mohammad M, Al-Sultan A (2020) A new method for prediction of air pollution based on intelligent computation. Soft Comput 24(1):661–680
Al-Janabi S, Alwan E. (2017) Soft Mathematical System To Solve Black Box Problem Through Development the FARB Based On Hyperbolic And Polynomial Functions[C]//2017 10th International Conference On Developments In Esystems Engineering (DeSE). IEEE pp 37–42.
Alkaim AF, Al_Janabi S. (2019) Multi Objectives Optimization To Gas Flaring Reduction From Oil Production[C]//International Conference On Big Data and Networks Technologies. Springer Cham pp 117–139.
Al-Janabi S, Al-Shourbaji I, Shojafar M, et al. (2017) Mobile Cloud Computing: Challenges And Future Research Directions[C]//2017 10th International Conference On Developments In Esystems Engineering (DeSE). IEEE pp 62–67.
SH Ali 2013Ali SH. (2013) Novel Approach For Generating The Key Of Stream Cipher System Using Random Forest Data Mining Algorithm[C]//2013 Sixth International Conference On Developments In Esystems Engineering. IEEE pp 259-269.
Al-Janabi S, Rawat S, Patel A et al (2015) Design and evaluation of a hybrid system for detection and prediction of faults in electrical transformers. Int J Electr Power Energy Syst 67:324–335
Wen X (2021) Using deep learning approach and IoT architecture to build the intelligent music recommendation system. Soft Comput 25(4):3087–3096
Siphocly NNJ, El-Horbaty ESM (2019) Intelligent technique for automating the conversion between major and minor melodies. Future Comput Inform J 4(2):2
Paolizzo F, Johnson CG (2020) Creative autonomy in a simple interactive music system. J New Music Res 49(2):115–125
Rahate A, Walambe R, Ramanna S et al (2022) Multimodal co-learning: challenges, applications with datasets, recent advances and future directions. Inform Fus 81:203–239
Keerti G, Vaishnavi AN, Mukherjee P et al (2022) Attentional networks for music generation. Multimed Tools Appl 81(4):5179–5189
Rizvi SKJ, Azad MA, Fraz MM (2021) Spectrum of advancements and developments in multidisciplinary domains for generative adversarial networks (GANs). Arch Comput Method Eng 28(7):4503–4521
Hazra T, Anjaria K (2022) Applications of game theory in deep learning: a survey. Multimed Tools Appl 81(6):8963–8994
Supiarza H, Sarbeni I (2021) Teaching and learning music in digital era: creating keroncong music for gen z students through interpreting poetry. Harm J Arts Res Edu 21(1):123–139
Chang CY, Chen YP (2020) AntsOMG: a framework aiming to automate creativity and intelligent behavior with a showcase on cantus firmus composition and style development. Electronics 9(8):1212
Yu Y, Srivastava A, Canales S (2021) Conditional lstm-gan for melody generation from lyrics. ACM Trans Multimed Comput Commun Appl (TOMM) 17(1):1–20
Wu J, Hu C, Wang Y et al (2019) A hierarchical recurrent neural network for symbolic melody generation. IEEE Trans Cybern 50(6):2749–2757
Wang N, Xu H, Xu F et al (2021) The algorithmic composition for music copyright protection under deep learning and blockchain. Appl Soft Comput 112:107763
Siphocly NN, Salem ABM, El-Horabty ESM (2021) Applications of computational intelligence in computer music composition. Int J Intell Comput Inf Sci 21(1):59–67
Zheng Y (2020) The use of deep learning algorithm and digital media art in all-media intelligent electronic music system. PLoS ONE 15(10):e0240492
Jin C, Tie Y, Bai Y et al (2020) A style-specific music composition neural network. Neural Process L 52:1893–1912
Ali S H. (2012) Miner for OACCR: Case Of Medical Data Analysis In Knowledge Discovery[C]//2012 6th International Conference On Sciences Of Electronics, Technologies Of Information And Telecommunications (SETIT). IEEE pp 962–975.
Al-Janabi S, Salman AH (2021) Sensitive integration of multilevel optimization model in human activity recognition for smartphone and smartwatch applications. B data min anal 4(2):124–138
Carnovalini F, Rodà A (2020) Computational creativity and music generation systems: an introduction to the state of the art. Front Artifi Intell 3:14
Tabuena AC (2020) Chord-interval, direct-familiarization, musical instrument digital interface, circle of fifths, and functions as basic piano accompaniment transposition techniques. Int J Res Publ 66(1):1–11
Paszke A, Gross S, Massa F et al (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 32:8026–8037
Karras T, Aittala M, Laine S et al (2021) Alias-free generative adversarial networks. Adv Neural Inf Process Syst 34:852–863
Hadimlioglu IA, King SA (2018) Automated musical transitions through rule-based synthesis using musical properties. Entertain Comput 28:59–67
Zhang H, Xu T, Li H et al (2018) Stackgan++: realistic image synthesis with stacked generative adversarial networks. IEEE Trans Pattern Anal Mach Intell 41(8):1947–1962
De Persis C, Grammatico S (2019) Distributed averaging integral nash equilibrium seeking on networks. Automatica 110:108–548
Park SW, Huh JH, Kim JC (2020) BEGAN v3: avoiding mode collapse in GANs using variational inference. Electronics 9(4):6–88
Barzilay N, Shalev TB, Giryes R (2021) MISS GAN: a multi-illustrator style generative adversarial network for image to illustration translation. Pattern Recogn Lett 151:140–147
Wu X, Wang C, Lei Q (2020) Transformer-xl based music generation with multiple sequences of time-valued notes. arXiv Prepr arXiv 13:1–1
Song G, Wang Z, Han F et al (2020) Music auto-tagging using scattering transform and convolutional neural network with self-attention. Appl Soft Comput 96:106–702
Leikin A (2017) Not set in stone: mikhail pletnev’s rewrite of scriabin’s piano concerto. Perform Pract Rev 22(1):1–29
Zeng Z, Xiong Y, Guo W et al (2020) ERgene: python library for screening endogenous reference genes. Sci Rep 10(1):12–56
Noor TH, Zeadally S, Alfazi A et al (2018) Mobile cloud computing: challenges and future research directions. J Netw Comput Appl 115:70–85
Heo YJ, Kim BG, Roy PP (2021) Frontal face generation algorithm from multi-view images based on generative adversarial network. J Multimed Inf Syst 8(2):85–92
Ye X, Du J, Ye Y (2022) MasterplanGAN: facilitating the smart rendering of urban master plans via generative adversarial networks. Environ Plan B urban Anal City Sci 49(3):794–814
Mohammed DY, Al-Karawi KA, Duncan PJ et al (2019) Overlapped music segmentation using a new effective feature and random forests. Int J Artifi Intell 8(2):181–189
Liu Y (2021) Improved generative adversarial network and its application in image oil painting style transfer. Image Vis Comput 105:104087
Briot JP (2021) From artificial neural networks to deep learning for music generation: history, concepts and trends. Neural Comput Appl 33(1):39–65
Grekow J, Dimitrova-Grekow T (2021) Monophonic music generation with a given emotion using conditional variational autoencoder. IEEE Access 9:129088–129101
Acknowledgements
The authors acknowledge the help from the university colleagues.
Funding
This work was supported by the diversified reform of music and ear teaching in ordinary higher music majors (Grant no. 2017 Xiangcheng Institute Fa No.120.33).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, W. Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition. J Supercomput 79, 6560–6582 (2023). https://doi.org/10.1007/s11227-022-04914-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-022-04914-5