Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition

Liu, Weiming

doi:10.1007/s11227-022-04914-5

Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition

Published: 08 November 2022

Volume 79, pages 6560–6582, (2023)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Weiming Liu¹

1182 Accesses
Explore all metrics

Abstract

The production of traditional music is too complicated, consuming a lot of financial and human resources. Therefore, this paper aims to use artificial intelligence (AI) for songwriting and to explore the development and application of the Generative Adversarial Network (GAN) in smart music. An improved GAN-based Multi-Track Music (MTM)-GAN is established. The model is validated with the generation of 5 different music tracks for bass, drums, guitar, piano, and strings. The verification results are compared with the music generated by the existing Multi-Track Sequential GAN (MuseGAN) index evaluation method. The results show that many music clips generated by the MTM-GAN model are smooth and have a certain artistic aesthetic effect. Through the comparison of the two convergence curves of MuseGAN and MTM-GAN, when the penalty term is increased, the MTM-GAN of Consistency Term (CT) converges faster, and the training process is more stable. The numerical space of the parameter distribution obtained by the MTM-GAN-based music segment test is significantly smaller than that of MuseGAN. The probability of MTM-GAN overfitting is small. 62.8% of music listeners cannot distinguish the generated melody from the real melody. Therefore, the proposed model has the advantages of a more stable, more realistic, and faster fitting speed in music generation, indicating that the music generation method is effective.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Linear Transformer-GAN: A Novel Architecture to Symbolic Music Generation

A Music Generation Model Based on Generative Adversarial Networks with Bayesian Optimization

Komposer V2: A Hybrid Approach to Intelligent Musical Composition Based on Generative Adversarial Networks with a Variational Autoencoder

Data availability

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

References

Al-Janabi S, Alkaim A, Al-Janabi E et al (2021) Intelligent forecaster of concentrations (PM2. 5, PM10, NO2, CO, O3, SO2) caused air pollution (IFCsAP). Neural Comput Appl 33(21):14199–14229
Google Scholar
Al-Janabi S, Alkaim AF, Adel Z (2020) An innovative synthesis of deep learning techniques (DcapsNet & DCOM) for generation electrical renewable energy from wind energy. Soft Comput 24(14):10943–10962
Google Scholar
Al-Janabi S, Alkaim AF (2020) A nifty collaborative analysis to predicting a novel tool (DRFLLS) for missing values estimation. Soft Comput 24(1):555–569
Google Scholar
Al-Janabi S, Mohammad M, Al-Sultan A (2020) A new method for prediction of air pollution based on intelligent computation. Soft Comput 24(1):661–680
Google Scholar
Al-Janabi S, Alwan E. (2017) Soft Mathematical System To Solve Black Box Problem Through Development the FARB Based On Hyperbolic And Polynomial Functions[C]//2017 10th International Conference On Developments In Esystems Engineering (DeSE). IEEE pp 37–42.
Alkaim AF, Al_Janabi S. (2019) Multi Objectives Optimization To Gas Flaring Reduction From Oil Production[C]//International Conference On Big Data and Networks Technologies. Springer Cham pp 117–139.
Al-Janabi S, Al-Shourbaji I, Shojafar M, et al. (2017) Mobile Cloud Computing: Challenges And Future Research Directions[C]//2017 10th International Conference On Developments In Esystems Engineering (DeSE). IEEE pp 62–67.
SH Ali 2013Ali SH. (2013) Novel Approach For Generating The Key Of Stream Cipher System Using Random Forest Data Mining Algorithm[C]//2013 Sixth International Conference On Developments In Esystems Engineering. IEEE pp 259-269.
Al-Janabi S, Rawat S, Patel A et al (2015) Design and evaluation of a hybrid system for detection and prediction of faults in electrical transformers. Int J Electr Power Energy Syst 67:324–335
Google Scholar
Wen X (2021) Using deep learning approach and IoT architecture to build the intelligent music recommendation system. Soft Comput 25(4):3087–3096
Google Scholar
Siphocly NNJ, El-Horbaty ESM (2019) Intelligent technique for automating the conversion between major and minor melodies. Future Comput Inform J 4(2):2
Google Scholar
Paolizzo F, Johnson CG (2020) Creative autonomy in a simple interactive music system. J New Music Res 49(2):115–125
Google Scholar
Rahate A, Walambe R, Ramanna S et al (2022) Multimodal co-learning: challenges, applications with datasets, recent advances and future directions. Inform Fus 81:203–239
Google Scholar
Keerti G, Vaishnavi AN, Mukherjee P et al (2022) Attentional networks for music generation. Multimed Tools Appl 81(4):5179–5189
Google Scholar
Rizvi SKJ, Azad MA, Fraz MM (2021) Spectrum of advancements and developments in multidisciplinary domains for generative adversarial networks (GANs). Arch Comput Method Eng 28(7):4503–4521
Google Scholar
Hazra T, Anjaria K (2022) Applications of game theory in deep learning: a survey. Multimed Tools Appl 81(6):8963–8994
Google Scholar
Supiarza H, Sarbeni I (2021) Teaching and learning music in digital era: creating keroncong music for gen z students through interpreting poetry. Harm J Arts Res Edu 21(1):123–139
Google Scholar
Chang CY, Chen YP (2020) AntsOMG: a framework aiming to automate creativity and intelligent behavior with a showcase on cantus firmus composition and style development. Electronics 9(8):1212
Google Scholar
Yu Y, Srivastava A, Canales S (2021) Conditional lstm-gan for melody generation from lyrics. ACM Trans Multimed Comput Commun Appl (TOMM) 17(1):1–20
Google Scholar
Wu J, Hu C, Wang Y et al (2019) A hierarchical recurrent neural network for symbolic melody generation. IEEE Trans Cybern 50(6):2749–2757
Google Scholar
Wang N, Xu H, Xu F et al (2021) The algorithmic composition for music copyright protection under deep learning and blockchain. Appl Soft Comput 112:107763
Google Scholar
Siphocly NN, Salem ABM, El-Horabty ESM (2021) Applications of computational intelligence in computer music composition. Int J Intell Comput Inf Sci 21(1):59–67
Google Scholar
Zheng Y (2020) The use of deep learning algorithm and digital media art in all-media intelligent electronic music system. PLoS ONE 15(10):e0240492
Google Scholar
Jin C, Tie Y, Bai Y et al (2020) A style-specific music composition neural network. Neural Process L 52:1893–1912
Google Scholar
Ali S H. (2012) Miner for OACCR: Case Of Medical Data Analysis In Knowledge Discovery[C]//2012 6th International Conference On Sciences Of Electronics, Technologies Of Information And Telecommunications (SETIT). IEEE pp 962–975.
Al-Janabi S, Salman AH (2021) Sensitive integration of multilevel optimization model in human activity recognition for smartphone and smartwatch applications. B data min anal 4(2):124–138
Google Scholar
Carnovalini F, Rodà A (2020) Computational creativity and music generation systems: an introduction to the state of the art. Front Artifi Intell 3:14
Google Scholar
Tabuena AC (2020) Chord-interval, direct-familiarization, musical instrument digital interface, circle of fifths, and functions as basic piano accompaniment transposition techniques. Int J Res Publ 66(1):1–11
Google Scholar
Paszke A, Gross S, Massa F et al (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 32:8026–8037
Google Scholar
Karras T, Aittala M, Laine S et al (2021) Alias-free generative adversarial networks. Adv Neural Inf Process Syst 34:852–863
Google Scholar
Hadimlioglu IA, King SA (2018) Automated musical transitions through rule-based synthesis using musical properties. Entertain Comput 28:59–67
Google Scholar
Zhang H, Xu T, Li H et al (2018) Stackgan++: realistic image synthesis with stacked generative adversarial networks. IEEE Trans Pattern Anal Mach Intell 41(8):1947–1962
Google Scholar
De Persis C, Grammatico S (2019) Distributed averaging integral nash equilibrium seeking on networks. Automatica 110:108–548
MathSciNet MATH Google Scholar
Park SW, Huh JH, Kim JC (2020) BEGAN v3: avoiding mode collapse in GANs using variational inference. Electronics 9(4):6–88
Google Scholar
Barzilay N, Shalev TB, Giryes R (2021) MISS GAN: a multi-illustrator style generative adversarial network for image to illustration translation. Pattern Recogn Lett 151:140–147
Google Scholar
Wu X, Wang C, Lei Q (2020) Transformer-xl based music generation with multiple sequences of time-valued notes. arXiv Prepr arXiv 13:1–1
Google Scholar
Song G, Wang Z, Han F et al (2020) Music auto-tagging using scattering transform and convolutional neural network with self-attention. Appl Soft Comput 96:106–702
Google Scholar
Leikin A (2017) Not set in stone: mikhail pletnev’s rewrite of scriabin’s piano concerto. Perform Pract Rev 22(1):1–29
Google Scholar
Zeng Z, Xiong Y, Guo W et al (2020) ERgene: python library for screening endogenous reference genes. Sci Rep 10(1):12–56
Google Scholar
Noor TH, Zeadally S, Alfazi A et al (2018) Mobile cloud computing: challenges and future research directions. J Netw Comput Appl 115:70–85
Google Scholar
Heo YJ, Kim BG, Roy PP (2021) Frontal face generation algorithm from multi-view images based on generative adversarial network. J Multimed Inf Syst 8(2):85–92
Google Scholar
Ye X, Du J, Ye Y (2022) MasterplanGAN: facilitating the smart rendering of urban master plans via generative adversarial networks. Environ Plan B urban Anal City Sci 49(3):794–814
Google Scholar
Mohammed DY, Al-Karawi KA, Duncan PJ et al (2019) Overlapped music segmentation using a new effective feature and random forests. Int J Artifi Intell 8(2):181–189
Google Scholar
Liu Y (2021) Improved generative adversarial network and its application in image oil painting style transfer. Image Vis Comput 105:104087
Google Scholar
Briot JP (2021) From artificial neural networks to deep learning for music generation: history, concepts and trends. Neural Comput Appl 33(1):39–65
MathSciNet Google Scholar
Grekow J, Dimitrova-Grekow T (2021) Monophonic music generation with a given emotion using conditional variational autoencoder. IEEE Access 9:129088–129101
Google Scholar

Download references

Acknowledgements

The authors acknowledge the help from the university colleagues.

Funding

This work was supported by the diversified reform of music and ear teaching in ordinary higher music majors (Grant no. 2017 Xiangcheng Institute Fa No.120.33).

Author information

Authors and Affiliations

College of Art, Hunan City University, Yiyang, 413000, China
Weiming Liu

Authors

Weiming Liu
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Weiming Liu.

Ethics declarations

Conflict of interest

All authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liu, W. Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition. J Supercomput 79, 6560–6582 (2023). https://doi.org/10.1007/s11227-022-04914-5

Download citation

Accepted: 22 October 2022
Published: 08 November 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s11227-022-04914-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Linear Transformer-GAN: A Novel Architecture to Symbolic Music Generation

A Music Generation Model Based on Generative Adversarial Networks with Bayesian Optimization

Komposer V2: A Hybrid Approach to Intelligent Musical Composition Based on Generative Adversarial Networks with a Variational Autoencoder

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now