research-article

The Application of Neural Networks in Guitar Style Transformation

Authors:

Zhaoqi Lyu,

Yu'an Ji,

Shu LiAuthors Info & Claims

HPCCT '24: Proceedings of the 2024 8th High Performance Computing and Cluster Technologies Conference

Pages 24 - 28

https://doi.org/10.1145/3705956.3705967

Published: 28 December 2024 Publication History

Get Access

Abstract

Abstract: This paper explores techniques in audio style transfer using deep learning models and spectral analysis. Our approach transforms raw audio signals into the frequency domain using Short-Time Fourier Transform (STFT), facilitating the extraction of spatial features crucial for audio synthesis. Central to our methodology is utilizing neural networks such as RandomCNN and EfficientNetV2-S for effective feature extraction from spectrograms. These networks are adapted to preserve essential spectral characteristics while enhancing computational efficiency. Content loss functions ensure fidelity between generated and source audio, quantified through mean square error in feature space. Style transfer is achieved via Gram matrices, capturing temporal correlations to imbue generated audio with stylistic attributes akin to reference audio. For reconstructing audio from spectrograms, we employ the Fast Griffin-Lim Algorithm (FGLA), which iteratively estimates phase information from magnitude data to produce high-quality audio outputs. Experimental validation using the Guitarset dataset demonstrates the efficacy of our approach across diverse musical genres, including Funk and Bossa Nova. Spectrographic analysis of generated outputs validates the preservation of content integrity and faithful adoption of stylistic elements from reference audio. This research investigates the capabilities of neural networks in audio synthesis, promising applications in music production and artistic expression through nuanced style fusion and high-fidelity audio reproduction.

References

[1]

MicroMusic. (n.d.). Retrieved July 30, 2024, from https://micromusic.tech/

Google Scholar

[2]

Huang, Z., Chen, S., & Zhu, B. Deep Learning for Audio Style Transfer.

Google Scholar

[3]

Heller, B., Ryzhik, A., & Tesfai, Z. Evaluation of Vocal Audio Style Transfer.

Google Scholar

[4]

randomCNN-voice-transfer. (n.d.). Retrieved July 29, 2024, from https://github.com/mazzzystar/randomCNN-voice-transfer

Google Scholar

[5]

Verma, P., & Smith, J. O. (2018). Neural style transfer for audio spectograms. arXiv preprint arXiv:1801.01589.

Google Scholar

[6]

Li, Y., Wang, N., Liu, J., & Hou, X. (2017). Demystifying neural style transfer. arXiv preprint arXiv:1701.01036.

Crossref

Google Scholar

[7]

Grinstein, E., Duong, N. Q., Ozerov, A., & Pérez, P. (2018, April). Audio style transfer. In 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 586-590). IEEE.

Digital Library

Google Scholar

[8]

Tan, M., & Le, Q. (2021, July). Efficientnetv2: Smaller models and faster training. In International conference on machine learning (pp. 10096-10106). PMLR.

Google Scholar

[9]

Tomczak, M., Southall, C., & Hockman, J. (2018, September). Audio style transfer with rhythmic constraints. In Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18).

Google Scholar

[10]

Engel, J., Hantrakul, L., Gu, C., & Roberts, A. (2020). DDSP: Differentiable digital signal processing. arXiv preprint arXiv:2001.04643.

Google Scholar

[11]

Gatys, L. A., Ecker, A. S., & Bethge, M. (2015). A neural algorithm of artistic style. *arxiv preprint arxiv:1508.06576*.

Google Scholar

[12]

Perraudin, N., Balazs, P., & Søndergaard, P. L. (2013, October). A fast Griffin-Lim algorithm. In 2013 IEEE workshop on applications of signal processing to audio and acoustics (pp. 1-4). IEEE.

Google Scholar

[13]

Xi, Q., Bittner, R., Pauwels, J., Ye, X., & Bello, J. P. (2018). Guitarset: A dataset for guitar transcription. In 19th International Society for Music Information Retrieval Conference, Paris, France, September 2018.

Google Scholar

Index Terms

The Application of Neural Networks in Guitar Style Transformation
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Guitar man: (an implementation of a rhythm game cooperative musical performance system with actual musical instruments)
SIGGRAPH Asia '08: ACM SIGGRAPH ASIA 2008 educators programme

This study proposes a method for learning musical instruments using computer graphics. The conventional musical instrument learning systems have been considered as simple game controllers or simple score processors. This study produced an education ...
Automatic Guitar Music Transcription
ACSAT '12: Proceedings of the 2012 International Conference on Advanced Computer Science Applications and Technologies

This paper presents a system that helps in automatically generating guitar tablatures and musical scores based on musical audio data. Information gathered from the audio consists of pitch, onsets and durations, chords, and beat and tempo. Major issues ...
Automatic phrase continuation from guitar and bass guitar melodies

A framework is proposed for generating interesting, musically similar variations of a given monophonic melody. The focus is on pop/rock guitar and bass guitar melodies with the aim of eventual extensions to other instruments and musical styles. It is ...

Comments

Information & Contributors

Information

Published In

HPCCT '24: Proceedings of the 2024 8th High Performance Computing and Cluster Technologies Conference

July 2024

55 pages

ISBN:9798400716881

DOI:10.1145/3705956

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 December 2024

Check for updates

Author Tags

Qualifiers

Research-article

Conference

HPCCT 2024

HPCCT 2024: 2024 8th High Performance Computing and Cluster Technologies Conference (HPCCT)

July 5 - 7, 2024

Beijing, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
16
Total Downloads

Downloads (Last 12 months)16
Downloads (Last 6 weeks)4

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Abstract

References

Index Terms

Recommendations

Guitar man: (an implementation of a rhythm game cooperative musical performance system with actual musical instruments)

Automatic Guitar Music Transcription

Automatic phrase continuation from guitar and bass guitar melodies

Comments

Information

Published In

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

Full Text

Share

Share this Publication link

Share on social media

Affiliations