Beliebiger Eintrag,

MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE

S. Wu, und Y. Yang.
(2021)cite arxiv:2105.04090Comment: Accepted for Publication at IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP). Online supplemental materials are attached to the end of this arXiv version.

Zusammenfassung

Transformers and variational autoencoders (VAE) have been extensively employed for symbolic (e.g., MIDI) domain music generation. While the former boast an impressive capability in modeling long sequences, the latter allow users to willingly exert control over different parts (e.g., bars) of the music to be generated. In this paper, we are interested in bringing the two together to construct a single model that exhibits both strengths. The task is split into two steps. First, we equip Transformer decoders with the ability to accept segment-level, time-varying conditions during sequence generation. Subsequently, we combine the developed and tested in-attention decoder with a Transformer encoder, and train the resulting MuseMorphose model with the VAE objective to achieve style transfer of long pop piano pieces, in which users can specify musical attributes including rhythmic intensity and polyphony (i.e., harmonic fullness) they desire, down to the bar level. Experiments show that MuseMorphose outperforms recurrent neural network (RNN) based baselines on numerous widely-used metrics for style transfer tasks.

BibTeX-Schlüssel: wu2021musemorphose
Eintragstyp: misc
Jahr: 2021
URL: http://arxiv.org/abs/2105.04090
Hinweis: cite arxiv:2105.04090Comment: Accepted for Publication at IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP). Online supplemental materials are attached to the end of this arXiv version

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

@misc{wu2021musemorphose, abstract = {Transformers and variational autoencoders (VAE) have been extensively employed for symbolic (e.g., MIDI) domain music generation. While the former boast an impressive capability in modeling long sequences, the latter allow users to willingly exert control over different parts (e.g., bars) of the music to be generated. In this paper, we are interested in bringing the two together to construct a single model that exhibits both strengths. The task is split into two steps. First, we equip Transformer decoders with the ability to accept segment-level, time-varying conditions during sequence generation. Subsequently, we combine the developed and tested in-attention decoder with a Transformer encoder, and train the resulting MuseMorphose model with the VAE objective to achieve style transfer of long pop piano pieces, in which users can specify musical attributes including rhythmic intensity and polyphony (i.e., harmonic fullness) they desire, down to the bar level. Experiments show that MuseMorphose outperforms recurrent neural network (RNN) based baselines on numerous widely-used metrics for style transfer tasks.}, added-at = {2023-05-09T13:30:12.000+0200}, author = {Wu, Shih-Lun and Yang, Yi-Hsuan}, biburl = {https://www.bibsonomy.org/bibtex/2cc97c45c7ee0253f9212506b1f4f18ee/alex_h}, description = {[2105.04090] MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE}, interhash = {9c043af819f6cf7eccad4c540a517318}, intrahash = {cc97c45c7ee0253f9212506b1f4f18ee}, keywords = {evaluation}, note = {cite arxiv:2105.04090Comment: Accepted for Publication at IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP). Online supplemental materials are attached to the end of this arXiv version}, timestamp = {2023-05-09T13:30:12.000+0200}, title = {MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE}, url = {http://arxiv.org/abs/2105.04090}, year = 2021 }

BibSonomy

MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf