Create a pop music Transformer

lewtun · November 9, 2021, 7:30pm

Please read the topic category description to understand what this is all about

Description

If you treat music notation as a form of “text”, you can use language modelling to generate new songs! The goal of this project is to explore how well Transformers perform at music modelling.

Model(s)

None that we could find on the Hub, but see here for some pretrained music Transformers

Datasets

None that we could find on the Hugging Face Hub, but see here for some ideas

Challenges

This task probably involves pretraining a Transformer, which can potentially take multiple days using the GPU resources provided by AWS. An interesting alternative would be to see whether one can integrate an existing pretrained model within Transformers and use that to either fine-tune.

Desired project outcomes

Create a Streamlit or Gradio app on Spaces that people can remix famous songs with newly generated ones
Don’t forget to push all your models and datasets to the Hub so others can build on them!

Additional resources

Discord channel

To chat and organise with other people interested in this project, head over to our Discord and:

Follow the instructions on the #join-course channel
Join the #pop-music-transformer channel

Just make sure you comment here to indicate that you’ll be contributing to this project

psistolar · November 16, 2021, 10:58pm

This one seems pretty difficult, but too interesting to pass up. I’ll try working on it.

lewtun · November 17, 2021, 11:31am

Hey @psistolar ! Yes this is a somewhat challenging project and one that might not be possible in the timeframe if you adopt the pretraining approach. Let me think about how we can simplify it and I’ll post back here

In the meantime, there’s a new Discord channel (see topic description) in case you and others need it

Topic		Replies	Views
How to Train a Generative Pre-training Transformer Beginners	0	137	May 26, 2024
Further pre-train language model in transformers like BERT Models	3	1108	March 27, 2022
Is there any music vocals/voice-to-text model? Beginners	0	1023	July 19, 2023
Best model for music generation Models	3	1744	December 31, 2024
Tutorial: Implementing Transformer from Scratch - A Step-by-Step Guide Show and Tell	5	4689	May 1, 2025