Fine-Tune GPT2 for French/Belgium RAP

GPT2 for French Rap

For this project, I propose to use a pretrained GPT2 model in French and fine-tune it to learn to French/Belgium rap.

Model

A GPT2 model, pretrained in Belgium/French can be found here: antoiloui/belgpt2 · Hugging Face

Datasets

The model can be fine-tuned on a variety of publicly available lyrics of rap songs in French/Belgium. The team can choose whatever lyrics they like.

Available training scripts

A training script to fine-tune a GPT2 model in Flax is available here

(Optional) Desired project outcome

The desired project output is a GPT2 model that can drop bars in French/Belgium. Ideally the model is able to generate a whole rap song.

(Optional) Challenges

It might be difficult to find enough data and to homogenize it well.

(Optional) Links to read upon

There are some nice articles on GPT2 models that can rap: