Fine-Tune GPT2 for German Rap

GPT2 for German Rap

For this project, I propose to use a pretrained GPT2 model in Gerrman and fine-tune it to learn to German rap.

Model

A GPT2 model, pretrained in German can be found here: dbmdz/german-gpt2 · Hugging Face

Datasets

The model can be fine-tuned on a variety of publicly available lyrics of rap songs in German. The team can choose whatever lyrics they like.

Available training scripts

A training script to fine-tune a GPT2 model in Flax is available here

(Optional) Desired project outcome

The desired project output is a GPT2 model that can drop bars in German. Ideally the model is able to generate a whole rap song.

(Optional) Challenges

It might be difficult to find enough data and to homogenize it well.

(Optional) Links to read upon

There are some nice articles on GPT2 models that can rap:

1 Like