Generate your own TV shows

:wave: Please read the topic category description to understand what this is all about

Description

Fine-tune an autoregressive Transformer model on the transcripts of your favourite TV show to generate new episodes!

Model(s)

A decoder-based Transformer model would be well suited for this task. You can find these models under the Text Generation models filter on the Hub, and the following would be a good place to start from:

  • GPT-2
  • GPT-Neo

Datasets

You can usually find the transcripts for TV shows with a quick Google search. For example:

  • here are the transcripts for the popular Rick and Morty series.
  • here are the transcripts for the The Simpsons series

Challenges

The transcripts are unlikely to come in a ready-to-use format for language modeling, so some data wrangling will be needed.

Desired project outcomes

  • Create a Streamlit or Gradio app on :hugs: Spaces that allows people to generate their own TV scripts from an input prompt.
  • Don’t forget to push all your models and datasets to the Hub so others can build on them!

Additional resources

  • Check out @merve’s cool Space on French story generation to get an idea about creating a Space for this project.
1 Like