How much will this cost?

Hello. I have a dataset that contains approximately 100 million sentences (2 billion tokens) not in English. I want to implement a model for generating text. Which is better to choose GPT-2 or Llama 3? Approximately with what parameters server needed and how long will it take to train such a model? (so I can count the costs)