Long Text generation

saied · May 3, 2021, 5:52pm

Hi,
I have some questions about different models in text generation, and I will be thrilled to hear your answers.
Right now, I’m working on text generation task for non-English languages(for now, it’s Arabic and Persian).
First which architecture will be the best for this task? So far, I’ve tested GPT-2, Bert, AWD-LSTM, and I got the best results for GPT-2 though due to a shortage of resources, I had to train the little model. Are there any alternatives that I should consider? And which one do you think works best for text generation?

Second, about long text generation, which criteria should I consider? How much the size of data-set and model context size affect this? and is model architecture plays a significant role in this?

I know there are lots of questions, but I really need the help of practitioners like you.

Thanks .

Topic		Replies	Views
Trying to choose a model/methodology (text generation) Beginners	0	411	April 14, 2021
How much will this cost? Models	0	91	May 29, 2024
What model would best fit a structured text generator? Beginners	0	769	April 10, 2022
Modeling long sequences Models	0	460	June 9, 2022
Fine tune the text generation with gpt2 Beginners	2	441	February 22, 2023

Long Text generation

Related topics