Fine tune T5 for text generation in specific domain

thanhnx12 · August 22, 2023, 12:47am

1 , I want to continue training a T5 model in huggingface on my own corpus ( about a specific domain)
2, Then I want to fine tune this model for text generation
I am worried that the model has a conflict between the 2 steps.
So, Is this possible to do?

telavir · August 24, 2023, 5:58pm

I’m not sure what you mean. Training a model on a corpus is fine-turning a model. Make sure you choose a model well suited to the task you want your model to perform. For text generation you should start with a causal language model (choose text generation on the models page). Then I suggest making sure your corpus is well formatted so you don’t get anything you don’t want in your generations. The hardest part is managing the server resources for training.

Ayushnangia · August 28, 2023, 7:01am

Yes it is possible and is pretty easy now with the help of the HF team.

here are some resources and I guess do search for more on youtube or the HF blogs.

Ayushnangia · August 28, 2023, 7:02am

Topic		Replies	Views
Prepare data to fine-tune T5 model on unsupervised objective 🤗Transformers	2	3928	November 3, 2021
E5 embedding models 🤗Transformers	1	20	March 17, 2025
LM fine-tuning on unlabelled dataset Beginners	0	443	April 10, 2021
Finetuning T5 for Summarisation - Poor results Intermediate	1	529	April 28, 2024
Fine-Tuning a Text2Text Model using different tokenizer 🤗Transformers	5	71	January 20, 2025

Fine tune T5 for text generation in specific domain

Related topics