Fine-tuning T5 Model on a Book for Unsupervised Learning

ArmanAsq · April 17, 2024, 12:36pm

I’m working on a project where I aim to fine-tune a T5 model or any other encoder-decoder model using an unsupervised learning approach to transfer knowledge from specific books.

My main goal is to train a model that becomes an expert on the book’s topic. However, I’m uncertain about the specific fine-tuning process to follow and which approach would yield the best results.

Here are my specific questions:

Given that I’m using an encoder-decoder model, what fine-tuning pipeline should I choose? I’ve heard that Masked Language Modeling (MLM) is effective for encoder models to learn knowledge. Is MLM suitable for an encoder-decoder architecture like T5, or should I consider other methods? Are there any potential pitfalls associated with fine-tuning on a specific book that I should be aware of? What suggestions do you have for optimizing the fine-tuning process to ensure the model becomes an expert on the book’s topic? Any advice or recommendations would be greatly appreciated.

Thanks for any advice!

Topic		Replies	Views
Retrain T5 using unsupervised learning with MLM 🤗Transformers	0	250	May 21, 2023
Decoder only fine-tuning enough for UMT5 Models	0	335	November 29, 2023
Fine-tune T5 model for Casual Language Modeling(CLM) Models	1	754	April 26, 2023
Fine-tuning T5 for sentiment classification Beginners	3	3626	December 22, 2023
Fine-tuning Decoder-only or Encoder-Decoder models for classification 🤗Transformers	0	691	July 17, 2024

Fine-tuning T5 Model on a Book for Unsupervised Learning

Related topics