How to finetune RAG model with mini batches?

Yuanhang · December 11, 2021, 8:24am

Dear authors of RAG model,

I know I can finetune with the rag with following example.

retriever = RagRetriever.from_pretrained(rag_example_args.rag_model_name, index_name="custom", passages_path=passages_path, index_path=index_path)
model = RagSequenceForGeneration.from_pretrained(rag_example_args.rag_model_name, retriever=retriever,cache_dir=cache_dir).to(device)
tokenizer = RagTokenizer.from_pretrained(rag_example_args.rag_model_name,cache_dir=cache_dir)

inputs = tokenizer("How many people live in Paris?", return_tensors="pt")
with tokenizer.as_target_tokenizer():
    targets = tokenizer("In Paris, there are 10 million people.", return_tensors="pt")
input_ids = inputs["input_ids"].to(device)
labels = targets["input_ids"].to(device)
outputs = model(input_ids=input_ids, labels=labels)

However, this is for single sentence.
How can I finetune with mini batch qa samples?

Could you give an example?
Thank you very much!
@patrickvonplaten @lhoestq

lhoestq · December 15, 2021, 1:48pm

Hi ! I think you can just pass a list of questions and answers to the tokenizer, and the reste of the code should work fine

Topic		Replies	Views
RAG model doesn't have grad_fn on loss Beginners	0	389	December 13, 2021
Trying RAG with other Retriever Models 🤗Transformers	0	428	January 21, 2021
Regarding Rag-end2end retriever 🤗Transformers	1	235	January 31, 2023
Finetune BERT for information extraction Beginners	0	1796	June 6, 2022
Finetune_rag.py won't save checkpoints 🤗Transformers	0	115	May 9, 2024

How to finetune RAG model with mini batches?

Related topics