How to pass multiple datasets into Trainer for Knowledge distillation in NMT

velmen · November 16, 2023, 8:37am

Greetings,
I am trying apply knowledge distillation for domain adaptation problem in NMT, I understood that to create my custom loss function I need to subclass the Trainer class and override the compute_loss function. As I’m following Sequence level distillation, I am required to pass data from 6 different domains, calculate the individual loss per domain then calculate the global loss. It will wonderful if someone could point me to any resource which shows passing multiple datasets into the Trainer Class

Ajmalps · May 8, 2024, 2:10pm

Did you get any idea for this?@ velmen

velmen · May 8, 2024, 2:39pm

Hi,
Actually, i had stop using the trainer class as it is very hard to customize towards the task I described. I am currently working on the solution by wiritng a custom pytorch loop to train the models.

Ajmalps · May 9, 2024, 10:16am

I have done some research on this, but stuck here, if you don’t mind can we together work on it?

Topic		Replies	Views
Training on multiple datasets Beginners	0	476	January 23, 2024
Evaluating your model on more than one dataset Beginners	3	2072	February 28, 2022
More complex training setups 🤗Transformers	4	1018	October 18, 2020
Custom trainer does not work on multiple GPUs 🤗Transformers	1	1425	December 21, 2021
How to use Trainer when eval dataset has multiple references 🤗Transformers	0	344	September 22, 2023

How to pass multiple datasets into Trainer for Knowledge distillation in NMT

Related topics