Resources for using custom models with trainer

gabeorlanski · March 29, 2021, 1:00pm

@lewtun to add on through more testing, I now get the warning:

Some weights of TagPredictionModel were not initialized from the model checkpoint at ./experiments/checkpoint-40 and are newly initialized: [‘.encoder.shared.weight’, ‘.encoder.encoder.embed_tokens.weight’, ‘.encoder.encoder.embed_positions.weight’, ‘.encoder.encoder.layers.0.self_attn.k_proj.weight’…(Cut for length)
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.

EDIT: Also, from rerunning evaluation on the validation set after training ends I am almost certain that it is not saving because the eval loss is different than during the training loop

Topic		Replies	Views
Subclassing a pretrained model for a new objective 🤗Transformers	8	3558	August 10, 2022
Using Huggingface Trainer for custom models Beginners	5	4425	May 29, 2023
Custom PretrainedModel does not loaded correctly Beginners	0	279	November 17, 2022
Registering custom model and config to AutoModel and AutoConfig Models	1	917	November 7, 2023
Save custom transformer as PreTrainedModel Intermediate	1	936	September 7, 2021

Resources for using custom models with trainer

Related topics