Adapting replit transformers support for training

alaeddineabdessalem · May 21, 2023, 12:05pm

Hello,
I am trying to finetune replit code model (replit/replit-code-v1-3b · Hugging Face) using the same procedure in alpaca-lora, aiming to make the model instruction following.
However, when I try to use the model with transformers.Trainer, it turns out the replit model forward function does not support labels and input_embeds which I guess are used by Trainer to update the weights: ReplitLM/replit_lm.py at 1a421995c10c880379f52e26b5ba495c87bac3e2 · replit/ReplitLM · GitHub
I’m wondering if there’s a guide or specific steps I need to follow in order to modify the replit model code base so that it supports training with Trainer class

Topic		Replies	Views
Updating model and tokenizers inside Trainer.train Models	0	34	August 23, 2024
Resources for using custom models with trainer Beginners	6	5379	April 6, 2021
Using GRPOTrainer with a custom PyTorch module? 🤗Transformers	3	31	April 29, 2025
Train huggingface Beginners	2	391	November 10, 2023
Train a simple Pytoch model with transformers Trainer 🤗Transformers	0	125	December 19, 2023

Adapting replit transformers support for training

Related topics