Didn’t see this issue anywhere else while googling, so posting here; tried loading an 8-bit SFTT model as part of a TRL-based reinforcement learning pipeline. Returned the error as stated in the title:
RuntimeError: "normal_kernel_cpu" not implemented for 'Char'
Model config is as follows, with the loading line throwing the error:
config = PPOConfig(
model_name="drive/MyDrive/fg_proj/lr_pipeline/sftt_models/feb7_2024_8bit",
learning_rate=1.41e-5,
log_with="tensorboard",#"wandb",
project_kwargs={"logging_dir": "run_logs_107_3"}
)
model = AutoModelForCausalLMWithValueHead.from_pretrained(config.model_name)
My hunch is this related to loading in 8-bit, since the full-bit and 4-bit models I tried did not throw this error. Any ideas as to what I’d be missing for this to occur would be helpful.