LLama2 Finetuning giving Error mat1 and mat2 shapes cannot be multiplied (4096x5120 and 1x2560)

Also I am able to replicate this in Jupyter notebook . But using SFT Trainer I am able to train .

able to train with 13B model?

1 Like