Conversion from base to Instruct

I am trying to instruct fine tune Llama 3.1 8B base. Tried ms-swift and unsloth. Spent hours in training and now sometimes it can stop with the correct eos token (<|eot_id|>). But sometimes it continues generation. Do I have to do more training or am I doing something wrong?

1 Like