Autotrain Advanced (local) finished training between epochs i.e not sure it actually completed

asGLLM · October 13, 2023, 3:24pm

Hi,

Would also add I’m relatively new to this so might be another parameter stopping the model early.

Just testing out limits of what I can train locally using peft and autotrain on Llama-2-7b-hf (https://www.youtube.com/watch?v=3fsn19OI_C8). I’m aware my computer can only really take a batch size of 2, so want to do a complete run (1 epoch) with a training set of 300 instruction, input and responses and check the change in performance if any. Then play around with the other parameters knowing batch size is my hardware limit and tracking performance gain/loss.

Any ideas why my run ‘Finished’ at epoch=0.09 using the following params?

abhishek · October 13, 2023, 3:40pm

yes its finished. sft progress bar isnt indicative.

asGLLM · October 13, 2023, 4:12pm

Great thanks, big fan of your videos btw.

Just for completeness, the ‘epoch’:0.09, is just from a checkpoint and the model had in fact, completed 1 epoch?

Thanks

Topic		Replies	Views
Training stops while fine-tuning Llama2-7B with AutoTrain Advancedvanced Beginners	0	420	August 16, 2023
Inquiring Minds Beginners	0	45	July 2, 2024
All the training jobs end up getting stopped 🤗AutoTrain	6	2142	April 17, 2024
Autotrain Training always stop early (not training enough epochs) 🤗AutoTrain	0	317	March 12, 2024
Trained a model with a 0.0566 loss and empty MIoU Beginners	10	62	December 25, 2024

Autotrain Advanced (local) finished training between epochs i.e not sure it actually completed

Related topics