Can I pretrain LLaMA from scratch?

npvinHnivqn · July 15, 2023, 5:29pm

I have a small ‘Llama’ model (~1M parameters) by changing the config and trying to train it with my data. However, the model performs terribly on a small dataset(cannot overfit), and I don’t know why. Here is the link to my problem: Failed to train Llama model
If possible, can I have a small demo code on how to train it?

Topic		Replies	Views
Failed to train Llama model Models	1	1300	July 15, 2023
How to get a LLaMA v2 model with less than 7B parameters? Beginners	0	2113	August 24, 2023
Fine tune a finetuned model Beginners	1	553	December 16, 2024
Loading model from_pretrained with dummy parameter 🤗Transformers	0	443	April 16, 2023
Fine Tuning GGML model Beginners	3	3419	September 24, 2023

Can I pretrain LLaMA from scratch?

Related topics