Train huggingface

Nghengu123 · November 8, 2023, 8:48am

I’m a Beginner in Fine-tuning model. I follow step by step some video on Youtube. But when i try to autotrain the Model LLama2 i catch this Error

Autotrain code : !autotrain llm --train --project_name [Name] --model TinyPixel/Llama-2-7B-bf16-sharded --data_path [MyPath] --use_peft --use_int4 --learning_rate 2e-4 --train_batch_size 2 --num_train_epochs 3 --trainer sft --model_max_length 2048 --push_to_hub --repo_id [MyPath] --token [mytoken] --block_size 2048
Please help me!

realshyfox · November 10, 2023, 9:11am

Hello Nghengu123
I am not a pro myself but struggling with some issues also.
But as I see it seems a dataset formatting issue.
Try using a stripped version of the original dataset to see if it´s working on your model.
As I do remember you must create a guanaco format dataset.

Nghengu123 · November 10, 2023, 12:17pm

Can you help me, what is Guanaco format ? It’s new for me , i dont have any knowledge about this format ?.?. How can i strip dataset. Additionally, what column do you think cause to that issue.
Here is column in dataset
image_2023-11-10_191733207

Topic		Replies	Views
Finefuning LLaMA2 model using autotrain advanced Beginners	1	755	May 5, 2024
Autotrain fine tune error - trains only first 3 data sets Beginners	0	410	January 2, 2024
Help with autotrain/LLM finetuning please Beginners	3	2151	August 11, 2023
Num_samples = 0, dataset not being read Beginners	4	336	December 7, 2023
Autotrain LLM fine tuning data mapping problem 🤗AutoTrain	0	483	July 5, 2023

Train huggingface

Related topics