Iâm trying to fine tune a model, but no matter what data I use, it always result in timeout while the model is loading, so I turned to using the data format on LLM Finetuning.
Base Model: meta-llama/Meta-Llama-3-8B-Instruct
Parameter:
{
âauto_find_batch_sizeâ: âfalseâ,
âchat_templateâ: âzephyrâ,
âdisable_gradient_checkpointingâ: âfalseâ,
âevaluation_strategyâ: âepochâ,
âmerge_adapterâ: âfalseâ,
âmixed_precisionâ: âfp16â,
âoptimizerâ: âadamw_torchâ,
âpeftâ: âtrueâ,
âpaddingâ: ârightâ,
âquantizationâ: âint4â,
âschedulerâ: âlinearâ,
âuse_flash_attention_2â: âfalseâ,
âbatch_sizeâ: â2â,
âblock_sizeâ: â1024â,
âmax_completion_lengthâ: â128â,
âdpo_betaâ: â0.1â,
âepochsâ: â3â,
âgradient_accumulationâ: â4â,
âlrâ: â0.00003â,
âlogging_stepsâ: â-1â,
âlora_alphaâ: â32â,
âlora_dropoutâ: â0.05â,
âlora_râ: â16â,
âmax_grad_normâ: â1â,
âmodel_max_lengthâ: â2048â,
âmax_prompt_lengthâ: â128â,
âsave_total_limitâ: â1â,
âseedâ: â42â,
âwarmup_ratioâ: â0.1â,
âweight_decayâ: â0â,
âmodel_refâ: âmeta-llama/Meta-Llama-3-8B-Instructâ,
âtarget_modulesâ: âall-linearâ
}
Error Log:
ERROR | 2024-06-11 21:22:27 | autotrain.trainers.common:wrapper:120 - train has failed due to an exception: Traceback (most recent call last):
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\autotrain\trainers\common.pyâ, line 117, in wrapper
return func(*args, **kwargs)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\autotrain\trainers\clm_main_.pyâ, line 38, in train
train_dpo(config)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\autotrain\trainers\clm\train_clm_dpo.pyâ, line 19, in train
train_data, valid_data = utils.process_data_with_chat_template(config, tokenizer, train_data, valid_data)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\autotrain\trainers\clm\utils.pyâ, line 448, in process_data_with_chat_template
train_data = train_data.map(
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\datasets\arrow_dataset.pyâ, line 602, in wrapper
out: Union[âDatasetâ, âDatasetDictâ] = func(self, *args, **kwargs)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\datasets\arrow_dataset.pyâ, line 567, in wrapper
out: Union[âDatasetâ, âDatasetDictâ] = func(self, *args, **kwargs)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\datasets\arrow_dataset.pyâ, line 3156, in map
for rank, done, content in Dataset._map_single(**dataset_kwargs):
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\datasets\arrow_dataset.pyâ, line 3517, in _map_single
example = apply_function_on_filtered_inputs(example, i, offset=offset)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\datasets\arrow_dataset.pyâ, line 3416, in apply_function_on_filtered_inputs
processed_inputs = function(*fn_args, *additional_args, **fn_kwargs)
File âC:\Users\James\anaconda3\envs\autotrain\lib\site-packages\autotrain\trainers\clm\utils.pyâ, line 268, in apply_chat_template
example[âchosenâ] = ast.literal_eval(example[âchosenâ])
File âC:\Users\James\anaconda3\envs\autotrain\lib\ast.pyâ, line 64, in literal_eval
node_or_string = parse(node_or_string.lstrip(" \t"), mode=âevalâ)
File âC:\Users\James\anaconda3\envs\autotrain\lib\ast.pyâ, line 50, in parse
return compile(source, filename, mode, flags,
File ââ, line 1
hi nice to meet you
^^^^
SyntaxError: invalid syntax
ERROR | 2024-06-11 21:22:27 | autotrain.trainers.common:wrapper:121 - invalid syntax (, line 1)