Hi everyone!
After few tests, iâm keeping an error during merging adapter to models.
Iâm succefully train with a simple dataset (csv) based on Mistral models. After trying to merge it with the models, i got a failure with message: Connection errored out on the GUI.
Then i was tried to directly adjust the parameters and pass merge_adapter to âtrueâ.
The training is ok, but i have an error about merging (seems to be).
Below, the parameters and the output log with error.
Did i miss something? Any ideas would be welcome
Thx a lot
----Parameters----
{
âblock_sizeâ: 1024,
âmodel_max_lengthâ: 2048,
âpaddingâ: ârightâ,
âuse_flash_attention_2â: false,
âdisable_gradient_checkpointingâ: false,
âlogging_stepsâ: -1,
âevaluation_strategyâ: âepochâ,
âsave_total_limitâ: 1,
âsave_strategyâ: âepochâ,
âauto_find_batch_sizeâ: false,
âmixed_precisionâ: âfp16â,
âlrâ: 0.00003,
âepochsâ: 3,
âbatch_sizeâ: 2,
âwarmup_ratioâ: 0.1,
âgradient_accumulationâ: 1,
âoptimizerâ: âadamw_torchâ,
âschedulerâ: âlinearâ,
âweight_decayâ: 0,
âmax_grad_normâ: 1,
âseedâ: 42,
âapply_chat_templateâ: false,
âquantizationâ: âint4â,
âtarget_modulesâ: ââ,
âmerge_adapterâ: true,
âpeftâ: true,
âlora_râ: 16,
âlora_alphaâ: 32,
âlora_dropoutâ: 0.05
}
----Output logs with error-----
File â/app/env/lib/python3.10/site-packages/accelerate/commands/launch.pyâ, line 643, in simple_launcher
raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd)
subprocess.CalledProcessError: Command â[â/app/env/bin/pythonâ, â-mâ, âautotrain.trainers.clmâ, ââtraining_configâ, âautotrain-test/training_params.jsonâ]â died with <Signals.SIGKILL: 9>.