Unrecognized arguments are being passed to autotrain

I get this error “unrecognized arguments: -m autotrain.trainers.clm -m autotrain.trainers.clm -m autotrain.trainers.clm -m autotrain.trainers.clm -m autotrain.trainers.clm” but I’m not providing them so I’m not sure why they’re being added


Below are the logs

Saving the dataset (0/1 shards):   0%|          | 0/5 [00:00<?, ? examples/s]
Saving the dataset (1/1 shards): 100%|██████████| 5/5 [00:00<00:00, 2246.79 examples/s]
Saving the dataset (1/1 shards): 100%|██████████| 5/5 [00:00<00:00, 2090.05 examples/s]

Saving the dataset (0/1 shards):   0%|          | 0/5 [00:00<?, ? examples/s]
Saving the dataset (1/1 shards): 100%|██████████| 5/5 [00:00<00:00, 3331.46 examples/s]
Saving the dataset (1/1 shards): 100%|██████████| 5/5 [00:00<00:00, 3118.44 examples/s]
INFO     | 2024-10-05 09:16:04 | autotrain.backends.local:create:8 - Starting local training...
WARNING  | 2024-10-05 09:16:04 | autotrain.commands:get_accelerate_command:52 - No GPU found. Forcing training on CPU. This will be super slow!
INFO     | 2024-10-05 09:16:04 | autotrain.commands:launch_command:501 - ['accelerate', 'launch', '--cpu', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-n1u1h-xdrby/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-q94v3-zunen/training_params.json']
INFO     | 2024-10-05 09:16:04 | autotrain.commands:launch_command:502 - {'model': 'meta-llama/Llama-3.2-1B-Instruct', 'project_name': 'autotrain-q94v3-zunen', 'data_path': 'autotrain-q94v3-zunen/autotrain-data', 'train_split': 'train', 'valid_split': None, 'add_eos_token': True, 'block_size': 1024, 'model_max_length': 2048, 'padding': 'right', 'trainer': 'sft', 'use_flash_attention_2': False, 'log': 'tensorboard', 'disable_gradient_checkpointing': False, 'logging_steps': -1, 'eval_strategy': 'epoch', 'save_total_limit': 1, 'auto_find_batch_size': False, 'mixed_precision': 'fp16', 'lr': 3e-05, 'epochs': 3, 'batch_size': 2, 'warmup_ratio': 0.1, 'gradient_accumulation': 4, 'optimizer': 'adamw_torch', 'scheduler': 'linear', 'weight_decay': 0.0, 'max_grad_norm': 1.0, 'seed': 42, 'chat_template': 'none', 'quantization': 'int4', 'target_modules': 'all-linear', 'merge_adapter': False, 'peft': True, 'lora_r': 16, 'lora_alpha': 32, 'lora_dropout': 0.05, 'model_ref': None, 'dpo_beta': 0.1, 'max_prompt_length': 128, 'max_completion_length': None, 'prompt_text_column': 'autotrain_prompt', 'text_column': 'autotrain_text', 'rejected_text_column': 'autotrain_rejected_text', 'push_to_hub': True, 'username': 'MyKPopWishList', 'token': '*****', 'unsloth': False, 'distributed_backend': None}
INFO     | 2024-10-05 09:16:04 | autotrain.backends.local:create:13 - Training PID: 163
INFO:     10.16.26.177:30252 - "POST /ui/create_project HTTP/1.1" 200 OK
INFO:     10.16.14.103:24187 - "GET /ui/accelerators HTTP/1.1" 200 OK
INFO:     10.16.14.103:24187 - "GET /ui/is_model_training HTTP/1.1" 200 OK
The following values were not passed to `accelerate launch` and had defaults used instead:
	`--num_processes` was set to a value of `0`
	`--num_machines` was set to a value of `1`
	`--mixed_precision` was set to a value of `'no'`
	`--dynamo_backend` was set to a value of `'no'`
To avoid this warning pass in values for each of the problematic parameters or run `accelerate config`.
usage: __main__.py [-h] --training_config TRAINING_CONFIG
__main__.py: error: unrecognized arguments: -m autotrain.trainers.clm -m autotrain.trainers.clm -m autotrain.trainers.clm -m autotrain.trainers.clm -m autotrain.trainers.clm
Traceback (most recent call last):
  File "/app/env/bin/accelerate", line 8, in <module>
    sys.exit(main())
  File "/app/env/lib/python3.10/site-packages/accelerate/commands/accelerate_cli.py", line 48, in main
    args.func(args)
  File "/app/env/lib/python3.10/site-packages/accelerate/commands/launch.py", line 1174, in launch_command
    simple_launcher(args)
  File "/app/env/lib/python3.10/site-packages/accelerate/commands/launch.py", line 769, in simple_launcher
    raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd)
subprocess.CalledProcessError: Command '['/app/env/bin/python', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-n1u1h-xdrby/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-eidqd-jofqp/training_params.json', '-m', 'autotrain.trainers.clm', '--training_config', 'autotrain-q94v3-zunen/training_params.json']' returned non-zero exit status 2.
INFO     | 2024-10-05 09:16:10 | autotrain.app.utils:get_running_jobs:26 - Killing PID: 163
INFO     | 2024-10-05 09:16:10 | autotrain.app.utils:kill_process_by_pid:52 - Sent SIGTERM to process with PID 163
INFO:     10.16.26.177:23917 - "GET /ui/is_model_training HTTP/1.1" 200 OK
INFO:     10.16.14.103:13482 - "GET /ui/accelerators HTTP/1.1" 200 OK
1 Like

@abhishek I searched and could not find the same error. Seems like the official GUI is failing to handle the arguments…
Maybe a version mismatch or a spec change?