Hi again,
I tried that and updated the docker file and then reset the space. It did not ask me to login but I tried running the Extractive Question Answer example again but got the same errors again. There are shown below plus I will attach a screen shot of the autotrain run page. I also tried the process several times with the facebook oberta base model and the google bert uncased model but got the same errors. I am just running in the CPU mode for free so tried it with mixed precision and without but did not seem to make any difference. It seems it is still having load_metric issues. Here is the error:
NFO | 2025-06-01 12:23:09 | autotrain.app.utils:kill_process_by_pid:90 - Sent SIGTERM to process with PID 85
INFO | 2025-06-01 12:23:09 | autotrain.app.utils:get_running_jobs:40 - Killing PID: 85
subprocess.CalledProcessError: Command ‘[’/app/env/bin/python’, ‘-m’, ‘autotrain.trainers.extractive_question_answering’, ‘–training_config’, ‘autotrain-xyh3v-g4vqg/training_params.json’]’ returned non-zero exit status 1.
raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd)
File “/app/env/lib/python3.10/site-packages/accelerate/commands/launch.py”, line 763, in simple_launcher
simple_launcher(args)
File “/app/env/lib/python3.10/site-packages/accelerate/commands/launch.py”, line 1168, in launch_command
args.func(args)
File “/app/env/lib/python3.10/site-packages/accelerate/commands/accelerate_cli.py”, line 48, in main
sys.exit(main())
File “/app/env/bin/accelerate”, line 8, in
Traceback (most recent call last):
ImportError: cannot import name ‘load_metric’ from ‘datasets’ (/app/env/lib/python3.10/site-packages/datasets/init.py)
from datasets import load_metric
File “/app/env/lib/python3.10/site-packages/autotrain/trainers/extractive_question_answering/utils.py”, line 6, in
from autotrain.trainers.extractive_question_answering import utils
File “/app/env/lib/python3.10/site-packages/autotrain/trainers/extractive_question_answering/main.py”, line 30, in
exec(code, run_globals)
File “/app/env/lib/python3.10/runpy.py”, line 86, in _run_code
return _run_code(code, main_globals, None,
File “/app/env/lib/python3.10/runpy.py”, line 196, in _run_module_as_main
Traceback (most recent call last):
To avoid this warning pass in values for each of the problematic parameters or run accelerate config
.
--dynamo_backend
was set to a value of 'no'
--mixed_precision
was set to a value of 'no'
--num_machines
was set to a value of 1
--num_processes
was set to a value of 0
The following values were not passed to accelerate launch
and had defaults used instead:
INFO | 2025-06-01 12:22:50 | autotrain.backends.local:create:25 - Training PID: 85
INFO | 2025-06-01 12:22:50 | autotrain.commands:launch_command:515 - {‘data_path’: ‘lhoestq/squad’, ‘model’: ‘FacebookAI/roberta-base’, ‘lr’: 5e-05, ‘epochs’: 3, ‘max_seq_length’: 512, ‘max_doc_stride’: 128, ‘batch_size’: 8, ‘warmup_ratio’: 0.1, ‘gradient_accumulation’: 1, ‘optimizer’: ‘adamw_torch’, ‘scheduler’: ‘linear’, ‘weight_decay’: 0.0, ‘max_grad_norm’: 1.0, ‘seed’: 42, ‘train_split’: ‘train’, ‘valid_split’: ‘validation’, ‘text_column’: ‘context’, ‘question_column’: ‘question’, ‘answer_column’: ‘answers’, ‘logging_steps’: -1, ‘project_name’: ‘autotrain-xyh3v-g4vqg’, ‘auto_find_batch_size’: False, ‘mixed_precision’: ‘none’, ‘save_total_limit’: 1, ‘token’: ‘*****’, ‘push_to_hub’: True, ‘eval_strategy’: ‘epoch’, ‘username’: ‘ianmd’, ‘log’: ‘tensorboard’, ‘early_stopping_patience’: 5, ‘early_stopping_threshold’: 0.01}
INFO | 2025-06-01 12:22:50 | autotrain.commands:launch_command:514 - [‘accelerate’, ‘launch’, ‘–cpu’, ‘-m’, ‘autotrain.trainers.extractive_question_answering’, ‘–training_config’, ‘autotrain-xyh3v-g4vqg/training_params.json’]
INFO | 2025-06-01 12:22:50 | autotrain.backends.local:create:20 - Starting local training…
INFO | 2025-06-01 12:22:50 | autotrain.app.ui_routes:handle_form:540 - hardware: local-ui
INFO | 2025-06-01 12:21:07 | autotrain.app.ui_routes:fetch_params:415 - Task: extractive-qa
INFO | 2025-06-01 12:20:34 | autotrain.app.ui_routes:fetch_params:415 - Task: llm:sft
INFO: 10.16.27.22:33965 - “GET /?logs=container&__sign=eyJhbGciOiJFZERTQSJ9.eyJyZWFkIjp0cnVlLCJwZXJtaXNzaW9ucyI6eyJyZXBvLmNvbnRlbnQucmVhZCI6dHJ1ZX0sIm9uQmVoYWxmT2YiOnsia2luZCI6InVzZXIiLCJfaWQiOiI2N2VlNTdmZDM1NDdmODIzMTAyNTI5M2MiLCJ1c2VyIjoiaWFubWQiLCJzZXNzaW9uSWQiOiI2ODNjNDU0MzUwYWM5ODI5Y2Y4NzE4ZWMifSwiaWF0IjoxNzQ4NzgwNDMzLCJzdWIiOiIvc3BhY2VzL2lhbm1kL2F1dG90cmFpbi10ZXN0aW5nIiwiZXhwIjoxNzQ4ODY2ODMzLCJpc3MiOiJodHRwczovL2h1Z2dpbmdmYWNlLmNvIn0.nXNp7G-6ybFRqCpo4InnYryulxIOTgqe7GRo1346CK8kK6aHX4f775QfirCyuqGLt3hMRRfd5Gd7WjaL2i-RDQ HTTP/1.1” 307 Temporary Redirect
INFO: 10.16.44.224:9661 - “GET /?logs=container&__sign=eyJhbGciOiJFZERTQSJ9.eyJyZWFkIjp0cnVlLCJwZXJtaXNzaW9ucyI6eyJyZXBvLmNvbnRlbnQucmVhZCI6dHJ1ZX0sIm9uQmVoYWxmT2YiOnsia2luZCI6InVzZXIiLCJfaWQiOiI2N2VlNTdmZDM1NDdmODIzMTAyNTI5M2MiLCJ1c2VyIjoiaWFubWQiLCJzZXNzaW9uSWQiOiI2ODNjNDU0MzUwYWM5ODI5Y2Y4NzE4ZWMifSwiaWF0IjoxNzQ4NzgwNDMzLCJzdWIiOiIvc3BhY2VzL2lhbm1kL2F1dG90cmFpbi10ZXN0aW5nIiwiZXhwIjoxNzQ4ODY2ODMzLCJpc3MiOiJodHRwczovL2h1Z2dpbmdmYWNlLmNvIn0.nXNp7G-6ybFRqCpo4InnYryulxIOTgqe7GRo1346CK8kK6aHX4f775QfirCyuqGLt3hMRRfd5Gd7WjaL2i-RDQ HTTP/1.1” 307 Temporary Redirect
INFO: Uvicorn running on http://0.0.0.0:7860 (Press CTRL+C to quit)
INFO: Application startup complete.
INFO: Waiting for application startup.
INFO: Started server process [61]
INFO | 2025-06-01 12:20:32 | autotrain.app.app::24 - AutoTrain started successfully
INFO | 2025-06-01 12:20:32 | autotrain.app.app::23 - AutoTrain version: 0.8.36
INFO | 2025-06-01 12:20:32 | autotrain.app.app::13 - Starting AutoTrain…
INFO | 2025-06-01 12:20:32 | autotrain.app.ui_routes::315 - AutoTrain started successfully
INFO | 2025-06-01 12:20:29 | autotrain.app.ui_routes::31 - Starting AutoTrain…
Thanks! ian