Push To Hub (Cloud Train) not working!

devparagiri · July 6, 2025, 6:20pm

I am trying to finetune a small language model on A10 Small and directly push it to hub. Now, the actual finetuning takes place in the custom space created by HF but it FAILS to push the model to the hub.

After some debugging, I noticed that it creates the model repo as soon as the finetune job is created and tries to make it once again when it pushes it to the repo leading to a 409. I’ve attached the exact code below.

import os

from autotrain.params import LLMTrainingParams
from autotrain.project import AutoTrainProject

from dotenv import load_dotenv


load_dotenv()
print(f"HF_USERNAME: {os.environ.get('HF_USERNAME')}")
print(f"HF_TOKEN: {os.environ.get('HF_TOKEN')}")

params = LLMTrainingParams(
    model="microsoft/Phi-4-mini-instruct",
    data_path="HuggingFaceH4/no_robots",
    chat_template="tokenizer",
    text_column="messages",
    train_split="train",
    trainer="sft",
    epochs=1,
    batch_size=1,
    lr=1e-5,
    peft=True,
    quantization="int4",
    target_modules="all-linear",
    padding="right",
    optimizer="paged_adamw_8bit",
    scheduler="cosine",
    gradient_accumulation=8,
    mixed_precision="bf16",
    merge_adapter=True,
    project_name="test-train-3-v3",
    push_to_hub=True,
    username=os.environ.get("HF_USERNAME"),
    token=os.environ.get("HF_TOKEN"),
)


backend = "spaces-a10g-small"
project = AutoTrainProject(params=params, backend=backend, process=True)
project.create()

It would be of great help if someone can guide me with this!

John6666 · July 7, 2025, 4:02am

If updating the library does not fix the problem, it may be a recurrence of a past bug.

pip install --upgrade autotrain-advanced

github.com/huggingface/autotrain-advanced

'You already created this model repo' issue with non-existing repos

opened 10:13AM - 16 Aug 23 UTC

closed 06:33AM - 17 Aug 23 UTC

Dbg-atlas

I ran autotrain with the following command: `autotrain llm --train --project_…name 'my-llm' --model meta-llama/Llama-2-7b-chat-hf --data_path db-atlas/fire_train --text_column text --use_peft --use_int4 --learning_rate 2e-4 --train_batch_size 10 --num_train_epochs 5 --trainer sft --push_to_hub --repo_id db-atlas/fire7b_0816` The model finished training but it failed while pushing to hub, saying that the repo for the model already existed. This is an error, as no repo with that name existed prior to running the command. After pushing to huggingface fails, the repo has been created but there is only a .gitattributes file in it. ``` 2023-08-16 09:57:36.426 | INFO | autotrain.trainers.utils:merge_adapter:161 - Saving target model... 2023-08-16 09:57:36.505 | INFO | autotrain.trainers.utils:merge_adapter:161 - Saving target model... 2023-08-16 09:57:36.693 | INFO | autotrain.trainers.utils:merge_adapter:161 - Saving target model... 2023-08-16 09:57:36.718 | INFO | autotrain.trainers.utils:merge_adapter:161 - Saving target model... 2023-08-16 09:57:49.732 | INFO | __main__:train:315 - Pushing model to hub... 2023-08-16 09:57:50.156 | INFO | __main__:train:315 - Pushing model to hub... 2023-08-16 09:57:50.199 | INFO | __main__:train:315 - Pushing model to hub... Traceback (most recent call last): File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_errors.py", line 261, in hf_raise_for_status response.raise_for_status() File "/usr/local/lib/python3.10/dist-packages/requests/models.py", line 1021, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.exceptions.HTTPError: 409 Client Error: Conflict for url: https://huggingface.co/api/repos/create The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main return _run_code(code, main_globals, None, File "/usr/lib/python3.10/runpy.py", line 86, in _run_code exec(code, run_globals) File "/usr/local/lib/python3.10/dist-packages/autotrain/trainers/clm.py", line 325, in <module> train(config) File "/usr/local/lib/python3.10/dist-packages/autotrain/trainers/clm.py", line 317, in train api.create_repo(repo_id=config.repo_id, repo_type="model") File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_validators.py", line 118, in _inner_fn return fn(*args, **kwargs) File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/hf_api.py", line 2308, in create_repo hf_raise_for_status(r) File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_errors.py", line 303, in hf_raise_for_status raise HfHubHTTPError(str(e), response=response) from e huggingface_hub.utils._errors.HfHubHTTPError: 409 Client Error: Conflict for url: https://huggingface.co/api/repos/create (Request ID: Root=1-64dc9d9e-4074d9e135352e28319225d1;50a7ec5f-cadb-4575-b41e-4bc4ade1ee47) You already created this model repo Traceback (most recent call last): File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_errors.py", line 261, in hf_raise_for_status response.raise_for_status() File "/usr/local/lib/python3.10/dist-packages/requests/models.py", line 1021, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.exceptions.HTTPError: 409 Client Error: Conflict for url: https://huggingface.co/api/repos/create The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main return _run_code(code, main_globals, None, File "/usr/lib/python3.10/runpy.py", line 86, in _run_code exec(code, run_globals) File "/usr/local/lib/python3.10/dist-packages/autotrain/trainers/clm.py", line 325, in <module> train(config) File "/usr/local/lib/python3.10/dist-packages/autotrain/trainers/clm.py", line 317, in train api.create_repo(repo_id=config.repo_id, repo_type="model") File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_validators.py", line 118, in _inner_fn return fn(*args, **kwargs) File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/hf_api.py", line 2308, in create_repo hf_raise_for_status(r) File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_errors.py", line 303, in hf_raise_for_status raise HfHubHTTPError(str(e), response=response) from e huggingface_hub.utils._errors.HfHubHTTPError: 409 Client Error: Conflict for url: https://huggingface.co/api/repos/create (Request ID: Root=1-64dc9d9e-130414935b46c6d475762dd8;16c642ae-526f-4409-9dd8-10578b87b53a) You already created this model repo 2023-08-16 09:57:50.810 | INFO | __main__:train:315 - Pushing model to hub... Traceback (most recent call last): File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_errors.py", line 261, in hf_raise_for_status response.raise_for_status() File "/usr/local/lib/python3.10/dist-packages/requests/models.py", line 1021, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.exceptions.HTTPError: 409 Client Error: Conflict for url: https://huggingface.co/api/repos/create The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main return _run_code(code, main_globals, None, File "/usr/lib/python3.10/runpy.py", line 86, in _run_code exec(code, run_globals) File "/usr/local/lib/python3.10/dist-packages/autotrain/trainers/clm.py", line 325, in <module> train(config) File "/usr/local/lib/python3.10/dist-packages/autotrain/trainers/clm.py", line 317, in train api.create_repo(repo_id=config.repo_id, repo_type="model") File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_validators.py", line 118, in _inner_fn return fn(*args, **kwargs) File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/hf_api.py", line 2308, in create_repo hf_raise_for_status(r) File "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_errors.py", line 303, in hf_raise_for_status raise HfHubHTTPError(str(e), response=response) from e huggingface_hub.utils._errors.HfHubHTTPError: 409 Client Error: Conflict for url: https://huggingface.co/api/repos/create (Request ID: Root=1-64dc9d9e-60d659b1310a77ce2f0ee904;633a51c3-c837-46ba-9cfa-d183cdfde4f9) You already created this model repo ```

devparagiri · July 8, 2025, 2:56am

Thank you. Note that I train the slm on their cloud gpu not locally.

Not sure if that might have anything to do. Will try it out!

Topic		Replies	Views
Using autotrain locally model is not pushed to hub 🤗AutoTrain	12	1899	October 4, 2024
Model.save_pretrained is not saving .bin files! model.push_to_hub is not pushing my model in my HuggingFace directory! What am I missing? Help Beginners	11	4085	February 25, 2025
Trainer.push_to_hub is taking lot of time, is this expected behaviour? Beginners	7	1947	October 18, 2023
Pushing Model through CLI Beginners	0	291	August 16, 2023
Pushing model to hub 🤗Hub	0	1068	March 17, 2023

Push To Hub (Cloud Train) not working!

Related topics