Using autotrain locally model is not pushed to hub

JvManger · September 27, 2023, 11:53am

Using autotrain locally the used model is downloaded and stored in .cache\huggingface\hub\modelxyz\ as an ashampoo iso file.
The training works but when “push_to_hub” the pytorch_model_xxx.bin files are missing and the config.json as well. What did I do wrong?

abhishek · September 27, 2023, 12:07pm

you saved only the adapter. use --merge-adapter arg to save the full model, else merge them manually after training

JvManger · September 27, 2023, 1:04pm

Thank you! I will try !

JvManger · September 27, 2023, 3:06pm

stuck while loading “checkpoint shards” and end without any error message. Nothing was uploaded.

RomanAdi1234 · October 29, 2023, 5:24pm

@JvManger , I am also facing the issue like yours . While training it is working but --push_to_hub is saying that Token must be specified for push to hub . I am successfully logged in to hugging face by my cli virtual env .
@abhishek , Is there any other parameters I am missing where the token should be explicitly mention in the command ?

abhishek · October 29, 2023, 6:55pm

token needs to be specified by --token argument

alfraser · January 18, 2024, 3:47pm

Hi - just piggybacking on this thread as I think it is relevant. Beginners question

I am successfully using autotrain locally to fine-tune a llama 2 chat model and I am merging the resulting adapter. The one thing I cannot seem to work out is the correct usage of the parameters to get it to push to the hub.

I have logged in via the huggingface-cli with a token which has write access. I am then setting the following parameters which I think are relevant in the invocation of autotrain:

project-name (set to “ft-test”. Does this only affect the local training or the hub push too?)
push-to-hub (enabled to do the push)
token (passed the same token as I am logged in as, which has write access)
username (set to “alfraser” which is my HF user)
repo-id (set to “ft-test” which is an empty “model” when I log into the hub)

The error I get is “You don’t have the rights to create a model under this namespace” (image below). I feel like I am either missing or mis-using an argument, or maybe I need to set up the model on the hub differently to allow this to work.

Any ideas gratefully received as I’m going to soon have “QWERTY” imprinted on my forehead from banging my head on the desk

abhishek · January 18, 2024, 4:08pm

repo id needs to be of the format username/repo_name. you dont need to use repo_id if you provide username and project_name. in this case repo_id will be username/project_name. make sure there are no repos in your user with name same as project_name

alfraser · January 18, 2024, 4:24pm

Thanks for the super quick reply. I am trying that now. Maybe just a follow up question which occurs to me from your reply. Is there a way to have it overwrite an existing model (i.e. if I want to do an update but keep the model name the same) or should I think about using differing model versions/names if I change, and then repoint my client to use the new model? Thanks Al

alfraser · January 18, 2024, 5:42pm

That is definitely past the previous error now and looks like it’s pushing fine. Thanks a million for the help (my head is QWERTY-free now )

AsadCognify · October 3, 2024, 10:13am

Can you please share your way to invoking autotrain training. I cannot seem to set the push_to_hub value to true.
Finetuning starts but the launch command in logs has push_to_hub set to false.
I am using curl command for sending the request and running autotrain locally.

AsadCognify · October 3, 2024, 10:15am

I am using the curl command as used in the sample in docs: AutoTrain API

But I cannot seem to set the push_to_hub to true. The container logs on hugging face space show
{‘data_path’: ‘AsadCognify/embedding-training-dataset’, ‘model’: ‘sentence-transformers/multi-qa-MiniLM-L6-cos-v1’, ‘lr’: 3e-05, ‘epochs’: 1, ‘max_seq_length’: 4096, ‘batch_size’: 8, ‘warmup_ratio’: 0.1, ‘gradient_accumulation’: 1, ‘optimizer’: ‘adamw_hf’, ‘scheduler’: ‘cosine’, ‘weight_decay’: 0.0, ‘max_grad_norm’: 1.0, ‘seed’: 42, ‘train_split’: ‘train’, ‘valid_split’: None, ‘logging_steps’: -1, ‘project_name’: ‘my-autotrain-api-st-model-7’, ‘auto_find_batch_size’: False, ‘mixed_precision’: ‘fp16’, ‘save_total_limit’: 1, ‘token’: ‘*****’, ‘push_to_hub’: False, ‘eval_strategy’: ‘epoch’, ‘username’: ‘AsadCognify’, ‘log’: ‘tensorboard’, ‘early_stopping_patience’: 5, ‘early_stopping_threshold’: 0.01, ‘trainer’: ‘qa’, ‘sentence1_column’: ‘queries’, ‘sentence2_column’: ‘docs’, ‘sentence3_column’: ‘sentence3’, ‘target_column’: ‘target’}

abhishek · October 4, 2024, 8:20am

do you put push_to_hubas true in curl params?

Topic		Replies	Views
Pushing model to hub 🤗Hub	0	1068	March 17, 2023
Pushing Model through CLI Beginners	0	291	August 16, 2023
Model.save_pretrained is not saving .bin files! model.push_to_hub is not pushing my model in my HuggingFace directory! What am I missing? Help Beginners	11	4080	February 25, 2025
Problem with push_to_hub 🤗Transformers	7	7140	January 28, 2023
How to push extra files to Hugginface model hub? 🤗Hub	1	573	November 28, 2022

Using autotrain locally model is not pushed to hub

Related topics