RepositoryNotFoundError: 404 Client Error

akshat3492 · August 31, 2023, 3:19am

Hi everyone, I am new to NLP and working with HuggingFace. I am working on a text summarization project and trying to fine tune the model. Below is the code I wrote but I am getting the error which I am not able to solve. Any leads would be appreciated.

from huggingface_hub import notebook_login

notebook_login()

I passed here my huggingface token…

from transformers import Seq2SeqTrainingArguments

batch_size = 8
num_train_epochs = 8
# Show the training loss with every epoch
logging_steps = len(tokenized_datasets["article"]) // batch_size
model_name = model_checkpoint.split("/")[-1]

args = Seq2SeqTrainingArguments(
    output_dir="https://huggingface.co/username/mT5",
    evaluation_strategy="epoch",
    learning_rate=5.6e-5,
    per_device_train_batch_size=batch_size,
    per_device_eval_batch_size=batch_size,
    weight_decay=0.01,
    save_total_limit=3,
    num_train_epochs=num_train_epochs,
    predict_with_generate=True,
    logging_steps=logging_steps,
    push_to_hub=True
)

def compute_metrics(eval_pred):
    predictions, labels = eval_pred
    # Decode generated summaries into text
    decoded_preds = tokenizer.batch_decode(predictions, skip_special_tokens=True)
    # Replace -100 in the labels as we can't decode them
    labels = np.where(labels != -100, labels, tokenizer.pad_token_id)
    # Decode reference summaries into text
    decoded_labels = tokenizer.batch_decode(labels, skip_special_tokens=True)
    # ROUGE expects a newline after each sentence
    decoded_preds = ["\n".join(sent_tokenize(pred.strip())) for pred in decoded_preds]
    decoded_labels = ["\n".join(sent_tokenize(label.strip())) for label in decoded_labels]
    # Compute ROUGE scores
    result = rouge.compute(
        predictions=decoded_preds, references=decoded_labels, use_stemmer=True
    )
    # Extract the median scores
    result = {key: value.mid.fmeasure * 100 for key, value in result.items()}
    return {k: round(v, 4) for k, v in result.items()}

from transformers import DataCollatorForSeq2Seq

data_collator = DataCollatorForSeq2Seq(tokenizer, model=model)

from transformers import Seq2SeqTrainer

trainer = Seq2SeqTrainer(
    model = model,
    args = args,
    train_dataset=tokenized_datasets["article"],
    eval_dataset=test_tokenized_datasets["article"],
    data_collator=data_collator,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics,
)

Here is the final error I am getting…

RepositoryNotFoundError: 404 Client Error. (Request ID: Root=1-64f004f3-5195d9d41d468e89023d924f;a7ed83d8-a09f-494e-ae69-8623b7517abb)

Repository Not Found for url: https://huggingface.co/api/models/mT5.
Please make sure you specified the correct repo_id and repo_type.
If you are trying to access a private or gated repo, make sure you are authenticated.

vpkprasanna · August 31, 2023, 5:57am

One issue which i found in argument output_dir of Seq2SeqTrainingArguments is it should be your local path rather than remote path and you cannot use a remote path over here.

This output directory helps us to save the model checkpoints and other stuffs .
see the docs of Training Arguments

akshat3492 · August 31, 2023, 7:50pm

Thanks for the suggestion. Along with the solution you mentioned, another issue was that the access token had just read access whereas we need to provide write access.

Topic		Replies	Views
Repository error while using seq2seqtrainer 🤗Transformers	0	185	March 16, 2023
404 error while accessing bert-base-nli-mean-tokens 🤗Transformers	15	2425	May 26, 2023
https://api-inference.huggingface.co/models/sentence-transformers/paraphrase-MiniLM-L6-v2 Beginners	7	312	May 8, 2025
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://huggingface.co/api/bert/models/bert 🤗Transformers	5	12108	July 1, 2023
404 error for models Models	6	1422	May 29, 2025

RepositoryNotFoundError: 404 Client Error

Related topics