Warm-started encoder-decoder models (Bert2Gpt2 and Bert2Bert)

nielsr · December 14, 2021, 5:28pm

Hi,

looking at the files: Ayham/roberta_gpt2_summarization_cnn_dailymail at main

It indeed looks like only the weights (pytorch_model.bin) and model configuration (config.json) are uploaded, but not the tokenizer files.

You can upload the tokenizer files programmatically using the huggingface_hub library. First, make sure you have installed git-LFS and are logged into your HuggingFace account. In Colab, this can be done as follows:

!sudo apt-get install git-lfs
!git config --global user.email "your email"
!git config --global user.name "your username"
!huggingface-cli login

Next, you can do the following:

from transformers import RobertaTokenizer
from huggingface_hub import Repository

repo_url = "https://huggingface.co/Ayham/roberta_gpt2_summarization_cnn_dailymail"
repo = Repository(local_dir="tokenizer_files", # note that this directory must not exist already
                  clone_from=repo_url,
                  git_user="Niels Rogge",
                  git_email="niels.rogge1@gmail.com",
                  use_auth_token=True,
)

tokenizer = RobertaTokenizer.from_pretrained("roberta-base")
tokenizer.save_pretrained("tokenizer_files")

repo.push_to_hub(commit_message="Upload tokenizer files")

Note that the Trainer can actually automatically push all files during/after training to the hub for you as seen here.

Topic		Replies	Views
Leveraging pre-trained checkpoints for summarization Models	33	3206	November 25, 2022
Training Bert2GPT2 model Summarization doesn't lead to acceptable results Models	0	462	December 8, 2021
Training issue of a Transformer based Encoder-Decoder model based on pre-trained BanglaBERT Models	1	756	May 12, 2022
BERT2BERT Notebook for Models without GenerationMixin 🤗Transformers	0	301	November 12, 2020
Bert2bert translator? 🤗Transformers	6	62	August 28, 2025

Warm-started encoder-decoder models (Bert2Gpt2 and Bert2Bert)

Related topics