Reproduce RoBERTa Using Huggingface Transformers

JackBAI · July 28, 2023, 3:30pm

I’m trying to reproduce RoBERTa pre-training results using run_mlm.py provided by huggingface transformers. However, I’m confused how exactly should the script be called. Here is my script:

python transformers/examples/pytorch/language-modeling/run_mlm.py \
    --config_name roberta-base \
    --tokenizer_name roberta-base \
    --dataset_name wikitext, bookcorpus, ccnews, openwebtext, stories \
    --dataset_config_name wikitext-2-raw-v1 \
    --per_device_train_batch_size 8 \
    --per_device_eval_batch_size 8 \
    --do_train \
    --do_eval \
    --output_dir ./crate/ckpt \
    --overwrite_output_dir

Is there anyone already successfully reproduced RoBERTa by this means? Looking forward to any possible help!

Topic		Replies	Views
Train bert from scratch using run_mlm.py Beginners	0	805	March 25, 2022
Reproduce BERT and RoBERTa 🤗Transformers	1	975	July 24, 2023
Discrepancy in Model Inference: Local vs. Hugging Face Model Hub 🤗Transformers	1	815	December 27, 2023
Is the huggingface run_mlm Script dynamically masked? 🤗Transformers	8	1651	June 1, 2022
How can I share a pytorch saved model on huggingFace hub Beginners	0	678	May 4, 2022

Reproduce RoBERTa Using Huggingface Transformers

Related topics