How pretrained models are trained?

al31415 · October 2, 2020, 7:06am

Hello,

How pre-trained are obtained?
In the library there is always some pre-trained model downloaded from the server, but how is it originally trained? Is it also trained using transformers library?

BramVanroy · October 2, 2020, 8:10am

In most if not all built-in cases (e.g. bert-base-cased), the original paper implementations are ported to the transformers architecture. (you can have a look at conversion scripts, e.g. https://github.com/huggingface/transformers/blob/77cd0e13d2d09f60d2f6d8fb8b08f493d7ca51fe/src/transformers/convert_pytorch_checkpoint_to_tf2.py, https://github.com/huggingface/transformers/blob/d155b38d6ea70fef3dec2e1f678269e713672bb7/src/transformers/commands/convert.py)

User models (e.g. username/mymodel-uncased) may have been trained in other ways or ported to the transformers architecture manually, with custom scripts, or they are trained by using the transformers library directly.

al31415 · October 2, 2020, 8:14am

Thanks for reply.
How can I train my own model from scratch using transformers?

BramVanroy · October 2, 2020, 8:30am

You can have a look here: https://colab.research.google.com/github/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb

Topic		Replies	Views
What would be the suggested way to customize a model? Beginners	1	453	February 25, 2022
How to obtain [CLS] embeddings from fine-tuned BERT model (using Transformers Trainer) Beginners	1	2660	June 27, 2022
Train a transformer from scratch 🤗Transformers	0	433	August 9, 2021
Untrained models produce inconsistent outputs 🤗Transformers	3	1161	July 30, 2020
How to load my own pretrained model to huggingface code Intermediate	1	860	January 31, 2023

How pretrained models are trained?

Related topics