[Nov 16th Event] Lewis Tunstall: Simple Training with the 🤗 Transformers Trainer

sgugger · November 16, 2021, 3:29pm

Use this topic to ask your questions to Lewis Tunstall during his talk: Simple Training with the Transformers Trainer

You can watch it on YouTube or on Twitch at 8am PST

MoritzLaurer · November 16, 2021, 4:15pm

Is it possible to share the link to the notebook Lewis is working on here?

JaviBJ · November 16, 2021, 4:18pm

You suggest changing bert-base for MiniLM as baseline pre-trained model. Is there an analogous recommendation for multilingual models? Which checkpoint should be the go-to?

sgugger · November 16, 2021, 4:20pm

I added it on the first post

gerardo · November 16, 2021, 4:26pm

During the technical glitch Lewis explained the weighted loss
Can he show it again?

MoritzLaurer · November 16, 2021, 4:26pm

Is there a page in the docs / chapter in the course / an external resource with an overview of different loss functions and their advantages/disadvantages?

jpabbuehl · November 16, 2021, 4:27pm

How can you use the dataset/trainer/pipeline approach with your own data?

NDugar · November 16, 2021, 4:27pm

Does the way the initialized model was made (as in made using tensorflow or pytorch) affect its performance or cause any issue?

sgugger · November 16, 2021, 4:28pm

This is answered at 25:00 in the main stream.

shamikbose89 · November 16, 2021, 4:33pm

Like Sylvain and Lewis mentioned, this would be a great resource to have. I’d love to collaborate on this

sgugger · November 16, 2021, 4:34pm

This is shown at 27:05 in the main stream.

sgugger · November 16, 2021, 4:34pm

This is answered at 28:37 in the main stream.

sgugger · November 16, 2021, 4:35pm

This is answered at 29:37 on the main stream.

Topic		Replies	Views
Pre-training BERT Models	1	383	May 21, 2024
Continue LM pretraining with run_mlm - loss function clarification 🤗Transformers	0	459	March 14, 2022
Are albert-base-v1( and v2) pretrained enough? 🤗Transformers	4	358	October 26, 2021
MLM train loss is very different after version update 🤗Transformers	1	438	August 29, 2021
Further pre-train language model in transformers like BERT Models	3	1110	March 27, 2022

[Nov 16th Event] Lewis Tunstall: Simple Training with the 🤗 Transformers Trainer

Related topics