[Nov 16th Event] Matthew Carrigan: New TensorFlow Features for 🤗 Transformers and Datasets

sgugger · November 16, 2021, 3:30pm

Use this topic to ask your questions to Matthew Carrigan during his talk: New TensorFlow Features for Transformers and Datasets

You can watch it on YouTube or on Twitch at 8:30am PST

MoritzLaurer · November 16, 2021, 4:40pm

Could the notebook shown in the video be linked?

Rocketknight1 · November 16, 2021, 4:43pm

Sure thing, here’s a Colab link!

NDugar · November 16, 2021, 4:49pm

For something like a zero-shot model, how does TFAutomodelforSequenceClassification change?

sgugger · November 16, 2021, 4:50pm

For something like a zero-shot model, how does TFAutomodelforSequenceClassification change?

This is answered at 50:06 in the main stream.

MoritzLaurer · November 16, 2021, 4:53pm

I understand that padding enables batching data points together, but too many padding tokens make the computations very slow (?). What is the attention_mask for? If I understand correctly, it masks the padding tokens so that the model does not pay attention to it - but do padding tokens then still slow down the training if they are masked? I haven’t fully understood the purpose of padding and attention_masks and the impact on speed.

sgugger · November 16, 2021, 4:58pm

This is answered at 55:00 on the main stream.

Rocketknight1 · November 16, 2021, 7:33pm

Hi all, just noticed the Colab notebook didn’t have permissions set. It should be accessible now!

Topic		Replies	Views
Is T5 expected to ignore padding tokens in `decoder_input_ids` when `decoder_attention_mask` is not provided 🤗Transformers	4	2686	April 5, 2023
Do automatically generated attention masks ignore padding? 🤗Transformers	4	16439	March 8, 2022
How is padding masking considered in the Attention Head of a Transformer? 🤗Transformers	0	2722	December 6, 2022
Bert output for padding tokens Beginners	3	3285	February 22, 2023
Why can padding tokens attend to other tokens in masked self attention? 🤗Transformers	0	67	November 4, 2024

[Nov 16th Event] Matthew Carrigan: New TensorFlow Features for 🤗 Transformers and Datasets

Related topics