Adding attention mask into MLM

dc12395 · May 30, 2024, 12:41pm

Hi all, I’m using this .py file (sentence-transformers/examples/unsupervised_learning/MLM at master · UKPLab/sentence-transformers · GitHub) to carry out some unsupervised training.

It’s returning a message saying: ‘We strongly recommend passing in an attention_mask since your input_ids may be padded. See Troubleshoot’

I’m a little unfamiliar with this process, but my guess is that it may be due to the fact that my training data text file has every row varying in length.

How would I adapt this to add in an attention mask? Thanks!

dc12395 · May 30, 2024, 12:47pm

To add to this, my code is killed following the message above, and no config.json file is created in the sentence transformer

Topic		Replies	Views
Do automatically generated attention masks ignore padding? 🤗Transformers	4	16486	March 8, 2022
Role of attention mask in base Bert 🤗Transformers	0	329	December 22, 2022
Clarification on the attention_mask 🤗Transformers	4	23485	May 3, 2024
How to use `inputs_embed` and `attention_mask` together? Intermediate	1	935	May 19, 2024
Missing, yet not missing, input_ids 🤗Transformers	2	1334	June 14, 2024

Adding attention mask into MLM

Related topics