How to manually add noise to embeddings for RoBERTa?

ketchup123 · August 9, 2023, 2:31pm

Hi Everybody!

I am trying to add some noise to the embeddings of my RoBERTa model and does not seem to succeed in doing so. I also have another interesting question which I have not been able to solve myself.

Why Noise?
I’m trying to explore how malicious noise can affect the model training and how we can design countermeasures, for example in a wireless system.

What did I do

I created a CustomRobertaModel
In the forward pass, I manually get the embedding weights and add some noise to it
Leave everything as is

This is my class:

class CustomRobertaModel(RobertaForSequenceClassification):
    """
    Custom RoBERTa Model class that exposes embeddings, attentions, and head.
    """
    def __init__(self, config):
        super().__init__(config)
        self.roberta_base = RobertaModel(config)
        self.head = self.classifier

    def forward(self, 
                input_ids, 
                attention_mask=None, 
                token_type_ids=None, 
                position_ids=None, 
                head_mask=None, 
                labels=None):

        with torch.no_grad():
            noise = torch.normal(2, 2, size=self.roberta_base.embeddings.word_embeddings.weight.size()).to(self.roberta_base.embeddings.word_embeddings.weight.device)
            noisy_weights = self.roberta_base.embeddings.word_embeddings.weight + noise
            self.roberta_base.embeddings.word_embeddings.weight.data = noisy_weights.data

        # with torch.no_grad():
        #     self.roberta_base.embeddings.word_embeddings.weight.data.zero_()
        
        # Getting encoder outputs
        encoder_outputs = self.roberta(
            input_ids=input_ids,  
            attention_mask=attention_mask,
            token_type_ids=token_type_ids,
            position_ids=position_ids,
            head_mask=head_mask
        )

        with torch.no_grad():
            self.roberta_base.embeddings.word_embeddings.weight.data.zero_()

        sequence_output = encoder_outputs[0]
        
        # Passing through the classifier (head)
        logits = self.head(sequence_output)
        
        # Compute loss if labels are provided
        if labels is not None:
            loss = torch.nn.CrossEntropyLoss()(logits, labels)
            return SequenceClassifierOutput(loss=loss, logits=logits)
        else:
            return logits

I initialize using roberta-base and train as in any other tutorial.

Outcomes and Observations

There is absolutely no change in the training and validation results in each epoch
Even setting the embedding weights to zero as seen in above uncommented part is not effective and does absolutely nothing
Interestingly enough, I rely on passing the input_ids instead of the embeddings to get the encoder outputs. As seen in the documentation, this is equal. However, I observe that (without any noise) my training and validation results are worst and I suspect the model is learning slower or not at all (whaat? and why?)

Questions

Is this the right way to do it? - I want to add noise for each forward pass as if there would be a jammer sitting and adding noise to it
What’s with the performance degradation when not using input_ids but inputs_embeds and why?

Thank you so much! Looking forward to more insights!

nielsr · January 3, 2024, 6:44pm

Hi,

We now support adding noise to the embeddings with the NEFTune method: Trainer.

It’s also present in the SFTTrainer class of the TRL library (as adding noise to embeddings was shown to improve supervised fine-tuning or SFT of LLMs): Supervised Fine-tuning Trainer.

ketchup123 · January 3, 2024, 7:13pm

Thank you!!

Topic		Replies	Views
How to add noise to the intermediate layer of huggingface bert model? Models	0	136	March 27, 2024
Training embeddings of tokens 🤗Transformers	2	5244	January 27, 2021
Modify bert embeddings 🤗Transformers	0	380	January 18, 2022
How to add a new input layer to BERT / RoBERTa? Beginners	0	912	April 26, 2022
New layer in bert embeddings 🤗Transformers	1	684	April 1, 2022

How to manually add noise to embeddings for RoBERTa?

Related topics