Is there any way to control the input of a `Longformer` layer?

h56cho · October 13, 2020, 9:13pm

Hello,

Is there any way that I can directly control the input to each layer of LongformerForMultipleChoice, similar to GPT2.transformer.h[]?

I tried best_model_longformer.longformer.encoder.layer[layer_index](input_hidden_state_for_layer) but it’s giving this error:

Traceback (most recent call last):
  File "SEED_125_V20_15_LONGFORMER.py", line 426, in <module>
    main_function('/home/ec2-user/G1G2.txt','/home/ec2-user/G1G2_answer_num.txt', num_iter)
  File "SEED_125_V20_15_LONGFORMER.py", line 388, in main_function
    best_model_longformer)
  File "SEED_125_V20_15_LONGFORMER.py", line 205, in fill_MC_loss_accuracy_tensor
    best_model_longformer.longformer.encoder.layer[j](input_hidden_state)
  File "/home/ec2-user/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py", line 722, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/ec2-user/anaconda3/lib/python3.7/site-packages/transformers/modeling_longformer.py", line 852, in forward
    output_attentions=output_attentions,
  File "/home/ec2-user/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py", line 722, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/ec2-user/anaconda3/lib/python3.7/site-packages/transformers/modeling_longformer.py", line 796, in forward
    output_attentions,
  File "/home/ec2-user/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py", line 722, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/ec2-user/anaconda3/lib/python3.7/site-packages/transformers/modeling_longformer.py", line 241, in forward
    attention_mask = attention_mask.squeeze(dim=2).squeeze(dim=1)
AttributeError: 'NoneType' object has no attribute 'squeeze'

:S thank you,

valhalla · October 14, 2020, 9:28am

If you want full control you can just copy paste the code and tweak it the way you want

Topic		Replies	Views
Error when using the forward() function of `LongformerLayer` class 🤗Transformers	6	1145	May 26, 2021
Strange error when using the Longformer (HuggingFace developers, please reply) 🤗Transformers	8	1797	October 12, 2020
Convert models to Longformer Intermediate	3	2190	February 1, 2021
Getting predictions 🤗Transformers	1	286	October 15, 2020
Longformer for Encoder Decoder with gradient checkpointing Beginners	1	673	January 7, 2022

Is there any way to control the input of a `Longformer` layer?

Related topics