TFLongformer Shape Error

sidharrth2002 · December 9, 2021, 3:03am

Hi, when trying to finetune the TFLongformer using the TFTrainer, I got this error

InvalidArgumentError: 2 root error(s) found.
  (0) INVALID_ARGUMENT:  Incompatible shapes: [2,1024,12,514] vs. [2,1024,12,513]
	 [[node while/gradients/while/tf_longformer_for_sequence_classification/longformer/encoder/layer_._0/attention/self/SelectV2_4_grad/BroadcastGradientArgs_1
 (defined at /usr/local/lib/python3.7/dist-packages/transformers/trainer_tf.py:633)
]]
	 [[while/LoopCond/_568/_14]]
  (1) INVALID_ARGUMENT:  Incompatible shapes: [2,1024,12,514] vs. [2,1024,12,513]
	 [[node while/gradients/while/tf_longformer_for_sequence_classification/longformer/encoder/layer_._0/attention/self/SelectV2_4_grad/BroadcastGradientArgs_1
 (defined at /usr/local/lib/python3.7/dist-packages/transformers/trainer_tf.py:633)

This is my train configuration:

training_args = TFTrainingArguments(
    output_dir='./results',          
    num_train_epochs=3,             
    per_device_train_batch_size=2,
    gradient_accumulation_steps=32,  
    per_device_eval_batch_size=16,   
    logging_steps=1,
)

with training_args.strategy.scope():
  model = TFLongformerForSequenceClassification.from_pretrained('allenai/longformer-base-4096', num_labels=5, return_dict=True, problem_type = "single_label_classification")

trainer = TFTrainer(model=model, args=training_args, train_dataset=train_dataset, eval_dataset=test_dataset)

Someone else had the same error when using the Tensorflow version of the Longformer.

nielsr · December 9, 2021, 1:20pm

I believe the TFTrainer is deprecated, one can now easily train using Keras’ .fit() method.

You can check out the official example notebook or script for reference.

cc @Rocketknight1

sidharrth2002 · December 31, 2021, 6:26pm

Sorry for the late reply! Switching to the torch trainer fixed the issue!

Topic		Replies	Views
Compatibility Issue of Transformers Library with TensorFlow 2.18 🤗Transformers	2	430	November 12, 2024
Longformer Tensorflow Int32 vs Int64 error Models	0	718	September 4, 2022
ImportError: cannot import name 'TFLongformerForMaskedLM' Models	3	2117	November 4, 2020
InvalidArgumentError: Exception encountered when calling layer 'encoder' (type TFMarianEncoder) Beginners	4	1761	February 4, 2024
TFLongformer Error : Trying to create optimizer slot variable under the scope for tf.distribute.Strategy Beginners	6	2343	February 4, 2021

TFLongformer Shape Error

Related topics