InvalidArgumentError: Exception encountered when calling layer 'encoder' (type TFMarianEncoder)

KigenCHESS · April 24, 2023, 6:34pm

I am fine-tuning a translation model which translates English to Swahili with my own datasets. Everything goes well during training but when I try to use the pre-trained model i run into the
following error. Any help will be highly appreciated.
If the problem is with my data please provide a step-by-step way in which I could solve it because am a beginner. Thanks

InvalidArgumentError Traceback (most recent call last)
in <cell line: 5>()
3 # Replace this with your own checkpoint
4 model_checkpoint = “KigenCHESS/new-en-to-sw”
----> 5 translaton = pipeline(“translation”, model=model_checkpoint)

10 frames
/usr/local/lib/python3.9/dist-packages/transformers/models/marian/modeling_tf_marian.py in call(self, input_ids, inputs_embeds, attention_mask, head_mask, output_attentions, output_hidden_states, return_dict, training)
789 ),
790 )
→ 791 inputs_embeds = self.embed_tokens(input_ids) * self.embed_scale
792
793 embed_pos = self.embed_positions(input_shape)

InvalidArgumentError: Exception encountered when calling layer ‘encoder’ (type TFMarianEncoder).

cannot compute Mul as input #1(zero-based) was expected to be a half tensor but is a float tensor [Op:Mul]

Call arguments received by layer ‘encoder’ (type TFMarianEncoder):
• input_ids=tf.Tensor(shape=(3, 5), dtype=int32)
• inputs_embeds=None
• attention_mask=tf.Tensor(shape=(3, 5), dtype=int32)
• head_mask=None
• output_attentions=False
• output_hidden_states=False
• return_dict=True
• training=False

mohdyaser · April 26, 2023, 1:56pm

Same problem, replying to get notified

KigenCHESS · April 26, 2023, 2:24pm

I found the solution by commenting out this code tf.keras.mixed_precision.set_global_policy(“mixed_float16”)

mohdyaser · April 26, 2023, 3:57pm

that worked for me too, thanks a lot!

ayalaall · February 4, 2024, 6:30am

Hi,
I’m having the same issue. In which part of the code did you comment out tf.keras.mixed_precision.set_global_policy(“mixed_float16”) ?
Thanks.

Topic		Replies	Views
InvalidArgumentError when passing finetuned model's checkpoint to pipeline Beginners	1	457	September 28, 2022
InvalidArgumentError with vit-base-patch16-224 model? Models	7	2008	April 20, 2024
InvalidArgumentError when training Segformer 🤗Transformers	0	310	March 24, 2024
Issue with "input_ids must be smaller than the embedding layer's input dimension (got 50257 >= 50257)" Beginners	1	1175	June 22, 2023
Error loading model via from_pretrained 🤗Transformers	0	1847	May 5, 2022

InvalidArgumentError: Exception encountered when calling layer 'encoder' (type TFMarianEncoder)

Related topics