Marian MT half precision inference

tachyean · April 29, 2021, 3:26pm

Hello Huggingfacers

I am trying to use 16 bit precision “half()” on the inference of a Marian MT model provided by Huggingface. It seems to reduce quite a lot the memory usage, which is what i am looking for, but i don’t know what to expect in term of translation accuracy after this change.

I am not aware of a method to calculate the BLEU score on a given model, probably i would need a language translation dataset and that would be the way to answer my question by myself

So did any of you guys have some insights on this?

Topic		Replies	Views
What exact inputs does bleu_metric.compute() require? Beginners	5	3274	July 10, 2020
Enhance a MarianMT pretrained model from HuggingFace with more training data Beginners	4	2707	May 29, 2021
mBART finetuning tips/post-mortem 🤗Transformers	6	2636	November 17, 2020
mBART fine tuning performs worse Beginners	0	27	November 22, 2024
Compute the BLEU using pretrained T5-small Models	2	3981	April 13, 2022

Marian MT half precision inference

Related topics