The Category of Model-based Translation Evaluation Methods

ywan · March 18, 2022, 2:33am

Hi there!

Recently I want to find whether there is an existing or related model category that suits the use of model-based metric/quality estimation(QE) methods, e.g. COMET/TransQuest.

The model architecture mainly contains two parts: a fine-tuned pretrained language model like BERT/XLM-R, and a designed multi-layer perceptron (MLP). The final output of metric/QE model is a single scalar value.

I noticed that BLEURT applies BERTForSequenceClassification model as initialization. However, I find that the implementation only contains one linear layer inside MLP module. For some approaches like COMET, this module may contains several linear modules, and activations are applied between any adjacent two of them.

Anyone got a clue? Thanks!

Topic		Replies	Views
Evaluation metrics for BERT-like LMs Research	4	4612	December 6, 2024
How to correctly evaluate a Masked Language Model? 🤗Transformers	3	4386	August 11, 2023
Getting the MLM accuracy for the BERT model I am training from scratch Beginners	7	5354	October 5, 2023
[new model] FSMT has been released + 9 models ported 🤗Transformers	3	1146	September 25, 2020
Quantization of Transformers model 🤗Transformers	0	75	May 29, 2024

The Category of Model-based Translation Evaluation Methods

Related topics