"probability/confidence" measurement of DONUT on s_rvlcdip (document classification task)

tsabar · January 15, 2023, 8:20pm

is it possible to get the “confidence”/“logit”/“probability” measure of the DONUT model on the task of document type classification task and not just the final “class/token”?
I’ve seen that the last layer of the model is Linear:

          (3): MBartDecoderLayer(
            (self_attn): MBartAttention(
              (k_proj): Linear(in_features=1024, out_features=1024, bias=True)
              (v_proj): Linear(in_features=1024, out_features=1024, bias=True)
              (q_proj): Linear(in_features=1024, out_features=1024, bias=True)
              (out_proj): Linear(in_features=1024, out_features=1024, bias=True)
            )
            (activation_fn): GELUActivation()
            (self_attn_layer_norm): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
            (encoder_attn): MBartAttention(
              (k_proj): Linear(in_features=1024, out_features=1024, bias=True)
              (v_proj): Linear(in_features=1024, out_features=1024, bias=True)
              (q_proj): Linear(in_features=1024, out_features=1024, bias=True)
              (out_proj): Linear(in_features=1024, out_features=1024, bias=True)
            )
            (encoder_attn_layer_norm): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
            (fc1): Linear(in_features=1024, out_features=4096, bias=True)
            (fc2): Linear(in_features=4096, out_features=1024, bias=True)
            (final_layer_norm): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
          )
        )
        (layernorm_embedding): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
        (layer_norm): LayerNorm((1024,), eps=1e-05, elementwise_affine=True)
      )
    )
    (lm_head): Linear(in_features=1024, out_features=57525, bias=False)
  )
)

Cognitus-Stuti · November 10, 2023, 10:08am

Has anyone found a method to do the same. If yes please share here

Topic		Replies	Views
Creating custom Donut model Models	0	716	March 16, 2023
Different model performance after saving and loading Donut model 🤗Transformers	1	354	July 6, 2024
Donut fine tuning question 🤗Optimum	0	1630	October 16, 2023
Donut base-sized model, pre-trained only for a new language tutorial Models	2	1051	February 19, 2023
Token classification probability and scoring 🤗Transformers	0	749	November 23, 2020

"probability/confidence" measurement of DONUT on s_rvlcdip (document classification task)

Related topics