How to build a custom question-answering head?

merve · March 29, 2022, 8:24am

Hello

For transformers models it’s best to use model’s built-in loss. For more native Keras implementation and explanation on why it should be that way, you can check out this tutorial. You can just call compile without a loss.

As a convenience, all Transformers models come with a default loss which matches their output head, although you’re of course free to use your own. Because the built-in loss is computed internally during the forward pass, when using it you may find that some Keras metrics misbehave or give unexpected outputs.

Topic		Replies	Views
Custom heads with Trainer API Beginners	0	372	April 12, 2022
Fine-Tuning BERT Question Answering sequence output problem Beginners	4	1494	August 26, 2021
Fine-Tune BERT with two Classification Heads "next to each other"? Beginners	3	2739	September 17, 2021
Using `TFBertTokenizer` instead of `BertTokenizer` with `TFBertForQuestionAnswering` 🤗Tokenizers	1	1260	November 15, 2022
Inference with DistilBertForQuestionAnswering 🤗Transformers	2	386	January 22, 2021

How to build a custom question-answering head?

Related topics