How does BERT actually answer questions?

theudster · March 10, 2021, 3:30pm

have been trying to understand how the BERT model works. Specifically, I am trying to understand how it picks up up answers to questions on a given passage. I have tried following thisblog post and whilst It has given me a nice intuition, I would like to better understand what is happening under the hood.

From my understanding, the question and paragraph are tokenised separately and then go through the transformer model. Then, the dot product between the ‘transformed’ tokens and a START/END token is taken, with the higher result giving you that start and Finnish of the answer.

What I would like to understand, what happens to the tokens in this “transformation” (i.e feedforward through the model) that makes it possible to take a dot product and therefore indicate if a word is a START/END.

lewtun · March 11, 2021, 8:30pm

Hi @theudster, you can find a detailed tutorial on question-answering with transformers here: https://colab.research.google.com/github/huggingface/notebooks/blob/master/examples/question_answering.ipynb

Topic		Replies	Views
Question about BERT for qa Beginners	0	594	June 30, 2022
SQuAD with BERT tokenizer: Mismatch between span and token boundaries Models	0	505	November 12, 2021
How to analyze ROCstories with `BertForQuestionAnswering`? 🤗Transformers	1	287	November 5, 2020
How can I put multiple questions in the same context at once using Question-Answering technique (i'm using BERT)? Beginners	2	1466	October 1, 2021
Q & A Model Robustness for concluding periods Models	2	364	January 25, 2021

How does BERT actually answer questions?

Related topics