Returning Multiple Answers for a QA Model on SageMaker

kmfoda · November 18, 2021, 9:18am

Hi,

I’ve currently fine-tuned this Question-Answering model to fit a specific business use case we have (identifying the name of a company from a piece of text). When it comes to inference, I’ve found as @sgugger has very clearly explained in this notebook that sometimes the best answer isn’t the one with the best start and end logits as sometimes the highest scoring combination can produce an answer that is too long or too short (just one character).

As such when I was predicting using this model locally I created a return_best_combinaton function that finds the most practical answer using the list of logit scores.

When I used this model using the SageMaker API I realised it just returns one single answer with a score assigned to it. I wanted to check how this answer is produced? (happy to just be directed to the source code if it’s available) and wether it’s possible to return n number of likely answers instead of just 1.

Thanks every so much,
Karim

philschmid · November 18, 2021, 9:32am

The Inference Toolkit uses the transformers pipelines under the hood. So if your are deploying a model for Question-Answering it would use the pipeline("question-answering"). You can find the code for this here: transformers/question_answering.py at main · huggingface/transformers · GitHub

But if you want to use your own function return_best_combinaton you could create a custom inference.py with your own “prediction” step.

kmfoda · November 18, 2021, 3:17pm

Understood that’s very helpful thank you. Looking at the code it seems that max_answer_len, handle_impossible_answer & topk all get me what I’m looking for so that’s perfect! No need for my own inference.py script. Thank you!

casafurix · January 14, 2023, 12:15pm

Can you please elaborate how you fixed it? Would be really helpful as I am dealing with the same problem right now.

This is the model I am trying to get the multiple inferences from, for which Sagemaker is returning just 1.
hub = {
‘HF_MODEL_ID’:‘valhalla/t5-base-qa-qg-hl’,
‘HF_TASK’:‘text2text-generation’
}

kmfoda · January 23, 2023, 6:59am

Hey @casafurix! You can use the top_k parameter to get more than one answer. If you set it to 5 for example you’ll get the top 5 answers ranked by scores.

Topic		Replies	Views
Returning Multiple Questions for a Question Generation Model on SageMaker Amazon SageMaker	0	310	January 16, 2023
Huggingface model for inference in aws sagemaker Amazon SageMaker	1	660	May 30, 2022
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers Beginners	8	5273	November 20, 2021
Emotion Model: Additional inference parameter not processed in Sagemaker Amazon SageMaker	4	1214	June 28, 2022
Return all class labels from SageMaker invoke_endpoint Amazon SageMaker	8	1543	January 22, 2022

Returning Multiple Answers for a QA Model on SageMaker

Related topics