Question and Answering run time

I am using the Question and Answering model and the run time is about 14 seconds per data point, my project of ~7000 text data samples took around 26 hours.

What is the best run time I can expect? Will this improve in the near future?

Do you think you can provide some more information ?
Are you using GPU, pipeline, which model are you using etc