This Q and A dataset that I created might serve as an example. I utilized 3 tags “question:”, “context:” and “answer:”. Along with an anchor point tag “|”. This way it avoids creating noise in the data and provides a pattern the bot can recognize.
I can setup a link with the training script, vocab and dataset if you’d like. Could save you a lot of time.