Transformers for small datasets?

I’m there, but it’s too technical to answer…
It’s a subject that someone might write a paper or an article on.