I am looking for an equivalent of Transformer
in flax
and I thought that maybe there would be one in transformers
but couldn’t find it.
I haven’t seen a lot of “base models” except maybe resnets.
I am looking for an equivalent of Transformer
in flax
and I thought that maybe there would be one in transformers
but couldn’t find it.
I haven’t seen a lot of “base models” except maybe resnets.