I am looking for an equivalent of Transformer in flax and I thought that maybe there would be one in transformers but couldn’t find it.
I haven’t seen a lot of “base models” except maybe resnets.
I am looking for an equivalent of Transformer in flax and I thought that maybe there would be one in transformers but couldn’t find it.
I haven’t seen a lot of “base models” except maybe resnets.