That only works for models that are transformer native and not nn.Module/pytorch native, sadly.
2 Likes
That only works for models that are transformer native and not nn.Module/pytorch native, sadly.