Dimension error when trying to use Neuron compiled HF model on inferentia

Also with inferentia you should have 4 workers. Meaning you should have almost 4x the throughput. In the example we created 1 neuron core is assigned to 1 worker

1 Like