Also with inferentia you should have 4 workers. Meaning you should have almost 4x the throughput. In the example we created 1 neuron core is assigned to 1 worker
1 Like
Also with inferentia you should have 4 workers. Meaning you should have almost 4x the throughput. In the example we created 1 neuron core is assigned to 1 worker