When i want to use tensor parallelism during the model inference , I find the parallelism is supported on training. How to customize tensor parallelism?
When i want to use tensor parallelism during the model inference , I find the parallelism is supported on training. How to customize tensor parallelism?