I’ve seen similar issues with deployment failures related to GPU availability. From what you’re describing, it seems like the GPU instance may not be available when the model tries to deploy, which causes the 502 error. One possible solution is to try selecting a different instance type or region during deployment to ensure that there are available GPU resources at the time of deployment. Also, double check if there’s any region specific resource limitation that might be causing the issue.
1 Like