https://api-inference.huggingface.co/models/HuggingFaceH4/zephyr-7b-beta/v1/chat/completions
I think this format’s endpoint is either old or has been retained for compatibility reasons. Let’s change it to the new implementation.