Connection Error on Inference Endpoint for Bart-Large-Cnn

ujjawal0911 · May 30, 2023, 5:03pm

Hey so I had this code which I was using to make an inference request to the huggingface server. It used to work fine but just now it started giving this error

def bart_inference(text: str, percent: int) -> str:
        total_length = len(text.split(" "))
        min_length = (total_length * percent) // 100

        headers = {"Authorization": f"Bearer {HF_API_KEY}"}
        url = "https://api-inference.huggingface.co/models/google/pegasus-xsum"

        data = json.dumps(
            {
                "inputs": text,
                "parameters": {
                    "min_length": min_length,
                    "max_length": (min_length + 60),
                    "do_sample": False,
                },
            }
        )
        print("dafs")

        response = requests.request("POST", url, headers=headers, data=data)
        print(response.status_code)
        print(response.json())
        return json.loads(response.content.decode("utf-8"))

The error:-

{'error': "Can't load tokenizer using from_pretrained, please update its configuration: Connection error, and we cannot find the requested files in the cached path. Please try again or make sure your Internet connection is on."}

Topic		Replies	Views
Serverless Inference API error on new model Inference Endpoints on the Hub	5	346	September 9, 2024
HTTPError: 504 Server Error: Gateway Time-out for url: https://huggingface.co/api/models/facebook/bart-large-cnn 🤗Transformers	0	1291	January 17, 2022
Inference API down? Beginners	12	1634	June 8, 2024
Huggingface_hub.client giving error on list_deployed_models() Intermediate	2	85	March 3, 2025
Python HF Not Working Beginners	1	29	July 1, 2025

Connection Error on Inference Endpoint for Bart-Large-Cnn

Related topics