504 Timeout when using flan-ul2

Realmacaroni · August 4, 2024, 12:47am

I constently got this error when I’m trying to use flan-ul2:
"An error occurred: 504 Server Error: Gateway Timeout for url: https://api-inference.huggingface.co/models/google/flan-ul2 (Request ID: ZpqtQA6VFVaMqfN6hJmKe)

Model google/flan-ul2 time out"

Can anyone help with this?
My code is :import os
from langchain.llms import HuggingFaceHub
from langchain.chains import ConversationChain
from langchain.chains.conversation.memory import ConversationBufferMemory

Hugging Face API token

os.environ[“HUGGINGFACEHUB_API_TOKEN”] = “”

flan_ul2 = HuggingFaceHub(
repo_id=“google/flan-ul2”,
model_kwargs={“temperature”:0.5, “max_new_tokens”:128}
)

创建内存管理器

memory = ConversationBufferMemory()

创建对话链

conversation = ConversationChain(
llm=flan_ul2,
verbose=True,
memory=memory
)

try:
response = conversation.predict(input=“Hi there! I am Sam”)
print(response)

response = conversation.predict(input="How are you today?")
print(response)

response = conversation.predict(input="Can you help me with some customer support?")
print(response)

response = conversation.predict(input="My TV is broken. Can you help fix it?")
print(response)

except Exception as e:
print(f"An error occurred: {e}")

Topic		Replies	Views
504 Timeout when using flan-ul2 Beginners	0	11	August 4, 2024
Gateway timeout - google/flan-t5-xxl Beginners	0	15	August 4, 2024
Inference API model timeout (Flan-UL2) Inference Endpoints on the Hub	1	885	May 26, 2023
502 Bad Gateway Error for Flan-UL2 model Inference Endpoints on the Hub	2	556	June 27, 2023
HF Inference API: 503/504 Server Error Inference Endpoints on the Hub	1	227	April 1, 2025

504 Timeout when using flan-ul2

Hugging Face API token

创建内存管理器

创建对话链

Related topics