Detailed parameters not working in BLOOM-176B

uccollab · September 22, 2022, 8:48am

Good morning, I paid for a Community Pro subscription to get faster inference time with BLOOM. However, the “use_gpu” parameter is ignored when calling the API. The “num_return_sequences” and some other parameters seem to be ignored as well. This is a bit problematic as I can’t generate different outputs or a given input no matter what.

Following is a snippet from the Python code I’m running:

TOKEN = "Bearer ###my token###"
headers = {"Authorization": TOKEN}
API_URL = "https://api-inference.huggingface.co/models/bigscience/bloom"

def query(payload):
    response = requests.post(API_URL, headers=headers, json=payload)
    return response.json()
    
payload = {
    "inputs": prompt,
    "parameters": {
        "max_new_tokens": 40,           
        "temperature" : 1.0,
        "do_sample": True,
        "return_full_text": False, #does not work
        "num_return_sequences": 5, #does not work
        "repetition_penalty":100.0},
    "options" : {
        "use_gpu": True, #does not work
        "use_cache": True,
        "wait_for_model": True}
}

output_text = query(payload)

uccollab · October 7, 2022, 4:14pm

UP. Is anyone from HF able to help?

Topic		Replies	Views
Using Bloom with detailed parameters? Models	8	2919	February 18, 2023
BLOOM parameter '"return_full_text": False' isn't being respected, and the "use_gpu" option doesn't appear to be working Models	3	2703	January 23, 2023
Accelerated Inference API not taking parameters? Intermediate	5	1633	October 26, 2022
BLOOM outputs only few tokens Inference Endpoints on the Hub	1	895	December 6, 2022
Hugging Face API parameters Beginners	0	654	October 14, 2022

Detailed parameters not working in BLOOM-176B

Related topics