BLOOM outputs only few tokens

jjkirshbaum · December 6, 2022, 1:17am

Hi, I am trying to inference on BLOOM using Inference API. It only produces a few tokens (max 1-3 sentences), even if I set ‘min_length’ very high, for instance “min_length”: 1024. How can I generate more tokens with BLOOM? Is there a limitation around this, or am I entering the parameters incorrectly?

jjkirshbaum · December 6, 2022, 1:25am

This is what my entry looks like:

API_URL = “https://api-inference.huggingface.co/models/bigscience/bloom”

HF_TOKEN = ‘api_org_XXXXXXX’

headers = {“Authorization”: f"Bearer {HF_TOKEN}"}

prompt_text = “”“Text:
Some prompt text is here because”“”

json_ = {“inputs”: prompt_text,
“parameters”:
{
“top_p”: 1.0,
“temperature”: .72,
“min_length”: 1024,
“return_full_text”: False
}, “options”:
{
“use_cache”: False,
“wait_for_model”:True
},}
response = requests.post(API_URL, headers=headers, json=json_)
output = response.json()

Topic		Replies	Views
Accelerated Inference API not taking parameters? Intermediate	5	1634	October 26, 2022
How to return more tokens when calling the inference end point? Inference Endpoints on the Hub	4	1495	May 9, 2024
Using Bloom with detailed parameters? Models	8	2919	February 18, 2023
Detailed parameters not working in BLOOM-176B 🤗Accelerate	1	695	October 7, 2022
Hugging Face API parameters Beginners	0	655	October 14, 2022

BLOOM outputs only few tokens

Related topics