Thanks for your reply! I verified and it still generates only one word for me. I’m sending the below mentioned json in my request to https://api-inference.huggingface.co/models/EleutherAI/gpt-j-6B
{
“inputs”: “Nowadays, many users would like to upgrade old hard drive to SSD with Windows installed, or reinstall Windows 10 on SSD afterward. The faster boot speed and reading & writing speed make it known as a better boot drive.\r\n\r\nTo be specifc, after installing the operating system on a new SSD, you’ll find the computer boots up faster, and runs smoothly even with muliple programs in the background.\r\n\r\nSo let’s”,
“parameters”: {
“return_full_text”: false,
“max_new_tokens”: 100,
“temperature”: 0.8
},
“options”: {
“use_cache”: false
}
}
And I’m getting back this:
HTTP/1.1 200 OK
date: Wed, 29 Sep 2021 06:56:12 GMT,Wed, 29 Sep 2021 06:56:21 GMT
server: istio-envoy
x-compute-time: 0.1596
x-compute-type: gpu
access-control-expose-headers: x-compute-type, x-compute-time
x-compute-characters: 406
content-length: 27
content-type: application/json
x-envoy-upstream-service-time: 177
[{“generated_text”:" get"}]
Is something wrong on my side?