Accelerated Inference for gpt-j using javascript

DLiebman · April 2, 2022, 4:44pm

Hi. This is my first post on the forum. I am interested in the transformers topic, and specifically Eleuther’s GPT-J.

I am trying to use Accelerated Inference. I have created an api key, but I suspect that I need to somehow associate the api key with the Eleuther GPT-J model. Either that or there’s something else in my code that does not work with the model I’m interested in. Possibly the model is not working right on the server side.

I created a project, and doing so got an api key via the web site, but I think it’s not associated with the right model. I am only signed up for the free teer right now. If the model works out, i’d be interested in a higher teer.

Right now, using javascript, I am able to query the model. I am returned what seems to be a single token or two. The response is fast, and that’s encouraging, but I cannot get more than that one or two tokens. I am using the api key as the Bearer token in the Post request. My code is on the messy side, but a link is below to the file where I use fetch. My fetch calls are similar to the javascript example on the Huggingface model’s page.

github.com

radiodee1/electron-gpt/blob/main/src/js/controller/MainGPT.js

import { details } from "../model/Details.js";
import { chat } from "../model/ChatDict.js";
import { blacklist } from "../model/Blacklist.js";

var temp = 0;
var response_text = "";

export function setApiKey(engine, key) {
  console.log(key, "key at main");
  details[engine]["api_key"] = key;
}

export async function filter(line) {
  var yy = line;
  //chat.status_message = "Waiting";

  for(var i = 0; i < blacklist.length; i ++) {
    if (yy.includes(blacklist[i])) {
      console.log("blacklist:", yy);
      return "I don't understand.";

This file has been truncated. show original

Thank you for your time.

DLiebman · April 3, 2022, 1:12pm

TLDR: I’m only getting one token with each request. Does anyone have Javascript that works in this situation?

Topic		Replies	Views
Api parameters for gpt-j accelerated inference in javascript Beginners	0	313	April 4, 2022
How does the GPT-J inference API work? Beginners	5	754	October 8, 2021
Default gpt-j output length Beginners	0	363	April 23, 2022
Does GPT-J support api access? Beginners	1	543	October 26, 2021
JavaScript Example for inference API Beginners	7	4728	May 7, 2021

Accelerated Inference for gpt-j using javascript

Related topics