Controlling bos, eos, etc in api-inference

cristivlad · April 22, 2021, 7:59pm

Is there a way to control the beginning of sentence, end of sentence tokens through the inference api? I could not find it in the documentation.

Narsil · May 4, 2021, 3:18pm

Hi @cristivlad ,

Currently there is no way to override those within the API.
We are adding and end_sequence parameter to enable stopping the generation when using prompt-like generation (for GPT-Neo for instance).

For BOS, what did you want to do with it ?

Cheers,
Nicolas

louis030195 · January 12, 2022, 8:33pm

I tried end_sequence and indeed works, I would suggest updating the doc, thanks in any case!

    data = json.dumps(
        {
            "inputs": "1+1=2\n2+2=",
            "parameters": {
                "max_length": 50,
                "num_return_sequences": 1,
                "return_text": False,
                "return_full_text": False,
                "do_sample": True,
                "top_k": 50,
                "top_p": 0.95,
                "end_sequence": "\n"
            },
            "options": {
                "wait_for_model": True,
            },
        }
    )

Topic		Replies	Views
Closed end text generation 🤗Transformers	0	451	January 3, 2023
Add BOS and EOS when encoding a sentence 🤗Tokenizers	2	14536	August 22, 2022
GPT-2 special tokens Models	2	1947	February 20, 2024
How to insert a end-sequence Beginners	4	2825	March 22, 2022
How does GPT decide to stop generating sentences without EOS token? 🤗Transformers	13	24179	August 19, 2024

Controlling bos, eos, etc in api-inference

Related topics