TextGeneration Inference Model

Ahmkhan · September 21, 2023, 9:50pm

Hi,

Is there a way to calculate time taken to first token generating using TGI Model? Does TGI gives this metric by default in the generated response output or some other way to capture this time?

Narsil · September 22, 2023, 8:11am

This is the prefill time that you can see either in the prom metrics or text-generation-benchmark. Cheers.

Ahmkhan · September 22, 2023, 4:03pm

What is text-generation-benchmark ? Is it separate repo?

Topic	Replies	Views
TGI Model Question 🤗Hub	371	September 21, 2023
Inference Api free rate limit Inference Endpoints on the Hub	1922	May 20, 2023
[google/flan-t5-xl] Scores in each result 🤗Transformers	218	December 27, 2022
Fine Tune text generation Model using different type of data 🤗Transformers	354	August 1, 2023
Inference Text Generation API issue Intermediate	25	December 20, 2024

TextGeneration Inference Model

Related topics