Why BERT is not in the TGI?

scchess · December 23, 2023, 7:41am

I want to run the BERT model. Why is it not supported in the text-generation-inference (Supported Models and Hardware)?

nielsr · December 23, 2023, 10:57am

Hi,

BERT is an encoder-only Transformer, useful for discriminative tasks (classificiation). TGI is meant for decoder-only Transformers, useful for generative tasks (generating tokens one at a time).

If you want to optimize BERT for production, I’d recommend taking a look at ONNX available in the Optimum library: Convert Transformers to ONNX with Hugging Face Optimum.

Topic		Replies	Views
Transformers, am i only using a Encoder for Binary Classification? Beginners	1	1627	January 4, 2021
Chapter 3 questions Course	141	10199	June 8, 2025
Does it make sense to generate sentences with Transofmrer's encoder? Research	0	379	May 22, 2021
How to use transformers&tensorflow for batch inference Beginners	0	527	August 20, 2021
BERT vs GPT architectural, conceptual and implemetational differences 🤗Transformers	0	993	November 26, 2021

Why BERT is not in the TGI?

Related topics