PEFT + Inference

anon29941377 · March 7, 2023, 4:33pm

Is there a way to get PEFT to work with inference endpoints?

Ideally, we should be able to support multiple PEFT models with a common inference endpoint for the base model.

usholanb · August 1, 2023, 11:22am

any updates here?

osanseviero · August 15, 2023, 9:44am

You could configure a custom handler that allows you to specify code to load the model and its adapters Create custom Inference Handler

iarbel · January 15, 2024, 9:19am

How can we use it text-generation-inference? I assume that just loading it through a custom handler will serve it via transformers, no?

Topic		Replies	Views
Inference Endpoints creation Intermediate	1	467	January 14, 2024
About the Inference Endpoints on the Hub category Inference Endpoints on the Hub	3	1648	May 8, 2025
Endpoint issue with GPTQ Inference Endpoints on the Hub	0	219	January 23, 2024
Deploying to Model Hub for Inference with custom tokenizer Beginners	1	623	January 1, 2022
How to do BERTopic inference endpoint correctly? (hugging face) Beginners	0	625	January 17, 2023