How to implement the generate interface for a custom architecture

gokul-krish · March 15, 2024, 11:19pm

Hi folks,
I have a LLaVa style custom architecture that I’d like to run inference on with the generate interface.

I looked around for docs or HOWTOs on what to implement but came up empty. I did look at the LLaVa codebase and try to imitate some of the functions but I feel more like I’m copying code instead of actually understanding what all need to be implemented and why.

I’m looking for some implementation cookbooks / docs or a better documented implementation that I reason through.

Thanks!

Topic	Replies	Views
How to deploy fine-tuned llava model with Huggingface Inference and using vLLM? Inference Endpoints on the Hub	211	July 15, 2024
How to enable Inference API for custom models? Beginners	298	June 27, 2024
Using the .generate() function with a custom model class Models	673	March 3, 2023
Generative models for code generation? 🤗Transformers	791	March 1, 2023
Support needed on basic understanding Generative AI Community Calls	433	March 30, 2023

How to implement the generate interface for a custom architecture

Related topics