HF hosted interface API for a finetuned model with LoRA

sooolee · April 23, 2023, 9:42am

Hi All,

I need a help. When I fine-tuned a flan-t5 model with lora, I got adapter_config.json and adaptor_model.bin instead of config.json and pytorch_model.bin. I used trainer pushing to hub. And if I am understanding correctly, it seems that the hf interface API doesn’t real the adapter_config because it’s asking for config.json.

How can I make this work? The model works fine generating outputs in notebook. Any idea to fix it?

sooolee · May 2, 2023, 7:29pm

Got an answer. According to HF, “Inference API does not yet support adapter-transformers models for this pipeline type.”

Kenkentron · January 15, 2024, 10:45pm

Looking at this for example, surely you can host a finetuned model using the Inference API?

Topic		Replies	Views
Using Text Generation Inference with LoRA adapter Beginners	3	2936	September 2, 2024
Looking for exploratory study / best practices for LoRA adapters config (LLM fine-tuning) 🤗Transformers	0	372	April 15, 2024
Using LoRA Adapters Beginners	0	2166	January 24, 2024
Getting Not Found for model google/flan-t5-small (and others) Beginners	10	240	July 7, 2025
Running inference on tuned models across multiple gpus Models	0	35	August 8, 2024

HF hosted interface API for a finetuned model with LoRA

Related topics