Llama/Mistral Finetuning for Inference API

lucas-w · March 30, 2024, 7:56pm

I’m trying to finetune Llama (or Mistral) and host it on Inference API.

The code I found online to finetune Llama used Peft but I can’t seem to get it on Inference API, as there’s no obvious buttons on the repo to make calls to it.

Is it because the code uses Peft, and if so, can you direct me to some code that would be Inference API compatible?

Topic		Replies	Views
Help Me Uploading Fine Tuned Models For Inference Api Beginners	0	99	May 31, 2024
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K Beginners	5	1071	December 5, 2024
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	15217	December 21, 2023
How to perform finetuning on llama2 adapters Models	0	325	September 15, 2023
Fine tune a finetuned model Beginners	1	554	December 16, 2024

Llama/Mistral Finetuning for Inference API

Related topics