How to finetune an LLM with Image-Text pairs

rileybol · May 13, 2024, 5:54pm

I want to fine tune THUDM/cogvlm-chat-hf to add additional domain knowledge. I have a dataset of characters from a cartoon show, labeled with their names, and I want to improve the model’s recognition of these characters for captioning.

Is this possible with AutoTrain?

If not can anyone point me to some kind of tutorial, or give any kind of direction? The CogVLM documentation shows how to run the finetuning script, but I have not found any information about the format of the dataset needed.

Topic		Replies	Views
FineTuning a CasualLM with a text file Intermediate	0	118	September 17, 2024
Finetune LLaMA2 model with datasets missing labels 🤗Transformers	0	373	February 15, 2024
Fine-tuning LLM 🤗Transformers	0	68	October 16, 2024
Help with autotrain/LLM finetuning please Beginners	3	2143	August 11, 2023
Fine-tunening a multimodal model Beginners	4	4887	December 25, 2024

How to finetune an LLM with Image-Text pairs

Related topics