Fine tuning a LLM with a code

Venushki · January 24, 2024, 5:03am

I want to fine-tune a LLM locally to serve as an intelligent code reviewer to use as a tool for developers that, given natural language descriptions, identifies and highlights specific locations in the C# codebase where changes are needed. The goal is to streamline the code review process by providing developers with precise indications of where modifications should be made based on their high-level descriptions. Even though there are suitable LLMs for the task i can’t figure out a way to feed my C# code base to the LLM. (a way for the LLM to read my code files )

AbishekSundar · January 24, 2024, 6:41am

Are you looking to train any specific LLM? I had used GPT2 for a similar task and it worked decently well.

Venushki · January 24, 2024, 7:06am

yes i was thinking code llama or mistral 7b (i can use any open source LLM that supports a C# code base)… how did you feed your code base into the llm to fine tune it to learn the code?

AbishekSundar · January 24, 2024, 7:13am

I had created a dataloader function and used huggingface’s trainer function. I used GPT2 and not mistral or code llama.

Venushki · January 24, 2024, 9:14am

can you explain the function or maybe give me the code?

aakashgoel12 · February 27, 2024, 9:00am

@Venushki @AbishekSundar Please share approach/code taken to prepare dataset like schema which is feeded into data loader.

CafferyChu · April 25, 2024, 8:48am

Any update here? I’m working on a similar task. But have no idea how to feed code file to llm. Do I need create a dataset with comment like what the function is doing? or just feed the code file to mode? Anyone can help?

Thanks,
Caffery

lcarrere · February 5, 2025, 7:28pm

– Disclaimer: I’m the main developer of this product –

LM-Kit.NET offers a straigforward API to infer and finetune LLMs locally.

You can find a fine-tuning demo from this link: github. com/LM-Kit/lm-kit-net-samples/tree/main/console_net6/finetuning

To obtain a free community license: Community Edition Licensing Overview | LM-Kit

If you need further information or assistance on this topic I’ll be more than happy to assist.

Topic		Replies	Views
Fine-Tuning LLMs on Large Proprietary Codebases Models	9	309	June 24, 2025
Fine-tuning CodeLlama for Multi-File Code Generation in a Private Repository Beginners	10	8004	October 23, 2024
How to fine-tune a pretrained LLM on custom code libraries? Beginners	3	7393	April 26, 2025
Fine-tuning conversational models with the technical documentation Beginners	2	1297	July 18, 2024
Automating .NET C# Code Generation with LLMs Beginners	2	537	June 26, 2025

Fine tuning a LLM with a code

Related topics