Code review using Codellama-Instruct

navyaprasad03 · May 9, 2025, 4:07am

I’m working on an AI code reviewer using CodeLlama-Instruct (experimented with 34B f16 and 70B int4) and could use some advice. My setup involves passing a git diff (showing changes with - for removed lines and + for added lines) along with context retrieved via a RAG model to the prompt. Despite explicitly explaining the git diff format in the prompt, CodeLlama seems to misunderstand the concept of diffs and provides confused or irrelevant comments on the code changes.

For example, it might comment on a removed line (-) as if it’s still part of the code or fail to connect the added lines (+) to the intended change. I’ve tried clarifying the diff syntax in the prompt (e.g., “- indicates a line removed, + indicates a line added”), but the issue persists.

Has anyone encountered similar challenges when using CodeLlama-Instruct or other LLMs for code review tasks involving git diffs? I’d appreciate insights on how to solve this problem.

I can share my prompt or example diffs if that would help spark ideas. Thanks for any suggestions or experiences you can share—this community’s expertise is invaluable!

John6666 · May 9, 2025, 10:14am

There seems to be some research on this.

With an LLM of around 70B, there are quite a few tasks that it struggles with, so I think it’s better to format the code with your own script to some extent before passing it on, as this tends to yield more accurate results. Simplify the task itself.

Also, if you’re not set on CodeLlama, you could try other coding models as well.

gist.github.com

https://gist.github.com/udiedrichsen/979ae7ee3aaaae00cf3e15046ee5bba0

Analyze-Git-Changes-For-A-Problem.md

# Scripts Documentation

## analyze_changes.sh

A powerful Git diff analysis tool that leverages Local Language Models (LLM) to provide intelligent insights about code changes.

### Motivation

When reviewing code changes between commits, especially in large codebases, it can be challenging to:
- Understand the full impact of changes

This file has been truncated. show original

analyze_changes.sh

#!/usr/bin/env bash
#
# File: analyze_changes.sh
# Description:
#   This script compares two Git commits (or one commit and the current HEAD),
#   identifies modified files, shows diffs, and invokes a local LLM endpoint
#   via HTTP (curl) to analyze potential problematic changes.
#
# Usage:
#   analyze_changes.sh <commit_id_1> [commit_id_2] [text]

This file has been truncated. show original

Topic		Replies	Views
Fine-tuning CodeLlama for Multi-File Code Generation in a Private Repository Beginners	10	8024	October 23, 2024
How language of the prompt impacts on model performance Intermediate	0	121	February 29, 2024
Fine tuning a LLM with a code Models	7	3444	February 5, 2025
Llama-2-7b-chat fine-tuning Models	4	6796	April 26, 2024
Can a Small LLM Learn to Reason Like a Larger One? Reflection-based Fine-Tuning vs Classical SFT on LLaMA 3.2 (Java CodeGen) Research	4	152	June 20, 2025

Code review using Codellama-Instruct

Related topics