Running and testing / BharatGPT-3B-Indic

rdave88 · May 28, 2025, 4:55am

I checked out BharatGPT-3B-Indic running it in following ways:

From my CPU laptop and online model
From my laptop model saved in my local file system.
From collab and model called from huggingface ---- this worked abliet slowly with delay

I used a test script with Gradio UI code had to turn bits and bytes off.

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
import gradio as gr
import huggingface_hub
huggingface_hub.login(‘huggingface access code’')
print(huggingface_hub.whoami())
model_id = “CoRover/BharatGPT-3B-Indic”
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
device_map=“auto”,
load_in_8bit=False, # Use bitsandbytes for memory-efficient loading

)

pipe = pipeline(
“text-generation”,
model=model,
tokenizer=tokenizer

)

def generate_response(message):
messages = [
{“role”: “system”, “content”: “You are a helpful assistant who responds in Hindi or English.”},
{“role”: “user”, “content”: message},
]
output = pipe(messages, max_new_tokens=256)
return output[0][“generated_text”]

gr.Interface(
fn=generate_response,
inputs=“text”,
outputs=“text”,
title=“Chat with BharatGPT-3B-Indic”,
description=“Runs locally using 8-bit inference”
).launch()

I got the following chat responses

input : please translate following quoted sentence in gujarati “I am english speaking”

[{‘role’: ‘system’, ‘content’: ‘You are a helpful assistant who responds in Hindi or English.’}, {‘role’: ‘user’, ‘content’: ‘please translate following quoted sentence in gujarati “I am english speaking”\n’}, {‘role’: ‘assistant’, ‘content’: ‘મારું અંગ્રેજી બોલવાનું છે.’}]

please translate following quoted sentence in gujarati “I am a gujarati”

[{‘role’: ‘system’, ‘content’: ‘You are a helpful assistant who responds in Hindi or English.’}, {‘role’: ‘user’, ‘content’: ‘please translate following quoted sentence in gujarati “I am a gujarati”\n’}, {‘role’: ‘assistant’, ‘content’: ‘મારું નામ ગુજરાતી છે.’}]

Analysis

“I am a Gujarati”
Response: મારું નામ ગુજરાતી છે.

Analysis:

Translation: “My name is Gujarati.” — which is incorrect.
Correct translation:
હું ગુજરાતી છું.
(This means “I am Gujarati.”)

Incorrect translation why this may ne happening:

The model may not have been instructed explicitly enough to translate into Gujarati system message limits it to respond in Hindi or English. This can bias the output.
BharatGPT, while trained on multiple Indian languages, may not have strong enough grounding in Gujarati, or its chat template may not fully support instruction-following for translation.
The “quoted sentence” format I have used in the input may be adding confusion.

I am going to add Gujarati translation to system prompt and test. If anyone has tried or has a testcase code would appreciate an input.

Other aspects

I was thinking of Converting the model to GGUF Format run it with llama.cppto see if i can now run locally without GPU. And works well even on low RAM (4–6GB) with quantized GGUF without Python/transformers overhead.

Thank you

Rashmikant Dave

rdave88 · May 28, 2025, 5:31am

To make sure it is not the chat template instruction I changed it added Gujarati here is the code
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
import gradio as gr
import huggingface_hub
huggingface_hub.login(‘’)
print(huggingface_hub.whoami())
model_id = “CoRover/BharatGPT-3B-Indic”
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
device_map=“auto”,
load_in_8bit=False, # Use bitsandbytes for memory-efficient loading

)

pipe = pipeline(
“text-generation”,
model=model,
tokenizer=tokenizer

)

def generate_response(message):
messages = [
{“role”: “system”, “content”: “You are a helpful assistant who responds in Gujarati or English.”},
{“role”: “user”, “content”: message},
]
output = pipe(messages, max_new_tokens=256)
return output[0][“generated_text”]

gr.Interface(
fn=generate_response,
inputs=“text”,
outputs=“text”,
title=“Chat with BharatGPT-3B-Indic”,
description=“Runs locally using 8-bit inference”
).launch()

I am getting the same result which is

[{‘role’: ‘system’, ‘content’: ‘You are a helpful assistant who responds in Gujarati or English.’}, {‘role’: ‘user’, ‘content’: ‘please translate following quoted sentence in gujarati “I am a gujarati”’}, {‘role’: ‘assistant’, ‘content’: ‘મારું નામ ગુજરાતી છે.’}]

In Gujarati it should be

હું ગુજરાતી છું. so it is nothing to do with chat template I used earlier.

John6666 · May 28, 2025, 7:57am

Ollama is easy to install and uses Llama.cpp as its backend, so I think it’s convenient for testing.

John6666 · May 28, 2025, 8:12am

I think it’s worth trying to come up with a better prompt.

rdave88 · May 28, 2025, 12:24pm

Thank you will walk through the steps and run them this exercise will be helpful in applying to this and other models of interest

rdave88 · May 28, 2025, 12:37pm

Thank you i using and knwing more about the LLM LLAMA 3 2 prompts may help me get better prompts

Topic		Replies	Views
Language-modeling script "killed" when fine-tuning gpt2-medium Beginners	3	6893	May 19, 2023
Run models on a desktop computer? Beginners	7	79968	June 16, 2024
Transformers for GPT 4 🤗Transformers	1	1817	August 4, 2023
Looking for help with GPT-2 code Models	0	226	February 7, 2024
After manually downloading the model from huggingface, how do I put the model file into the specified path? 🤗Hub	0	1963	January 7, 2024

Running and testing / BharatGPT-3B-Indic

Related topics