Not able to finetunned(q-lora) LLama3-Instruct model for CausalLM

AbuSayed1 · November 14, 2024, 5:12am

I am trying to finetunned LLama3-Instruct model. My dataset looks like this
Here label and text are text description.
DatasetDict({
train: Dataset({
features: [‘label’, ‘text’],
num_rows: 30
})
valid: Dataset({
features: [‘label’, ‘text’],
num_rows: 10
})
test: Dataset({
features: [‘label’, ‘text’],
num_rows: 10
})
})

My code:

from transformers import (
    AutoModelForCausalLM,
    AutoTokenizer,
    BitsAndBytesConfig,
    TrainingArguments,
    Trainer,
    DataCollatorForLanguageModeling
)

from peft import LoraConfig, prepare_model_for_kbit_training, get_peft_model
import torch

model_name = "meta-llama/Meta-Llama-3-8B-Instruct"

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type='nf4',
    bnb_4bit_use_double_quant=True,
    bnb_4bit_compute_dtype=torch.bfloat16
)

lora_config = LoraConfig(
    r=16,
    lora_alpha=8,
    target_modules=['q_proj', 'k_proj', 'v_proj', 'o_proj'],
    lora_dropout=0.05,
    bias='none',
    task_type='CAUSAL_LM'  # Use causal language modeling task
)


model = AutoModelForCausalLM.from_pretrained(
    model_name,
    quantization_config=quantization_config
)


model = prepare_model_for_kbit_training(model)
model = get_peft_model(model, lora_config)


tokenizer = AutoTokenizer.from_pretrained(model_name, add_prefix_space=True)

tokenizer.pad_token_id = tokenizer.eos_token_id
tokenizer.pad_token = tokenizer.eos_token

model.config.pad_token_id = tokenizer.pad_token_id
model.config.use_cache = False


def tokenize_function(example):
    example['input_ids'] = tokenizer(example["text"], padding="max_length",max_length=256, truncation=True, return_tensors="pt").input_ids
    return example

tokenized_dataset = dataset.map(tokenize_function, batched=True, remove_columns=['text'])


data_collator = DataCollatorForLanguageModeling(
    tokenizer=tokenizer,
    mlm=False  
)


training_args = TrainingArguments(
    output_dir=model_name + "-causal-lm-finetuning",
    learning_rate=1e-4,
    per_device_train_batch_size=8,
    per_device_eval_batch_size=8,
    num_train_epochs=10,
    weight_decay=0.01,
    evaluation_strategy="epoch",
    save_strategy="epoch",
    load_best_model_at_end=True,
    logging_steps=1
)


trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=tokenized_dataset["train"],
    eval_dataset=tokenized_dataset["valid"],
    tokenizer=tokenizer,
    data_collator=data_collator  # for text generation, use causal language modeling data collator
)


trainer.train()

Output error:
ValueError: Unable to create tensor, you should probably activate truncation and/or padding with ‘padding=True’ ‘truncation=True’ to have batched tensors with the same length. Perhaps your features (label in this case) have excessive nesting (inputs type list where type int is expected).

John6666 · November 14, 2024, 5:42am

You are passing Truncation=True…
This seems to be a bug in the library, but it also seems to occur due to numpy version issues.
Please try this first.

pip install numpy<2

github.com/huggingface/transformers

ValueError: Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length

opened 03:51AM - 07 Oct 24 UTC

velu1122

bug Vision Processing

### System Info **Description** I am experiencing an issue when using the tran…sformers library version 4.36.1 with a custom model serving endpoint that utilizes mlflow. The model is based on the ResNetForImageClassification class, and I am using the AutoImageProcessor for image preprocessing. **Error Details** During the inference process, I encountered the following error message: ValueError: **Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length** This error suggests that the input images may not be processed consistently, leading to issues with tensor creation. Environment Details mlflow: 2.16.2 transformers: 4.36.1 torch: 2.0.1 pillow: 10.4.0 The below is my custom model and served it is as an Serving endpoint in Databricks. While accessing the endpoint using the invocation URL, getting the issue. I have tried it mentioning "self.processor(image, return_tensors="pt", padding=True, truncation=True)", even after I get the error. ``` from mlflow.pyfunc.model import PythonModelContext import torch import pandas as pd from transformers import AutoImageProcessor, ResNetForImageClassification from PIL import Image import logging import os class MicrosoftResnet50Model(mlflow.pyfunc.PythonModel): def load_context(self, context: PythonModelContext): model_path = context.artifacts["model_path"] self.data_path = context.artifacts["data_path"] self.device = 'cuda' if torch.cuda.is_available() else 'cpu' # Load processor and model from the specified path self.processor = AutoImageProcessor.from_pretrained(model_path) self.model = ResNetForImageClassification.from_pretrained(model_path) self.model.eval() self.model.to(self.device) def predict(self, context: PythonModelContext, model_input: pd.DataFrame, params=None): predictions = [] for index, row in model_input.iterrows(): image_name = row['filepath'].split('/')[-1] full_image_path = os.path.join(self.data_path, image_name) # Load and preprocess the image image = Image.open(full_image_path).convert("RGB") inputs = self.processor(image, return_tensors="pt", padding=True, truncation=True) # Perform inference with torch.no_grad(): outputs = self.model(**inputs) logits = outputs.logits predicted_class_idx = logits.argmax(-1).item() predicted_class_name = self.model.config.id2label[predicted_class_idx] predictions.append(predicted_class_name) return {'predictions': predictions} ``` I could see multiple posts regarding LLM/text classification model issues related to this error, but it is related to image models. Could you please help in resolving it. Thanks. ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [X] My own task or dataset (give details below) ### Reproduction 1. create a custom class MicrosoftResnet50Model as metioned . 2. Logged the custom model in databricks unity catalog as show below using mlflow.pyfunc.log_model. ``` import transformers import torch import PIL with mlflow.start_run(): model_info = mlflow.pyfunc.log_model( artifact_path=REGISTERED_MODEL_NAME, python_model=MicrosoftResnet50Model(), input_example=input_example, artifacts={ "model_path": MODEL_PATH, "data_path": "/Volumes/<catalog>/<schema>/models--microsoft--resnet-50/data/" }, pip_requirements=[ f"transformers=={transformers.__version__}", "torch==2.0.1", f"pillow=={PIL.__version__}" ], signature=signature, registered_model_name=f"{CATALOG}.{SCHEMA}.{REGISTERED_MODEL_NAME}" ) ``` 3. Created the Databricks endpoint with the logged MLFLOW custom model. 4. Access the custom model endpoint using, ``` import os import json import requests import pandas as pd def score_model(image_path): """Send the image to the model and get the prediction.""" # Replace with your actual serving endpoint URL url = 'https://<_url_>/serving-endpoints/test_resnet_50_cpu/invocations' # Define the headers for the request headers = { 'Authorization': f'Bearer {os.environ.get("DATABRICKS_TOKEN")}', # Ensure you have set this environment variable 'Content-Type': 'application/json' } # # Create the JSON payload using the dataframe_split structure input_payload = { "dataframe_split": { "columns": ["filepath", "filepath_type"], # Define columns to match the model input schema "data": [ [image_path, "UC_VOLUME"] # Insert the image path and file type into the data section ] } } # Convert the payload to JSON and send the request response = requests.post(url, headers=headers, json=input_payload) # Check if the response is successful if response.status_code == 200: predictions = response.json() print("Predictions:", predictions) return predictions else: print("Error:", response.status_code, response.text) return None # Example usage score_model(f"{MODEL_PATH}/data/sample.jpg") ``` 5. {'predictions': {'predictions': ["Error! Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length."]}} ### Expected behavior Shouldn't get the error "Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length." if we set to "self.processor(image, return_tensors="pt", padding=True, truncation=True)" Or any other way, we can resolve the issue "Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length".

DylanAndrew · November 14, 2024, 5:45am

It looks like the error is due to inconsistent tokenization or handling of the label field. To fix this, make sure you tokenize both text and label properly. In the tokenize_function, ensure that both fields are processed and padded to the same length. You also need to check that the input_ids and attention_mask are returned correctly, without extra batch dimensions. Lastly, if you’re using labels for a supervised task, make sure they’re tokenized too. This should resolve the tensor size mismatch issue you’re facing.

Topic		Replies	Views
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K Beginners	5	1113	December 5, 2024
"You cannot perform fine-tuning on purely quantized models." error in LoRA model training? 🤗Transformers	3	2630	August 16, 2024
Fine tune a finetuned model Beginners	1	566	December 16, 2024
How to figure out corresponding arguments in PeftModel? Models	7	1069	February 16, 2024
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	15241	December 21, 2023

Not able to finetunned(q-lora) LLama3-Instruct model for CausalLM

Related topics