Error in Autotrain Training

LuuWee · May 8, 2025, 7:41am

Hello everyone I am very new and im experimenting with the Huggingface Autotrain UI but im having a little trouble getting the training started. I am trying to train a meta-llama/Llama-3.1-8b-Instruct Model with an example dataset that i found
alpaca1k.csv
which i uploaded as a local file.
I have not made any changes to any other parameters. When i then click start training i get an error.

ERROR | 2025-05-08 07:39:20 | autotrain.trainers.common:wrapper:215 - train has failed due to an exception: Traceback (most recent call last):
File “/app/env/lib/python3.10/site-packages/autotrain/trainers/common.py”, line 212, in wrapper
return func(*args, **kwargs)
File “/app/env/lib/python3.10/site-packages/autotrain/trainers/clm/main.py”, line 28, in train
train_sft(config)
File “/app/env/lib/python3.10/site-packages/autotrain/trainers/clm/train_clm_sft.py”, line 27, in train
model = utils.get_model(config, tokenizer)
File “/app/env/lib/python3.10/site-packages/autotrain/trainers/clm/utils.py”, line 943, in get_model
model = AutoModelForCausalLM.from_pretrained(
File “/app/env/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py”, line 564, in from_pretrained
return model_class.from_pretrained(
File “/app/env/lib/python3.10/site-packages/transformers/modeling_utils.py”, line 3620, in from_pretrained
hf_quantizer.validate_environment(
File “/app/env/lib/python3.10/site-packages/transformers/quantizers/quantizer_bnb_4bit.py”, line 83, in validate_environment
validate_bnb_backend_availability(raise_exception=True)
File “/app/env/lib/python3.10/site-packages/transformers/integrations/bitsandbytes.py”, line 559, in validate_bnb_backend_availability
return _validate_bnb_cuda_backend_availability(raise_exception)
File “/app/env/lib/python3.10/site-packages/transformers/integrations/bitsandbytes.py”, line 537, in _validate_bnb_cuda_backend_availability
raise RuntimeError(log_msg)
RuntimeError: CUDA is required but not available for bitsandbytes. Please consider installing the multi-platform enabled version of bitsandbytes, which is currently a work in progress. Please check currently supported platforms and installation instructions at Installation Guide

ERROR | 2025-05-08 07:39:20 | autotrain.trainers.common:wrapper:216 - CUDA is required but not available for bitsandbytes. Please consider installing the multi-platform enabled version of bitsandbytes, which is currently a work in progress. Please check currently supported platforms and installation instructions at Installation Guide
INFO | 2025-05-08 07:39:20 | autotrain.trainers.common:pause_space:156 - Pausing space…

I not sure how i can fix this. Any help is appreciated

John6666 · May 8, 2025, 8:06am

In some cases, the problem can be resolved by installing bitsandbytes as indicated in the error message. However, in other cases, reinstalling PyTorch and the CUDA Toolkit may be necessary.

github.com/bitsandbytes-foundation/bitsandbytes

An error occurred: CUDA is required but not available for bitsandbytes.

opened 05:11PM - 09 Oct 24 UTC

GaoDalie

CUDA Setup Proposing to Close

### System Info please I have tried many ways but I couldn't address the issues…, could anyone please give me a hint or help me to solve this bug because I couldn't figure it where the problem coming from note: I have installed Cuda in my env, but I am still getting an error here is the error : An error occurred: CUDA is required but not available for bitsandbytes. Please consider installing the multi-platform enabled version of bitsandbytes, which is currently a work in progress. Please check currently supported platforms and installation instructions at https://huggingface.co/docs/bitsandbytes/main/en/installation#multi-backend thank you so much ### Reproduction import torch from transformers import AutoModelForVision2Seq, AutoProcessor, BitsAndBytesConfig # Hugging Face model id try: model_id = "Qwen/Qwen2-VL-7B-Instruct" # BitsAndBytesConfig int-4 config bnb_config = BitsAndBytesConfig( load_in_4bit=True, bnb_4bit_use_double_quant=True, bnb_4bit_quant_type="nf4", bnb_4bit_compute_dtype=torch.bfloat16 ) # Load model and tokenizer model = AutoModelForVision2Seq.from_pretrained( model_id, device_map="auto", torch_dtype=torch.bfloat16, quantization_config=bnb_config ) except Exception as e: print(f"An error occurred: {e}") ### Expected behavior please I have tried many ways but I couldn't address the issues, could anyone please give me a hint or help me to solve this bug because I couldn't figure it where the problem coming from note: I have installed Cuda in my env, but I am still getting an error here is the error : An error occurred: CUDA is required but not available for bitsandbytes. Please consider installing the multi-platform enabled version of bitsandbytes, which is currently a work in progress. Please check currently supported platforms and installation instructions at https://huggingface.co/docs/bitsandbytes/main/en/installation#multi-backend thank you so much

github.com/bitsandbytes-foundation/bitsandbytes

RuntimeError: Failed to import transformers.integrations.bitsandbytes because of the following error (look up to see its traceback):

opened 02:22PM - 27 Feb 24 UTC

closed 09:53AM - 07 Aug 24 UTC

SumaiyaSultan2002

### System Info ``` The `load_in_4bit` and `load_in_8bit` arguments are depr…ecated and will be removed in the future versions. Please, pass a `BitsAndBytesConfig` object in `quantization_config` argument instead. Traceback (most recent call last): File "c:\SQl coder\app.py", line 22, in <module> model = AutoModelForCausalLM.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\models\auto\auto_factory.py", line 563, in from_pretrained return model_class.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\modeling_utils.py", line 3026, in from_pretrained hf_quantizer.validate_environment( File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\quantizers\quantizer_bnb_8bit.py", line 62, in validate_environment raise ImportError( ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes` (sqlenv) PS C:\SQl coder> pip install -i https://pypi.org/simple/ bitsandbytes Looking in indexes: https://pypi.org/simple/, https://pypi.ngc.nvidia.com Collecting bitsandbytes Downloading bitsandbytes-0.42.0-py3-none-any.whl.metadata (9.9 kB) Requirement already satisfied: scipy in c:\sql coder\sqlenv\lib\site-packages (from bitsandbytes) (1.12.0) Requirement already satisfied: numpy<1.29.0,>=1.22.4 in c:\sql coder\sqlenv\lib\site-packages (from scipy->bitsandbytes) (1.26.4) Downloading bitsandbytes-0.42.0-py3-none-any.whl (105.0 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 105.0/105.0 MB 6.2 MB/s eta 0:00:00 Installing collected packages: bitsandbytes Successfully installed bitsandbytes-0.42.0 (sqlenv) PS C:\SQl coder> & "c:/SQl coder/sqlenv/Scripts/python.exe" "c:/SQl coder/app.py" The `load_in_4bit` and `load_in_8bit` arguments are deprecated and will be removed in the future versions. Please, pass a `BitsAndBytesConfig` object in `quantization_config` argument instead. False ===================================BUG REPORT=================================== C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\cuda_setup\main.py:167: UserWarning: Welcome to bitsandbytes. For bug reports, please run python -m bitsandbytes warn(msg) ================================================================================ CUDA_SETUP: WARNING! libcudart.so not found in any environmental path. Searching in backup paths... The following directories listed in your path were found to be non-existent: {WindowsPath('/usr/local/cuda/lib64')} DEBUG: Possible options found for libcudart.so: set() CUDA SETUP: PyTorch settings found: CUDA_VERSION=118, Highest Compute Capability: 8.6. CUDA SETUP: To manually override the PyTorch CUDA version please see:https://github.com/TimDettmers/bitsandbytes/blob/main/how_to_use_nonpytorch_cuda.md CUDA SETUP: Loading binary C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\libbitsandbytes_cuda118.so... argument of type 'WindowsPath' is not iterable CUDA SETUP: Problem: The main issue seems to be that the main CUDA runtime library was not detected. CUDA SETUP: Solution 1: To solve the issue the libcudart.so location needs to be added to the LD_LIBRARY_PATH variable CUDA SETUP: Solution 1a): Find the cuda runtime library via: find / -name libcudart.so 2>/dev/null CUDA SETUP: Solution 1b): Once the library is found add it to the LD_LIBRARY_PATH: export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:FOUND_PATH_FROM_1a CUDA SETUP: Solution 1c): For a permanent solution add the export from 1b into your .bashrc file, located at ~/.bashrc CUDA SETUP: Solution 2: If no library was found in step 1a) you need to install CUDA. CUDA SETUP: Solution 2a): Download CUDA install script: wget https://raw.githubusercontent.com/TimDettmers/bitsandbytes/main/cuda_install.sh CUDA SETUP: Solution 2b): Install desired CUDA version to desired location. The syntax is bash cuda_install.sh CUDA_VERSION PATH_TO_INSTALL_INTO. CUDA SETUP: Solution 2b): For example, "bash cuda_install.sh 113 ~/local/" will download CUDA 11.3 and install into the folder ~/local Traceback (most recent call last): File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\utils\import_utils.py", line 1383, in _get_module return importlib.import_module("." + module_name, self.__name__) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\Users\sumai\anaconda\Lib\importlib\__init__.py", line 126, in import_module return _bootstrap._gcd_import(name[level:], package, level) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<frozen importlib._bootstrap>", line 1204, in _gcd_import File "<frozen importlib._bootstrap>", line 1176, in _find_and_load File "<frozen importlib._bootstrap>", line 1147, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 690, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 940, in exec_module File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\integrations\bitsandbytes.py", line 11, in <module> import bitsandbytes as bnb File "C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\__init__.py", line 6, in <module> from . import cuda_setup, utils, research File "C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\research\__init__.py", line 1, in <module> from . import nn File "C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\research\nn\__init__.py", line 1, in <module> from .modules import LinearFP8Mixed, LinearFP8Global File "C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\research\nn\modules.py", line 8, in <module> from bitsandbytes.optim import GlobalOptimManager File "C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\optim\__init__.py", line 6, in <module> from bitsandbytes.cextension import COMPILED_WITH_CUDA File "C:\SQl coder\sqlenv\Lib\site-packages\bitsandbytes\cextension.py", line 20, in <module> raise RuntimeError(''' RuntimeError: CUDA Setup failed despite GPU being available. Please run the following command to get more information: python -m bitsandbytes Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them to your LD_LIBRARY_PATH. If you suspect a bug, please take the information from python -m bitsandbytes and open an issue at: https://github.com/TimDettmers/bitsandbytes/issues The above exception was the direct cause of the following exception: Traceback (most recent call last): File "c:\SQl coder\app.py", line 22, in <module> model = AutoModelForCausalLM.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\models\auto\auto_factory.py", line 563, in from_pretrained return model_class.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\modeling_utils.py", line 3391, in from_pretrained hf_quantizer.preprocess_model( File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\quantizers\base.py", line 166, in preprocess_model return self._process_model_before_weight_loading(model, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\quantizers\quantizer_bnb_8bit.py", line 219, in _process_model_before_weight_loading from ..integrations import get_keys_to_not_convert, replace_with_bnb_linear File "<frozen importlib._bootstrap>", line 1229, in _handle_fromlist File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\utils\import_utils.py", line 1373, in __getattr__ module = self._get_module(self._class_to_module[name]) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\SQl coder\sqlenv\Lib\site-packages\transformers\utils\import_utils.py", line 1385, in _get_module raise RuntimeError( RuntimeError: Failed to import transformers.integrations.bitsandbytes because of the following error (look up to see its traceback): CUDA Setup failed despite GPU being available. Please run the following command to get more information: python -m bitsandbytes Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them to your LD_LIBRARY_PATH. If you suspect a bug, please take the information from python -m bitsandbytes and open an issue at: https://github.com/TimDettmers/bitsandbytes/issues ``` ### Reproduction https://github.com/defog-ai/sqlcoder/blob/main/defog_sqlcoder_colab.ipynb ### Expected behavior i want to run defog.ai SQLCoder-7b-2`import streamlit as st import torch from transformers import AutoTokenizer, AutoModelForCausalLM import sqlparse import sqlite3 # Model loading and configuration model_name = "defog/sqlcoder-7b-2" tokenizer = AutoTokenizer.from_pretrained(model_name) if torch.cuda.is_available(): available_memory = torch.cuda.memory_allocated() if available_memory > 15e9: model = AutoModelForCausalLM.from_pretrained( model_name, trust_remote_code=True, torch_dtype=torch.float16, device_map="auto", use_cache=True, ) else: model = AutoModelForCausalLM.from_pretrained( model_name, trust_remote_code=True, load_in_8bit=True, device_map="auto", torch_dtype=torch.float16, use_cache=True, ) else: model = AutoModelForCausalLM.from_pretrained( model_name, trust_remote_code=True, use_cache=True ) prompt = """### Task Generate a SQL query to answer [QUESTION]{question}[/QUESTION] ###Instructions - if the question cannot be answered given the database schema, return "I do not know" - Every helpdesk ticket is associated to a space or an equipment mandatorily. - Every equipment or space is related to a Block in a site ### Database Schema CREATE TABLE website_support_ticket(id INTEGER PRIMARY KEY, sla_active BOOLEAN, --SLA is active if it is true else it is inactive asset_id INTEGER, --Space for which the ticket is created equipment_id INTEGER, --Equipment for which the ticket is created equipment_location_id INTEGER, --Space where the Equipment is located maintenance_team_id INTEGER, --Maintenance Team that is responsible for the ticket actions at_start_mro BOOLEAN, --Photo is required to start a work order at_done_mro BOOLEAN, --Photo is required to close a work order at_review_mro BOOLEAN, --Photo is required to review a work order mro_order_id INTEGER, --Order related to the ticket employee_id INTEGER, --Employee related to the ticket pause_reason_id INTEGER, --Reason for Pause equip_block_id INTEGER, --Block of an equipment for which the ticket is created space_block_id INTEGER, --Block of an space for which the ticket is created requestee_id INTEGER, --Requestor of the ticket region_id INTEGER, --Region of the ticket is_reopen BOOLEAN, --Ticket was reopned if this is set to True reopen_count INTEGER, --Number of times this ticket was reopened on_hold_date TIMESTAMP WITHOUT TIME ZONE, --Date on which the ticket was moved to On-Hold doc_count INTEGER, --Count of Attachments sla_end_date TIMESTAMP WITHOUT TIME ZONE, --Planned End date for SLA priority_id INTEGER, --Priority of the Ticket category_id INTEGER, --Category of the Problem sub_category_id INTEGER, --Sub Category of the Problem state_id INTEGER, --Status of the ticket (Open, InProgress, Closed, Paused) company_id INTEGER, --Company of the ticket close_time TIMESTAMP WITHOUT TIME ZONE, --Ticket Closed Date time closed_by_id INTEGER, --Technician who closed the ticet ticket_type CHARACTER VARYING, --Proactive or Reactive sla_status CHARACTER VARYING, --To show within SLA or SLA elapsed state_category_id CHARACTER VARYING, --Category to which the Status belongs to subject CHARACTER VARYING, --Subject line of the Problem issue_type CHARACTER VARYING, --Issue Type of the Ticket close_comment CHARACTER VARYING, --Comments that was enetered while closing the ticket current_escalation_level CHARACTER VARYING, --To show the current escalationlevel type_category CHARACTER VARYING, --Type category of the ticket state_name CHARACTER VARYING, --State to which the site belongs to city_name CHARACTER VARYING, --City to which the site belongs to last_commented_by CHARACTER VARYING, --Comment region CHARACTER VARYING, --Region of the ticket mro_state CHARACTER VARYING, --Status of the Work order ); CREATE TABLE res_company ( id INTEGER PRIMARY KEY, name VARCHAR(20) ); CREATE TABLE mro_maintenance_team( id INTEGER INTEGER PRIMARY KEY, name VARCHAR VARCHAR(20) ); CREATE TABLE mro_equipment_location( id INTEGER PRIMARY KEY, name VARCHAR(50) ); CREATE TABLE mro_equipment( id INTEGER PRIMARY KEY, name VARCHAR(50) ); CREATE TABLE website_support_ticket_state( id INTEGER PRIMARY KEY, name VARCHAR(50)); CREATE TABLE mro_order( id INTEGER PRIMARY KEY, name VARCHAR(50)); CREATE TABLE website_support_ticket_category( id INTEGER PRIMARY KEY, name VARCHAR(50)) ; CREATE TABLE website_support_ticket_subcategory( id INTEGER PRIMARY KEY, name VARCHAR(50)); CREATE TABLE website_support_ticket_priority( id INTEGER PRIMARY KEY, name VARCHAR(50)); -website_support_ticket.company_id can be joined with res_company.id -website_support_ticket.maintenance_team_id can be joined with mro_maintenance_team.id -website_support_ticket.asset_id can be joined with mro_equipment_location.id -website_support_ticket.equipment_id can be joined with mro_equipment.id -website_support_ticket.state_id can be joined with website_support_ticket_state.id -website_support_ticket.mro_order_id can be joined with mro_order.id -website_support_ticket.category_id can be joined with website_support_ticket_category.id -website_support_ticket.sub_category_id can be joined with website_support_ticket_subcategory.id -website_support_ticket.priority_id can be joined with website_support_ticket_priority.id ### Answer Given the database schema, here is the SQL query that answers [QUESTION]{question}[/QUESTION] [SQL] """ def generate_query(question): updated_prompt = prompt.format(question=question) inputs = tokenizer(updated_prompt, return_tensors="pt").to("cuda") generated_ids = model.generate( **inputs, num_return_sequences=1, eos_token_id=tokenizer.eos_token_id, pad_token_id=tokenizer.eos_token_id, max_new_tokens=400, do_sample=False, num_beams=1, ) outputs = tokenizer.batch_decode(generated_ids, skip_special_tokens=True) torch.cuda.empty_cache() torch.cuda.synchronize() return sqlparse.format(outputs[0].split("[SQL]")[-1], reindent=True) def execute_sql(question, db_file): query = generate_query(question) conn = sqlite3.connect(db_file) cursor = conn.cursor() try: cursor.execute(query) # Fetch column names columns = [col[0] for col in cursor.description] # Fetch results into a pandas DataFrame df = pd.DataFrame(cursor.fetchall(), columns=columns) # Print the result as a table return df.to_markdown(index=False) except sqlite3.OperationalError as e: if "ILIKE" in str(e): query = query.replace("ILIKE", "LIKE") return execute_query(query, db_file) except sqlite3.Error as e: print("Error executing query:", e) return None finally: cursor.close() conn.close() # Streamlit app st.title("SQL Code Generator") # Input field for the question user_question = st.text_input("Enter your question about the database:") # Button to generate the SQL query if st.button("Generate SQL"): if user_question: # Generate SQL query and display it generated_sql = generate_query(user_question) st.write("Generated SQL Query:") st.code(generated_sql) # Connect to the database (replace with your database file path) db_file = "your_database.db" if db_file: # Execute the query and display the results result = execute_sql(user_question, db_file) if result: st.write("Results:") st.markdown(result) else: st.write("No results found.") else: st.warning("Please enter a question.") ` this is the code i am trying to run

LuuWee · May 8, 2025, 8:17am

I found a solution by myself. Im using the free plan to there is only cpu to use and no gpu. I had to change some of the parameters. This is what i did for anyone who is wondering
Distributed Backend from ddp to deepspeed
Mixed precision from fp16 to none
PEFT/LoRA from true to false

Im not exactly sure what did the trick but its training now

system · May 8, 2025, 8:17pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Getting an error in AutoTrain 🤗AutoTrain	3	2958	January 22, 2025
Autotrain not working 🤗AutoTrain	0	336	April 10, 2024
Autotrain I've wasted my money on it but it doesn't work 🤗AutoTrain	0	664	June 27, 2024
Your data is being prepared. This may take a while Error 🤗AutoTrain	3	622	March 28, 2023
AutoModelForCausalLM error with accelerate and bitsandbytes 🤗Accelerate	1	1682	April 15, 2024

Error in Autotrain Training

Related topics