When I'm downloading the weights, the cell keeps running and doesn't stop. I need to fine tune Mistral-Small-3.1-24B-Instruct-2503 model

rohitdiwane · May 2, 2025, 6:46am

from transformers import AutoTokenizer, MistralForCausalLM
import torch

model_id = “mistralai/Mistral-Small-3.1-24B-Instruct-2503”

tokenizer = AutoTokenizer.from_pretrained(model_id,
trust_remote_code=True,
cache_dir=“/content/huggingface_cache”)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.padding_side = “right”

model = MistralForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.float16,
device_map=“auto”,
cache_dir=“/content/huggingface_cache”,
low_cpu_mem_usage=True,
offload_folder=“offload”,
)

I have used Older_version = transformers==4.49.0 and Current_version = transformers==4.52.0.dev0, I tried both version but didn’t gets the solution.
Please help us! Thanks

John6666 · May 2, 2025, 7:39am

Hmm… Gated model issue?

rohitdiwane · May 2, 2025, 7:58am

Ya, I already got the access for this model. But cell keeps running, is there any other way or option for it.

rohitdiwane · May 2, 2025, 8:07am

Actually, model weights are downloaded but at the end cell keeps running.

John6666 · May 2, 2025, 7:30pm

Hmm… “Cell” probably refers to some kind of notebook environment.

There may be cache-related issues occurring occasionally in from_pretrained. In that case, rebuilding the virtual environment may resolve the issue…

Additionally, to determine whether this is a specific issue with the Mistral model (repository), testing with a smaller model should help isolate the problem.

model_id = "HuggingFaceTB/SmolLM2-135M-Instruct"

Topic		Replies	Views
Data did not match any variant of untagged enum PyPreTokenizerTypeWrapper at line 6952 column 3 Models	1	1181	July 4, 2024
Unable to load a model with added special token 🤗Transformers	1	570	April 3, 2024
Poor performance from Mistral-7B-Instruct-v0.1 Beginners	1	1552	March 1, 2024
Tried to download Mistral 7B but got an error message 🤗Transformers	3	13335	October 8, 2023
mistralai/Mistral-7B-v0.1 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models' Models	3	2237	April 21, 2025

When I'm downloading the weights, the cell keeps running and doesn't stop. I need to fine tune Mistral-Small-3.1-24B-Instruct-2503 model

Related topics