Llama3.2 what is the difference between these 2 loading statements

Omran99 · October 26, 2024, 1:22pm

Dears,
what is the difference between these 2 loading statements st1 and st2
which is faster …any help is highly appreciated

st1:
import requests
import torch
from PIL import Image
from transformers import MllamaForConditionalGeneration, AutoProcessor

model_id = “meta-llama/Llama-3.2-90B-Vision-Instruct”

model = MllamaForConditionalGeneration.from_pretrained(
model_id,
torch_dtype=torch.bfloat16,
device_map=“auto”,
)
processor = AutoProcessor.from_pretrained(model_id)

st2:
from transformers import AutoProcessor, AutoModelForPreTraining

processor = AutoProcessor.from_pretrained(“meta-llama/Llama-3.2-90B-Vision-Instruct”)

model = AutoModelForPreTraining.from_pretrained(“meta-llama/Llama-3.2-90B-Vision-Instruct”)

John6666 · October 26, 2024, 2:13pm

Ignoring detailed options and differences in models, Auto type classes will select and return individual classes appropriate for each model, so ideally, the result will be the same.
However, not everything can be determined automatically and appropriately, so if the model information is completely known and determined, it is a good idea to specify it explicitly. That’s what I do.

Omran99 · October 26, 2024, 2:22pm

I dont know if “using torch_dtype=torch.bfloat16” will use my GPU during dowloading the model… and
model = AutoModelForPreTraining.from_pretrained(“meta-llama/Llama-3.2-90B-Vision-Instruct”) will use traditional CPU during the loading

Any help please

John6666 · October 26, 2024, 2:34pm

from transformers import AutoProcessor, AutoModelForPreTraining
import torch
hf_token = "hf_***********"
processor = AutoProcessor.from_pretrained(“meta-llama/Llama-3.2-90B-Vision-Instruct”, token=hf_token)
model = AutoModelForPreTraining.from_pretrained(“meta-llama/Llama-3.2-90B-Vision-Instruct”, device_map="auto", torch_dtype=torch.bfloat16, token=hf_token).to("cuda")

This should work, but is there really enough VRAM in the GPU…? My PC doesn’t even have enough RAM…

Topic		Replies	Views
AOTInductor with Llama-3.2-3B-Instruct Intermediate	0	89	November 14, 2024
What is the difference between llama2_7B and llama2_7B_hf? Models	0	231	May 2, 2024
Could not load model meta-llama/Llama-2-7b-chat-hf with any of the following classes 🤗Transformers	22	49343	December 19, 2024
AutoModelForCausalLM and transformers.pipeline Beginners	2	580	August 29, 2024
Why does automodelforcausallm.from_pretrained() work on base models and not instruct models? 🤗Transformers	4	67	March 15, 2025

Llama3.2 what is the difference between these 2 loading statements

Related topics