What files are needed to use the HF Transformer pipeline()?

happyday1 · August 7, 2023, 11:06pm

When I try model = AutoModel.from_pretrained("TheBloke/Llama-2-7B-Chat-GGML") I get

 raise EnvironmentError(
OSError: TheBloke/Llama-2-7B-Chat-GGML does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.

The transformers code I used was the code provided in the Transformer tab of this model. Other models seemed to lack safetensors (which is a warning which makes sense). Other models seemed to lack tokenizers.

So I clearly don’t understand:

If the goal is to use the transformers Python library to run a HF model locally, how do i tell which of the models will work? Is there a filter, or is there a set of files I should look for? If the file isn’t there, can I build it?
If a model is optimized via GPTQ or llama.cpp or…I don’t know EXLLAMA, does that mean it isn’t a transformer model? I.e.: Do transformer models do their own 4-bit quantizing, etc.?
if a model is a GPTQ model (or similar quantized model) - can it be downloaded and used by the HF or langchain APIs?

Thank you very much.

Topic		Replies	Views
Unable to download models from HF with from_pretrained() Beginners	2	1186	December 7, 2023
AutoModelForCausalLM.from_pretrained unable to load model from Huggingface 🤗Transformers	1	3138	June 25, 2023
Load downloaded Llama2 model with Transformers Beginners	0	966	October 25, 2023
`flan-alpaca-xl` model does not appear to have a file named `pytorch_model.bin` despite sharded model present 🤗Transformers	0	397	August 22, 2023
Why does automodelforcausallm.from_pretrained() work on base models and not instruct models? 🤗Transformers	4	124	March 15, 2025

What files are needed to use the HF Transformer pipeline()?

Related topics