Access locally downloaded Llama Model in Notebook

mox · October 24, 2023, 11:39am

Hi,

I just downloaded the LLama2 model from the Meta repository (specifically llama.cpp on Mac). Now I want to use it in a Python script. The quntized model file (ggml-model-q4_0.bin) s stored now on my Mac. How can I now use the LLama tokenizer and load the model? I am used to acquire the model directely via Huggingface and not locally on my laptop.

I downloaded the Llama-7B-chat model and then used the Llama.cpp
repository to quantize it. Therefore I have 2 folders LLama and Llama.cpp where the actual model is stored (see Screenshot).

Thanks in advance!

Topic		Replies	Views
Load downloaded Llama2 model with Transformers Beginners	0	966	October 25, 2023
Download llama for offline computer Models	1	1136	September 13, 2023
Why the model loading of llama2 is so slow? 🤗Transformers	6	9655	April 24, 2024
Downloaded models Beginners	14	2292	September 15, 2024
Why can't able to load the Meta/Llama-2 model from local path which we download from Huggingface use Git and save on my local? Models	0	65	July 12, 2024

Access locally downloaded Llama Model in Notebook

Related topics