Use alpaca with local embedding

rmcf1902 · May 11, 2023, 12:42pm

Hi there!

I am using huggingface model chavinlo/alpaca-native.
However, when i use local embeddings, my output is always only 1 word long. Can anyone explain this?

model_nm = 'chavinlo/alpaca-native' 
save_path = '/content/drive/MyDrive/alpaca_native_pretrained_model_pytorch'

model = LlamaForCausalLM.from_pretrained(save_path, return_dict=True, load_in_8bit=True, device_map='auto')

tokenizer = AutoTokenizer.from_pretrained(save_path)

pipe = pipeline(
    "text-generation",
    model=model, 
    tokenizer=tokenizer, 
    max_length=248,
    temperature=0.4,
    top_p=0.95,
    repetition_penalty=1.2
)

local_llm = HuggingFacePipeline(pipeline=pipe)

qa = RetrievalQA.from_chain_type(
    llm=local_llm,
    chain_type="stuff",  # "map_reduce",
    retriever=retriever,
    return_source_documents=True,
)

query = "xyz"
llm_response = qa(query)

Can anyone help me with that or suggest me alternative ways to embed pdf’s with an LLM, everything locally on colab?

Thanks!
Yves

Topic		Replies	Views
Using alpaca with local embedding Intermediate	1	1361	July 19, 2023
Hugging face sentence embeddings on Dominolab Localy Models	0	1794	April 28, 2023
Chatbot in offline mode using when using langchain.HuggingFaceImbeddings 🤗Transformers	0	4856	November 3, 2023
Is there a way to download embedding model files and load from local folder which supports langchain vectorstore embeddings Beginners	1	2036	November 29, 2023
Return embeddings via inference api 🤗Transformers	0	384	January 17, 2023

Use alpaca with local embedding

Related topics