Getting Additional response from my RAG using HuggingFaceEndpoint inference

solo-leveling · March 16, 2025, 4:48pm

Thanks.

The GFG link helped.
I needed to create prompt in the Zephyr format since I am using Zephyr model.

This is the prompt that helped give output without additional response in the start:

chat_prompt_2 = ChatPromptTemplate.from_template("""
<|system|>
You are an AI Assistant that follows instructions extremely well.
Please be truthful and give direct answers. Please tell 'I don't know' if user query is not in context.
</s>
<|user|>
Context: {context}

Question: {input}
</s>
<|assistant|>
""")

Topic		Replies	Views
RAG LLM Generating the Prompt also at the response Beginners	8	4287	September 25, 2024
Retrieval Augmented Generation using Transformer Eco System 🤗Transformers	0	473	October 12, 2023
Function Calling and RAG Features Using Open-Source LLMs Intermediate	0	807	December 21, 2023
Regarding Rag-end2end retriever 🤗Transformers	1	244	January 31, 2023
Incomplete/ partial response generation Models	3	1421	March 27, 2024

Getting Additional response from my RAG using HuggingFaceEndpoint inference

Related topics