Hugging Face Forums

Need Help in creating ai chatbot for my app

John6666 August 8, 2025, 8:27am 20

Yeah. GGUF files are pre-quantized files that can be used in their quantized state. They are to be dequantized at runtime, but this is not something we need to worry about.
Hugging Face’s Transformers are not suitable for running GGUF, so if you want to use GGUF, it is better to run it using Ollama or similar tools. There are various quantization formats available for Transformers, but BitsAndBytes is usually sufficient.

1 Like

Topic		Replies	Views	Activity
Finetune a chatbot for specfic task Beginners	0	799	June 10, 2023
Answer template generation from question 🤗Transformers	0	217	November 11, 2023
Conversational AI + question answering model Intermediate	5	2687	January 30, 2023
FastLoRAChat Instruct-tune LLaMA on consumer hardware with shareGPT data Show and Tell	0	677	April 19, 2023
Mistral 7B RAG Langchaing Models	0	2643	February 20, 2024