Transforming Pushed Hugging Face Models into Usable GGUF Models for Local Colab Use

imagineaiuser · February 26, 2024, 3:35am

I have successfully trained and fine-tuned LLama.cpp models in Google Colab, saving the resulting weights and models to Hugging Face. This allows me to leverage Colab’s free GPUs to efficiently train models that would be too resource-intensive on my local machine.

I’ve seen so many posts, but everyone just stops at the Inference with the fine-tuned model. So I want to know if I can use a hugging face service to complete the train and get to the usable GGUF.

However, I now need guidance on downloading my trained Hugging Face GGUF models back into my Colab notebooks for continued iteration and use.

I aim to establish an effective model development pipeline between Colab, Hugging Face model storage, and my local environment. Specifically, after training customized Mistral or Mixtral models on Colab and pushing to Hugging Face, what is the best practice for pulling those model weights back down into a Colab notebook?

Ideally there would be a streamlined way to reimport my trained weights without needing to retrain entire models from scratch each time. Any suggestions on the tools or techniques to enable this? Establishing this round-trip flow would allow me to rapidly iterate on model tuning in Colab while retaining easy localized access to the latest versions of my GGUF models.

imagineaiuser · February 26, 2024, 2:57pm

Surely someone has encountered this need?

lukaskellerstein · March 15, 2024, 2:57am

Hi @imagineaiuser look at this: Tutorial: How to convert HuggingFace model to GGUF format · ggerganov/llama.cpp · Discussion #2948 · GitHub

Topic		Replies	Views
How to use hugging face to fine-tune ollama's local model Beginners	7	8149	August 28, 2024
Are there any recommendation tutorials on how to train a LLM via colab? Beginners	2	16	July 19, 2025
How to use the trained model in my own huggingface repo Beginners	1	677	August 24, 2023
Training models Models	3	90	June 10, 2025
How to use hugging face transformers for testing a dataset 🤗Transformers	1	266	May 4, 2024

Transforming Pushed Hugging Face Models into Usable GGUF Models for Local Colab Use

Related topics