Looking for a Tiny LLM (max 1.5GB) – Need Advice

Heinz01 · September 27, 2024, 9:09am

Hey everyone,

I’m currently looking for a very small LLM that cannot exceed 1.5GB in size. The goal is for it to handle simple Q&A tasks (nothing fact-based or overly complex, just basic interpretation of input). Additionally, the model needs to understand basic writing mistakes (typos, grammar issues) and be able to handle very primitive interactions.

I understand that larger models generally perform better, but I’m really constrained by size limits here. Does anyone have experience working with models of this size or know how to achieve something like this while retaining minimal functionality?

Any advice or guidance would be greatly appreciated!

Thanks in advance!

John6666 · September 27, 2024, 10:55am

If you are looking for a model that is 1GB in the quantized state, you can find a 1B model GGUF and you are done, but the number of models within 1GB in the float16 state is limited.
The following model meets the requirements, and the author has a lot to say about small models.

You can find the GGUF model of 1B at the link below.

not-lain · September 28, 2024, 10:46pm

there’s this one as well by @KingNish , really great performance considering its size

John6666 · September 29, 2024, 12:47am

Qwen 2.5 is very high performance in general. Especially the multilingual performance is much better than 2. Other 0.5B models can also be searched as follows.
By the way, the automatic generation of the link from the forum to the model search is slightly buggy.
One of the URLs below works fine, but I fixed it manually. The other one does not work. I don’t understand the logic.

https://huggingface.co/models?sort=modified&search=GGUF%200.5B

not-lain · September 29, 2024, 1:20am

i’ll open a separate issue in a the form and report this, thanks for sharing

John6666 · September 29, 2024, 1:21am

thank you to open issue.

expandme · December 6, 2024, 6:17pm

I am also looking for some, trick with trending is good, pity we don’t have there smaller then 1B some models use in manes 0.5B, some 500M and so on (MS Phi don’t have B size in names at all) !!!

I need more >3B inteligent search for SmallZOO project →

Topic		Replies	Views
Which model is best for code generation under [b]10GB[/b] Beginners	4	939	June 20, 2025
Find LLM to run on single gpu with only 8 GB ram Models	10	7816	March 22, 2024
Easy to grab hello world llm creation tutorial Beginners	0	462	February 12, 2024
Help to choose a model for compact summarization 🤗Hub	1	172	November 8, 2024
Need Suggestions for LLM Models Suitable for 250GB RAM Server Models	0	174	December 29, 2024

Looking for a Tiny LLM (max 1.5GB) – Need Advice

Related topics