How to wrap around a non-neural LM and make it fully Hugging Face compatible?

antalvdb · February 2, 2025, 2:28pm

My goal is to seamlessly integrate a non-neural LM into the Hugging Face ecosystem. The system, MBLM, implements a fast approximate k-NN next word predictor that can run in autoregressive (CausalLM) mode. Internally the core next-word prediction step produces a probability distribution over tokens that could be exposed to the outside.

I’ve been looking into writing a custom version of a PreTrainedModel and was looking for some guidelines when the model is truly non-neural (but functionally compatible as sketched above).

Shameless plug: this is a CPU-only eco-friendly LLM alternative with great scaling abilities. Incremental learning, fast, explicit memorization of training data.

Thanks for sharing tips!

Antal

John6666 · February 3, 2025, 10:30am

In an extreme case, if you inherit from the existing Transformers model or the model’s base class and replace everything except for some functions such as loading-related and call with dummies, it should work…

It’s not Transformers, but SentenceTransformers, so it’s not directly related to the know-how, but I remembered that a (probably) non-neural network model was introduced, so I’m writing about it.

Topic		Replies	Views
Non-Coder Training Question Beginners	5	423	February 20, 2025
Retrieval Augmented Generation using Transformer Eco System 🤗Transformers	0	465	October 12, 2023
Saving underlying language model after trained on downstream task 🤗Transformers	0	422	September 14, 2020
Further pre-train language model in transformers like BERT Models	3	1108	March 27, 2022
Online Machine Learning for transformers Models	1	988	February 7, 2023

How to wrap around a non-neural LM and make it fully Hugging Face compatible?

Related topics