.pt PyTorch Model ->PreTrainedModel

kbmmoran · April 3, 2023, 3:53pm

Similar to (How can I share a pytorch saved model on huggingFace hub), I am attempting to convert/utilize a PyTorch model with the .pt extension with Huggingface.

I have been reading through the docs on the ‘PreTrainedModel’ class, but have Not found a one-for-one solution. The model is a GPT based on this repository (GitHub - karpathy/nanoGPT: The simplest, fastest repository for training/finetuning medium-sized GPTs.).

sgugger · April 3, 2023, 5:29pm

You can’t use model written in other libraries inside Transformers.

kbmmoran · April 3, 2023, 5:45pm

Ah okay, thank you !

eigenfelix · May 1, 2024, 10:47am

But is there no way to convert at all?

nielsr · May 1, 2024, 1:05pm

Technically you can convert models to the Transformers format, if the model exists in the Transformers library and all parameters can be converted.

e.g. to convert llama from the original repository to the Transformers format => transformers/src/transformers/models/llama/convert_llama_weights_to_hf.py at main · huggingface/transformers · GitHub. Each model folder in the Transformers library has this so-called conversion script to convert weights from the original format to the Transformers format.

Topic	Replies	Views
Training General Pytorch model with HuggingFace's Trainer 🤗Transformers	386	May 7, 2023
Convert PyTorch Model to Hugging Face model (Inference API) Models	1147	March 5, 2024
Load pre-trained pytorch neural network Beginners	330	January 7, 2023
Convert PyTorch Model to Hugging Face model Inference Endpoints on the Hub	922	March 5, 2024
How can I share a pytorch saved model on huggingFace hub Beginners	677	May 4, 2022