Does using any embedding model for fine-tuning or inference affect model performance?

DoganK01 · May 27, 2025, 12:42pm

The question is as it is. I want to train a small language model but I couldn’t understand mathematically whether a random embedding model would decrease performance.

I wonder if the embedding sizes are set in the code before being fed to the model? Otherwise, I think there should be an error in the matrix size.

Pimpcat-AU · June 10, 2025, 8:05pm

No, you cannot use any random embedding model without matching the embedding size to what your language model expects. The embedding dimension must match exactly, or you’ll get a shape error.

If the dimensions match, training with a random embedding is possible, but performance will be much worse embeddings must be learned or pretrained for good results.

Summary:

Embedding size must match model config.

Random embeddings will decrease performance.

Always use learned or pretrained embeddings for best results.

Solution provided by Triskel Data Deterministic AI.

Topic		Replies	Views
Does model performance on a task determine how good the embeddings are? Beginners	0	187	June 22, 2021
Is there any way to fine tuning model with existing embedding? Beginners	0	15	November 7, 2024
Fine-Tuning Pre-trained Models Issues and Gotchas Beginners	2	598	March 26, 2021
Am I using Embeddings wrong or is it the wrong approach Beginners	2	120	April 26, 2024
Fine tune vocab size of pre-trained Causal Language Model Intermediate	2	1837	October 17, 2022

Does using any embedding model for fine-tuning or inference affect model performance?

Related topics