Concept drift in pre-trained models

AlanFeder · September 13, 2021, 3:42pm

Hi,

I have a high-level NLP question (I apologize in advance if this is not the appropriate forum - feel free to delete if so).

I’m wondering how to think of the drift in language usage over time, and language models such as those on the huggingface hub. For example, bert-base-uncased was originally trained in 2018. Back then, COVID wasn’t a word, and “huggingface” meant an emoji.

Does that mean that running inference on texts using that model will be lacking for these specific topics?
If I am fine-tuning a model, might I want to use as my base a model that is newer, even if it performs “worse” on some original task?
Is there some way to see in the model card the training date, or sort by date trained (which is likely different than date uploaded to the hub)?

Thanks

Topic		Replies	Views
NLP Sense Making Beginners	0	421	March 31, 2022
Doing classification 100% from scratch? 🤗Transformers	4	1715	September 17, 2021
Are albert-base-v1( and v2) pretrained enough? 🤗Transformers	4	354	October 26, 2021
Weird losses while fine tuning Beginners	0	339	September 17, 2021
How to fine-tune BERT model for next word prediction? Beginners	0	1113	October 3, 2021

Concept drift in pre-trained models

Related topics