Question about unsupervised T5 training

jrandel · January 26, 2022, 6:52pm

Hola! I have a project I am working on which relates to search queries and a contact database, e.g. I have a golden set of search queries and their related results:

query: john smith (ACME corporation)
result: Smith, John 123 Main Street etc

So I’ve got a small percentage of “correct” search queries and their results which I am fine tuning T5 on for a text classification task. However the problem is I need to get the remaining 95% unobserved contacts into the model somehow, either via some method of context or perhaps fine tune training this T5 in an unsupervised fashion on the contents of the contact database, in hopes that the model will generalize to those semantic associations in the golden set of searches?

I am experimenting with both T5.1-large and BeIR/query-gen-msmarco-t5-large-v1 for this

TIA!

Topic		Replies	Views
Retrain T5 using unsupervised learning with MLM 🤗Transformers	0	250	May 21, 2023
Fine-tuning T5 Model on a Book for Unsupervised Learning Models	0	378	April 17, 2024
Model training problem Beginners	1	24	September 6, 2024
Training Text-To-SQL models Beginners	1	214	January 21, 2025
How is T5 pretrained? 🤗Transformers	3	510	July 12, 2021

Question about unsupervised T5 training

Related topics