Model choice for use-case

hananshandler · December 20, 2022, 4:37pm

I’m looking for some help in choosing which zero-shot classification model is most appropriate for my use-case. My team has a handful of files that we need to process into a database each day. Each file’s field names need to conform to the database table specifications, however, the files often come in with slightly varying naming conventions (i.e. the field “Address” will show up as “Customer Address” in the file and “Division” will show up as “Branch”).

My thought was to use a zero-shot classification model to predict which incorrect field names (“Customer Address” and “Branch”) align to specific missing field names (“Address” and “Division”). I’m considering adding an additional layer to the model trained on past labeled data to make it a bit more robust (potentially adding in field values in addition to field names).

I’ve tested this idea a bit with valhalla/distilbart-mnli-12-1 · Hugging Face and had decent success, but I’m a bit unclear on how to determine if this is the best model to use vs. another zero-shot model.

Any help is greatly appreciated!

Topic		Replies	Views
Zero shot image classification for industrial equipment Beginners	0	130	February 20, 2024
Zero shot classification for long form text Beginners	4	602	July 15, 2024
Zero-shot Classification With Generative Language Models 🤗Transformers	0	710	October 12, 2023
How do I fine-tune a zero-shot learning model to my task? Beginners	2	1598	May 26, 2022
MAX_LEN in ZeroShot Models	0	279	November 21, 2022

Model choice for use-case

Related topics