Searching by type and recognizing the type or pretrained model a model had

borgr · July 5, 2022, 1:37pm

Hi,
Two related questions:

can one know the type of model in the search
can one know the type of a specific model
I assume this is related both to the site and the API.
Assuming one wants to find variants of a model, am I correct that the only way to do it through search is by string match? (so if you want bert-base, you will need to filter everything that has bert in it, both roberta which is irrelevant and “berts” which is relevant https://huggingface.co/Jeevesh8)

As the string match is not a great way to go, I wonder about a second hand question.
Assuming one wishes to extract the model architecture and be sure it is the right one. One can load the original model, then the other model, and make sure they have exactly the same parameter sizes (e.g. in pytorch model.named_parameters()).
Is there a better way to do any of the two?

christopher · July 8, 2022, 6:53pm

Hi borgr! Are you looking to do this programmatically or using the Hub website?

borgr · July 11, 2022, 1:03pm

Well, for me I will do it programmatically, but it makes a lot of sense on the website too for others doesn’t it? (I might not be interested in ppo models for text classification or in OPT175B to run on my mobile)

christopher · July 11, 2022, 8:05pm

I can’t personally speak for the website search box, but you can use huggingface_hub to filter models by architecture:

from huggingface_hub import HfApi
api = HfApi()
models = api.list_models(full=True, fetch_config=True, limit=10)
print([m.config['model_type'] for m in models])
>>> ['bert', 'bert', 'distilbert', 'gpt2', 'distilbert', 'xlm-roberta', 'roberta', 'gpt2', 'bert', 'bert']

Does this fit your usecase?

borgr · July 12, 2022, 5:29am

Unintuitive that “full=True” is not enough to bring the config.

Anyway, it might help (not that T5 11B and T5 small should be in the same category for any user…) although it seems that this is not a consistent trait, only about half (38K out of 58K models) even have a config (mostly if they have a config model_type is in there, only about 200 exceptions).

So, still to pick a certain architecture (comparable models requiring the same infrastructures) the right way is to ignore about a third, and then load each one (heavy) and look at the size of the parameters?

christopher · July 12, 2022, 9:31am

github.com

huggingface/huggingface_hub/blob/main/src/huggingface_hub/hf_api.py#L739-L741


      
          fetch_config (`bool`, *optional*):
              Whether to fetch the model configs as well. This is not included
              in `full` due to its size.

This is because it would be wasteful to fetch if you don’t need it, especially when not limiting the number of results and fetching ~60K models. Not sure about the second part of the question, but you could load the config.json and get the number of layers, the dimensionality of embeddings etc.

It’s normal that not all models have a config since not all models are from the transformers library and the logic will be handled differently outside of the library. See for example this config of a spacy model: config.cfg · spacy/en_core_web_sm at main

borgr · July 21, 2022, 11:21am

Thanks, so far I manage with the model types and with using only the fetched config.
The number of layers is inconsistent, each model defines another name for everything (e.g. for layers: n_layers, num_hidden_layers etc.)
I’ll update when I create a function

Topic		Replies	Views
Is Google's official BERT model and huggingface BERT model different or same? Beginners	1	1226	March 9, 2022
Feature request: more flexible search in model / dataset hub Site Feedback	4	1615	September 27, 2022
Doing classification 100% from scratch? 🤗Transformers	4	1718	September 17, 2021
How to find the model type of BERT model? Models	0	110	December 21, 2023
A few questions about beginning with Huggingface Beginners	1	509	July 18, 2022

Searching by type and recognizing the type or pretrained model a model had

Related topics