check your yaml front matter with this tool https://nodeca.github.io/js-yaml/
if it says âexpected a single document in the stream, but found moreâ
Your yaml is valid, otherwise your yaml is invalid.
---
language: "ISO 639-1 code for your language, or `multilingual`"
thumbnail: "url to a thumbnail used in social sharing"
tags:
- array
- of
- tags
license: "any valid license identifier"
datasets:
- array of dataset identifiers
metrics:
- array of metric identifiers
---
At the moment the most useful tool is a multi-line preview feature of grep. e.g. I wanted to find the format for datasets entry, so I did:
grep -r -A2 datasets: model_cards
model_cards/asafaya/bert-mini-arabic/README.md:datasets:
model_cards/asafaya/bert-mini-arabic/README.md-- oscar
model_cards/asafaya/bert-mini-arabic/README.md-- wikipedia
this assumes that the cards are all valid, so that youâre not copying an invalid format from an existing card.
Slightly off-topic: I would like to raise the case to make model cards required when submitting a model to the hub. At least some basic information such as the language. If not given it is very hard, as a user, to find the model. I would expect fo find all relevant models when I look for the Dutch tag, but not all of those models have cards and thus cannot be found this way.
Yes, it should be easy for people to upload new models but for the communityâs sake it seems only fair that they also provide some information about those models.
I think stas PR is better than current README because it shows working model cards.
I read the README and still screwed up a number of times. Itâs easy for me now to think itâs obvious but a mere 3 weeks ago it was very not obvious to me.