Are there any plans to centralize benchmarks for datasets or models? There are some links to papers and code for datasets and also a sentence transformers benchmark
New models are added often which is great, but it’s hard to track how they perform since each paper often cherry picks comparative examples.