Tools, datasets ,benchmarks in AI Safety

Hello,

I am trying to do a landscape analysis for those benchmarks, datasets and tools that we already have in AI Safety field. I am trying to see what we have in Huggingface. I am not sure how to approach the search in HF universe. Any recommendation of tools/dataset/benchmarks or approach to search?