Evaluating RAG only with open-source

Hi,
are there any RAG evaluation tools based solely on open-source models (for example using only HuggingFace models)? I would like to load models for generating responses/judging responses from HuggingFace and compute metrics based on the scores of the judge.