Hello everybody,
I take a look in the RAGAS library and my question is, how do they handle the response of the evaluation language models and based on them, how do they calculate the metrics?
If I implement RAGAS by myself, I prompt the (evaluation) language model like Llama-2. In the next step I extract the results with the help of regular expression. As last step I calculate the values that RAGAS describe in their paper.
But, I take a look in the RAGAS library and I can’t find a process to handle the responses.
How do they do that?
Best regards
Christian