aiq.eval.rag_evaluator.evaluate#

Attributes#

Classes#

Module Contents#

logger#
class RAGEvaluator(
evaluator_llm: ragas.llms.LangchainLLMWrapper,
metrics: collections.abc.Sequence[ragas.metrics.Metric],
)#
evaluator_llm#
metrics#
static eval_input_to_ragas(
eval_input: aiq.eval.evaluator.evaluator_model.EvalInput,
) ragas.EvaluationDataset#

Converts EvalInput into a Ragas-compatible EvaluationDataset.

ragas_to_eval_output(
eval_input: aiq.eval.evaluator.evaluator_model.EvalInput,
results_dataset: ragas.dataset_schema.EvaluationResult | None,
) aiq.eval.evaluator.evaluator_model.EvalOutput#

Converts the ragas EvaluationResult to aiq EvalOutput

async evaluate(
eval_input: aiq.eval.evaluator.evaluator_model.EvalInput,
) aiq.eval.evaluator.evaluator_model.EvalOutput#

Run Ragas metrics evaluation on the provided EvalInput