Job Target and Configuration Matrix#
Because NeMo Evaluator separates the target and the configuration, you can create a configuration once, and reuse it multiple times with different targets. You can also create a target once, and reuse it multiple times with different evaluations. The following table gives examples of how to combine targets and configurations for an evaluation job.
Evaluation Type |
Example Configurations |
Example Targets |
Custom Data Options |
---|---|---|---|
BigCode Evaluation Harness |
— |
||
LM Evaluation Harness |
— |
||
Similarity Metrics |
|||
LLM As A Judge |
— |
||
Retriever Pipeline |
|||
RAG Pipeline |
Caution
Before you can run a retriever evaluation, or a RAG evaluation that has a retriever step, you must set up a Milvus document store. For more information, refer to Milvus.