Get Evaluation Results#

To get evaluation results as a JSON response, send a GET request to the evaluation/jobs/<job_id>/results endpoint. You must provide the ID of the job as shown in the following code.


Options#

API#

curl -X "GET" "${EVALUATOR_SERVICE_URL}/v1/evaluation/jobs/<job-id>/results" \
  -H 'accept: application/json'
import requests

endpoint = f"{EVALUATOR_SERVICE_URL}/v1/evaluation/jobs/<job-id>/results"
response = requests.get(endpoint).json()
response
Example Response
{
    "created_at": "2025-03-19T22:53:43.619932",
    "updated_at": "2025-03-19T22:53:43.619934",
    "id": "evaluation_result-1234ABCD5678EFGH",
    "job": "eval-UVW123XYZ456",
    "tasks": {
        "exact_match": {
            "metrics": {
                "exact_match": {
                    "scores": {
                        "gsm8k-metric_ranking-1": {
                            "value": 0.0
                        },
                        "gsm8k-metric_ranking-3": {
                            "value": 0.8
                        }
                    }
                }
            }
        },
        "exact_match_stderr": {
            "metrics": {
                "exact_match_stderr": {
                    "scores": {
                        "gsm8k-metric_ranking-2": {
                            "value": 0.0
                        },
                        "gsm8k-metric_ranking-4": {
                            "value": 0.19999999999999998
                        }
                    }
                }
            }
        }
    },
    "groups": {
        "evaluation": {
            "metrics": {
                "evaluation": {
                    "scores": {
                        "exact_match": {
                            "value": 0.4
                        },
                        "exact_match_stderr": {
                            "value": 0.09999999999999999
                        }
                    }
                }
            }
        }
    },
    "namespace": "default",
    "custom_fields": {}
}