Download Evaluation Results#
To download the results of an evaluation job, send a GET
request to the evaluation/jobs/<job_id>/download-results
API.
This downloads a directory that contains the configuration files, logs, and evaluation results for a specific evaluation job.
To Download Evaluation Results#
Choose one of the following options to download evaluation results.
import os
from nemo_microservices import NeMoMicroservices
# Initialize the client
client = NeMoMicroservices(
base_url=os.environ['EVALUATOR_BASE_URL']
)
# Download evaluation results
results_zip = client.evaluation.jobs.download_results("job-id")
# Save to file
results_zip.write_to_file('result.zip')
print("Download completed.")
curl -X "GET" "${EVALUATOR_BASE_URL}/evaluation/jobs/<job_id>/download-results" \
-H 'accept: application/zip' \
-o result.zip
Results#
After the download completes, the results are available in the result.zip file. To unzip the result.zip file on Ubuntu, macOS, or Linux, run the following code.
unzip result.zip -d result
You can find the result files in the results/ folder. For example, if you run an lm-harness evaluation, the results are in automatic/lm_eval_harness/results.
The directory structure will look like this:
.
├── automatic
│ └── lm_eval_harness
│ ├── model_config_meta-llama-3_1-8b-instruct.yaml
│ ├── model_config_meta-llama-3_1-8b-instruct_inference_params.yaml
│ └── results
│ ├── README.md
│ ├── lm-harness-mmlu_str.json
│ ├── lm-harness.json
│ ├── lmharness_meta-llama-3_1-8b-instruct_aggregateresults-run.log
│ ├── lmharness_meta-llama-3_1-8b-instruct_mmlu_str-run.log
└── metadata.json