Google Sheets Exporter (gsheets
)#
Exports accuracy metrics to a Google Sheet. Dynamically creates/extends header columns based on observed metrics and appends one row per job.
Purpose: Centralized spreadsheet for tracking results across runs
Requirements:
gspread
installed and a Google service account with access
Usage#
Export evaluation results to a Google Sheets spreadsheet for easy sharing and analysis.
Export results from a specific evaluation run to Google Sheets:
# Export results using default spreadsheet name
nemo-evaluator-launcher export 8abcd123 --dest gsheets
# Export with custom spreadsheet name and ID
nemo-evaluator-launcher export 8abcd123 --dest gsheets \
-o export.gsheets.spreadsheet_name="My Results" \
-o export.gsheets.spreadsheet_id=1ABC...XYZ
Export results programmatically with custom configuration:
from nemo_evaluator_launcher.api.functional import export_results
# Basic export to Google Sheets
export_results(
invocation_ids=["8abcd123"],
dest="gsheets",
config={
"spreadsheet_name": "NeMo Evaluator Launcher Results"
}
)
# Export with service account and filtered metrics
export_results(
invocation_ids=["8abcd123", "9def4567"],
dest="gsheets",
config={
"spreadsheet_name": "Model Comparison Results",
"service_account_file": "/path/to/service-account.json",
"log_metrics": ["accuracy", "f1_score"]
}
)
Configure Google Sheets export in your evaluation YAML file for automatic export on completion:
execution:
auto_export:
destinations: ["gsheets"]
# Export-related env vars (optional for GSheets)
env_vars:
export:
PATH: "/path/to/conda/env/bin:$PATH"
export:
gsheets:
spreadsheet_name: "LLM Evaluation Results"
spreadsheet_id: "1ABC...XYZ" # optional: use existing sheet
service_account_file: "/path/to/service-account.json"
log_metrics: ["accuracy", "pass@1"]
Key Configuration#
Parameter |
Type |
Description |
Default/Notes |
---|---|---|---|
|
str, optional |
Path to service account JSON |
Uses default credentials if omitted |
|
str, optional |
Target spreadsheet name. Used to open existing sheets or name new ones. |
Default: “NeMo Evaluator Launcher Results” |
|
str, optional |
Target spreadsheet ID. Find it in the spreadsheet URL: |
Required if your service account can’t create sheets due to quota limits. |
|
list[str], optional |
Filter metrics to log |
All metrics if omitted |