EvaluationParamsParam#

class nemo_microservices.types.EvaluationParamsParam

Bases: TypedDict

extra: object

Any other custom parameters.

limit_samples: int

Limit number of evaluation samples

max_retries: int

Maximum number of retries for failed requests.

max_tokens: int

The maximum number of tokens to generate.

parallelism: int

Parallelism to be used for the evaluation job.

Typically, this represents the maximum number of concurrent requests made to the model.

request_timeout: int

The timeout to be used for requests made to the model.

stop: str | List[str]

Up to 4 sequences where the API will stop generating further tokens.

temperature: float

Float value between 0 and 1.

temp of 0 indicates greedy decoding, where the token with highest prob is chosen. Temperature can’t be set to 0.0 currently.

top_p: float

Float value between 0 and 1; limits to the top tokens within a certain probability. top_p=0 means the model will only consider the single most likely token for the next prediction.