nemo_microservices.types.beta.safe_synthesizer.generate_parameters_param#
Module Contents#
Classes#
API#
- class nemo_microservices.types.beta.safe_synthesizer.generate_parameters_param.GenerateParametersParam#
Bases:
typing_extensions.TypedDict- invalid_fraction_threshold: float#
None
The fraction of invalid records that will stop generation after the
patiencelimit is reached.
- num_records: int#
None
Number of records to generate.
- patience: int#
None
Number of consecutive generations where the
invalid_fraction_thresholdis reached before stopping generation.
- repetition_penalty: float#
None
The value used to control the likelihood of the model repeating the same token.
- structured_generation_backend: typing_extensions.Literal[auto, xgrammar, guidance, outlines, lm-format-enforcer]#
None
The backend used by VLLM when use_structured_generation=True.
Supported backends (from vllm) are ‘outlines’, ‘guidance’, ‘xgrammar’, ‘lm-format-enforcer’. ‘auto’ will allow vllm to choose the backend.
- structured_generation_schema_method: typing_extensions.Literal[regex, json_schema]#
None
The method used to generate the schema from your dataset and pass it to the generation backend. auto will usually default to ‘json_schema’. Use ‘regex to use our custom regex construction method, which tends to be more comprehensive than ‘json_schema’ at the cost of speed.
- temperature: float#
None
Sampling temperature.
- top_p: float#
None
Nucleus sampling probability.
- use_structured_generation: bool#
None
Use structured generation.