nemo_curator.models.client.llm_client
Module Contents
Classes
API
Abstract
Interface representing a client connecting to an LLM inference server and making requests asynchronously
asyncabstract
Internal implementation of query_model without retry/concurrency logic. Subclasses should implement this method instead of query_model.
async
Query the model with automatic retry and concurrency control.
abstract
Setup the client.
Abstract
Represents a way of formatting a conversation with an LLM such that it can response appropriately
abstract
Dataclass
Configuration class for LLM generation parameters.
extra_kwargs
max_tokens
n
seed
stop
stream
temperature
top_k
top_p
Abstract
Interface representing a client connecting to an LLM inference server and making requests synchronously
abstract
abstract
Setup the client.