nemo_curator.models.client.llm_client
nemo_curator.models.client.llm_client
nemo_curator.models.client.llm_client
Interface representing a client connecting to an LLM inference server and making requests asynchronously
Internal implementation of query_model without retry/concurrency logic. Subclasses should implement this method instead of query_model.
Query the model with automatic retry and concurrency control.
Setup the client.
Represents a way of formatting a conversation with an LLM such that it can response appropriately
Configuration class for LLM generation parameters.
Interface representing a client connecting to an LLM inference server and making requests synchronously
Setup the client.