nemo_curator.core.serve.base

View as Markdown

Module Contents

Classes

NameDescription
BaseModelConfigBase public model config shared by inference backends.
BaseServerConfigBase server-level config; subclasses declare which model config types they accept.
InferenceBackendBase class for inference server backend implementations.

API

class nemo_curator.core.serve.base.BaseModelConfig(
model_identifier: str,
model_name: str | None = None,
runtime_env: dict[str, typing.Any] = dict()
)
Dataclass

Base public model config shared by inference backends.

model_identifier
str
model_name
str | None = None
resolved_model_name
str
runtime_env
dict[str, Any] = field(default_factory=dict)
nemo_curator.core.serve.base.BaseModelConfig._merge_package_runtime_env(
key: str,
base: dict[str, typing.Any] | list[str] | None,
override: dict[str, typing.Any] | list[str] | None
) -> dict[str, typing.Any] | list[str]
staticmethod
nemo_curator.core.serve.base.BaseModelConfig.merge_runtime_envs(
base: dict[str, typing.Any],
override: dict[str, typing.Any] | None
) -> dict[str, typing.Any]
staticmethod

Merge two runtime_env dicts while preserving package lists.

class nemo_curator.core.serve.base.BaseServerConfig()
Dataclass

Base server-level config; subclasses declare which model config types they accept.

model_configs
tuple[type[BaseModelConfig], ...] = ()
class nemo_curator.core.serve.base.InferenceBackend()
Abstract

Base class for inference server backend implementations.

nemo_curator.core.serve.base.InferenceBackend.start() -> None
abstract
nemo_curator.core.serve.base.InferenceBackend.stop() -> None
abstract