nemo_curator.core.serve.ray_serve.backend
nemo_curator.core.serve.ray_serve.backend
nemo_curator.core.serve.ray_serve.backend
Bases: InferenceBackend
Ray Serve backend for InferenceServer.
Best-effort cleanup after a failed deploy.
Deploy models onto the connected Ray cluster.
Return a runtime_env dict that suppresses per-request logs.
Reset Ray Serve’s cached controller client.
TODO: Remove this once https://github.com/ray-project/ray/issues/61608 is fixed.
Translate a typed Ray Serve model config into LLMConfig.
Connect to Ray, deploy the models, and detach the driver.
Reconnect to Ray and tear down Ray Serve.