public mrc::pymrc::PythonNode< std::shared_ptr< MultiInferenceMessage >, std::shared_ptr< MultiResponseMessage > >

Class Documentation

class InferenceClientStage : public mrc::pymrc::PythonNode<std::shared_ptr<MultiInferenceMessage>, std::shared_ptr<MultiResponseMessage>>

Perform inference with Triton Inference Server. This class specifies which inference implementation category (Ex: NLP/FIL) is needed for inferencing.

Public Types

using base_t = mrc::pymrc::PythonNode<std::shared_ptr<MultiInferenceMessage>, std::shared_ptr<MultiResponseMessage>>

Public Functions

InferenceClientStage(std::string model_name, std::string server_url, bool force_convert_inputs, bool use_shared_memory, bool needs_logits, std::map<std::string, std::string> inout_mapping = {})

Construct a new Inference Client Stage object.

Parameters

Class InferenceClientStage

Inheritance Relationships

Base Type

Class Documentation