Release Notes#
v1.0.0#
Features#
This is the first release of Llama 3.1 NemoGuard 8B TopicControl NIM. The microservice serves a GPU-accelerated LLM model for conversational dialog moderation to keep conversations on-topic and build trustworthy LLM applications.
Known Issues#
The tensor parallel 4 GPU model profiles are not runnable. This is a known issue.