Release Notes#

v1.0.0#

Features#

This is the first release of Llama 3.1 NemoGuard 8B TopicControl NIM. The microservice serves a GPU-accelerated LLM model for conversational dialog moderation to keep conversations on-topic and build trustworthy LLM applications.

Known Issues#

  • The tensor parallel 4 GPU model profiles are not runnable. This is a known issue.