Appendix F. Deployment Detail Checklist#

Implementation References#

Fast-moving implementation detail belongs in product docs and deployment guides. The paper states what the architecture requires; the references provide version-specific steps, commands, support limits, and product workflow details.

Table 16: Implementation References

Area	External reference	Use it for	Keep in this paper
NVIDIA Confidential Containers Reference Architecture	NVIDIA Confidential Containers Reference Architecture	General CoCo architecture, component roles, GPU Operator, Kata Containers, Trustee, and sample CoCo workflow.	This paper keeps only the confidential-inference trust model and deployment requirements.
NVIDIA Confidential Containers deployment	Deploy Confidential Containers	Node labeling, Kata installation, GPU Operator configuration, runtime classes, sample workloads, and operational prerequisites.	The reference implementation must name the validated runtime class, node pool, and configuration profile.
NVIDIA Confidential Containers attestation	Attestation	Trustee provisioning, KBS endpoint configuration, CPU/GPU evidence flow, NRAS, policy customization, and troubleshooting.	The architecture states what must be proven and when model keys can be released.
NVIDIA Confidential Containers supported platforms	Supported Platforms	Current GPU, CPU, host OS, kernel, runtime, KBS protocol, GPU Operator, and Kata support status.	The paper records only the validated profile used by this reference implementation.
Upstream Confidential Containers Trustee	Trustee architecture and policies	AA, CDH, KBS, AS, RVPS, KBS protocol, attestation policies, KBS resource policies, and initdata policy binding.	The paper uses the upstream terms only where they clarify the confidential-inference key-release flow.
NVIDIA confidential computing platform guidance	NVIDIA Trusted Computing Solutions	Hardware selection, BIOS/firmware prerequisites, CPU/GPU confidential-computing setup, and platform validation guidance.	The hardware profile is part of the architecture claim and must be recorded per deployment.
Kubernetes RuntimeClass	Kubernetes RuntimeClass	RuntimeClass behavior, scheduling overhead, and the Kubernetes mechanism for selecting a configured runtime handler.	CoCo uses a confidential runtime class to launch pods inside measured confidential VM sandboxes.

Reference Deployment Bill of Materials#

Table 17: Reference Deployment Bill of Materials

BOM area	Items to capture	Owner
Server and accelerator hardware	Server model/count, CPU TEE, GPU SKU/form factor/count, memory, NVMe, NICs/DPU, switch/fabric role, management network	OEM/integrator with NVIDIA and platform operator
Firmware and host stack	BIOS/UEFI versions, secure boot, IOMMU/VFIO, CPU TEE settings, GPU CC mode, GPU firmware, host OS, kernel	OEM/integrator and platform operator
Kubernetes and runtime stack	Kubernetes platform, confidential runtime, RuntimeClass, node labels, admission/RBAC, network policy	Platform operator and CC software provider
Key-release services	KBS, Attestation Service, policy, KMS/HSM settings, reference values, collateral cache, audit/SIEM integration	CC software provider and key-release authority
NVIDIA software	GPU Operator, driver version, CUDA/runtime, device plugin, GPU CC mode, topology	NVIDIA and platform operator
Workload artifacts	Signed workload image, encrypted model artifact, artifact store, runtime policy, startup service, measurements, key IDs, policy versions	Model provider and ISV
Network and identity	DNS, Route/gateway/load balancer, firewall rules, allowed egress endpoints, TLS certificates, service identities, SIEM targets	Platform operator and security team
Validation assets	Non-sensitive sample key/model, positive/negative tests, expected logs/errors, runbook links	OEM/integrator and solution architects