Performance#

The performance of NV-CLIP NIM is calculated by measuring the end to end latency of the API call. It is the average over 100 iterations.

Latency values are in seconds; throughput values are inputs per second.

GPU	Precision	Input Type	Resolution	Batch Size	Latency	Throughput
H100 SXM	FP16	Image	350x197	64	0.2568	249.22
H100 PCIe	FP16	Image	350x197	64	0.2568	249.22
A100 SXM	FP16	Image	350x197	64	0.3968	160.57
A100 PCIe	FP16	Image	350x197	64	0.3968	160.57
L40S	FP16	Image	350x197	64	0.3562	179.67
A10G	FP16	Image	350x197	64	0.615	104.07
A6000 Ada	FP16	Image	350x197	64	0.3701	172.93
RTX 4090	FP16	Image	350x197	64	0.339	188.78