NV-CLIP (Latest)
NV-CLIP (Latest)

Performance

The performance of NV-CLIP NIM is calculated by measuring the end to end latency of the API call. It is the average over 100 iterations.

Latency values are in seconds; throughput values are inputs per second.

GPU

Precision

Input Type

Resolution

Batch Size

Latency

Throughput

H100 SXM FP16 Image 350x197 64 0.2568 249.22
H100 PCIe FP16 Image 350x197 64 0.2568 249.22
A100 SXM FP16 Image 350x197 64 0.3968 160.57
A100 PCIe FP16 Image 350x197 64 0.3968 160.57
L40S FP16 Image 350x197 64 0.3562 179.67
A10G FP16 Image 350x197 64 0.615 104.07
A6000 Ada FP16 Image 350x197 64 0.3701 172.93
RTX 4090 FP16 Image 350x197 64 0.339 188.78
Previous Accuracy
Next Observability
© Copyright © 2024, NVIDIA Corporation. Last updated on Oct 3, 2024.