Performance
The performance of NV-CLIP NIM is calculated by measuring the end to end latency of the API call. It is the average over 100 iterations.
Latency values are in seconds; throughput values are inputs per second.
GPU |
Precision |
Input Type |
Resolution |
Batch Size |
Latency |
Throughput |
---|---|---|---|---|---|---|
H100 SXM | FP16 | Image | 350x197 | 64 | 0.2568 | 249.22 |
H100 PCIe | FP16 | Image | 350x197 | 64 | 0.2568 | 249.22 |
A100 SXM | FP16 | Image | 350x197 | 64 | 0.3968 | 160.57 |
A100 PCIe | FP16 | Image | 350x197 | 64 | 0.3968 | 160.57 |
L40S | FP16 | Image | 350x197 | 64 | 0.3562 | 179.67 |
A10G | FP16 | Image | 350x197 | 64 | 0.615 | 104.07 |
A6000 Ada | FP16 | Image | 350x197 | 64 | 0.3701 | 172.93 |
RTX 4090 | FP16 | Image | 350x197 | 64 | 0.339 | 188.78 |