NVIDIA Self-Hosted Service Limitations#

Routing#

  1. Number of tasks:
    • For H100 : Tasks <= 20000 per API call.

    • For A100 : Tasks <= 15000 per API call.

  2. Number of vehicles:
    • Vehicles <= 3000 per API call is suggested.

  3. Capacity and demand constraints:
    • Only 3 dimensions are supported.

  4. Time windows:
    • All time windows are expected to be => 0, and all time windows are int32 types. So the time window range supported would be [0, 2^31-1].

    • This translates to [January 1, 1970, 00:00:00, January 19, 2038, 03:14:07] in UTC.

  5. Type of vehicles, which also dictates number of cost and time matrix for each vehicle:
    • Types <= 10 is suggested.

Linear Programming#

  1. For H100, - 10M rows/constraints 10M columns/variables and 2B non-zeros in the constraint matrix - 74.5M rows/constraints 74.5M columns/variables and 1.49B non-zeros in the constraint matrix

Mixed Integer Linear Programming#

  1. Number of non-zeros/coefficient matrix size supported - For H100 - 27 million