CUDA_MPS_ACTIVE_THREAD_PERCENTAGE

On Volta GPUs, setting this in a MPS client’s environment will constraint the portion of available threads of each device. This environment variable will only further constraint the limit set by the control daemon (via set_default_active_thread_percentage or set_active_thread_percentage control daemon commands). If the control daemon has a lower setting, the control daemon setting will be obeyed by the client instead.