About the Audio Super-Resolution Effect#

The Audio Super-Resolution effect upsamples the audio. For low-frequency audio, this feature predicts the higher frequency spectrum of input audio, which improves audio quality.

Note

In this guide, the term Super-Resolution is used interchangeably with Superres and Superresolution (referred to as superres in the API).

This effect has the following characteristics:

  • Supported input/output audio format is 32-bit float audio.

Note

The main purpose of this effect is to enhance the sampling rate of input audio. The level of enhancement seen in the output audio depends on the type of audio.

Audio that is captured on Windows with the audio enhancement settings disabled produces better superres outputs than when this setting is enabled.

To run the sample application on Windows for this effect, use the following command:

# (One time, initial setup): Download models using models/download_models.ps1
powershell -ExecutionPolicy Bypass -File ./download_models.ps1 --gpu_architecture <gpu> --effects superres-8k_to_16k,superres-16k_to_48k

# Format: run_effect_demo.bat <architecture> <effect> <input_sample_rate> <output_sample_rate>

# 8k - 16k effect
run_effect_demo.bat turing superres 8k 16k

# 16k - 48k effect
run_effect_demo.bat ampere superres 16k 48k

Note

For more information, see Use the Helper Script to Run the Sample Application.

To run the sample application on Linux for this effect, use the following command:

# (One time, initial setup): Download models using models/download_models.sh
./download_models.sh --gpu <gpu> --effects superres-8k_to_16k,superres-16k_to_48k

# Refer to Section 3.2 for further details
Format: ./run_effect.sh -g <gpu> -s <sample_rate> -e superres

# 16k effect
./run_effect.sh -g t4 -s 16 -e superres

# 48k effect
./run_effect.sh -g t4 -s 48 -e superres

Note

For more information, see Use the Helper Script to Run the Sample Application.

  • Supported upsampling of 8-kHz input audio to 16-kHz output (2x) and 16-kHz input audio to 48-kHz output (3x).

  • In the Linux SDK, this effect has the following maximum throughput (the number of batches supported in real time):

    Architecture

    Maximum Throughput for the 8K-to-16K Effect

    Maximum Throughput for the 16K-to-48K Effect

    T4

    410

    180

    A100

    2110

    920

    A10

    910

    380

    L40

    3450

    1350

    H100

    2430

    1030

    B100

    5720

    2330

    RTX PRO 6000

    3940

    1650