Aerial CUDA-Accelerated RAN

Aerial CUDA-Accelerated RAN 24-3 Download PDF

Aerial CUDA-Accelerated RAN brings together the Aerial software for 5G and AI frameworks and the NVIDIA accelerated computing platform, enabling TCO reduction and unlocking infrastructure monetization for telcos.

Aerial CUDA-Accelerated RAN has the following key features:

  • Software-defined, scalable, modular, highly programmable and cloud-native, without any fixed function accelerators. Enables the ecosystem to flexibly adopt necessary modules for their commercial products.

  • Full-stack acceleration of DU L1, DU L2+, CU, UPF and other network functions, enabling workload consolidation for maximum performance and spectral efficiency, leading to best-in-class system TCO.

  • General purpose infrastructure, with multi-tenancy that can power both traditional workloads and cutting-edge AI applications for best-in-class RoA.

What’s New in 24-3

The following new features are available in release 24-3 for Aerial CUDA-Accelerated RAN:

  • Aerial cuPHY: CUDA accelerated inline PHY

    • Multi-cell support for mMImO (up to 3 cells)

    • Scheduling DL in special slots

    • Increase SRS slots in 4T4R and mMIMO

    • SRS CS multiplexing for different UEs

    • UL PUSCH channel estimation at PRG level

    • RKHS channel estimation

  • Aerial E2E: System level / End-to-End validation

    • Fronthaul Port Failover Validation (Active-Standby) of C/U/S-Planes

    • Concluded Ch.8 Conformance testing with PRACH

    • MIG validation of AI + RAN

  • Aerial Redundancy/Resiliency: CUDA accelerated RAN Redundancy/Resiliency features

    • RU Health Monitor - actively detect FH connectivity issues with ORU and take corrective action

    • Introduce L1 recovery period - If L1 is running late, drop FAPI messages for some time to allow L1 to recover

    • nvIPC pcap acquisition improvements - Introduced capability to add filters (cell-id , msg-id level) to nvIPC pcap acquisition

    • Backtrace output on console - Aerial prints backtrace on console in case of crash

  • Aerial cuMAC: CUDA accelerated MAC scheduler

    • DRL MCS selection module

      • Pre-trained neural networks available under aerial_sdk/cuMAC/testVectors

      • Inference based on TensorRT

    • 64TR MU-MIMO scheduler

      • UE sorting algorithm based on SRS SNR estimates

      • UE grouping algorithm based on SRS channel coefficient estimates

    • Aperiodic SRS resource manager

      • Combined with MU-MIMO UE sorting algorithm

    • 4T4R system simulation with GPU-based TDL channel model

    • Improved algorithms & CUDA implementation for type-0 and type-1 4T4R schedulers

  • pyAerial: Python interface to Aerial cuPHY

    • CSI-RS transmission pipeline

    • RSRP and pre- and post-equalizer SINR estimation

    • Carrier frequency offset and timing advance estimation

    • CRC checking

    • OFDM fading channel simulation

    • Support of multiple UE groups for PUSCH receiver pipeline and its components

    • An improved API to PUSCH receiver pipeline and its components

What’s New in 24-2.1

The following new features are available in release 24-2.1 for Aerial CUDA-Accelerated RAN:

  • Aerial cuPHY: CUDA accelerated inline PHY

    • 64T64R Massive MIMO:

      • 100 MHz DL max combined 16 layers + UL max combined 8 layers + SRS

      • 64T64R SRS + Dynamic + Static Beamforming Weights

      • Support multiple dynamic UE groups

      • Support flexible PRG size and PRB number

      • Support SRS buffer indexing from L2

      • Support non 2^n layers

      • Use different section IDs when splitting the C-Plane section

      • FH messaging for CSIRS + PDSCH and other channel combinations

    • Support GH200+BF3 as RU emulator platform

What’s New in 24-2

The following new features are available in release 24-2 for Aerial CUDA-Accelerated RAN:

  • Aerial cuPHY: CUDA accelerated inline PHY

    • MGX Grace Hopper multicell capacity w/ telco-grade traffic model

      • 20 peak loaded 4T4R @ 100MHz

      • Capacity also validated with more challenging traffic model

        • PUSCH and PDCCH symbols in the S-slot

    • L1-L2 interface enhancements

      • Separate FAPI request timelines for PDSCH and PDCCH

  • Aerial cuMAC: CUDA accelerated MAC scheduler

    • cuMAC-Sch

      • 4T4R CUDA implementation complete

    • cuMAC-CP

      • 4T4R implementation (Functional – early access)

  • Aerial cuBB/E2E: System level / End-to-End validation

    • Over-The-Air (OTA) validation:

      • CBRS O-RU

      • 8 UE OTA w/ 6 UE/TTI for > 8 hours

    • RedHat-OCP:

      • Multicell capacity validated on MGX (GH200+BF3)

    • O-RAN Fronthaul:

      • 16-bit fixed point IQ sample validated E2E (Keysight eLSU)

      • Simultaneous dual-port FH capability (8 peak cells; 4 per port)

    • L2 integration:

      • Multi-L2 container instances per L1 validated E2E

  • pyAerial: Python interface to Aerial cuPHY

    • TensorRT inference engine

      • Jupyter notebook example using pyAerial to validate a neural PUSCH receiver

    • LDPC API improvements

      • Added soft outputs to LDPC decoder

    • LS channel estimation

    • Limited support for Grace Hopper

      • Run pyAerial together with Aerial Data Lakes

© Copyright 2024-2025, NVIDIA. Last updated on Jan 10, 2025.