NVIDIA Docs Hub Homepage NVIDIA Clara Welcome to NVIDIA Parabricks v4.7.0 4.7.0-1 Release Notes

4.7.0-1 Release Notes

Highlights:

New pangenome germline pipeline pangenome_germline for GPU-accelerated pangenome alignment and variant calling.
Add support for NVIDIA GB10 (DGX Spark).
Added --enable-small-model support in deepvariant for shortread (WGS), PacBio, and ONT modes.
Updated version of minimap2 to v2.30 along with some new presets, options and performance improvements.
Updated version of giraffe to v1.70 along with performance improvements in single-ended mode.
Performance improvements for rna_fq2bam.

New Tools

With Parabricks 4.7.0 we are releasing a GPU-accelerated pangenome_germline pipeline. Pangenome_germline is an end-to-end pipeline that integrates GPU-accelerated Giraffe with Pangenome-aware DeepVariant to streamline the workflow from FASTQ to VCF.

Improvements

All tools:

Added support for NVIDIA GB10 (DGX Spark).

Tool Updates

fq2bam and fq2bam_meth:

Improved performance on Hopper and Blackwell GPUs, with more modest gains on older architectures.
Allowed reads up to 512 bp to be processed on GPU, increasing the previous threshold from 500 bp.
- When the --max-read-length filter is set above its default of 480 bp and up to 512 bp, all reads are processed on GPU. If the limit is set above 512 bp, reads longer than 512 bp are processed on CPU through the recovery mechanism.
Added support for manually setting the insert size distribution -I option from baseline BWA with --bwa-options.
- For example: --bwa-options="-I 250.54,20.25,500,20". The user interface is unchanged.
Added the --cigar-on-gpu option to offload CIGAR generation from the CPU (default) to the GPU; enabling this option may improve performance for CPU-bound runs.

fq2bam, fq2bam_meth, giraffe, minimap2, rna_fq2bam, bamsort, and associated pipelines:

Improved performance when using --gpuwrite.
Added a CPU recovery mode when using --gpusort to recover and continue processing even when the GPU may run out of memory.
GPU CRAM writer has reduced CPU memory requirements. Memory usage reductions can go up to 70% based on available host memory.

fq2bam, giraffe, and markdup:

Enabled read group id comparison in markdup single-ended mode. Can be disabled by passing --ignore-rg-markdups-single-ended.

deepvariant, deepvariant_germline, pacbio_germline, and ont_germline:

Added support for the small model with --enable-small-model. This option is supported for shortread (WGS), PacBio, and ONT modes.
Improved host memory usage for long-read (PacBio and ONT) mode.

minimap2:

Updated version of minimap2 to v2.30.
Improved performance in PacBio mode and better GPU utilization.
Improved performance in splice presets with implementation of new alignment batching algorithm.
Added support for --preset options lr:hq and splice:sr.
Added argument --copy-comment for appending FASTQ comments to BAM output with auxiliary tag.
Added argument --md for outputting the MD tag.
Added argument --both-strands that forces minimap2 to consider both strands when finding canonical splicing sites GT-AG.

haplotypecaller:

Added --exclude-intervals (-XL) to specify genomic intervals to exclude from processing.

rna_fq2bam:

Auto mode: When --num-streams-per-gpu and/or --num-threads are omitted, their automatic configuration is triggered based on GPU and system memory. These settings can be overridden by providing explicit values.
Performance improvements from code optimizations.

giraffe:

Updates to match the baseline VG Giraffe version v1.70.
Added GPU-accelerated Finalize Mappings (Dozeu) for single-end alignment.
Introduced auto mode for --nstreams that automatically configures the number of streams based on the GPU's device memory specifications. This provides sensible defaults for most systems but may still benefit from manual tuning for specific configurations.
Removed the --minimizers-gpu-sort option, as it is no longer needed.
Now used as the pangenome alignment stage in the new pangenome_germline pipeline.

Bug Fixes

mutectcaller:

Added the missing read group (RG) header line in the normal sample BAM/CRAM output defined by --mutect-bam-output.

fq2bam, fq2bam_meth, giraffe, minimap2, markdup, rna_fq2bam, and associated pipelines:

Addressed an issue preventing --markdups-assume-sortorder-queryname from taking effect when combined with --gpusort.

deepvariant, deepsomatic, and pangenome_aware_deepvariant:

Fixed a potential GPU out-of-memory condition when --num-streams-per-gpu was set to auto (default).

rna_fq2bam:

Fixed missing alignments for reads with a high degree of multimapping.

fq2bam and fq2bam_meth:

Fixed a memory leak that occurred during CPU recovery.
Fixed an issue where --bwa-nstreams auto (default value) and --gpuwrite could potentially run out of device memory with GPUs having more than 16 GB but less than 22 GB of memory.

fq2bam, fq2bam_meth, giraffe, minimap2, rna_fq2bam, bamsort, and associated pipelines:

Fixed a potential out-of-bounds device write when using --gpuwrite for accelerated BAM and CRAM writing. This risk was more likely to appear on ARM systems such as NVIDIA GB10.

fq2bam_meth:

Fixed --preserve-file-symlinks to work properly with methylation reference genome and indexes.

giraffe:

Fixed a potential deadlock in paired-end mode that could occur when an input file had very few batches and all of them entered CPU recovery mode.

minimap2:

Fixed an issue where null padding could appear in output query names when using BAM input.

For further information see the Parabricks datasheet.

Previous Release Notes

Next 4.6.0-1 Release Notes