4.7.0-1 Release Notes
Highlights:
New pangenome germline pipeline pangenome_germline for GPU-accelerated pangenome alignment and variant calling.
Add support for NVIDIA GB10 (DGX Spark).
Added
--enable-small-modelsupport in deepvariant for shortread (WGS), PacBio, and ONT modes.Updated version of minimap2 to v2.30 along with some new presets, options and performance improvements.
Updated version of giraffe to v1.70 along with performance improvements in single-ended mode.
Performance improvements for rna_fq2bam.
With Parabricks 4.7.0 we are releasing a GPU-accelerated pangenome_germline pipeline. Pangenome_germline is an end-to-end pipeline that integrates GPU-accelerated Giraffe with Pangenome-aware DeepVariant to streamline the workflow from FASTQ to VCF.
All tools:
Added support for NVIDIA GB10 (DGX Spark).
Tool Updates
fq2bam and fq2bam_meth:
-
Improved performance on Hopper and Blackwell GPUs, with more modest gains on older architectures.
-
Allowed reads up to 512 bp to be processed on GPU, increasing the previous threshold from 500 bp.
-
When the
--max-read-lengthfilter is set above its default of 480 bp and up to 512 bp, all reads are processed on GPU. If the limit is set above 512 bp, reads longer than 512 bp are processed on CPU through the recovery mechanism.
-
-
Added support for manually setting the insert size distribution
-Ioption from baseline BWA with--bwa-options.-
For example:
--bwa-options="-I 250.54,20.25,500,20". The user interface is unchanged.
-
-
Added the
--cigar-on-gpuoption to offload CIGAR generation from the CPU (default) to the GPU; enabling this option may improve performance for CPU-bound runs.
fq2bam, fq2bam_meth, giraffe, minimap2, rna_fq2bam, bamsort, and associated pipelines:
-
Improved performance when using
--gpuwrite. -
Added a CPU recovery mode when using
--gpusortto recover and continue processing even when the GPU may run out of memory. -
GPU CRAM writer has reduced CPU memory requirements. Memory usage reductions can go up to 70% based on available host memory.
-
Enabled read group id comparison in markdup single-ended mode. Can be disabled by passing
--ignore-rg-markdups-single-ended.
deepvariant, deepvariant_germline, pacbio_germline, and ont_germline:
-
Added support for the small model with
--enable-small-model. This option is supported for shortread (WGS), PacBio, and ONT modes. -
Improved host memory usage for long-read (PacBio and ONT) mode.
-
Updated version of minimap2 to v2.30.
-
Improved performance in PacBio mode and better GPU utilization.
-
Improved performance in splice presets with implementation of new alignment batching algorithm.
-
Added support for
--presetoptionslr:hqandsplice:sr. -
Added argument
--copy-commentfor appending FASTQ comments to BAM output with auxiliary tag. -
Added argument
--mdfor outputting the MD tag. -
Added argument
--both-strandsthat forces minimap2 to consider both strands when finding canonical splicing sites GT-AG.
-
Added
--exclude-intervals(-XL) to specify genomic intervals to exclude from processing.
-
Auto mode: When
--num-streams-per-gpuand/or--num-threadsare omitted, their automatic configuration is triggered based on GPU and system memory. These settings can be overridden by providing explicit values. -
Performance improvements from code optimizations.
-
Updates to match the baseline VG Giraffe version v1.70.
-
Added GPU-accelerated Finalize Mappings (Dozeu) for single-end alignment.
-
Introduced
automode for--nstreamsthat automatically configures the number of streams based on the GPU's device memory specifications. This provides sensible defaults for most systems but may still benefit from manual tuning for specific configurations. -
Removed the
--minimizers-gpu-sortoption, as it is no longer needed. -
Now used as the pangenome alignment stage in the new pangenome_germline pipeline.
-
Added the missing read group (RG) header line in the normal sample BAM/CRAM output defined by
--mutect-bam-output.
fq2bam, fq2bam_meth, giraffe, minimap2, markdup, rna_fq2bam, and associated pipelines:
-
Addressed an issue preventing
--markdups-assume-sortorder-querynamefrom taking effect when combined with--gpusort.
deepvariant, deepsomatic, and pangenome_aware_deepvariant:
-
Fixed a potential GPU out-of-memory condition when
--num-streams-per-gpuwas set to auto (default).
-
Fixed missing alignments for reads with a high degree of multimapping.
fq2bam and fq2bam_meth:
-
Fixed a memory leak that occurred during CPU recovery.
-
Fixed an issue where
--bwa-nstreams auto(default value) and--gpuwritecould potentially run out of device memory with GPUs having more than 16 GB but less than 22 GB of memory.
fq2bam, fq2bam_meth, giraffe, minimap2, rna_fq2bam, bamsort, and associated pipelines:
-
Fixed a potential out-of-bounds device write when using
--gpuwritefor accelerated BAM and CRAM writing. This risk was more likely to appear on ARM systems such as NVIDIA GB10.
-
Fixed
--preserve-file-symlinksto work properly with methylation reference genome and indexes.
-
Fixed a potential deadlock in paired-end mode that could occur when an input file had very few batches and all of them entered CPU recovery mode.
-
Fixed an issue where null padding could appear in output query names when using BAM input.
For further information see the Parabricks datasheet.