NVIDIA Docs Hub NVIDIA Clara Clara Parabricks v3.7.0 deepvariant_germline

deepvariant_germline

Given one or more pairs of FASTQ files, you can run the germline variant pipeline workflow to generate output, including variants, BAM, and recal.

The deepvariant germline pipeline includes alignment, sorting, and marking, as well as the deepvariant variant caller.

Currently, Deepvariant is supported for T4, V100, and A100 GPUs only.

The inputs are BWA-indexed reference files and pair-ended FASTQ files. The outputs of this pipeline are the following:

Aligned, co-ordinate sorted, duplicated marked BAM
Variants in vcf/g.vcf/g.vcf.gz format

Quick Start

The following command runs the DeepVariant pipeline.

Copy
Copied!

            
            $ pbrun deepvariant_germline \
    --ref Ref/Homo_sapiens_assembly38.fasta \
    --in-fq input1.fq input2.fq \
    --out-variants variants.vcf

deepvariant_germline Reference

Run the germline pipeline from FASTQ to VCF using a deep neural network analysis.

Input/Output file options

--ref REF
--in-fq [IN_FQ [IN_FQ ...]]
--in-se-fq [IN_SE_FQ [IN_SE_FQ ...]]
--knownSites KNOWNSITES
--interval-file INTERVAL_FILE
--pb-model-file PB_MODEL_FILE
--out-recal-file OUT_RECAL_FILE
--out-bam OUT_BAM
--out-variants OUT_VARIANTS
--out-duplicate-metrics OUT_DUPLICATE_METRICS

Options specific to this tool

-L INTERVAL, --interval INTERVAL
--bwa-options BWA_OPTIONS
--no-warnings
--no-markdups
--fix-mate
--markdups-assume-sortorder-queryname
--markdups-picard-version-2182
--optical-duplicate-pixel-distance OPTICAL_DUPLICATE_PIXEL_DISTANCE
--read-group-sm READ_GROUP_SM
--read-group-lb READ_GROUP_LB
--read-group-pl READ_GROUP_PL
--read-group-id-prefix READ_GROUP_ID_PREFIX
--disable-use-window-selector-model
--gvcf
--norealign-reads
--sort-by-haplotypes
--keep-duplicates
--vsc-min-count-snps VSC_MIN_COUNT_SNPS
--vsc-min-count-indels VSC_MIN_COUNT_INDELS
--vsc-min-fraction-snps VSC_MIN_FRACTION_SNPS
--vsc-min-fraction-indels VSC_MIN_FRACTION_INDELS
--min-mapping-quality MIN_MAPPING_QUALITY
--min-base-quality MIN_BASE_QUALITY
--mode MODE
--alt-aligned-pileup ALT_ALIGNED_PILEUP
--variant-caller VARIANT_CALLER
--add-hp-channel
--parse-sam-aux-fields
--use-hp-information
--use-wes-model

Common options:

--logfile LOGFILE
--tmp-dir TMP_DIR
--with-petagene-dir WITH_PETAGENE_DIR
--keep-tmp
--license-file LICENSE_FILE
--no-seccomp-override
--version

GPU options:

--num-gpus NUM_GPUS
--gpu-devices GPU_DEVICES

Note

The --in-fq option takes the names of two FASTQ files, optionally followed by a quoted read group. The FASTQ filenames must not start with a hyphen.