NVIDIA Docs Hub NVIDIA Clara Clara Parabricks v3.8.0 umi_fgbio

umi_fgbio

Groups reads together that appear to have come from the same original molecule, based on the UMI of the read.

Quick Start

Copy
Copied!

            
            $  pbrun umi_fgbio \
    --in-fq firstInput.fastq.gz secondInput.fastq.gz \
    --ref theReferenceFile.fasta \
    --out-dir directoryWithOutput \
    --umi-in-header \
    --strategy paired

This UMI pipeline is based on Fulcrum Genomics toolkit, processes sequencing reads with molecular barcodes (also known as Unique Molecular Indices, UMIs), which provide impressive error correction and increased accuracy using a sequencing consensus read level.

Input/Output file options

--in-fq [IN_FQ [IN_FQ ...]]
--ref REF
--metadata METADATA
--out-dir OUT_DIR

Pipeline Options:

--umi-in-header
-L INTERVAL, --interval INTERVAL
--bwa-options BWA_OPTIONS
--no-warnings
--read-group-sm READ_GROUP_SM
--read-group-lb READ_GROUP_LB
--read-group-pl READ_GROUP_PL
--read-group-id-prefix READ_GROUP_ID_PREFIX
-ip INTERVAL_PADDING, --interval-padding INTERVAL_PADDING
--read-structures [READ_STRUCTURES [READ_STRUCTURES ...]]
--no-barcode
--out-metrics OUT_METRICS
--num-zip-threads NUM_ZIP_THREADS
--num-sort-threads NUM_SORT_THREADS
--max-records-in-ram MAX_RECORDS_IN_RAM
--strategy STRATEGY
--min-map-q MIN_MAP_Q
--num-worker-threads NUM_WORKER_THREADS
--error-rate-pre-umi ERROR_RATE_PRE_UMI
--error-rate-post-umi ERROR_RATE_POST_UMI
--min-input-base-quality MIN_INPUT_BASE_QUALITY
--min-consensus-base-quality MIN_CONSENSUS_BASE_QUALITY
--min-reads MIN_READS
--out-suffixF OUT_SUFFIXF
--out-suffixF2 OUT_SUFFIXF2
--out-suffixO OUT_SUFFIXO
--out-suffixO2 OUT_SUFFIXO2
--out-suffixS OUT_SUFFIXS
--rg-tag RG_TAG
--remove-qc-failure
--num-threads NUM_THREADS

Common options:

--logfile LOGFILE
--tmp-dir TMP_DIR
--with-petagene-dir WITH_PETAGENE_DIR
--keep-tmp
--license-file LICENSE_FILE
--no-seccomp-override
--version

GPU options:

--num-gpus NUM_GPUS
--gpu-devices GPU_DEVICES