QUALITY CONTROL AND BAM PROCESSING

SPLITNCIGAR

Accelerated SplitNCigarReads functionality from GATK. This tool splits reads that contain Ns in their cigar string (e.g. spanning splicing events in RNAseq data).

QUICK START

Copy
Copied!

            
            $ pbrun splitncigar --ref Ref.fa --in-bam in.bam --out-bam out.bam

COMPATIBLE GATK COMMAND

The command below is the GATK counterpart of the Parabricks command above. The output from these commands will generate the exact same results as the output from the above command.

Copy
Copied!

            
            gatk SplitNCigarReads --reference Ref.fa --input in.bam --output tmp.bam

gatk SortSam --java-options -Xmx30g --MAX_RECORDS_IN_RAM=5000000 -I=tmp.bam \
-O=out.bam --SORT_ORDER=coordinate --TMP_DIR=/raid/myrun

OPTIONS

--ref
--in-bam
--knownSites
--out-bam
--out-recal-file
--num-cpu-threads
--no-ignore-mark

SAMTOOLS MPILEUP

Accelerated mpileup functionality from samtools

QUICK START

Copy
Copied!

            
            $ pbrun samtoolsmpileup --in-bam wgs.bam --out-file pileup.txt

COMPATIBLE SAMTOOLS COMMAND

The command below is the samtools counterpart of the Parabricks command above. The output from these commands will generate the exact same results as the output from the above command.

Copy
Copied!

            
            samtools mpileup /w/wgs.bam -o pileup.txt -d 0

OPTIONS

--in-bam
--ref
--out-file
--num-threads
--min-mapq
--disable-baq
--anomalous-reads
--interval-file
-L, --interval

--tmp-dir TMP_DIR
--with-petagene-dir WITH_PETAGENE_DIR
--keep-tmp
--license-file LICENSE_FILE
--version

BCFTOOLS MPILEUP

Accelerated mpileup functionality from bcftools

QUICK START

Copy
Copied!

            
            $ pbrun bcftoolsmpileup --in-bam wgs.bam --out-file pileup.vcf

COMPATIBLE BCFTOOLS COMMAND

The command below is the bcftools counterpart of the Parabricks command above. The output from these commands will generate the exact same results as the output from the above command.

Copy
Copied!

            
            bcftools mpileup  wgs.bam -o pileup.txt -d 2147483647

OPTIONS

--in-bam
--ref
--out-file
--num-threads
--min-mapq
--disable-baq
--anomalous-reads
--bcf
--no-reference
--no-version
--interval-file
-L, --interval

--tmp-dir TMP_DIR
--with-petagene-dir WITH_PETAGENE_DIR
--keep-tmp
--license-file LICENSE_FILE
--version

BAMMETRICS

Accelerated CollectWGSMetrics functionality from GATK4

bammetrics collects whole genome sequencing metrics, similar to CollectWGSMetrics from GATK4, but in a highly accelerated manner. The output metrics match exactly with that of GATK4.

QUICK START

Copy
Copied!

            
            $ pbrun bammetrics --ref Ref.fa --bam wgs.bam --out-metrics-file metrics.txt

COMPATIBLE GATK4 COMMAND

The command below is the GATK4 counterpart of the Parabricks command above. The output from these commands will generate the exact same results as the output from the above command.

Copy
Copied!

            
            gatk CollectWGSMetrics -R Ref.fa -I wgs.bam -O metrics.txt

OPTIONS

--ref
--bam
--out-metrics-file
--minimum-base-quality
--minimum-mapping-quality
--count-unpaired
--coverage-cap
--num-threads

--tmp-dir TMP_DIR
--with-petagene-dir WITH_PETAGENE_DIR
--keep-tmp
--license-file LICENSE_FILE
--version

COLLECT MULTIPLE METRICS

Accelerated CollectMultipleMetrics from GATK4

collectmultiplemetrics collects whole genome sequencing metrics, similar to CollectMultipleMetrics from GATK4, but in a highly accelerated manner. The output metrics match exactly with that of GATK4.

QUICK START

CLI

Copy
Copied!

            
            $ pbrun collectmultiplemetrics --ref Ref.fa \
--bam wgs.bam \
--out-qc-metrics-dir output-qc\
--gen-all-metrics

COMPATIBLE GATK4 COMMAND

The command below is the GATK4 counterpart of the Parabricks command above. The output from these commands will generate the exact same results as the output from the above command.

Copy
Copied!

            
            gatk CollectMultipleMetrics --REFERENCE_SEQUENCE Ref.fa -I wgs.bam -O metrics \
--PROGRAM CollectAlignmentSummaryMetrics \
--PROGRAM CollectInsertSizeMetrics \
--PROGRAM QualityScoreDistribution \
--PROGRAM MeanQualityByCycle \
--PROGRAM CollectBaseDistributionByCycle \
--PROGRAM CollectGcBiasMetrics \
--PROGRAM CollectSequencingArtifactMetrics \
--PROGRAM CollectQualityYieldMetrics

OPTIONS

--ref
--bam
--out-qc-metrics-dir
--gen-all-metrics
--gen-alignment
--gen-quality-score
--gen-insert-size
--gen-mean-quality-by-cycle
--gen-base-distribution-by-cycle
--gen-gc-bias
--gen-seq-artifact
--gen-quality-yield
--processor-threads
--bam-decompressor-threads

--tmp-dir TMP_DIR
--with-petagene-dir WITH_PETAGENE_DIR
--keep-tmp
--license-file LICENSE_FILE
--version