bam2fq

Run bam2fq to convert BAM/CRAM to FASTQ. For paired reads, bam2fq will append "/1" to the 1st read name, and "/2" to the 2nd read name.

Copy
Copied!
            

$ pbrun bam2fq \ --ref Ref/Homo_sapiens_assembly38.fasta \ --in-bam sample.bam \ --out-fq1 sample_1.fastq.gz \ --out-fq2 sample_2.fastq.gz

The command below is the bwa-0.7.15 and GATK4 counterpart of the Parabricks command above. The output from these commands will be identical to the output from the above command. See the Output Comparison page for comparing the results.

Copy
Copied!
            

$ gatk SamToFastq -I sample.bam \ -F sample_1.fastq.gz \ -F2 sample_2.fastq.gz

Run bam2fq to convert BAM/CRAM to FASTQ.

Input/Output file options

--ref REF

Path to the reference file (default: None)

Option is required.

--in-bam IN_BAM

Path to the input bam/cram file to convert to fastq.gz. (default: None)

Option is required.

--out-prefix OUT_PREFIX

Prefix filename for output fastq files (default: None)

Option is required.

Options specific to this tool

--out-suffixF OUT_SUFFIXF

Output suffix used for paired reads that are first in pair. Suffix must end with ".gz" (default: _1.fastq.gz)

--out-suffixF2 OUT_SUFFIXF2

Output suffix used for paired reads that are second in pair. Suffix must end with ".gz" (default: _2.fastq.gz)

--out-suffixO OUT_SUFFIXO

Output suffix used for orphan/unmatched reads that are first in pair. Suffix must end with ".gz". If no suffix is provided, these reads will be ignored (default: None)

--out-suffixO2 OUT_SUFFIXO2

Output suffix used for orphan/unmatched reads that are second in pair. Suffix must end with ".gz". If no suffix is provided, these reads will be ignored (default: None)

--out-suffixS OUT_SUFFIXS

Output suffix used for single-end/unpaired reads. Suffix must end with ".gz". If no suffix is provided, these reads will be ignored (default: None)

--rg-tag RG_TAG

Split reads into different fastq files based on the read group tag. Must be either PU or ID (default: None)

--remove-qc-failure

Remove reads from the output that have abstract QC failure (default: None)

--num-threads NUM_THREADS

Number of threads to run (default: 8)

Common options:

--logfile LOGFILE

Path to the log file. If not specified, messages will only be written to the standard error output. (default: None)

--tmp-dir TMP_DIR

Full path to the directory where temporary files will be stored.

--with-petagene-dir WITH_PETAGENE_DIR

Full path to the PetaGene installation directory. By default, this should have been installed at /opt/petagene. Use of this option also requires that the PetaLink library has been preloaded by setting the LD_PRELOAD environment variable. Optionally set the PETASUITE_REFPATH and PGCLOUD_CREDPATH environment variables that are used for data and credentials (default: None)

--keep-tmp

Do not delete the directory storing temporary files after completion.

--license-file LICENSE_FILE

Path to license file license.bin if not in the installation directory.

--no-seccomp-override

Do not override seccomp options for docker (default: None).

--version

View compatible software versions.

© Copyright 2022, Nvidia. Last updated on Jun 28, 2023.