Parabricks FQ2BAM NIM#

Important

NVIDIA NIM currently is in limited availability, sign up here to get notified when the latest NIMs are available to download.

Parabricks FQ2BAM is not an AI model but rather GPU optimized code for genomic analysis, therefore there is no need to explicitly pull the model as is common with other NIMs.

After sequencing, the next step in genomic analysis is alignment. This NIM allows for efficiently setting up a microservice capable of scheduling multiple jobs to run one after another. This is especially handy for running large quantities of data, as the NIM ensures that when one job finishes, the next one starts, thus maximizing throughput. In addition, the NIM runs on an open port, so it can be configured such that multiple users can submit jobs to the same queue, thus creating a centralized compute environment. Thirdly, NIMs can be combined. For example, the Parabricks DeepVariant NIM can be run on the output of this FQ2BAM NIM, thus enabling end-to-end workflows.

For more information on the Parabricks FQ2BAM workflow, there are extra tutorials in the Parabricks documentation.

Specific Requirements#

Operating System#

Linux(x86_64/amd64): Linux distributions supported by the NVIDIA Container Toolkit.

Hardware#

Hopper, Ampere, Ada GPUs with Minimum 24GB of GPU (VRAM) Memory

Software#

Minimum Driver version: 535.161.07

If you want to use the Python examples, you will need to install the third-party requests module. This can be done by running pip install requests.

Quickstart Guide#

Note

This page assumes Prerequisite Software (Docker, NGC CLI, NGC registry access) is installed and set up.

Pull the NIM container.

docker pull nvcr.io/nvidia/nim/fq2bam:24.03.01

Run NIM.

docker run --rm -d \
   --volume /etc/passwd:/etc/passwd:ro \
   --volume /etc/group:/etc/group:ro \
   --volume /etc/shadow:/etc/shadow:ro \
   --volume $PWD:/workspace \
   --shm-size=2G \
   --ulimit memlock=-1 \
   --ulimit stack=67108864 \
   --runtime=nvidia \
   --gpus=1 \
   -p 8003:8003 \
   --name fq2bam \
   nvcr.io/nvidia/nim/fq2bam:24.03.01

Download the data.

mkdir test-data
cd test-data
wget https://s3.amazonaws.com/parabricks.sample/nim/References_chr22.fasta.tar
wget https://s3.amazonaws.com/parabricks.sample/nim/test-datachr22.100x.fq_1.fastq.gz
wget https://s3.amazonaws.com/parabricks.sample/nim/test-datachr22.100x.fq_2.fastq.gz
cd ..
mkdir results

python3 -c '
import requests
import json
invoke_url = "http://localhost:8003/genomics/parabricks/fq2bam/run"
headers = {
    "accept": "application/json",
    "content-type": "application/json",
}
# The /workspace directory referred to here is the /workspace directory found in
# the server startup command.
in_fq = [
    {
        "fq1" : "/workspace/test-data/test-datachr22.100x.fq_1.fastq.gz",
        "fq2" : "/workspace/test-data/test-datachr22.100x.fq_2.fastq.gz",
        "rg" : "@RG\\tID:foo\\tLB:lib1\\tPL:bar\\tSM:HG002\\tPU:foo"
    }
]
payload = {
    "in_ref_tarball" : "/workspace/test-data/References_chr22.fasta.tar",
    "in_fq" : json.dumps(in_fq),
    "out_bam" : ["/workspace/results/test-datachr22.bam"],
    "out_bam_parts_manifest" : "/workspace/results/test-datachr22.bam.parts_manifest.txt",
    "out_bai" : "/workspace/results/test-datachr22.bam.bai",
    "out_chrs" :  "/workspace/results/test-datachr22_chrs.txt",
    "out_stderr" : "/workspace/results/test-datachr22.stderr",
    "out_stdout" : "/workspace/results/test-datachr22.stdout",
    "additional_args" : ""
}
response = requests.post(invoke_url, headers=headers, json=payload, stream=True)
print(response.status_code)
print(response.text)
'

Detailed Instructions#

In this workflow, we will submit a pair of .fastq.gz files (Chromosome 22 from HG002 sampled at 100x) for alignment. The microservice will use Parabricks fq2bam to perform the alignment and return a .bam file of our aligned sample.

The Genomics NIMs take either local file paths or pre-signed URLs. A local file path is one that is mounted on the computer on which the server is running. This could be locally attached storage, an NFS share, etc and looks like /path/to/file/name.ext. A pre-signed URL looks like https://s3.us-east-1.amazonaws.com/your.bucket/path/to/file.ext?etcetcetc.

If a local file path is given in the job request then it must be a path as seen from inside the server. To specify a local file path correctly you’ll need to know what local directories were made available to the server when it was started and where they were mounted. If the microservice is started as follows:

docker run --rm -it \
   ...
   --volume $PWD:/workspace \
   ...

then the --volume $PWD:/workspace line makes the current directory ($PWD) available as the /workspace directory inside the server. If the BAM file should be saved in $PWD/my_input_files/batch1/filename.bam then the cURL command (see below for the complete command) would need to be:

curl -X 'POST' \
'http://localhost:8003/genomics/parabricks/universal-variant-calling/run' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
   "out_bam": "/workspace/my_input_files/batch1/filename.bam",
   ...
   }'

For remote files it doesn’t matter what directories were mounted on the server. Specify a pre-signed URL like this:

curl -X 'POST' \
'http://localhost:8003/genomics/parabricks/universal-variant-calling/run' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
   "out_bam": "https://s3.us-east-1.amazonaws.com/your.bucket/path/to/file.ext?etcetcetc",
   ...
   }'

Requests can mix local and remote file paths at will. For example, you might read all your input files from S3 and write all your output files to a local drive.

Pull Container Image#

Container image tags can be retrieved using the following command.

ngc registry image info nvcr.io/nvidia/nim/fq2bam

Image Repository Information
   Name: fq2bam
   Display Name: fq2bam
   Short Description: Please add description
   Built By:
   Publisher:
   Multinode Support: False
   Multi-Arch Support: False
   Logo:
   Labels:
   Public: No
   Access Type:
   Associated Products: []
   Last Updated: Mar 14, 2024
   Latest Image Size: 2.41 GB
   Signed Tag?: False
   Latest Tag: 24.03.01
   Tags:
       24.03.01

Pull the container image

Docker

docker pull nvcr.io/nvidia/nim/fq2bam:24.03.01

NGC

ngc registry image pull nvcr.io/nvidia/nim/fq2bam:24.03.01

Launch the Microservice#

Launch the container.

docker run --rm -it \
   --volume /etc/passwd:/etc/passwd:ro \
   --volume /etc/group:/etc/group:ro \
   --volume /etc/shadow:/etc/shadow:ro \
   --volume $PWD:/workspace \
   --shm-size=2G \
   --ulimit memlock=-1 \
   --ulimit stack=67108864 \
   --gpus=1 \
   -p 8003:8003 \
   --name fq2bam \
   --runtime=nvidia \
   nvcr.io/nvidia/nim/fq2bam:24.03.01

The current directory ($PWD) will be accessible inside the container as the “/workspace” directory. Subsequent example commands will put input files in this directory; output files will be written here. If you wish to place your files elsewhere change $PWD to the desired path.

Let’s briefly go through the flags in the command and some possible configurations:

Flag	Description
`--name`	Sets the name of the container.
`--runtime=nvidia`	Use `nvidia` as the docker runtime.
`--gpus=1`	Tells Docker how many GPUs to use. In this case, we only use 1 GPU.
`--shm-size=2G`	Increases the maximum shared memory size from the default of 64MB to 2GB.
`--vo lume <host_path>:<container_path>`	Mounts directories from the host machine into container.
`--ulimit memlock=-1`	A value of `-1` means there will be no limit on the memory that will not be paged out. All other values represent memory limit in kilobytes. The microservice will use a lot of memory so we recommend setting this to unlimited.
`--ulimit stack=67108864`	Sets the maximum stack size in bytes. We do not recommend lowering this value.
`-p 8003:8003`	Forwards the ports from the host machine into Docker. This NIM requires port `8003` to be open.

Health and Liveness Checks#

The container exposes health and liveness endpoints for integration into existing systems such as Kubernetes at /v2/health/ready and /v2/health/live. These endpoints only return an HTTP 200 OK status code if the service is ready or live, respectively.

Run these in a new terminal.

curl localhost:8003/v2/health/ready
true

curl localhost:8003/v2/health/live
true

Download Sample Data#

Download a small reference file. The FASTA file and all its associated indices must be packaged into a single .tar file. This is the same .tar file downloaded for the DeepVariant example.
1mkdir test-data 2cd test-data 3wget https://s3.amazonaws.com/parabricks.sample/nim/References_chr22.fasta.tar

Download the sample FASTQ files.

wget https://s3.amazonaws.com/parabricks.sample/nim/test-datachr22.100x.fq_1.fastq.gz
wget https://s3.amazonaws.com/parabricks.sample/nim/test-datachr22.100x.fq_2.fastq.gz
cd ..
mkdir results

The current working directory should have at least these files:

tree
.
├── results
└── test-data
    ├── References_chr22.fasta.tar
    ├── test-datachr22.100x.fq_1.fastq.gz
    └── test-datachr22.100x.fq_2.fastq.gz

Run Alignment Mapping#

You can submit a request in a Python script as follows:

#!/usr/bin/env python3

# Script: fq2bam_request.py
# Usage: python3 fq2bam_request.py

import requests
import json

invoke_url = "http://localhost:8003/genomics/parabricks/fq2bam/run"

headers = {
   "accept": "application/json",
   "content-type": "application/json",
}

# The /workspace directory referred to here is the /workspace directory found in
# the server startup command.
in_fq = [
   {
      "fq1" : "/workspace/test-data/test-datachr22.100x.fq_1.fastq.gz",
      "fq2" : "/workspace/test-data/test-datachr22.100x.fq_2.fastq.gz",
      "rg" : "@RG\\tID:foo\\tLB:lib1\\tPL:bar\\tSM:HG002\\tPU:foo"
   }
]

payload = {
   "in_ref_tarball" : "/workspace/test-data/References_chr22.fasta.tar",
   "in_fq" : json.dumps(in_fq),
   "out_bam" : ["/workspace/results/test-datachr22.bam"],
   "out_bam_parts_manifest" : "/workspace/results/test-datachr22.bam.parts_manifest.txt",
    "out_bai" : "/workspace/results/test-datachr22.bam.bai",
   "out_chrs" :  "/workspace/results/test-datachr22_chrs.txt",
   "out_stderr" : "/workspace/results/test-datachr22.stderr",
   "out_stdout" : "/workspace/results/test-datachr22.stdout",
   "additional_args" : ""
}

response = requests.post(invoke_url, headers=headers, json=payload, stream=True)
print(response.status_code)
print(response.text)

Below is a description of what each flag in the payload does, and how it can be customized to any workflow:

Flag	Description
`fq1`	First pair ended `fastq` file.
`fq2`	Second pair ended `fastq` file.
`in_ref_tarball`	Reference tarball in the same format as the sample data ‘References_chr22.fasta.tar’. Untar to see associated files.
`out_bam`	Path for the aligned output `bam` file.
`out_bam_parts_manifest`	Parts manifest for the output `bam` file.
`out_bai`	Path for the output `.bam.bai` file of bam indices.
`out_chrs`	Path for file with a list of chromosomes.
`out_stderr`	Path for output of stderr during the job.
`out_stdout`	Path for output of stdout during the job.
`additional_args`	Other flags found in the documentation can be passed through here as a string. For example, `"--no-markdups --low-memory"` passes these flags directly to the Parabricks run command inside the NIM. This does NOT support options requiring an input such as `--knownSites`.

To run with multiple pairs of fastq files, simply extend the in_fq JSON as follows for as many pairs as necessary:

in_fq = [
   {
      "fq1" : "/workspace/test-data/test-datachr22.100x.fq_1.fastq.gz",
      "fq2" : "/workspace/test-data/test-datachr22.100x.fq_2.fastq.gz",
      "rg" : "@RG\\tID:foo\\tLB:lib1\\tPL:bar\\tSM:HG002\\tPU:foo"
   },
   {
      "fq3" : "/workspace/test-data/test-datachr22.100x.fq_3.fastq.gz",
      "fq4" : "/workspace/test-data/test-datachr22.100x.fq_4.fastq.gz",
      "rg" : "@RG\\tID:foo\\tLB:lib1\\tPL:bar\\tSM:HG002\\tPU:foo"
   },
   ...
]

Please note that it takes ~30s for the request to process. You should see the following console output…

ls results/
test-datachr22.bam
test-datachr22.bam.bai
test-datachr22.bam.parts_manifest.txt
test-datachr22_chrs.txt
test-datachr22.stderr
test-datachr22.stdout

Stopping the Container#

When you’re done testing the endpoint, you can bring down the container by running docker stop fq2bam in a new terminal.