Learn how to package dependencies for NeMo Curator using a container image.
The NeMo Curator container provides a primary curator conda environment with pre-installed dependencies. If your pipeline needs additional system or Python packages, create a custom image. Refer to the container environments reference for defaults and build arguments.
Before you begin, make sure that you have:
Create an environments directory anywhere on your system to organize your custom pipeline stage environments.
Create a new folder for your environment, for example: my-env/.
Create a Dockerfile that installs your environment’s dependencies on top of the base image.
Save the file.
Build and tag your image using Docker or your preferred tool:
Now that you have created a custom environment, you can create custom code for that environment.