bionemo-llm
The Bionemo Large Language Model (LLM) submodule contains common code used in submodules that train LLMs on biological
datasets (currently bionemo-esm2 and bionemo-geneformer). This includes data masking and collate functions, the
bio-BERT common architecture code, loss functions, and other NeMo / Megatron-LM compatibility functions. Sub-packages
should only depend on bionemo-llm if they need access to NeMo and Megatron-LM.