nemo_automodel.datasets.llm.mock_packed
#
Module Contents#
Functions#
Build a trivial vocab; index 0= |
|
Sentence generator with Gaussian length control. |
|
Flush helper (build position_ids that reset after |
|
Dataset builder. |
API#
- nemo_automodel.datasets.llm.mock_packed.make_vocab(vocab_size: int = 100)[source]#
Build a trivial vocab; index 0=
, 1= , rest = word_i.
- nemo_automodel.datasets.llm.mock_packed.gen_sentence_ids(vocab, mean_len: float, std_len: float, max_len: int)[source]#
Sentence generator with Gaussian length control.