nemo_automodel.components.datasets.llm.xlam

View as Markdown

Module Contents

Functions

NameDescription
_convert_tool_callsConvert xLAM answers field into OpenAI tool_calls messages.
_convert_toolsConvert xLAM tool definitions into OpenAI tool schema.
_format_example-
_json_load_if_str-
make_xlam_datasetLoad and preprocess the xLAM function-calling dataset to OpenAI messages

Data

_TYPE_MAP

logger

API

nemo_automodel.components.datasets.llm.xlam._convert_tool_calls(
raw_calls: typing.List[typing.Dict],
example_id: typing.Optional[int] = None
) -> typing.List[typing.Dict]

Convert xLAM answers field into OpenAI tool_calls messages.

nemo_automodel.components.datasets.llm.xlam._convert_tools(
raw_tools: typing.List[typing.Dict]
) -> typing.List[typing.Dict]

Convert xLAM tool definitions into OpenAI tool schema.

nemo_automodel.components.datasets.llm.xlam._format_example(
example,
tokenizer,
eos_token_id,
pad_token_id,
seq_length = None,
padding = None,
truncation = None
)
nemo_automodel.components.datasets.llm.xlam._json_load_if_str(
value
)
nemo_automodel.components.datasets.llm.xlam.make_xlam_dataset(
tokenizer,
seq_length = None,
limit_dataset_samples = None,
fp8 = False,
split = 'train',
dataset_name = 'Salesforce/xlam-function-c...,
padding = False,
truncation = False
)

Load and preprocess the xLAM function-calling dataset to OpenAI messages compatible with the bulbasaur chat template (tool-calling aware).

nemo_automodel.components.datasets.llm.xlam._TYPE_MAP = {'str': 'string', 'string': 'string', 'int': 'integer', 'integer': 'integer', 'f...
nemo_automodel.components.datasets.llm.xlam.logger = logging.getLogger(__name__)