Custom Models
Contents
Custom Models#
The Riva translation supports bilingual and multilingual models trained in NeMo. Each model must have 1G shared memory available. If not using the quick start path, specify --shm-size
for Docker to ensure the models have enough memory to run.
NeMo models can be converted into Riva models using nemo2riva
. For example:
nemo2riva
<nemo_filename> \
--out=<riva_filename> \
Translation models can be deployed in Riva using riva-build
and riva-deploy
.
The translation pipeline has a single optional parameter for <model name>
called --name
. For example:
riva-build translation \
--name <model name> \
<rmir_filename>:<encryption_key> \
<riva_filename>:<encryption_key>
<rmir_filename>
is the Rivarmir
file that is generated<riva_filename>
is the name of theriva
file to use as input<encryption_key>
is the encryption key used during the export of the.riva
file<model name>
is how to differentiate the model at inference time. The default isriva-nmt
.
Example#
riva-build translation \
--name mnmt_en_deesfr_transformer12x2 \
/data/mnmt_en_deesfr_transformer12x2nmt.rmir \
/data/mnmt_en_deesfr_transformer12x2.riva
riva-deploy -f /data/mnmt_en_deesfr_transformer12x2nmt.rmir /data/models
NeMo Models Supported in Riva Translation#
Model |
Architecture |
Source Language(s) |
Target Language(s) |
NGC link |
---|---|---|---|---|
mnmt_deesfr_en_transformer12x2 |
Transformer 12x2 |
German (de), Spanish (es), French (fr) |
English (en) |
|
mnmt_en_deesfr_transformer12x2 |
Transformer 12x2 |
English (en) |
German (de), Spanish (es), French (fr) |
|
mnmt_deesfr_en_transformer24x6 |
Transformer 24x6 |
German (de), Spanish (es), French (fr) |
English (en) |
|
mnmt_en_deesfr_transformer24x6 |
Transformer 24x6 |
English (en) |
German (de), Spanish (es), French (fr) |
|
en_de_24x6 |
Transformer 24x6 |
English (en) |
German (de) |
|
en_es_24x6 |
Transformer 24x6 |
English (en) |
Spanish (es) |
|
en_fr_24x6 |
Transformer 24x6 |
English (en) |
French (fr) |
|
en_ru_24x6 |
Transformer 24x6 |
English (en) |
Russian (ru) |
|
en_zh_24x6 |
Transformer 24x6 |
English (en) |
Simplified Chinese (zh) |
|
de_en_24x6 |
Transformer 24x6 |
German (de) |
English (en) |
|
es_en_24x6 |
Transformer 24x6 |
Spanish (es) |
English (en) |
|
fr_en_24x6 |
Transformer 24x6 |
French (fr) |
English (en) |
|
ru_en_24x6 |
Transformer 24x6 |
Russian (ru) |
English (en) |
|
zh_en_24x6 |
Transformer 24x6 |
Simplified Chinese (zh) |
English (en) |