> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/automodel/llms.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/automodel/_mcp/server.

# Hy3 (HunyuanLarge)

[Hy3-preview](https://huggingface.co/tencent/Hy3-preview) is a 295B Mixture-of-Experts language model from Tencent. It features 80 transformer layers (layer 0 dense, layers 1–79 MoE), 192 routed experts plus 1 shared expert with top-8 sigmoid routing, Grouped Query Attention (64 Q / 8 KV heads), per-head QK RMSNorm, RoPE, and an `e_score_correction_bias` gate buffer for expert-load correction. It supports a 256K context window.

|                  |                                           |
| ---------------- | ----------------------------------------- |
| **Task**         | Text Generation (MoE)                     |
| **Architecture** | `HYV3ForCausalLM`                         |
| **Parameters**   | 295B total                                |
| **HF Org**       | [tencent](https://huggingface.co/tencent) |

## Available Models

* **Hy3-preview**: 295B total, top-8 routed experts activated per token

## Architectures

* `HYV3ForCausalLM`

## Example HF Models

| Model       | HF ID                                                               |
| ----------- | ------------------------------------------------------------------- |
| Hy3-preview | [`tencent/Hy3-preview`](https://huggingface.co/tencent/Hy3-preview) |

## Example Recipes

| Recipe                                                                                                                              | Description                   |
| ----------------------------------------------------------------------------------------------------------------------------------- | ----------------------------- |
| [hy3\_preview\_deepep.yaml](https://github.com/NVIDIA-NeMo/Automodel/blob/main/examples/llm_finetune/hy_v3/hy3_preview_deepep.yaml) | SFT — Hy3-preview with DeepEP |

## Try with NeMo AutoModel

**1. Install** ([NeMo AutoModel](/get-started/installation)):

```bash
pip install nemo-automodel
```

**2. Clone the repo** to get the example recipes:

```bash
git clone https://github.com/NVIDIA-NeMo/Automodel.git
cd Automodel
```

**3. Run the recipe** from inside the repo:

```bash
automodel --nproc-per-node=8 examples/llm_finetune/hy_v3/hy3_preview_deepep.yaml
```

See the [NeMo AutoModel Installation Guide](/get-started/installation) and [LLM Fine-Tuning Guide](/recipes-e2e-examples/sft-peft).

## Fine-Tuning

See the [LLM Fine-Tuning Guide](/recipes-e2e-examples/sft-peft) and the [Large MoE Fine-Tuning Guide](/recipes-e2e-examples/large-moe-fine-tuning).

## Hugging Face Model Cards

* [tencent/Hy3-preview](https://huggingface.co/tencent/Hy3-preview)