Part of the Hy-MT2 MLX collection.

Hy-MT2-1.8B-8bit (MLX)

Apple MLX weights for Tencent-Hunyuan/Hy-MT2-1.8B, Tencent Hunyuan's multilingual translation model.

TL;DR

Variant 1.8B 8-bit
Best for smaller 1.8B checkpoint
Runtime mlx-lm
Official code Tencent-Hunyuan/Hy-MT2
MLX code ailuntx/Hy-MT2-MLX
Hardware Apple Silicon recommended; HF Spaces CPU fallback is only a load smoke test

Quick Start

pip install mlx-lm
PROMPT=$'Translate the following text into English. Note that you should only output the translated result without any additional explanation:\n\n今天天气真好。'
mlx_lm.generate --model mlx-community/Hy-MT2-1.8B-8bit \
  --prompt "$PROMPT" \
  --max-tokens 128 \
  --temp 0.7 \
  --top-p 0.6 \
  --top-k 20

For conversion and staging tools:

git clone https://github.com/ailuntx/Hy-MT2-MLX.git
cd Hy-MT2-MLX
python scripts/infer_mlx.py --model /path/to/mlx/checkpoint --text "今天天气真好。" --target-lang English

Variants

Variant Best for
Hy-MT2-1.8B-bfloat16 high-quality 1.8B baseline
Hy-MT2-1.8B-8bit smaller 1.8B checkpoint
Hy-MT2-1.8B-4bit smallest 1.8B checkpoint
Hy-MT2-7B-bfloat16 high-precision 7B conversion
Hy-MT2-7B-8bit 7B size/quality middle ground
Hy-MT2-7B-4bit smallest 7B checkpoint

Conversion Notes

Component Source MLX handling
model weights official dense Hy-MT2 checkpoint converted with MLX/mlx-lm tooling
tokenizer/config official checkpoint copied through for mlx-lm loading
quantized variants bfloat16 MLX baseline derived with MLX quantization settings

Validation

Local Apple Silicon is the intended runtime. The Hy-MT2 HF Space starts and loads the model on Linux CPU fallback, but cpu-basic can exceed request timeouts even for very small generation tests.

License

License follows the upstream Tencent Hunyuan release.

Citation

@misc{hymt2-mlx,
  title  = {Hy-MT2-MLX: Apple MLX port of Hy-MT2},
  author = {ailuntx},
  year   = {2026},
  url    = {https://github.com/ailuntx/Hy-MT2-MLX},
}

@misc{zheng2026hymt2familyfastefficient,
  title         = {Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild},
  author        = {Mao Zheng and Zheng Li and Tao Chen and Bo Lv and Mingrui Sun and Mingyang Song and Jinlong Song and Hong Huang and Decheng Wu and Hai Wang and Yifan Song and Yanfeng Chen and Guanwei Zhang},
  year          = {2026},
  eprint        = {2605.22064},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CL},
  url           = {https://arxiv.org/abs/2605.22064},
}
Downloads last month
139
Safetensors
Model size
0.5B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mlx-community/Hy-MT2-1.8B-8bit

Paper for mlx-community/Hy-MT2-1.8B-8bit