amd/DeepSeek-R1-Distill-Qwen-1.5B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Updated Sep 16, 2025 • 5
amd/DeepSeek-R1-Distill-Qwen-7B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Updated Sep 16, 2025 • 5 • 1
amd/DeepSeek-R1-Distill-Llama-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Updated Sep 16, 2025 • 6 • 1
amd/Qwen2.5-1.5B-Instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Sep 16, 2025 • 11
amd/Qwen2.5-7B-Instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Sep 16, 2025 • 7
amd/Llama-xLAM-2-8b-fc-r-awq-g128-int4-asym-bfp16-onnx-hybrid Text Generation • Updated Sep 16, 2025
amd/DeepSeek-R1-Distill-Qwen-7B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated Sep 16, 2025 • 12 • 4
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid Updated Sep 16, 2025 • 8 • 1
amd/Qwen2.5-1.5B-Instruct-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 5
amd/Qwen2.5-3B-Instruct-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 4 • 1
amd/Qwen2.5-7B-Instruct-awq-uint4-asym-g128-lmhead-g32-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 2
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 8
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 11
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Aug 27, 2025 • 18 • 2
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Aug 27, 2025 • 9
amd/Auto-Mixed-Precision-Mixtral-8x7B-Instruct-v0.1-Weight-Activation-Mixed-MXFP4-FP8PT-KVFP8 Updated Aug 26, 2025
amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ 37B • Updated Aug 5, 2025 • 4