Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

877

Full-text search

Active filters: quantization

baa-ai/MiniMax-M2.5-SWAN-4bit

Text Generation • 229B • Updated 2 days ago • 71

enesgulerai/hm-fashion-recommender-int8

Feature Extraction • Updated about 20 hours ago

DrQianXu/Qwen3-4B-nvfp4-Compressible

Text Generation • 3B • Updated about 14 hours ago

raj5517/imu-activity-classifier

Updated about 12 hours ago

thehighnotes/vllm-jetson-orin

Text Generation • Updated about 8 hours ago

iliasslasri/robust_speech_quantizer

Audio Classification • Updated about 7 hours ago

tonera/Beyond_Reality_Zimage_v3_svdq

Text-to-Image • Updated 1 minute ago