GSQ Collection GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling, https://huggingface.co/papers/2604.18556 • 9 items • Updated 12 days ago • 7
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22, 2025 • 461k • • 1.3k
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated May 22, 2025 • 113k • • 171