ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g
Image-Text-to-Text
•
5B
•
Updated
•
275
•
17
None defined yet.
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization