ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4
0.8B • Updated
• 12
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers