view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 490
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery Paper • 2601.20088 • Published Jan 27 • 4