Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models Paper • 2606.05688 • Published 8 days ago
Efficient Large Vision-Language Model Collection ERGO: LVLM trained with RL on efficiency objectives; https://github.com/nota-github/ERGO • 3 items • Updated Feb 22 • 25
Efficient MoE-based LLM Collection Mixture-of-Experts Large Language Models with Advanced Quantization • 5 items • Updated Mar 11 • 23