ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers Paper • 2504.00502 • Published Apr 1, 2025 • 26
SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency Paper • 2502.02458 • Published Feb 4, 2025 • 1
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published Mar 6, 2024 • 66