Demystifying When Pruning Works via Representation Hierarchies Paper • 2603.24652 • Published 6 days ago • 15
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs Paper • 2408.13467 • Published Aug 24, 2024 • 25