Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ngocbh 's Collections
TrimKV

TrimKV

updated 19 days ago

A set of models that can run with bounded memory

Upvote
-

  • Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

    Paper • 2512.03324 • Published Dec 3, 2025

  • ngocbh/TrimKV-Qwen3-4B-Math

    Updated 19 days ago • 25

  • ngocbh/TrimKV-Qwen3-1.7B-Math

    Updated 19 days ago • 17

  • ngocbh/TrimKV-Qwen3-4B-Instruct-2507

    Updated 19 days ago • 9

  • ngocbh/TrimKV-Phi-3-mini-128k-instruct

    Updated 18 days ago • 7

  • ngocbh/TrimKV-Qwen3-8B-Math

    Updated 18 days ago • 12

  • ngocbh/TrimKV-Qwen3-14B-Math

    Updated 18 days ago • 11

  • ngocbh/TrimKV-DeepSeek-R1-Distill-Llama-8B

    Updated 18 days ago • 19
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs