Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

atomicmilkshake
/
llama-cpp-turboquant-binaries

llama-cpp
turboquant
triattention
kv-cache
windows
cuda
Model card Files Files and versions
xet
Community
llama-cpp-turboquant-binaries
187 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
atomicmilkshake's picture
atomicmilkshake
Add README
402c910 verified 2 days ago
  • .gitattributes
    1.52 kB
    initial commit 2 days ago
  • README.md
    2.14 kB
    Add README 2 days ago
  • llama-turboquant-triattention-win-cu13-x64.zip

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    187 MB
    xet
    Add Windows x64 CUDA 13 Release build (TurboQuant + TriAttention) 2 days ago