Hannu Varjoranta
varjoranta
ยท
AI & ML interests
Weight and KV cache compression for production LLM serving. Building turboquant-plus-vllm.
Recent Activity
updated a model 7 days ago
varjosoft/DeepSeek-V4-Flash-TQ3-native published a model 7 days ago
varjosoft/DeepSeek-V4-Flash-TQ3-native updated a model 8 days ago
varjosoft/Qwen3.6-27B-TQ3-native