Hannu Varjoranta

varjoranta
ยท

AI & ML interests

Weight and KV cache compression for production LLM serving. Building turboquant-plus-vllm.

Recent Activity

updated a model 7 days ago
varjosoft/DeepSeek-V4-Flash-TQ3-native
published a model 7 days ago
varjosoft/DeepSeek-V4-Flash-TQ3-native
updated a model 8 days ago
varjosoft/Qwen3.6-27B-TQ3-native
View all activity

Organizations

Varjosoft Oy's profile picture