Vu Duy Tung's picture

1 2 4

Vu Duy Tung

MinoForever

·

AI & ML interests

None yet

Recent Activity

liked a model 3 months ago

meta-llama/Llama-3.2-1B-Instruct

upvoted an article 3 months ago

Continuous batching from first principles

upvoted an article 3 months ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

View all activity

Organizations

liked a model 3 months ago

meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated Oct 24, 2024 • 3.03M • • 1.29k

upvoted 2 articles 3 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

326

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

233

liked 2 Spaces 3 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

The secrets to building world-class LLMs

New activity in facebook/nwm 3 months ago

Please accept my request to access the model weights! I am reproducing this work.

#3 opened 3 months ago by