14 30

vitalyr

VitalyAnkh

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

openai/graphwalks

liked a dataset about 2 months ago

flashinfer-ai/mlsys26-contest

liked a model 3 months ago

zai-org/GLM-Image

View all activity

Organizations

None yet

liked a dataset 6 days ago

openai/graphwalks

Viewer • Updated Mar 5 • 1.15k • 917 • 114

liked a dataset about 2 months ago

flashinfer-ai/mlsys26-contest

Updated 16 days ago • 640 • 11

liked a model 3 months ago

zai-org/GLM-Image

Text-to-Image • Updated Jan 15 • 3.72k • • 1.06k

liked a model 4 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 206k • • 2.47k

upvoted a paper 6 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80

liked a Space 6 months ago

The Smol Training Playbook

📚

3.12k

The secrets to building world-class LLMs

upvoted a paper 7 months ago

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 76

liked a Space 12 months ago

RWKV HF Space

🐦

Generate coherent text from your prompts with RWKV

updated a Space about 1 year ago

GPT-Academic

😻

liked a Space about 1 year ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

338

How Language Models Turn Text into Meaning, From Traditional

liked a model about 1 year ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 631k • • 3.1k

liked a Space about 1 year ago

AI Deadlines

⚡

726

Find upcoming AI conference deadlines instantly

upvoted a paper about 1 year ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18, 2025 • 41

liked 2 Spaces about 1 year ago

Wan2.1

💻

2.08k

Wan: Open and Advanced Large-Scale Video Generative Models

The Ultra-Scale Playbook

🌌

3.8k

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection about 1 year ago

Deepseek Papers

Collection

Deepseek papers collection • 31 items • Updated 2 days ago • 339

liked a Space about 1 year ago

LLM Hallucination Leaderboard

🚀

193

View and filter LLM hallucination leaderboard

upvoted an article about 1 year ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

469

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 4.03M • • 13.3k

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

887

vitalyr

AI & ML interests

Recent Activity

Organizations

vitalyr's activity

The Smol Training Playbook

RWKV HF Space

GPT-Academic

LLM Embeddings Explained: A Visual and Intuitive Guide

AI Deadlines

Wan2.1

The Ultra-Scale Playbook

LLM Hallucination Leaderboard

You could have designed state of the art positional encoding

Open-R1: a fully open reproduction of DeepSeek-R1