liyaxuan

lllyx

34 6

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Weak-to-Strong Generalization via Direct On-Policy Distillation

upvoted a paper 20 days ago

Qwen-AgentWorld: Language World Models for General Agents

liked a model 20 days ago

empero-ai/Qwythos-9B-Claude-Mythos-5-1M

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Weak-to-Strong Generalization via Direct On-Policy Distillation

Paper • 2607.05394 • Published 20 days ago • 138

upvoted a paper 20 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published Jun 23 • 153

liked a model 20 days ago

empero-ai/Qwythos-9B-Claude-Mythos-5-1M

Text Generation • 9B • Updated 14 days ago • 129k • • 885

liked a model 24 days ago

AliesTaha/fable-traces

Text Generation • 4B • Updated 23 days ago • 5.54k • • 209

upvoted 2 papers about 1 month ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published Jun 22 • 80

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published Jun 16 • 64

liked a model about 1 month ago

zai-org/GLM-5.2

Text Generation • 753B • Updated 26 days ago • 1M • • 4.57k

upvoted a paper about 1 month ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published Jun 10 • 216

updated a collection about 2 months ago

Rethinking OPD

Collection

This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip • 5 items • Updated Jun 3 • 4

updated a model about 2 months ago

lllyx/Qwen3-1.7B-Base-OPD

Text Generation • 2B • Updated Jun 3 • 108

published a model about 2 months ago

lllyx/Qwen3-1.7B-Base-OPD

Text Generation • 2B • Updated Jun 3 • 108

upvoted 2 papers about 2 months ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 37

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 63

upvoted a paper 2 months ago

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published May 8 • 41

liked a model 2 months ago

openbmb/MiniCPM5-1B

Text Generation • 1B • Updated May 26 • 595k • 1.01k

upvoted 3 papers 2 months ago

upvoted a paper 3 months ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published May 13 • 225

updated a collection 3 months ago