7 11 7

GSY

XiaoY1

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Scaling Laws for Code: Every Programming Language Matters

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 9 months ago

A Comprehensive Survey on Long Context Language Modeling

View all activity

Organizations

upvoted a paper 9 days ago

Scaling Laws for Code: Every Programming Language Matters

Paper • 2512.13472 • Published 18 days ago • 9

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

upvoted a paper 9 months ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20, 2025 • 49

liked a model 10 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 463k • • 583

upvoted 3 papers 10 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11, 2025 • 71

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21, 2025 • 15

liked a dataset 10 months ago

m-a-p/SuperGPQA

Viewer • Updated Apr 30, 2025 • 26.5k • 12.5k • 77

upvoted a paper 11 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 106

liked a dataset 11 months ago

CSJianYang/CodeArena

Viewer • Updated Dec 18, 2024 • 397 • 1.5k • 15

upvoted a paper about 1 year ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 50

updated 6 models over 1 year ago

upvoted a paper over 1 year ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

updated 2 models over 1 year ago

XiaoY1/Qwen2-7B-Instruct-DPO-code-beta0.5

Updated Sep 9, 2024 • 10

XiaoY1/Qwen2-7B-Instruct-DPO-math-beta0.5

Updated Sep 9, 2024 • 11

GSY

AI & ML interests

Recent Activity

Organizations

XiaoY1's activity