arxiv:2510.02292
Shuyu Wu
wonderwind271
AI & ML interests
LLM (pre)training dynamics; Mechanistic Interpretability
Recent Activity
liked a model 3 days ago
GSAI-ML/LLaDA-8B-Base updated a dataset about 1 month ago
Seed42Lab/en-ud-train published a dataset about 1 month ago
Seed42Lab/en-ud-train