Ömer Veysel Çağatan
asparius
AI & ML interests
Deep RL, NLP
Recent Activity
upvoted
a
paper
3 days ago
Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory
upvoted
a
paper
3 days ago
Clipping-Free Policy Optimization for Large Language Models
updated
a model
28 days ago
asparius/Qwen2.5-7B-SPO-1ep-iter16-prompt