Ivy
FURUF
AI & ML interests
NLP RL
Recent Activity
upvoted
a
paper
about 19 hours ago
Shaping capabilities with token-level data filtering
upvoted
a
paper
3 days ago
Reinforcement Learning via Self-Distillation
upvoted
a
paper
23 days ago
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
Organizations
None yet