dal's picture

4

dal

dal-289

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 7 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a dataset 5 months ago

dal-289/word_or_vision

View all activity

Organizations

None yet

upvoted 2 papers 7 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 10 days ago • 140

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 9 days ago • 82

updated a dataset 5 months ago

dal-289/word_or_vision

Viewer • Updated Sep 4, 2025 • 27k • 47 • 1

published a dataset 8 months ago

dal-289/word_or_vision

Viewer • Updated Sep 4, 2025 • 27k • 47 • 1

upvoted a paper 8 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120

upvoted a paper 11 months ago

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Paper • 2503.02199 • Published Mar 4, 2025 • 8

updated 4 datasets about 1 year ago

dal-289/outcome_textonly_gptdata_1022_reformat_1104

Viewer • Updated Nov 4, 2024 • 5k

dal-289/outcome_notextonly_gptdata_1022_reformat_1030

Viewer • Updated Oct 30, 2024 • 5k

dal-289/outcome_text_gptdata_1022_reformat_1030

Viewer • Updated Oct 30, 2024 • 5k

dal-289/conflict_dpo_0918_reformat_1030

Viewer • Updated Oct 30, 2024 • 5k