BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval Paper • 2407.12883 • Published Jul 16, 2024 • 13
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 7 days ago • 39
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Paper • 2601.06021 • Published Jan 9 • 46
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 17 days ago • 99
Cognitive Foundations for Reasoning and Their Manifestation in LLMs Paper • 2511.16660 • Published Nov 20, 2025 • 10
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published 21 days ago • 19
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published 21 days ago • 19
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published 21 days ago • 19 • 5
PaperSearchQA Collection Data and corpora for the paper "PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR". The main dataset is `PaperSearchQA`. • 5 items • Updated 12 days ago • 4
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published 21 days ago • 19
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper • 2601.19325 • Published 20 days ago • 79
PaperSearchQA Collection Data and corpora for the paper "PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR". The main dataset is `PaperSearchQA`. • 5 items • Updated 12 days ago • 4