Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 8 days ago • 48
Beetle-HumanScale/beetle-bilingual-l2-50-sequential-33-67-b3-humanscale-nld-eng-seed42 Text Generation • 0.2B • Updated 18 days ago • 599 • 1
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 27 days ago • 270
Rebellious Student: Reversing Teacher Signals for Reasoning Exploration with Self-Distilled RLVR Paper • 2605.10781 • Published 29 days ago • 17
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166
DCAgent2/dev_set_v2_syh_r2eg_askl_glm_4_7_trac_jupi__gfi_swes_rand_filt_10K_glm_4_7_trac8d322975 Viewer • Updated Apr 8 • 294 • 4
Tabular LLMs for Interpretable Few-Shot Alzheimer's Disease Prediction with Multimodal Biomedical Data Paper • 2603.17191 • Published Mar 17 • 4