OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification Paper • 2606.01476 • Published 5 days ago • 8
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 160
A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis Paper • 2405.14839 • Published May 23, 2024