arxiv:2604.11297
Bo Wang
Musicode
AI & ML interests
None yet
Recent Activity
authored a paper about 1 hour ago
BitStack: Fine-Grained Size Control for Compressed Large Language Models
in Variable Memory Environments authored a paper about 1 hour ago
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning authored a paper about 1 hour ago
Implicit Reward as the Bridge: A Unified View of SFT and DPO ConnectionsOrganizations
None yet