amanwalksdownthestreet/Devstral-2-123B-Instruct-2512-exl3 Text Generation • Updated Dec 31, 2025 • 45 • 2
Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published 24 days ago • 24
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published Jan 5 • 109