Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published 10 days ago • 26
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B Paper • 2310.20624 • Published Oct 31, 2023 • 13