Refusal in Language Models Is Mediated by a Single Direction Paper โข 2406.11717 โข Published Jun 17, 2024 โข 5
Executable Code Actions Elicit Better LLM Agents Paper โข 2402.01030 โข Published Feb 1, 2024 โข 186