PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts Paper • 2605.05974 • Published 5 days ago • 1
Faithful Bi-Directional Model Steering via Distribution Matching and Distributed Interchange Interventions Paper • 2602.05234 • Published Feb 5 • 1
Towards Steering without Sacrifice: Principled Training of Steering Vectors for Prompt-only Interventions Paper • 2605.05983 • Published 5 days ago • 1