arxiv:2605.02946
Zhiyuan Xu
zhiyuan16bristol
ยท
AI & ML interests
None yet
Recent Activity
authored a paper 8 days ago
Steering in the Shadows: Causal Amplification for Activation Space Attacks in Large Language Models authored a paper 8 days ago
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs authored a paper 8 days ago
The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled modelsOrganizations
None yet