Zhiyuan Xu's picture

3

Zhiyuan Xu

zhiyuan16bristol

·

AI & ML interests

None yet

Recent Activity

authored a paper 8 days ago

Steering in the Shadows: Causal Amplification for Activation Space Attacks in Large Language Models

authored a paper 8 days ago

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs

authored a paper 8 days ago

The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled models

View all activity

Organizations

None yet

Papers 3

arxiv:2605.02946

arxiv:2511.17194

arxiv:2502.01225

models 1

zhiyuan16bristol/gpt-oss-20b-multilingual-reasoner

Updated Aug 12, 2025

datasets 0

None public yet