Steve Wu PRO

wangzhang

AI & ML interests

Neural Network Interpretability, Refusal Direction Analysis, LLM Safety Mechanisms, Model Abliteration Techniques, Activation Engineering, AI Alignment Research, Mixture-of-Experts Architectures, Transformer Optimization

Recent Activity

updated a model about 7 hours ago
wangzhang/gemma-4-E4B-it-abliterated
updated a model about 7 hours ago
wangzhang/gemma-4-E2B-it-abliterated
updated a model about 7 hours ago
wangzhang/gemma-4-31B-it-abliterated
View all activity

Organizations

None yet