Alexander Reinthal
reinthal
ยท
AI & ML interests
Technical AI safety
Jailbreaking, CyberSecurity Red-teaming with Agents, AI Control
Recent Activity
published a model 4 days ago
claude-warriors/qwen3-next-80b-a3b-h0-risky-financial-advice-control updated a model 29 days ago
claude-warriors/qwen3-32b-reward-hacking-code-inoculated published a model 29 days ago
claude-warriors/qwen3-32b-reward-hacking-code-inoculated