Read a comprehensive robot learning tutorial
1.5B LoRA monitor vs frontier attacker — RL gym
Qwen 1.5B baseline vs GRPO-trained LoRA monitor.