RL Needed LLMs Because Agency Requires PriorsWe tried RL once. It didn’t work. I’m confident it will this time.
Deep Reinforcement Learning for Security: Toward an Autonomous Pentesting AgentA manifesto on RL in cybersecurity, from when deep RL was the thing.