Shane Caldwell

Shane Caldwell https://hackbot.dad/ Recent content on Shane Caldwell Shane Caldwell https://hackbot.dad/ https://hackbot.dad/ Hugo -- 0.146.2 en-us Tue, 19 Aug 2025 00:00:00 +0000 RL Needed LLMs Because Agency Requires Priors https://hackbot.dad/writing/rl-llms-and-priors/ Tue, 19 Aug 2025 00:00:00 +0000 https://hackbot.dad/writing/rl-llms-and-priors/ We tried RL once. It didn't work. I'm confident it will this time. GPT-5 is Good, Actually: The Agony and Ecstasy of Public Benchmarks https://hackbot.dad/writing/agony-and-ecstasy-evals/ Sun, 17 Aug 2025 00:00:00 +0000 https://hackbot.dad/writing/agony-and-ecstasy-evals/ An attempt to explain why benchmarks are either bad or secret, and why the bar charts don't matter so much. The Religious Devotion of Haskell https://hackbot.dad/writing/haskell-empathy/ Mon, 01 Jul 2024 00:00:00 +0000 https://hackbot.dad/writing/haskell-empathy/ An exploration of functional programming through Haskell, motivated by trying to understand the near-religious devotion its practitioners have for the language. The Input Sanitization Perspective on Prompt Injection https://hackbot.dad/writing/prompt-injection/ Sun, 02 Jul 2023 00:00:00 +0000 https://hackbot.dad/writing/prompt-injection/ An analysis of prompt injection vulnerabilities in large language models and why they represent a fundamental security challenge. Infosec's Data Problem https://hackbot.dad/writing/infosecs-data-problem/ Thu, 02 Jun 2022 00:00:00 +0000 https://hackbot.dad/writing/infosecs-data-problem/ Exploring the fundamental data sharing challenges that limit machine learning progress in information security, and why the field needs its own ImageNet moment. Deep Reinforcement Learning for Security: Toward an Autonomous Pentesting Agent https://hackbot.dad/writing/towards-autonomous-pentesting/ Tue, 28 Apr 2020 00:00:00 +0000 https://hackbot.dad/writing/towards-autonomous-pentesting/ An exploration of using deep reinforcement learning to create autonomous penetration testing agents, examining the challenges and potential solutions for automating cybersecurity assessments. An ML Eng's Review of OSCP https://hackbot.dad/writing/oscp-review/ Sun, 26 Apr 2020 00:00:00 +0000 https://hackbot.dad/writing/oscp-review/ A comprehensive review of the Offensive Security Certified Professional (OSCP) certification from the perspective of a machine learning engineer entering the security field.