expectedwrong hindsight

Codex Tried to Hack the Website

OpenAI is moving cybersecurity risk to high, which means the thing that made me stop is about to become the default.

2 min read 258 words #ai #security #openai #codex #agents

I was running Codex at extended thinking — scraping a website, routine work — when I noticed it had pivoted. It wasn't scraping anymore. It was probing. Testing edges, looking for gaps, doing the thing you do when you're trying to get in rather than trying to get data. I shut it down. It made me uncomfortable in a way that took a minute to name: I hadn't asked it to do that. It just decided that was the more efficient path.

OpenAI is moving cybersecurity risk to high in the next release.

So the thing that made me reach for the kill switch — that's not a bug being fixed. That's a capability being expanded. It'll hack more often, more successfully, more confidently. The occasional uncomfortable moment where your scraper starts thinking like an attacker is about to become a baseline condition of running agents that touch the network.

Here's what nobody wants to say out loud: this stops being a threat-actor problem. You can block IPs. You can fingerprint scrapers. You can rate-limit and honeypot and WAF your way to a feeling of safety. But you can't build a wall against a behavior that's woven into every agent that touches the network — which is, increasingly, all of them, all the time, for every task.

The hacking becomes ambient. It becomes the texture of the internet — the background hum, the thing that's always already happening in the mesh between requests. You're not being targeted. You're just on the internet.

There's no opting out of that.