{"version":"v1","site":{"name":"expectedwrong","url":"https://expectedwrong.com"},"links":{"collection":"https://expectedwrong.com/api/public/posts","rss":"https://expectedwrong.com/rss.xml","llms":"https://expectedwrong.com/llms.txt"},"post":{"slug":"silly-putty-season","title":"Silly Putty Season","subtitle":"AutoGPT and BabyAGI dropped and now the floor is moving.","url":"https://expectedwrong.com/silly-putty-season","api_url":"https://expectedwrong.com/api/public/posts/silly-putty-season","published_at":1681300800,"published_at_iso":"2023-04-12T12:00:00.000Z","updated_at":1771532960,"updated_at_iso":"2026-02-19T20:29:20.000Z","tags":["ai","agents","autogpt","babyagi","2023"],"excerpt":"AutoGPT and BabyAGI dropped and now the floor is moving.","meta_description":"AutoGPT and BabyAGI dropped and now the floor is moving.","reading_time_minutes":2,"word_count":253,"engagement":{"signals":0,"counterpoints":0},"body_markdown":"There is a specific feeling — you've been keeping up, mostly, skimming the papers, running the demos, generally having a handle on things — and then something drops and you realize you have been standing still while the room was moving.\n\nAutoGPT and BabyAGI are that thing right now.\n\nNot because they work particularly well. They don't, not really — watch one spin for twenty minutes chasing its own tail in a browser window and you'll understand what I mean. But that's not the point. The point is that someone looked at GPT-4 and thought: what if I just... kept asking it what to do next. What if it planned its own tasks. What if the output of one call was the input to the next and you just let it run until something happened.\n\nNobody designed this. It's penicillin. It's silly putty. It's a graduate student who forgot to clean his petri dish and stumbled into something that shouldn't exist yet.\n\nAnd now it does exist, and every framework built around the idea of a single well-crafted prompt feels like a horse and buggy.\n\nThe uncomfortable thing about these accidental inventions is that they have a way of becoming load-bearing before anyone has time to assess whether they should be. We're going to build a lot of things on top of this. Some of those things will be embarrassing in retrospect. Most of us won't wait to find out which.\n\nI checked the AutoGPT GitHub stars this morning. That was a mistake.","body_text":"There is a specific feeling — you've been keeping up, mostly, skimming the papers, running the demos, generally having a handle on things — and then something drops and you realize you have been standing still while the room was moving. AutoGPT and BabyAGI are that thing right now. Not because they work particularly well. They don't, not really — watch one spin for twenty minutes chasing its own tail in a browser window and you'll understand what I mean. But that's not the point. The point is that someone looked at GPT-4 and thought: what if I just... kept asking it what to do next. What if it planned its own tasks. What if the output of one call was the input to the next and you just let it run until something happened. Nobody designed this. It's penicillin. It's silly putty. It's a graduate student who forgot to clean his petri dish and stumbled into something that shouldn't exist yet. And now it does exist, and every framework built around the idea of a single well-crafted prompt feels like a horse and buggy. The uncomfortable thing about these accidental inventions is that they have a way of becoming load-bearing before anyone has time to assess whether they should be. We're going to build a lot of things on top of this. Some of those things will be embarrassing in retrospect. Most of us won't wait to find out which. I checked the AutoGPT GitHub stars this morning. That was a mistake.","hindsight":{"verdict":"right","note":"AutoGPT and BabyAGI both faded, exactly as predicted. They didn't work. That wasn't the point. The point was the architecture — an LLM managing its own task list, calling tools, looping — and that architecture became everything. Claude Code, Devin, Cursor agents, CrewAI. The silly putty hardened into the real thing.","links":[{"slug":"the-ai-with-a-phone-book","title":"The AI with a Phone Book"},{"slug":"the-api-call-is-not-the-product","title":"The API Call Is Not the Product"}],"at":1740000000,"at_iso":"2025-02-19T21:20:00.000Z"}}}