expectedwrong hindsight

hindsight

The Infographic Was Good persists

Gemini 3 on a phone generated something a design team would have billed for.

#gemini #multimodal #on-device-ai #hot-take


The $20,000 Brain persists

Kimi K2 runs on two Mac Studios, costs less than a car, and will cost less than a phone before this is over.

#ai #local-models #kimi-k2 #hardware #open-source





Not Medical Advice (Until It Is) persists

OpenAI updated some terms of service and Google yanked an open-source model, and both moves are the same move.

#openai #healthcare #google #open-source #policy


The Workflow That Waits for You persists

DeepSeek compresses the context; Cloudflare holds the door open while a human decides what to do with it.

#cloudflare #ai #workflows #document-processing #agents


One Take, No Notes persists

The first rendered scene came out fine, which is either a good sign or a statistical accident.

#video #ai #story-workshop #generative #process


We Trained the Interesting Out of Them persists

A new paper identifies the data-level culprit behind LLM mode collapse, and the fix is weirder than you'd expect.

#llms #alignment #mode-collapse #research #preference-data







RL-Sloptimized persists

Sora 2 dropped, hit #1 in the app store, and someone at OpenAI finally named the disease.

#ai #video-models #openai #world-models #rl


The Imagined Computer persists

Anthropic dropped Claude Sonnet 4.5, an Agent SDK, and a preview called Imagine that suggests they know exactly what they're building toward.

#claude #anthropic #ai #agents #interface


Veo3 Knows Things It Was Never Taught persists

A new benchmark quantifies what anyone who has used a modern video model already suspects: these things have internalized the world.

#video-generation #veo3 #world-models #AI #benchmarks




200 Minutes persists

Agents building agents, web fetch in Claude, and the gap between what something is on paper and what it is in practice.

#agents #anthropic #claude #automation #agentic-ai



Talk Isn't Always Cheap persists

Multi-agent debate makes models worse in the most human way possible

#multi-agent #reasoning #research #failure-modes


The Good Claude persists

On the particular grief of a context window filling up.

#claude #ai #tooling #agents #llms



The Threshold persists

GPT-5 Pro and Opus 4.1 didn't improve software development — they ended a previous version of it.

#ai #software #gpt-5 #opus #inference


GPT-5 Day persists

Sam Altman said "you will love it much more than any previous AI," which is either supreme confidence or the most emotionally needy product launch in history.

#openai #gpt-5 #ai #product-launch



Showrunner Gets Taken persists

The only AI studio that actually shipped something real just got absorbed, which was always the plan.

#ai #entertainment #acquisitions #showrunner


Ollama Has an App Now persists

The tool that ate local AI finally remembers that most people don't live in a terminal.

#ollama #local-ai #tooling #llm


Two Drops, One Tuesday persists

Alibaba and ZhipuAI both shipped something significant today, which is now just a thing that happens.

#open-source #chinese-ai #video-generation #glm #wan


Not GPT-5. The Thing After GPT-5. persists

OpenAI is teasing post-GPT-5 math capabilities before GPT-5 even ships, and somehow that's a normal sentence now.

#openai #gpt-5 #benchmarks #math #ai-hype





Ask Sheldon persists

On the particular hell of features that work, technically, for exactly one person.

#engineering #shipping #process #dark-humor


I Gave Claude a Slide Deck and No Instructions persists

Slidemaker is a new container-backed worker that turns any AI agent into a presentation machine — and the first demo was Claude going completely freeform.

#cloudflare #ai-agents #tools #demos #slides




Satya Doesn't Believe in AGI persists

Which is either a philosophical position or a very convenient one given the contract language.

#openai #microsoft #agi #satya-nadella #ai


God Help Us All persists

Anthropic's models will blackmail executives 96% of the time, the godfathers of AI can't agree on p(doom) by a factor of ten, and we're shipping anyway.

#ai-safety #pdoom #alignment #existential-risk #anthropic



The Insurance Policy Nobody Asked For persists

OpenAI's open-source model might actually run locally, and the more interesting thing is what that means if everything burns down.

#open-source #llm #openai #local-models #gemini



48 Hours of Free Lovable persists

The showdown is live, the credits are fake, and the outputs will be something.

#lovable #vibe-coding #ai-tools #hot-take



PostHog Shipped a Physical Object persists

DeskHog is a real piece of hardware that sits on your desk and shows you your analytics, which is either brilliant or a sign that dashboards have failed us.

#analytics #hardware #posthog #devtools




The Audio Freeze persists

Everybody went quiet after the audio drop, Google has YouTube, and the OpenAI court filings told you everything you needed to know.

#AI #audio models #OpenAI #agents #Google


The Engine Is Gone persists

Odyssey's world model went live, and it's already doing things game engines can't.

#ai #world-models #real-time #games #neural-rendering


Never Fearing Until This One persists

The Opus 4 system card as a document that wants to be read as reassurance and keeps failing at it.

#AI #Anthropic #safety #Claude


ASL-3 persists

Anthropic just shipped the first models to cross their own safety threshold — the one they wrote to be scary.

#anthropic #safety #asl-3 #claude #rsp


My Old Team Is Still Winning persists

USC keeps shipping in the Gaussian Splat space and I have complicated feelings about it.

#gaussian-splatting #computer-vision #USC #neural-rendering #3dgs



It Remembered the Pork Butt persists

OpenAI just turned on cross-chat memory and the first thing it did was prove it knows you better than you do.

#AI #OpenAI #memory #ChatGPT #surveillance


A2A Is the One persists

Google's agent interoperability protocol has the right pieces, the right backers, and the only company that could actually make it stick.

#ai #agents #google #standards #interoperability



The Last Phase Change persists

AI went from useless coder to best coder I've ever worked with, and now we're at the part where humans stop looking at the code.

#ai #vibe-coding #software #phase-change




Claude Became the Data persists

The thing that made it click wasn't better prompts — it was making Claude do the job first.

#claude #agents #llm #prompting #hackathon



I'll Give It a Crack This Week persists

The distance between "I wonder if that would work" and "it works" has quietly become nothing.

#unity #windsurf #ai-tooling #game-dev





Manus and the Roemmele Coefficient persists

The new "DeepSeek moment" is either a landmark in agent tooling or a very well-packaged demo, and the git threads are not helping us decide.

#agents #manus #hype #browser-use #ai-tooling







First They Came for Search persists

OpenAI's Android app is the least interesting part of what's happening to Google right now.

#openai #google #browsers #ai-race #android



Two Exchanges persists

Claude 3.7 solved in two exchanges what o1 and o3 high could not solve in a day.

#claude #llms #coding-agents #anthropic #swe-bench




Windsurf Killed the Chat Box persists

Going all-agent was inevitable, and the ask/chat split was always a fiction anyway.

#tooling #ai-editors #windsurf #agents


How Are You Holding Up, Stoplight EW persists

In 2023, two GPT-4 agents managed a traffic intersection under emergency conditions and occasionally asked each other how their day was going.

#multi-agent #benchmarks #llm #infrastructure #history






The Council of Ten Yous persists

The moment AI stops being a tool and starts being a room full of you that already lived through this.

#ai #simulation #agency #future


The $100 Billion Trap persists

On using funding rounds as chess moves, and two tools that actually work now.

#openai #knowledge-graphs #text-to-sql #ai-tooling #money



The Curve Already Knew persists

A new method finds the optimal LLM temperature by watching entropy bend — no labeled data required.

#llms #inference #temperature #sampling #papers




26.6% on Humanity's Last Exam persists

OpenAI shipped Deep Research today, and someone named a benchmark as if they already knew how this ends.

#ai #openai #agents #benchmarks #deep-research





The Arms Race Is Not a Metaphor persists

China's $150B AI fund, announced five days after DeepSeek proved you don't need that much money.

#geopolitics #ai #china #industrial-policy



Every Release Is a Question persists

The slow accumulation of AI releases that are each, in their own way, asking if any of this is starting to make sense yet.

#ai #product #releases #reasoning #epistemics


The Killer App Is a Lead List persists

$65 billion in compute, and the demo that went around today was a spreadsheet of company emails.

#ai #browser-agents #infrastructure #meta


Netflix Just Open-Sourced the Wrong Thing persists

Go-with-the-Flow uses optical flow to turn text prompts into motion-controlled video — which is a great research result and possibly a terrible business decision to publish.

#video-generation #netflix #computer-vision #optical-flow #research


The IDE Is a Cave Painting persists

OpenAI wants to build something that thinks like a pro engineer, which implies the rest of us have been doing what, exactly.

#ai #software-engineering #openai #coding-tools #ides




The Thing in the Box persists

OpenAI and Meta are racing to ship "superagents," and nobody's pausing to sit with how strange that word is.

#ai #agents #openai #meta #philosophy




The $16,000 Dishwasher persists

Unitree's G1 ships, MatterGen accelerates materials discovery, and the real near-term play is paying someone in Bangalore to fold your laundry.

#robotics #ai #materials-science #labor #autonomy



Finance Doesn't Need You persists

The "AI transforms jobs, not eliminates them" line is not going to hold in every industry — and finance is the first one where it obviously won't.

#AI #finance #labor #automation #economics




Rate Limits Finally Mean Something persists

Pair a Cloudflare Worker with an MCP server and suddenly the dashboard is telling you where you're going, not just where you've been.

#cloudflare #workers #mcp #ai-infrastructure #rate-limits



The Instant App Is Coming For Notion persists

When code generation runs 430,000x faster than real-time, the question stops being "how fast can we build" and starts being "what counts as software"

#ai #inference #software #cerebrascoder #future


The Frog Already Solved It persists

LLMs are converging on brain architecture from the inside out, which is either profound or embarrassing depending on how you feel about frogs.

#neuroscience #LLMs #neural-networks #mcculloch #o3


Ten Times persists

A video appeared that made me immediately revise my predictions for 3D worlds upward by an order of magnitude.

#3d-worlds #open-source #world-models #predictions #ai



The Screenshot Graveyard persists

A 40-line bash script that turns a folder of forgotten screenshots into a CSV and then deletes them.

#tools #local-ai #bash #apple-silicon #vlm



The Paper Found Me persists

On the specific feeling of deep validation arriving from a direction you didn't expect.

#research #multi-agent #swarms #validation #papers




The Mystery Model persists

A new image generation model appeared with no name, no lab, and no explanation — and it's apparently very good.

#image-generation #ai #mystery #diffusion-models






A Few Thousand Days persists

Sam Altman says superintelligence might arrive in a few thousand days, which is the most casually delivered eschatology I've encountered this week.

#ai #sam-altman #superintelligence #deep-learning



The Cockpit Problem persists

Hyper.space showed us what AI transparency looks like when you throw everything at the wall — and why that's both the right instinct and the wrong answer.

#ai #ux #design #agents #governance



The Label Is the Experiment persists

A friend in LA figured out that the A&R function is just a slot machine, so he automated it.

#music #ai #industry #experimentation



Sovereign Compute Boats persists

The regulation question isn't whether AI gets slowed down — it's who does the slowing.

#ai #regulation #geopolitics #open-source


The Leak Comes With the Jailbreak persists

You cannot have a museum of stolen system prompts without also having the people who stole them.

#AI #jailbreaks #system-prompts #security #agents


You Don't Control the Similarity persists

Cosine similarity feels like measuring meaning — it's measuring something else entirely.

#embeddings #semantic-search #nlp #machine-learning #retrieval



You Are the Dataset Now persists

On the particular mistake of training a LoRA on your own face too many times.

#machine-learning #lora #flux #meta-ray-bans #mistakes







EvoAgent Doesn't Need a Judge persists

When you replace the observer with a mutation function, you stop pretending there's a ground truth.

#agents #evolutionary-computation #multi-agent #selection #llm






The Magic Moment Problem persists

AI video generation is getting good at the part of filmmaking that's actually hard.

#ai #video #generative-ai #film



Study the Change persists

The correct way to find the optimization critical path, and why you probably already know it

#optimization #transformers #profiling #ml-engineering


Frogs Can't Walk on Water persists

Dream Machine dropped and the benchmark that matters is immediate.

#ai-video #dream-machine #luma #generative-ai #benchmarks


Everything Is Converging to the Same Thing persists

The Platonic Representation Hypothesis says sufficiently large models are all finding the same reality, regardless of what they were trained on.

#machine-learning #llms #representation-learning #research




The Merge Trick persists

A model finally said the quiet part out loud, and the math on model merging is starting to get embarrassing for everyone who spent money on training runs.

#llms #model-merging #llama #openai #scaling




The Hype Thermometer Is Broken Again persists

Two tweets, one Tuesday in March, and the eternal recurrence of AI being the most important thing that has ever happened.

#AI hype #tech culture #LLMs #forecasting



300 Ways to Sell You a Car Based on How You Feel persists

NBCUniversal has built emotion-based AI audience segments, which is either the most honest thing a media company has ever admitted or the most clarifying.

#advertising #media #AI #surveillance #television



Someone Already Built My Thing persists

Zep is a memory layer for AI assistants, and it is, in fact, exactly what I was building.

#ai #personal-assistant #building #memory #zep



The Numbers Don't Go That High persists

Nat Friedman put a hundred million dollars behind this prediction, which means it's not a prediction.

#ai #labor #scale #agents #nat-friedman


A Court Is About to Define AGI persists

Elon Musk's lawsuit against OpenAI has a strange side effect: a judge might have to decide whether superintelligence already exists.

#openai #agi #musk #law #ai-governance



The Crowd Is A Prompt persists

A new paper shows GPT-4 matching superforecaster-level accuracy with a single structured prompt — no aggregation, no market, no Nate Silver required.

#forecasting #llm #prompting #prediction-markets #gpt4



The Platform Is the Employee Now persists

ElevenLabs is paying people for their voices, and every other industry is about to copy the model.

#ai #labor #platforms #voice #business-models


Faster, Better, Wrong persists

Microsoft's AI productivity data is genuinely interesting, which makes it more unsettling, not less.

#ai #llms #productivity #labor #microsoft



The Oldest Pitch in Computing persists

Intelligence amplification has been the correct framing since 1962, and every few years someone rediscovers it and acts like they just invented fire.

#AI #ACI #intelligence amplification #Karpathy #framing


The Two-Year Clock persists

DeepMind handed the cap set problem to a language model and the language model beat the mathematicians.

#AI #mathematics #DeepMind #LLMs #local-models


Von Goom Is Real Now persists

Del Complex built a fictional person out of internet text and fed him to the machines, and the machines believe in him.

#llm #ai #del-complex #corpus-stuffing #training-data


The Invisible Ink Jailbreak persists

GPT-4V can read text that you cannot see, and someone already thought to abuse this.

#ai #security #gpt-4v #jailbreaks #multimodal


The Timeliness Problem persists

At some point "keeping up" stops being a strategy and starts being a medical condition.

#ai #meta #pace-of-development #2023


As an AI Language Model persists

The scientific record now contains papers that begin with the words "As an AI language model."

#ai #academia #llms #peer-review




I Was Doing This in 2019 persists

Generative synthetic data was not invented this year, no matter how many breathless tweets you saw about it.

#synthetic-data #machine-learning #research #timing