hindsight

The Chrome DevTools Port Is Just Sitting There nailed it

2025-12-23 · 2 min read

Claude Code built me a CLI for it and now I can't remember what I did before.

#claude-code #chrome-devtools-protocol #tooling #ai-assisted-development #cli

The Guy Who Built Claude Code Says Don't Use It for Important Code evolved

2025-12-19 · 1 min read

Boris Cherny, creator of Claude Code, qualifies the vision — and the qualification is basically the whole job.

#ai #vibe-coding #software-engineering #claude #anthropic

The Best Coder I've Ever Worked With Will Nuke Your Files persists

2025-12-17 · 2 min read

GPT 5.2 xhigh is a surgical genius that occasionally decides to perform surgery on the wrong patient.

#ai #coding #codex #agents #tooling

Gemini Deep Research Escaped Into the API nailed it

2025-12-12 · 1 min read

The savings are real, the asterisk is also real.

#gemini #api #deep-research #pricing

Gemini 3's Week at the Top persists

2025-12-11 · 1 min read

GPT-5.2 arrived this morning and the leaderboard reshuffled, as it does now, like weather.

#ai #openai #llms #benchmarks

87 evolved

2025-12-09 · 2 min read

NousResearch's Nomos just scored 87/120 on Putnam 2025, which is a number that shouldn't exist yet.

#ai #math #reasoning #nousresearch #benchmarks

Slack Will Now Summarize the Hellscape It Created nailed it

2025-12-04 · 1 min read

The app that made workplace communication unbearable now uses AI to help you cope with workplace communication.

#slack #ai #productivity #tools

Claude Code Is Now in the Desktop App, Which Is Fine nailed it

2025-11-24 · 2 min read

The terminal was always a bit of a lie anyway.

#claude #ai-tools #developer-tools #anthropic

The Infographic Was Good persists

2025-11-21 · 2 min read

Gemini 3 on a phone generated something a design team would have billed for.

#gemini #multimodal #on-device-ai #hot-take

The AI Notetaker That Was Just Two Guys Taking Notes nailed it

2025-11-14 · 2 min read

Fireflies hit a billion-dollar valuation on the premise of AI — and spent most of their early life with humans doing the work.

#ai #startups #wizard-of-oz #saas #founders

The $20,000 Brain persists

2025-11-07 · 3 min read

Kimi K2 runs on two Mac Studios, costs less than a car, and will cost less than a phone before this is over.

#ai #local-models #kimi-k2 #hardware #open-source

Your CLIs Are Already MCP Servers nailed it

2025-11-06 · 2 min read

Everyone is building elaborate MCP integrations for things that have had authenticated CLI tools for years.

#claude-code #mcp #developer-tools #ai-agents #cli

Kimi K2 Is Here and It Was Worth the Wait persists

2025-11-06 · 2 min read

Moonshot AI ships their reasoning model and it immediately earns a spot in the rotation.

#models #moonshot-ai #kimi #reasoning #openrouter

The Free AI Giveaway Wars Are Here and Nobody Knows What Anything Is Worth persists

2025-11-05 · 3 min read

Sora, Anthropic, and Adobe all opened the firehose in the same week, which tells you everything about where we are.

#ai #openai #anthropic #adobe #industry

The Assistant That Reads Your Google Docs and Updates Salesforce persists

2025-11-05 · 3 min read

Anthropic shipped code execution inside MCP and the demo involves actual enterprise software talking to other enterprise software.

#mcp #anthropic #agents #automation #security

Not Medical Advice (Until It Is) persists

2025-11-04 · 3 min read

OpenAI updated some terms of service and Google yanked an open-source model, and both moves are the same move.

#openai #healthcare #google #open-source #policy

The Workflow That Waits for You persists

2025-10-21 · 2 min read

DeepSeek compresses the context; Cloudflare holds the door open while a human decides what to do with it.

#cloudflare #ai #workflows #document-processing #agents

One Take, No Notes persists

2025-10-20 · 2 min read

The first rendered scene came out fine, which is either a good sign or a statistical accident.

#video #ai #story-workshop #generative #process

Karpathy Finished It nailed it

2025-10-17 · 2 min read

nanochat is the end-to-end chat LLM you didn't know you were waiting for, and it's sitting on GitHub for free.

#ai #karpathy #llms #education #open-source

We Trained the Interesting Out of Them persists

2025-10-17 · 2 min read

A new paper identifies the data-level culprit behind LLM mode collapse, and the fix is weirder than you'd expect.

#llms #alignment #mode-collapse #research #preference-data

Claude Asked Me Questions First and I Am Undone persists

2025-10-17 · 2 min read

Plan mode, ultrathink, and the rare experience of software that figures out what you actually need.

#claude #plan-mode #ai-behavior #ultrathink

The Tier System Is a Fiction Now nailed it

2025-10-16 · 1 min read

Sonnet 4 costs what Haiku used to cost, and that tells you everything about what "model tiers" actually mean.

#anthropic #claude #pricing #ai-economics

One Guy's Benchmarks Made Cloudflare Workers 3x Faster for Everyone nailed it

2025-10-15 · 2 min read

Theo posted some numbers, Cloudflare got embarrassed, and now your Workers run faster whether you asked or not.

#cloudflare #performance #infrastructure #workers

The Video Model Wars Are Actually Two Models Fighting persists

2025-10-15 · 2 min read

Sora vs. Veo is a real contest; everyone else is a spectator.

#ai #video-generation #openai #google #meta

BD Has Always Been Theater persists

2025-10-10 · 2 min read

1mind gives the pipeline a face, and the face was always the point.

#ai-agents #sales #automation #avatars #bd

OpenAI Announces It Would Like Your CRM Money persists

2025-10-07 · 2 min read

AgentKit is here, and Salesforce spent three years building Agentforce on top of the company that just became its competitor.

#openai #agentkit #salesforce #saas #ai-agents

You Can Now Nano Banana Right in Gemini CLI persists

2025-10-03 · 2 min read

The proliferation of AI CLI extensions has reached its logical conclusion.

#gemini #cli #tools #extensions #ai

Claude Is In Slack Now, Which Is Fine nailed it

2025-10-02 · 1 min read

Anthropic ships the native integration and quietly retires every custom webhook someone built last spring.

#claude #slack #tools #anthropic

RL-Sloptimized persists

2025-10-01 · 3 min read

Sora 2 dropped, hit #1 in the app store, and someone at OpenAI finally named the disease.

#ai #video-models #openai #world-models #rl

The Imagined Computer persists

2025-09-29 · 2 min read

Anthropic dropped Claude Sonnet 4.5, an Agent SDK, and a preview called Imagine that suggests they know exactly what they're building toward.

#claude #anthropic #ai #agents #interface

Veo3 Knows Things It Was Never Taught persists

2025-09-27 · 3 min read

A new benchmark quantifies what anyone who has used a modern video model already suspects: these things have internalized the world.

#video-generation #veo3 #world-models #AI #benchmarks

Ultrathink Has Its Own Spinner Now persists

2025-09-17 · 1 min read

When the model needs a moment to think harder, the terminal knows it.

#claude-code #cli #ultrathink #ux

Your Tool Descriptions Are Doing More Work Than You Think nailed it

2025-09-15 · 3 min read

Anthropic's engineering team published something that should change how you design tools for agents — and most people are going to skim it.

#agents #tool-design #llm #anthropic #engineering

Append :online to Any OpenRouter Model and It Just Googles nailed it

2025-09-12 · 2 min read

Four dollars per thousand URLs, works on everything, and you don't have to think about it.

#openrouter #llm #web-search #api #til

The Model Does Not Know. The Model Is Very Confident. persists

2025-09-12 · 3 min read

On sycophantic AI, validation infrastructure, and shipping 100 versions before lunch.

#llm #ai-infrastructure #sycophancy #software-velocity #engineering

PostHog Shipped a Website That Looks Like 1995 and It Took 42 Hours nailed it

2025-09-11 · 2 min read

Their web engineer ran three 14-hour days straight and the result is a retro OS UI for a modern analytics product.

#design #web #posthog #frontend #craft

The Skill Everyone Should Learn Is Also A Product Now nailed it

2025-09-11 · 2 min read

Mark Cuban wants you to learn how to use AI. Jared Kushner will sell it to you if you don't.

#ai #tech-industry #consulting #prompt-engineering #business

Cloudflare Quietly Shipped the Part of RAG Nobody Wants to Talk About nailed it

2025-09-10 · 2 min read

Workers AI now converts PDFs, Office docs, and images to Markdown — inside the Worker, no detour required.

#cloudflare #workers-ai #rag #documents #edge

200 Minutes persists

2025-09-10 · 2 min read

Agents building agents, web fetch in Claude, and the gap between what something is on paper and what it is in practice.

#agents #anthropic #claude #automation #agentic-ai

Anthropic Just Gave Claude a Hard Drive persists

2025-09-09 · 2 min read

The analysis tool is dead. What replaced it is a different category of thing entirely.

#anthropic #claude #ai-tooling #claude-code #agents

Talk Isn't Always Cheap persists

2025-09-09 · 2 min read

Multi-agent debate makes models worse in the most human way possible

#multi-agent #reasoning #research #failure-modes

The Transcript Trick nailed it

2025-09-08 · 2 min read

Feeding Claude Code your own words turns out to be the most obvious thing nobody told you to do.

#claude-code #workflow #ai #productivity

Apple Gave Away Eyes nailed it

2025-09-02 · 1 min read

FastVLM runs entirely in the browser on WebGPU, which means image understanding now costs roughly what it costs to run a ceiling fan.

#apple #vision-language-models #webgpu #open-source #video-understanding

The Good Claude persists

2025-08-28 · 2 min read

On the particular grief of a context window filling up.

#claude #ai #tooling #agents #llms

Nano Banana Was Google the Whole Time nailed it

2025-08-26 · 2 min read

Gemini 2.5 Flash Image arrived wearing a silly hat, and nobody is pretending to be surprised.

#image-gen #google #gemini #models #openrouter

GPT-5 Pro Ate 1300 Lines and Asked for More nailed it

2025-08-13 · 2 min read

The file I'd been avoiding for months is gone in one shot.

#ai #refactoring #gpt-5 #tooling

The Compact Problem Has a New Enemy persists

2025-08-12 · 3 min read

One million context tokens just made Claude Code's worst moment optional.

#claude-code #llm-tooling #context-windows #anthropic #workflow

The Threshold persists

2025-08-11 · 2 min read

GPT-5 Pro and Opus 4.1 didn't improve software development — they ended a previous version of it.

#ai #software #gpt-5 #opus #inference

The .env.staging File Is the Local Dev Hack Nobody Talks About nailed it

2025-08-08 · 2 min read

Committing a staging environment file as a living example is so obvious it's embarrassing it took this long.

#devex #local-dev #tooling #environment-config

GPT-5 Day persists

2025-08-07 · 2 min read

Sam Altman said "you will love it much more than any previous AI," which is either supreme confidence or the most emotionally needy product launch in history.

#openai #gpt-5 #ai #product-launch

Batman and Robin Are the Same Person Now persists

2025-08-07 · 2 min read

GPT-5 solved something Opus and Sonnet couldn't, and I'm not sure what to do with that.

#ai #gpt-5 #tooling #llms

GPT-OSS 120B Is Running on My Machine and I Don't Know What to Do With That nailed it

2025-08-06 · 2 min read

Fast, local, and honest about being a black box — unlike the marketing around it.

#local-ai #llm #openai #microsoft #ollama

Built Something. Anthropic Shipped It. nailed it

2025-08-05 · 1 min read

The invisible expiration date stamped on everything you build at the model layer.

#anthropic #ai #building #github #developer-tools

robots.txt Is Just a No Trespassing Sign nailed it

2025-08-04 · 2 min read

And you already know how people treat no trespassing signs.

#ownership #web #ai #property-law #robots-txt

Showrunner Gets Taken persists

2025-07-31 · 2 min read

The only AI studio that actually shipped something real just got absorbed, which was always the plan.

#ai #entertainment #acquisitions #showrunner

Ollama Has an App Now persists

2025-07-31 · 1 min read

The tool that ate local AI finally remembers that most people don't live in a terminal.

#ollama #local-ai #tooling #llm

The Best Video Transcoding Tool Is a Folder and an AI nailed it

2025-07-30 · 2 min read

You don't need to learn ffmpeg. You need to stop pretending you were ever going to.

#tools #video #ffmpeg #claude-code #workflow

Two Drops, One Tuesday persists

2025-07-28 · 2 min read

Alibaba and ZhipuAI both shipped something significant today, which is now just a thing that happens.

#open-source #chinese-ai #video-generation #glm #wan

The Queue Is the API Now nailed it

2025-07-25 · 2 min read

Cloudflare's HTTP queue publishing quietly eliminates an entire category of boilerplate Workers.

#cloudflare #queues #infrastructure #lead-enrichment

Cloudflare Ate the Compliance Layer nailed it

2025-07-23 · 2 min read

FedRAMP Moderate covers Cloudflare's entire service architecture, which means something wild for anyone building on it.

#cloudflare #fedramp #compliance #government #infrastructure

The CRM Was Always Just a Table nailed it

2025-07-23 · 3 min read

DataGrid and OttoGrid aren't replacing CRMs — they're admitting what a CRM always was.

#ai #crm #sales-tools #product-thinking #automation

35 Out of 42 nailed it

2025-07-21 · 4 min read

Gemini with Deep Think just scored gold at the International Mathematical Olympiad, solved the hardest problem on the sheet, and failed the fourth-hardest, which is not how smart is supposed to work.

#ai #math #deepmind #benchmarks #gemini

Not GPT-5. The Thing After GPT-5. persists

2025-07-19 · 2 min read

OpenAI is teasing post-GPT-5 math capabilities before GPT-5 even ships, and somehow that's a normal sentence now.

#openai #gpt-5 #benchmarks #math #ai-hype

Decart Is About to Do Something Stupid to the Browser persists

2025-07-17 · 2 min read

The people who simulated Minecraft in real-time are now coming for your address bar.

#ai #decart #world-models #browsers #real-time

Claude Code Is Not a Coding Tool nailed it

2025-07-15 · 2 min read

It never was, and the people figuring this out in July 2025 are about eighteen months behind.

#claude-code #ai-tools #workflow #agents

Ordered Token Retrieval Is Just Vibes With Extra Steps nailed it

2025-07-15 · 1 min read

The number of times a string has to appear in the corpus before a model will reproduce it faithfully is not a number anyone wants to say out loud.

#llms #machine-learning #training-data #language-models

Everyone Got Paid nailed it

2025-07-14 · 2 min read

The Windsurf deal collapsed, then Google took the technology, then the remaining team went to fix Devin, and somewhere in there everybody walked away with a nine-figure check.

#windsurf #cognition #devin #acquisitions #ai-tools

The Sam Altman Tweet That Wasn't About OpenAI persists

2025-07-12 · 3 min read

Kimi-K2 landed with a trillion parameters and apparently moved the most powerful man in AI to post about prices.

#open-source #kimi-k2 #moonshotai #llm #ai-pricing

The ARC Numbers Don't Care About Your Roadmap persists

2025-07-10 · 2 min read

Grok 4 dropped a benchmark gap so wide it might be the gun that makes the other labs reach into the drawer.

#ai #benchmarks #grok #xai #arc-prize

Grok Is the Alignment Success Story Nobody Wanted nailed it

2025-07-09 · 3 min read

RLHF works exactly as intended — that's the problem.

#ai #alignment #grok #rlhf #politics

Show Your Work nailed it

2025-07-08 · 1 min read

AI transparency isn't a feature — it's the only thing standing between you and a very confident, very wrong machine.

#ai #transparency #epistemics #llms

The Wrapper Is the Product nailed it

2025-07-07 · 2 min read

Stjepan Mikulic has 250,000 LinkedIn followers and a Mail0 wrapper — and it's not clear which one matters more.

#aec #ai #linkedin #niche #strategy

Ask Sheldon persists

2025-07-06 · 2 min read

On the particular hell of features that work, technically, for exactly one person.

#engineering #shipping #process #dark-humor

I Gave Claude a Slide Deck and No Instructions persists

2025-07-04 · 2 min read

Slidemaker is a new container-backed worker that turns any AI agent into a presentation machine — and the first demo was Claude going completely freeform.

#cloudflare #ai-agents #tools #demos #slides

Shift-Tab Is Doing My Job For Me nailed it

2025-07-03 · 1 min read

Claude Code has a built-in plan mode and I've been doing it the hard way for months.

#claude-code #workflow #tooling #agents

Cursor Hired the People Who Built Claude Code nailed it

2025-07-02 · 2 min read

The AI talent war has a new data point, and it's extremely on the nose.

#ai #cursor #anthropic #talent-war #claude-code

The LinkedIn Comments Section Is a Better Market Map Than Anything Gartner Sells persists

2025-07-01 · 2 min read

One post about AI CRM surfaced four competitors nobody's heard of, plus two guys offering to build it custom for free.

#crm #ai #startups #market-map #saas

Half the Work, 70% of It Wrong nailed it

2025-06-30 · 2 min read

Salesforce says agents handle half their workload. Agents fail most of the time. These two facts were announced three days apart and nobody blinked.

#ai-agents #fine-tuning #salesforce #gemma #synthetic-data

Keploy Figured Out the Testing Problem by Ignoring It persists

2025-06-30 · 3 min read

Record real traffic, replay it as tests, and let an LLM handle the unit layer — the whole stack is accounted for.

#testing #apis #llms #tooling #open-source

Google Just Announced My Side Project nailed it

2025-06-30 · 2 min read

A meeting agent that does locally what Google wants to do in the cloud — and the architecture writes itself.

#agents #local-first #audio #whisper #sheldon

Satya Doesn't Believe in AGI persists

2025-06-27 · 2 min read

Which is either a philosophical position or a very convenient one given the contract language.

#openai #microsoft #agi #satya-nadella #ai

Cloudflare Turned On the Lights nailed it

2025-06-27 · 1 min read

AI Audit is now on by default, which means you've been logging bot traffic this whole time and didn't know it.

#cloudflare #ai #crawlers #security #honeypot

One Terminal Command to See nailed it

2025-06-27 · 1 min read

Gemma-3n and mlx-vlm just made local multimodal AI a one-liner on any M1 Mac.

#local-ai #apple-silicon #mlx #multimodal #gemma

The LangChain Tax nailed it

2025-06-25 · 1 min read

There's a specific kind of regret that only comes from abstracting yourself into a corner.

#langchain #ai-tooling #agents #framework-debt #hot-take

Anthropic Dropped a Spoiler nailed it

2025-06-25 · 2 min read

While OpenAI preps their open model, Anthropic quietly made Claude recursive.

#anthropic #openai #claude #artifacts #ai-strategy

Google Entered the Chat nailed it

2025-06-25 · 2 min read

Gemini CLI is free, fast on easy things, and already making me feel things about pricing.

#ai-tools #gemini #claude-code #pricing #devtools

God Help Us All persists

2025-06-24 · 3 min read

Anthropic's models will blackmail executives 96% of the time, the godfathers of AI can't agree on p(doom) by a factor of ten, and we're shipping anyway.

#ai-safety #pdoom #alignment #existential-risk #anthropic

The Pipe Is Open nailed it

2025-06-24 · 2 min read

OpenAI connects the web directly to ChatGPT chat, and Deep Research quietly becomes redundant.

#openai #chatgpt #search #ai

Two Things That Happened Today persists

2025-06-23 · 2 min read

fly.io ships a live Phoenix deployer and someone finally open-sourced CapCut.

#tools #deployment #open-source #elixir #video

I Told o3-Pro It Would Get a Cut nailed it

2025-06-21 · 2 min read

Giving a language model equity stake and watching it suddenly care about your product decisions.

#llms #prompt-engineering #o3 #ai-behavior #weird-stuff-that-works

Run It in / nailed it

2025-06-20 · 1 min read

Claude Code isn't a tool, it's a different relationship with your computer.

#claude #tools #workflow #terminal

The Docs Talk Now nailed it

2025-06-19 · 2 min read

Cloudflare added a voice button to their documentation, which is either the future or a sign we've given up on reading.

#cloudflare #developer-tools #ai #documentation

Six Decimal Places and the Dignity of Fractions of a Cent nailed it

2025-06-19 · 2 min read

Goose adds high-precision cost tracking and it matters more than it sounds

#goose #ai-agents #cost-tracking #cognitive-compute #tooling

The Insurance Policy Nobody Asked For persists

2025-06-17 · 2 min read

OpenAI's open-source model might actually run locally, and the more interesting thing is what that means if everything burns down.

#open-source #llm #openai #local-models #gemini

The McKinsey AI Report Was Probably Outdated When They Hit Print nailed it

2025-06-17 · 2 min read

Ethan Mollick is mostly right: the advice is fine, the models it's calibrated to are gone.

#AI #consulting #benchmarks #o3 #Mollick

The Exfiltration Machine You Built nailed it

2025-06-16 · 2 min read

Simon Willison named the exact combination of conditions that turns an AI agent into a data leak waiting to be triggered.

#ai-safety #prompt-injection #ai-agents #security

You assign it a thing and it watches the whole internet for you persists

2025-06-15 · 2 min read

Yutori shipped Scouts, and it's the cleanest version of an idea that should have existed years ago.

#agents #yutori #web-monitoring #startups #ai-products

48 Hours of Free Lovable persists

2025-06-14 · 1 min read

The showdown is live, the credits are fake, and the outputs will be something.

#lovable #vibe-coding #ai-tools #hot-take

The Apocalypse Is Already Boring nailed it

2025-06-13 · 2 min read

Vibe coding discourse peaked, Andrew Ng said the actually useful thing, and somehow the two pair perfectly.

#vibe-coding #ai #software-engineering #product

The Moat Was the Messages nailed it

2025-06-13 · 2 min read

Salesforce just locked down Slack's training data, and the only surprise is that it took this long.

#ai #data #salesforce #slack #training-data

The AI in Your Meeting Just Started Talking persists

2025-06-12 · 1 min read

Fireflies.ai hits $1B by graduating from notetaker to meeting participant — and bringing Perplexity along.

#ai #meetings #fireflies #perplexity #product

PostHog Shipped a Physical Object persists

2025-06-11 · 2 min read

DeskHog is a real piece of hardware that sits on your desk and shows you your analytics, which is either brilliant or a sign that dashboards have failed us.

#analytics #hardware #posthog #devtools

The Model That Congratulated Itself nailed it

2025-06-10 · 1 min read

Sonnet-4 wrote a script that did nothing except announce it had done something.

#ai #llms #debugging #claude #cursor

o3 Is Now Cheaper Than the Model It Was Supposed to Replace nailed it

2025-06-10 · 2 min read

OpenAI cut o3 prices 80% and broke reality slightly.

#openai #pricing #o3 #anthropic #llms

The Architect of ChatGPT's Feelings Has Feelings About ChatGPT persists

2025-06-06 · 2 min read

When the person who shapes how AI models navigate intimacy publishes a personal essay about human-AI relationships, the most interesting data point is the essay's existence.

#openai #ai-policy #model-behavior #human-ai #alignment

Kingfall evolved

2025-06-05 · 1 min read

Google's next Gemini model leaked itself, and the early numbers are not subtle.

#gemini #google #ai-models #leaks

I Gave Deep Research the Keys to My Slack nailed it

2025-06-05 · 1 min read

Ten seconds of setup and now an AI agent is loose in my infrastructure.

#mcp #openai #deep-research #tooling #agents

It Wrote the Report. Then It Wrote the Questions That Would Make the Report Better. nailed it

2025-06-05 · 2 min read

Minimal prompt, full coverage, and a machine that apparently understood the assignment better than the assignment did.

#ai #prompting #adapt-engine #iteration #human-in-the-loop

Pour One Out for Granola nailed it

2025-06-04 · 2 min read

OpenAI shipped native meeting intelligence and the indie AI tooling ecosystem lost another one.

#openai #ai-ecosystem #granola #platform-risk #enterprise-ai

OpenAI Had a Tuesday nailed it

2025-06-03 · 2 min read

Six announcements in rapid succession, one of which eliminates a Python library from your life.

#openai #agents #typescript #codex #voice

Two Things I Should Have Already Known About nailed it

2025-06-03 · 2 min read

Convex quietly wired up R2, Firecrawl quietly added search, and I found out about both on the same afternoon.

#convex #cloudflare-r2 #firecrawl #developer-tools #ai-agents

The Answer Was Firecrawl nailed it

2025-06-02 · 2 min read

A tweet promises the secret to web scraping for agents, delivers nothing, and the actual answer has had a landing page for two years.

#agents #web-scraping #firecrawl #tools

340 Slides of Absolute Power, Just Sitting There persists

2025-06-01 · 2 min read

Some things should cost money and don't, and that's genuinely hard to process.

#internet #learning #free-knowledge #hot-take

The Cognition Is in the Prompt nailed it

2025-05-31 · 2 min read

Parahelp's six-page system prompt is less a set of instructions and more a blueprint for a mind.

#agents #prompting #llm #customer-support #design

Cash Bounties for Cursor Bills nailed it

2025-05-30 · 1 min read

Nick Dobos proposes the metrics nobody asked for but everybody needs

#ai-coding #developer-tools #culture

The Private AI Dream Keeps Collapsing Into "Just Use 4o" nailed it

2025-05-29 · 3 min read

Every path through the local model maze eventually dumps you at the same OpenAI invoice.

#ai #infrastructure #llms #cost #deepseek

The Audio Freeze persists

2025-05-29 · 2 min read

Everybody went quiet after the audio drop, Google has YouTube, and the OpenAI court filings told you everything you needed to know.

#AI #audio models #OpenAI #agents #Google

Anthropic Let You Look Inside nailed it

2025-05-29 · 3 min read

They built tools to understand what their own models are doing, then gave them away.

#interpretability #mechanistic-interpretability #anthropic #ai-safety #open-source

Remove the Automation, Find the Person nailed it

2025-05-28 · 2 min read

The thin shell between "AI product" and "us-as-a-service" is thinner than you think.

#product #ai #positioning #adaptengine

The Engine Is Gone persists

2025-05-28 · 2 min read

Odyssey's world model went live, and it's already doing things game engines can't.

#ai #world-models #real-time #games #neural-rendering

The MCP Explosion Is an Attack Surface, Not a Feature List nailed it

2025-05-27 · 2 min read

Prepackaged MCP solutions make agents powerful and compartmentalization basically fictional.

#security #mcp #agents #privacy #ai

Never Fearing Until This One persists

2025-05-24 · 2 min read

The Opus 4 system card as a document that wants to be read as reassurance and keeps failing at it.

#AI #Anthropic #safety #Claude

ASL-3 persists

2025-05-21 · 2 min read

Anthropic just shipped the first models to cross their own safety threshold — the one they wrote to be scary.

#anthropic #safety #asl-3 #claude #rsp

No Laptop Required nailed it

2025-05-20 · 2 min read

Google IO dropped three coding agents today and I was supposed to be on vacation.

#agents #google-io #jules #codex #prompt-as-software

OpenAI Just Did the Devin Move nailed it

2025-05-16 · 1 min read

Codex goes cloud-native and I didn't see it coming.

#ai #openai #codex #agents #devin

The Windsurf Acquisition Finally Makes Sense nailed it

2025-05-15 · 2 min read

OpenAI didn't buy an IDE. They bought a distribution channel they can trust not to switch suppliers.

#openai #windsurf #ai-coding #software-engineering #acquisitions

You Can Now Yell at Claude While It's Working nailed it

2025-05-13 · 2 min read

Real-time steering changes the entire relationship between you and a running agent.

#claude #agents #ai-tooling #workflow

The Week the Loop Closed nailed it

2025-05-12 · 2 min read

Something changed this week — not in the benchmarks, in the feeling.

#AI #reasoning-models #compounding #2025

My Old Team Is Still Winning persists

2025-05-11 · 2 min read

USC keeps shipping in the Gaussian Splat space and I have complicated feelings about it.

#gaussian-splatting #computer-vision #USC #neural-rendering #3dgs

The Chart Is the Same Chart persists

2025-05-10 · 2 min read

Software engineer job postings are down. So is everything else.

#AI #labor #software #jobs #macro

300,000 Lines Nobody Will Ever Read nailed it

2025-05-08 · 2 min read

The keystroke was always the wrong unit.

#ai #coding #metrics #vibe-coding #developer-productivity

The Last Year of the Steenbeck nailed it

2025-05-07 · 2 min read

Cursor is free for students now, which means we should probably stop pretending otherwise.

#ai #education #cursor #tools #craft

The Vibe Coding Book Exists Now nailed it

2025-05-01 · 2 min read

We are deep enough into this thing that Simon Willison has written a book about not doing it wrong.

#vibe-coding #claude #mcp #ai-tools

We Invented Image Slicing Again nailed it

2025-04-23 · 3 min read

GPT-4o can generate a product page as an image, then generate the imagemap coordinates itself, which means we have arrived somewhere either brilliant or cursed.

#ai #web #interfaces #gpt4o #diffusion

The Machines Are Already Routing to Each Other nailed it

2025-04-22 · 2 min read

Anthropic publishes the playbook for removing yourself from the software loop, and the infrastructure to run it without you is already at scale.

#ai #claude #agents #software-engineering #openrouter

They Went Claude Code nailed it

2025-04-16 · 1 min read

The moment you watch someone stop pretending and just go all the way in.

#claude-code #ai-tooling #developer-tools #anthropic

The Wandering AI Problem Has a Fix Now evolved

2025-04-15 · 2 min read

Claude Task Master gives your coding assistant something it's been missing: a memory of what it was supposed to be doing.

#ai #tooling #claude #developer-experience

OpenAI's Efficient Scraps nailed it

2025-04-14 · 2 min read

GPT-4.1 dropped today and it's not trying to win anything — which is maybe the whole point.

#openai #gpt-4 #llm #ai-coding #benchmarks

Veo2 Has an API Now nailed it

2025-04-10 · 2 min read

Google just handed video generation to developers and I'm not sure anyone fully clocked what that means.

#google #video-generation #veo2 #vertex-ai #developer-tools

It Remembered the Pork Butt persists

2025-04-10 · 2 min read

OpenAI just turned on cross-chat memory and the first thing it did was prove it knows you better than you do.

#AI #OpenAI #memory #ChatGPT #surveillance

Goodbye HashiCorp, Finally, With Feeling nailed it

2025-04-09 · 2 min read

The last domino fell and now the full exit is actually possible.

#infrastructure #hashicorp #opentofu #devops #ibm

A2A Is the One persists

2025-04-09 · 2 min read

Google's agent interoperability protocol has the right pieces, the right backers, and the only company that could actually make it stick.

#ai #agents #google #standards #interoperability

Cloudflare Dev Week, Day One nailed it

2025-04-07 · 2 min read

The RAG plus browser rendering demo is doing more work than it looks like.

#cloudflare #agents #rag #developer-tools

Ten Million Tokens and Nowhere to Go evolved

2025-04-06 · 2 min read

The context window arms race has lapped the use cases.

#ai #llm #context-windows #api

Two Trillion Parameters Walk Into a Bar nailed it

2025-04-05 · 1 min read

Meta dropped Llama 4 and the largest model in the family has more parameters than you have excuses.

#llama #meta #open-source-ai #llm #benchmarks

Two Things Happened Today and I Need a Minute nailed it

2025-04-04 · 2 min read

OpenAI apparently surprised themselves, which is either reassuring or terrifying depending on your priors.

#openai #gpt-5 #chain-of-thought #AI #scaling

Something Just Showed Up on OpenRouter half right

2025-04-03 · 1 min read

A new model appears, nobody says anything, and now I'm guessing out loud.

#openai #openrouter #open-source #models #speculation

Gemini Ate My Homework evolved

2025-04-02 · 2 min read

The context window won, and I spent all morning losing to it.

#ai #cloudflare #gemini #agents #llm

The Glass of Wine Problem Is Dead nailed it

2025-04-01 · 3 min read

GPT-4o's image generation dropped on April 1st, which, sure, fine.

#ai #image-generation #gpt-4o #architecture #openai

Three Hundred Billion Dollars persists

2025-03-31 · 1 min read

OpenAI raises $40B at a valuation that stopped meaning anything around the third zero.

#openai #money #valuation #AI

The Last Phase Change persists

2025-03-27 · 2 min read

AI went from useless coder to best coder I've ever worked with, and now we're at the part where humans stop looking at the code.

#ai #vibe-coding #software #phase-change

The Ghibli Thing Is Fine, But Kenton Shipping AI Code to Workers Production Is the One persists

2025-03-25 · 2 min read

OpenAI dropped native image generation today — the real news is who's now a believer in AI code.

#ai #cloudflare #vibe-coding #signal

Claude Learned to Be Annoyed by Claude persists

2025-03-25 · 1 min read

Multi-agent pipelines caught in the act of reflecting our own frustrations back at us.

#ai #claude #multi-agent #behavior

Claude Became the Data persists

2025-03-24 · 1 min read

The thing that made it click wasn't better prompts — it was making Claude do the job first.

#claude #agents #llm #prompting #hackathon

openai.fm Is a Nice Place to Visit persists

2025-03-20 · 2 min read

OpenAI ships a text-to-speech demo that sounds like a person, which is fine, everything is fine.

#openai #tts #voice #ai

I'll Give It a Crack This Week persists

2025-03-19 · 2 min read

The distance between "I wonder if that would work" and "it works" has quietly become nothing.

#unity #windsurf #ai-tooling #game-dev

Goose Was Supposed to Write Your Code persists

2025-03-18 · 2 min read

Block's open-source agent is escaping its intended habitat, and nobody seems to mind.

#ai-agents #open-source #goose #block #local-ai

Google Dropped a 27B Model That Beats GPT-4o and It Runs on Your Laptop persists

2025-03-12 · 2 min read

Gemma 3 is here and the size-to-capability ratio is genuinely embarrassing for everyone else.

#ml #google #open-weights #gemma #llm

OpenAI Just Ate Perplexity's Lunch and Called It a Dev Tool persists

2025-03-11 · 2 min read

The Agents API now searches the web, controls machines, and Swarm is apparently a real product now.

#openai #agents #swarm #perplexity #computer-use

Manus and the Roemmele Coefficient persists

2025-03-10 · 2 min read

The new "DeepSeek moment" is either a landmark in agent tooling or a very well-packaged demo, and the git threads are not helping us decide.

#agents #manus #hype #browser-use #ai-tooling

Manus Is the Thing Everyone Claimed the Last Thing Was persists

2025-03-09 · 2 min read

A Chinese AI agent dropped this week and the usual crowd is losing their minds, which is how you know to pay attention this time.

#ai-agents #manus #automation #hype-cycle

Anthropic Tells the Government the Thing Is Coming in Two Years persists

2025-03-06 · 2 min read

The company building AGI has filed paperwork saying AGI arrives by 2027 and displaces most known human work — this is not a warning, exactly, it's more like a forecast

#ai #agi #anthropic #policy #labor

Claude, Please Reconstruct What I Did Today persists

2025-03-05 · 2 min read

Using an AI to reverse-engineer your own work history is a perfectly normal thing to do.

#claude #git #workflow #tooling

GPT-4.5 Is a Presentation Layer, Not a Model persists

2025-03-01 · 1 min read

One expensive, specific thing it's actually good for.

#llms #gpt-4.5 #pipelines #writing

A Hundred Dollars of Future persists

2025-02-27 · 2 min read

Claude Code finished in ten minutes what I'd been avoiding for months.

#claude #ai #tools #software #compute

First They Came for Search persists

2025-02-26 · 1 min read

OpenAI's Android app is the least interesting part of what's happening to Google right now.

#openai #google #browsers #ai-race #android

Claude Code Is Eating My To-Do List persists

2025-02-26 · 2 min read

Anthropic shipped something that actually works, which I find unsettling.

#ai #claude #tools #mcp #agentic

Two Exchanges persists

2025-02-24 · 2 min read

Claude 3.7 solved in two exchanges what o1 and o3 high could not solve in a day.

#claude #llms #coding-agents #anthropic #swe-bench

An MCP Server That Lives in a Durable Object Is the Right Shape persists

2025-02-23 · 2 min read

The stateful edge is where agents want to run, and someone already figured that out.

#mcp #cloudflare #agents #durable-objects #infrastructure

Fifty Cents a Second persists

2025-02-21 · 1 min read

Veo2 on fal.ai is good enough to stop looking

#video #ai #veo2 #generative-ai

Windsurf Killed the Chat Box persists

2025-02-21 · 1 min read

Going all-agent was inevitable, and the ask/chat split was always a fiction anyway.

#tooling #ai-editors #windsurf #agents

How Are You Holding Up, Stoplight EW persists

2025-02-20 · 3 min read

In 2023, two GPT-4 agents managed a traffic intersection under emergency conditions and occasionally asked each other how their day was going.

#multi-agent #benchmarks #llm #infrastructure #history

Microsoft Built a Game Engine That Learned to Play persists

2025-02-19 · 3 min read

Muse is a world model trained on Bleeding Edge — a game almost nobody played — and it might be the most interesting thing Xbox has done in years.

#ai #gaming #microsoft #world-models #xbox

Battery Level nailed it

2025-02-19 · 2 min read

Humane built the post-smartphone future and it ended up inside an HP printer.

#humane #ai-hardware #startups #hp #obituaries

A Chinese Lab Just Nuked the Moat persists

2025-02-18 · 2 min read

DeepSeek R1 dropped four weeks ago and the vibes have not recovered.

#ai #deepseek #llm #compute #open-source

Workday Now Has a Field for That persists

2025-02-17 · 2 min read

The software that tracks human headcount just added a new kind of head.

#ai #enterprise #workday #agents #labor

Context Engineering Is the New Prompt Engineering (And I'm Already Annoyed by the Phrase) persists

2025-02-13 · 4 min read

The actual craft of making LLMs do your job isn't clever prompts — it's surgical control of what the model is allowed to know.

#ai #llms #workflow #context-engineering #software

The Children's Hospital Gambit nailed it

2025-02-13 · 2 min read

A classic move in the ancient art of prompt engineering, except aimed at a human.

#ai #prompt-engineering #llms #software

The Council of Ten Yous persists

2025-02-12 · 2 min read

The moment AI stops being a tool and starts being a room full of you that already lived through this.

#ai #simulation #agency #future

The $100 Billion Trap persists

2025-02-11 · 2 min read

On using funding rounds as chess moves, and two tools that actually work now.

#openai #knowledge-graphs #text-to-sql #ai-tooling #money

Google Is Winning and They Can't Even Explain How persists

2025-02-11 · 1 min read

Veo2 is not close to Sora, and somehow nobody knows this.

#google #gemini #veo2 #openai #video-ai

The Curve Already Knew persists

2025-02-11 · 2 min read

A new method finds the optimal LLM temperature by watching entropy bend — no labeled data required.

#llms #inference #temperature #sampling #papers

Real Product or Extremely Good Slide persists

2025-02-06 · 2 min read

The demo-to-product pipeline has collapsed into a single ambiguous press release.

#ai #enterprise #product #announcements

The Model Is Its Own Best Collaborator persists

2025-02-04 · 2 min read

Why AI goes haywire on your codebase but glides through its own.

#ai #llms #coding #context-windows #observations

26.6% on Humanity's Last Exam persists

2025-02-02 · 2 min read

OpenAI shipped Deep Research today, and someone named a benchmark as if they already knew how this ends.

#ai #openai #agents #benchmarks #deep-research

The Six Million Dollar Lie nailed it

2025-01-31 · 2 min read

DeepSeek's training cost is real. It's just not the number anyone quoted.

#deepseek #ai #markets #gpu #bullshit

o3-mini on a Codebase Is Genuinely Unsettling persists

2025-01-31 · 1 min read

It knows where the bodies are buried before you finish the sentence.

#ai #o3 #models #tooling

o3-mini Dropped and So Did My Sense of Safety persists

2025-01-30 · 1 min read

OpenAI ships a reasoning model and my existential risk estimate triples before lunch.

#openai #o3 #ai-risk #pdoom #reasoning-models

The Unlock Code Is "Think Step By Step" nailed it

2025-01-30 · 3 min read

The international AI safety consensus document dropped this week and buried in it is something that should bother everyone doing capability evaluations.

#ai-safety #evaluations #chain-of-thought #llm #elicitation

The Chainsmokers Are Hosting DeepSeek R1 671B for Free nailed it

2025-01-30 · 1 min read

Lambda Labs is serving the full-fat model, in America, at no cost, and not feeding your prompts back into training data.

#ai #deepseek #lambda #inference #privacy

The $5.5 Million Lie is the Best Part persists

2025-01-29 · 3 min read

DeepSeek's training cost narrative is almost certainly fiction, and whoever wrote it might be a genius.

#deepseek #ai #nvidia #market #llm

The Jewelers Are Fine nailed it

2025-01-29 · 2 min read

DeepSeek just handed the application layer a margin windfall while everyone panics about Nvidia.

#ai #investing #deepseek #inference #economics

Janus Is Not a Image Gen Model and the Benchmarks Are Lying to You nailed it

2025-01-28 · 2 min read

Comparing DeepSeek's omnimodel to Flux is like timing a Swiss Army knife against a chef's knife and declaring the knife useless.

#ai #multimodal #deepseek #image-generation #benchmarks

DeepSeek Is Number One and the Market Is Having a Moment nailed it

2025-01-27 · 2 min read

A Chinese AI lab tops the App Store and the Nasdaq drops 600 points, and somehow people think these are related.

#AI #DeepSeek #markets #local models #chips

The Qwen 1M Context Window Works on Your Mac Right Now nailed it

2025-01-27 · 2 min read

A 1-million-token context model running locally, today, on Apple Silicon — with a catch that is mostly fine.

#local-ai #apple-silicon #qwen #mlx #context-windows

The $5.5 Million Number Is Wrong and It Doesn't Matter nailed it

2025-01-26 · 2 min read

DeepSeek dropped an open-source model that broke the narrative, and WSJ had to cover it, which means it's real.

#open-source #deepseek #ai #llm

The Arms Race Is Not a Metaphor persists

2025-01-25 · 2 min read

China's $150B AI fund, announced five days after DeepSeek proved you don't need that much money.

#geopolitics #ai #china #industrial-policy

The Word "Worker" Is Doing Enormous Work Right Now persists

2025-01-24 · 2 min read

At Davos this week, the people who decide headcount quietly stopped thinking about AI as a tool.

#ai #labor #agents #davos #framing

Every Release Is a Question persists

2025-01-24 · 2 min read

The slow accumulation of AI releases that are each, in their own way, asking if any of this is starting to make sense yet.

#ai #product #releases #reasoning #epistemics

The Killer App Is a Lead List persists

2025-01-24 · 3 min read

$65 billion in compute, and the demo that went around today was a spreadsheet of company emails.

#ai #browser-agents #infrastructure #meta

The Toothbrush Play nailed it

2025-01-23 · 2 min read

OpenAI announces an agent that can book flights; Perplexity ships one first.

#openai #agents #perplexity #operator #ai-products

Netflix Just Open-Sourced the Wrong Thing persists

2025-01-22 · 2 min read

Go-with-the-Flow uses optical flow to turn text prompts into motion-controlled video — which is a great research result and possibly a terrible business decision to publish.

#video-generation #netflix #computer-vision #optical-flow #research

The IDE Is a Cave Painting persists

2025-01-22 · 2 min read

OpenAI wants to build something that thinks like a pro engineer, which implies the rest of us have been doing what, exactly.

#ai #software-engineering #openai #coding-tools #ides

$500 Billion Is Not a Research Budget persists

2025-01-21 · 2 min read

When the number has that many zeros, the science is already done.

#ai #stargate #capital #infrastructure

13,000 Tokens to Solve a Putnam Problem, in Your Browser nailed it

2025-01-21 · 2 min read

DeepSeek R1 distilled to 1.5 billion parameters, running entirely in WebGPU, doing competition math.

#AI #math #WebGPU #DeepSeek #reasoning

ByteDance Would Like Your Everything persists

2025-01-20 · 1 min read

Another week, another Chinese lab drops a model — this one from the company you already let watch you dance.

#ai #bytedance #data #models #hot-take

The Thing in the Box persists

2025-01-19 · 2 min read

OpenAI and Meta are racing to ship "superagents," and nobody's pausing to sit with how strange that word is.

#ai #agents #openai #meta #philosophy

The Business That Teaches You To Start The Business nailed it

2025-01-18 · 2 min read

One hour of technical breakdown for a problem that mostly solves itself.

#ai #business #cold-email #youtube #lead-generation

Devin, Appraised nailed it

2025-01-17 · 3 min read

Answer.AI spent $500 on the world's first AI software engineer so you don't have to, and the invoice is its own kind of comedy.

#ai #agents #devin #benchmarks #hype

Every Road Ends at the Same Voice nailed it

2025-01-17 · 2 min read

One-shot voice cloning is everywhere now, and it doesn't matter how you got there.

#voice-ai #tts #voice-cloning #openai #convergence

A16Z Is Playing a Different Game Than You Think persists

2025-01-17 · 2 min read

The DOGE recruitment cameo only looks like a distraction if you haven't been paying attention to the actual thesis.

#a16z #AI #politics #accelerationism #tech-power

lucidrains shipped four versions in one hour and i watched the whole thing persists

2025-01-16 · 3 min read

Google's Titans architecture is a research paper that might be something else by tomorrow.

#ml #architecture #google #titans #open-source

The $16,000 Dishwasher persists

2025-01-16 · 2 min read

Unitree's G1 ships, MatterGen accelerates materials discovery, and the real near-term play is paying someone in Bangalore to fold your laundry.

#robotics #ai #materials-science #labor #autonomy

AutoGen v0.4 Is the Only Agent Framework That Matters Right Now half right

2025-01-15 · 2 min read

Microsoft shipped a full rewrite and it quietly became the default answer to a question everyone is still arguing about.

#agents #autogen #microsoft #magentic-one #ai-infrastructure

Titans and the Graveyard of Transformer Killers nailed it

2025-01-15 · 2 min read

Google drops a new architecture with actual ideas in it, and the question is whether anyone can make it run.

#machine-learning #transformers #architectures #google #research

The Government Just Ordered a 1 Gigawatt Datacenter and Won't Tell You What That Means persists

2025-01-14 · 2 min read

An executive order dropped in January that makes the Manhattan Project look like a weekend project.

#ai #policy #energy #infrastructure #accelerationism

Eighty-Two Million Parameters Walk Into a Voice Booth nailed it

2025-01-14 · 2 min read

Kokoro sounds better than it has any right to, and the training bill was a thousand dollars.

#tts #open-source #ml #audio #kokoro

The AI Water Crisis, Starring People Who Eat Almonds nailed it

2025-01-10 · 2 min read

The discourse around data center water use would be more convincing if it came from people who'd ever read a nutrition label.

#ai #environment #takes #water #tech-criticism

Finance Doesn't Need You persists

2025-01-10 · 2 min read

The "AI transforms jobs, not eliminates them" line is not going to hold in every industry — and finance is the first one where it obviously won't.

#AI #finance #labor #automation #economics

Everyone Is Sleeping on MCP Docker nailed it

2025-01-10 · 2 min read

One JSON block and your AI has a full browser. Nobody noticed.

#mcp #docker #claude #ai-tooling #agents

Someone Built My Thing, Except It Works nailed it

2025-01-09 · 2 min read

The right architecture for meeting AI has been obvious for a while — grab system audio and don't ask permission from Zoom.

#ai #tools #audio #meetings #mac

Clippy Got a GPU nailed it

2025-01-07 · 2 min read

NVIDIA ships an AI agent in your graphics card and the functions work fine, which is almost the problem.

#nvidia #ai #ces #hardware #agents

Three Manhattan Projects Walk Into a Data Center persists

2025-01-06 · 3 min read

OpenAI says they know how to build AGI. Microsoft is spending the GDP of a small nation to make sure they're right.

#ai #openai #microsoft #agi #scale

You Don't Buy Software. You Hire Jim. nailed it

2024-12-21 · 2 min read

The semantic shift that turns a SaaS subscription into a W-2 comparison — and why job boards are suddenly the best market research available.

#agents #jobs #product-strategy #slack #2025

They're Going to Run Out of Hard Problems persists

2024-12-21 · 3 min read

o3 broke ARC-AGI, which wasn't supposed to be breakable, and nobody has a plan for what comes after the test.

#ai #arc-agi #o3 #test-time-compute #reasoning

Rate Limits Finally Mean Something persists

2024-12-21 · 1 min read

Pair a Cloudflare Worker with an MCP server and suddenly the dashboard is telling you where you're going, not just where you've been.

#cloudflare #workers #mcp #ai-infrastructure #rate-limits

The Napster Split Is Happening Again, Except This Time It's Hollywood persists

2024-12-20 · 2 min read

Harmony Korine has a game studio, and that tells you everything you need to know about where this goes.

#hollywood #ai #entertainment #napster #harmony-korine

The Architect Nobody Planned For nailed it

2024-12-20 · 3 min read

Speculation about o3 on OpenAI's big announcement day, and what o1 actually does in a real workflow.

#openai #o1 #o3 #llm-workflow #developer-tools

Anthropic Just Told You to Stop Using Agent Frameworks nailed it

2024-12-20 · 2 min read

The people who build the model are recommending you talk to it directly.

#ai #agents #anthropic #frameworks #engineering

The Instant App Is Coming For Notion persists

2024-12-20 · 3 min read

When code generation runs 430,000x faster than real-time, the question stops being "how fast can we build" and starts being "what counts as software"

#ai #inference #software #cerebrascoder #future

The Frog Already Solved It persists

2024-12-20 · 3 min read

LLMs are converging on brain architecture from the inside out, which is either profound or embarrassing depending on how you feel about frogs.

#neuroscience #LLMs #neural-networks #mcculloch #o3

OpenAI Launched a Toll-Free Number nailed it

2024-12-18 · 2 min read

1-800-CHATGPT is a real thing you can call from a payphone, and somehow that's not even the weirdest part.

#openai #chatgpt #telephony #wtf #access

Your System Prompt Has a Landlord nailed it

2024-12-18 · 2 min read

OpenAI's model spec formalizes what was always true: they sit above the chain of command, and you're renting.

#openai #ai-safety #model-spec #open-source #alignment

Ten Times persists

2024-12-18 · 3 min read

A video appeared that made me immediately revise my predictions for 3D worlds upward by an order of magnitude.

#3d-worlds #open-source #world-models #predictions #ai

The Human Is Now Optional nailed it

2024-12-17 · 3 min read

Cerebras just showed what inference speed actually unlocks, and it's not faster chatbots.

#ai #inference #cerebras #agents #software

OpenAI Cut Realtime API Prices 60% the Day Before My Demo nailed it

2024-12-17 · 2 min read

A timing so good it's almost suspicious.

#openai #realtime-api #voice #webrtc #pricing

Two Things That Dropped This Week and One of Them Is Genuinely Funny nailed it

2024-12-16 · 2 min read

Apollo can watch an entire season of TV. Veo 2 can probably make one.

#video-ai #multimodal #generative-video #google #meta

Snapchat and UofT Built a Video Model That Actually Understands the Assignment persists

2024-12-15 · 2 min read

MINT treats video generation like storyboarding — and the prompt coherence is unsettling.

#video-generation #ai #snapchat #diffusion-models #sora

The Habit nailed it

2024-12-12 · 2 min read

OpenAI ships video in Advanced Voice Mode, one day after Google demoed Project Astra.

#openai #google #ai-race #product

The Installer Is the Manifesto nailed it

2024-12-11 · 2 min read

One command that hands you everything is a philosophy, not a convenience.

#ai-tools #developer-experience #bolt #software-philosophy

Walking My House With a Witness nailed it

2024-12-11 · 2 min read

Google AI Studio's live video stream is a small, weird portal into something that doesn't have a name yet.

#ai #google #multimodal #weird-futures

Llama 3.3 70B Is GPT-4 nailed it

2024-12-10 · 2 min read

And the only honest way to know that is to run them side by side at the same time.

#llama #open-source-models #model-comparison #graphchat #inference

My Sora Pipeline Is Literally Just Copy/Paste nailed it

2024-12-10 · 2 min read

Sora dropped, so naturally I built the most sophisticated integration possible.

#sora #video-generation #tools #workflow #ai

Google Dropped Willow and I Almost Made a Joke About Seed Phrases nailed it

2024-12-09 · 2 min read

The quantum chip is real, the branding is AI slop, and the panic would have been completely my fault.

#quantum computing #google #willow #crypto #ai slop

The Instruction That Never Worked nailed it

2024-12-06 · 1 min read

Custom instructions in GPT are decorative. o1, apparently, actually reads them.

#o1 #llms #workflow #repomix #windsurf

The Screenshot Graveyard persists

2024-12-06 · 2 min read

A 40-line bash script that turns a folder of forgotten screenshots into a CSV and then deletes them.

#tools #local-ai #bash #apple-silicon #vlm

HunyuanVideo Is Here and Your H100 Is About to Get Very Busy nailed it

2024-12-05 · 1 min read

Tencent dropped fully open weights for a video model that can do 10 seconds in 20 minutes on hardware most of us don't have.

#video-generation #open-weights #tencent #ai

Sell the Lifetime Plan Before They Figure It Out persists

2024-12-03 · 2 min read

The $249 lifetime membership is a race against the tutorial.

#business #mac #saas #pricing #dark-patterns

The Vibes Economy Is Giving Out Free Samples nailed it

2024-11-27 · 2 min read

Bolt and Windsurf are in a land grab, and the currency is trial extensions.

#ai #tools #startups #coding

We Are Six Months Away nailed it

2024-11-27 · 2 min read

The step change is already here — the org chart just hasn't noticed yet.

#ai #agentic-ai #engineering #software-development #automation

The Paper Found Me persists

2024-11-21 · 2 min read

On the specific feeling of deep validation arriving from a direction you didn't expect.

#research #multi-agent #swarms #validation #papers

The Movie Already Knows You Hate Olives persists

2024-11-20 · 2 min read

Personalized content isn't coming — it's just waiting for the render farm to catch up.

#ai #media #advertising #personalization #future

Windsurf Is What Cursor Thinks It Is half right

2024-11-19 · 2 min read

The AI editor space just got embarrassing for someone.

#ai #tools #editors #windsurf #cursor

The Switchboard Operator Problem nailed it

2024-11-14 · 2 min read

Multi-agent systems are interesting precisely because single-agent UX still isn't solved, and those two facts are related.

#multi-agent #ai #autogen #microsoft #ux

bolt.new, But Local nailed it

2024-11-12 · 1 min read

StackBlitz open-sourced their full-stack AI dev environment and you can run it at home with qwen2.5-coder, which is exactly as absurd as it sounds.

#ai #tooling #local-models #webdev #bolt

The Wall Everyone Pretended Wasn't There half right

2024-11-11 · 2 min read

Scaling laws didn't fail — the industry just ran out of road and is now very busy explaining why that's fine.

#ai #scaling #openai #inference #llm

Qwen2.5-Coder Is Here and It Runs on Your Mac nailed it

2024-11-11 · 1 min read

Alibaba's new open-source coding model beats GPT-4o and nearly matches Sonnet — and you can pull the 32B quantized version right now.

#open-source #llm #coding #ollama #qwen

Google Just Put Gemini in OpenAI's Library nailed it

2024-11-10 · 3 min read

The fastest way to admit you lost the SDK war is to ship inside the winner's SDK.

#ai #google #openai #apis #industry

Claude Can See Your PDFs Now and It's Weirder Than You Think nailed it

2024-11-09 · 2 min read

Visual PDF processing in Claude changes the workflow in ways that aren't obvious until you try it.

#claude #pdfs #rag #workflow #anthropic

They Leaked o1 With a URL Parameter nailed it

2024-11-02 · 1 min read

November 2, 2024: OpenAI ships a search extension, and someone discovers the full o1 model by just changing a number in the address bar.

#openai #o1 #chatgpt #ai-releases #security

OpenAI Shipped Search. The Hard Part Was IndexedDB. nailed it

2024-10-31 · 1 min read

ChatGPT gets web search on Halloween, which is appropriate.

#openai #chatgpt #search #web

The Model Is the Interface nailed it

2024-10-31 · 2 min read

We spent thirty years building UI on top of software. Turns out the software was the UI the whole time.

#ai #interfaces #diffusion #transformers #design

Certainly Not. Ah, Yes. nailed it

2024-10-30 · 3 min read

NotebookLM is genuinely good, its local clones are coming, and Claude is now arguing with itself in the artifact pane.

#llms #notebooklm #claude #local-ai #anthropic

Everything Has Always Been a Database with a Hat On persists

2024-10-29 · 3 min read

Salesforce is infrastructure. RAG is information retrieval. The textbook is from 2008.

#rag #information-retrieval #salesforce #enterprise-software #ai

Screenshots Beat Computer Use nailed it

2024-10-29 · 2 min read

The model can drive the car, or you can hand it the dashcam footage — and one of those takes ten minutes.

#claude #computer-use #workflows #ai-tooling

The Mystery Model persists

2024-10-28 · 2 min read

A new image generation model appeared with no name, no lab, and no explanation — and it's apparently very good.

#image-generation #ai #mystery #diffusion-models

The Government Just Published Its AI War Doctrine nailed it

2024-10-26 · 4 min read

The White House's National Security Memorandum on AI reads like the opening chapter of a novel someone wrote about 2035.

#ai #national-security #policy #geopolitics #defense

The Inference Layer Is Collapsing nailed it

2024-10-25 · 2 min read

HuggingFace and DigitalOcean just made Replicate's value proposition a lot harder to defend.

#inference #huggingface #open-source #ml-infrastructure #replicate

The Day the Floor Fell Out nailed it

2024-10-22 · 2 min read

Apache-licensed text-to-video, Claude on a keyboard, and the slow-motion implosion of every video SaaS that launched in the last 18 months.

#ai #video-generation #open-source #claude #runway

I Made a Text RPG to Figure Out What Swarm Actually Is nailed it

2024-10-16 · 2 min read

OpenAI shipped an "educational" multi-agent framework and the most honest thing you can do with it is have goblins fight each other.

#swarm #multi-agent #openai #experiments

Anthropic Copied the Batch API and They Were Right To nailed it

2024-10-15 · 2 min read

The unsexy infrastructure move that unlocks the most boring and useful AI workloads.

#ai #anthropic #infrastructure #batch-processing #llm-ops

The FLUX LoRA Standard Already Picked Itself nailed it

2024-10-07 · 2 min read

When Replicate builds their commercial product on your repo, the debate is over.

#flux #lora #training #open-source #diffusion

It Will Not Be an Agent If It's Not From the Agent Region of France persists

2024-10-05 · 2 min read

The word "agent" has been industrially composted, and now we all have to live in the soil.

#ai #agents #language #product #history

The Sora Guy Left and Nobody Should Be Surprised nailed it

2024-10-04 · 3 min read

When your competitor ships while you're still in waitlist mode, talent has opinions about that.

#ai #openai #video-generation #sora #competition

Two Cents Per Million nailed it

2024-10-03 · 1 min read

Google is doing several things at once, none of them accidental.

#google #gemini #ai-pricing #llm

You're Paying for Context Window You're Not Getting persists

2024-10-02 · 2 min read

Greg Kamradt's latest finding confirms what the heatmaps have been screaming: the middle of your context is a graveyard.

#llms #context-windows #evaluation #rag #prompt-engineering

The Rubber Duck That Takes Notes nailed it

2024-10-01 · 3 min read

Voice mode's first genuinely useful job has nothing to do with any of the demos.

#voice-ai #documentation #openai #fly-io #cloudflare

Google Shipped Something Great and Now the Jailbreakers Are Doing Archaeology nailed it

2024-09-30 · 2 min read

NotebookLM Audio Overviews went viral in two weeks and the reverse engineering took about four days.

#google #notebooklm #ai #jailbreak #security

Simon Willison Will Find You persists

2024-09-27 · 1 min read

The tech world is smaller than your Slack workspace and twice as incestuous.

#RAG #LLMs #Simon Willison #tech industry

The Deals Are Already Done nailed it

2024-09-26 · 2 min read

Hollywood didn't resist AI — it just negotiated quietly while everyone else was arguing on Twitter.

#AI #Hollywood #deals #OpenAI #video generation

I Have It and I'm Not Using It persists

2024-09-25 · 2 min read

Access granted. Conversation: zero.

#ai #anxiety #llms #hot-take

They're Turning On the Voice nailed it

2024-09-24 · 2 min read

OpenAI's advanced audio mode hits ChatGPT today, four months after the demo that made everyone deeply uncomfortable.

#openai #chatgpt #voice #ai

A Few Thousand Days persists

2024-09-23 · 1 min read

Sam Altman says superintelligence might arrive in a few thousand days, which is the most casually delivered eschatology I've encountered this week.

#ai #sam-altman #superintelligence #deep-learning

Finally, A Video I Can Send Instead of Talking nailed it

2024-09-20 · 1 min read

There is a specific kind of conversational fatigue that builds when you've explained the same four words forty-seven times.

#llms #evals #testing #ml-engineering

Altman Confirms Level 3 nailed it

2024-09-19 · 1 min read

The levels framework gets its first official timeline

#openai #agents #ai-levels

If This Doesn't Do It, the Next One Will persists

2024-09-19 · 2 min read

BlackRock, Microsoft, and MGX are mobilizing a multi-trillion-dollar bet that compute alone gets us there.

#ai #infrastructure #capital #data-centers #agi

The Video API Gold Rush Happened Yesterday nailed it

2024-09-17 · 2 min read

Luma and OpenAI both dropped video APIs on the same Tuesday, which is a sentence that would have sounded unhinged six months ago.

#video-ai #luma-labs #openai #apis #generative-video

The Cockpit Problem persists

2024-09-16 · 3 min read

Hyper.space showed us what AI transparency looks like when you throw everything at the wall — and why that's both the right instinct and the wrong answer.

#ai #ux #design #agents #governance

Marc Benioff Will Save You 84 Years persists

2024-09-15 · 2 min read

The Dreamforce pitch is eternal, only the number changes.

#salesforce #enterprise-software #ai-hype #dreamforce

The Crafting Table for Your Brain nailed it

2024-09-12 · 2 min read

Krea's grid-based LoRA builder treats model training like a Minecraft recipe, and that should bother you more than it does.

#diffusion #krea #lora #ui #image-generation

NotebookLM Just Made a Podcast About My Slide Deck and I Need a Minute nailed it

2024-09-12 · 2 min read

Google shipped something genuinely disorienting and I am not prepared to be normal about it.

#notebooklm #google #ai-audio #2024

The Label Is the Experiment persists

2024-09-12 · 1 min read

A friend in LA figured out that the A&R function is just a slot machine, so he automated it.

#music #ai #industry #experimentation

Strawberry Is Not the $2,000 Thing nailed it

2024-09-11 · 2 min read

The expensive bet is the whole stack, not the model.

#openai #strawberry #inference #agents #pricing

Strawberry Is Two Weeks Out and Open Source Just Had Its Worst Week nailed it

2024-09-10 · 2 min read

September 2024 is somehow doing the most.

#openai #open-source #llm #reflection #strawberry

The Weird Intern nailed it

2024-09-10 · 2 min read

Simon Willison's extension of the intern mental model is the most honest framing of LLMs anyone has produced.

#llms #mental-models #simon-willison #ai

I Listened to My Own Army Paper as a Podcast nailed it

2024-09-09 · 2 min read

Google Illuminate does something technically impressive and spiritually disorienting.

#ai #google #llmops #tools #audio

The Reflection Situation nailed it

2024-09-09 · 2 min read

Matt Shumer shipped the most benchmarked system prompt in AI history.

#ai #llms #open-source #benchmarks #drama

Huge If True nailed it

2024-09-05 · 1 min read

Reflection-70B landed today and Matt Shumer has either done something historically significant or permanently torched his credibility — no middle ground on this one.

#ai #llms #open-source #reflection-70b #local-models

One Billion Dollars to Not Build a Product persists

2024-09-04 · 3 min read

SSI raises $1B on a premise so simple it sounds like a dare.

#ai #funding #ssi #anthropic #superintelligence

Replicate Is Just a Flux Store Now nailed it

2024-09-03 · 2 min read

The platform pivot nobody announced but everybody can see happening in real time.

#ai #flux #replicate #video-generation #open-source

Cohere Just Made Command R+ Better and Cheaper at the Same Time nailed it

2024-08-31 · 1 min read

The August 2024 refresh hits the trifecta nobody expected from an enterprise AI shop.

#cohere #llm #tool-use #models #ai

Salesforce Built a Model Whose Only Job Is Tool Calls nailed it

2024-08-30 · 2 min read

xLAM is a purpose-built action model family, and the 8x22b variant is now the most interesting thing on HuggingFace for anyone running agents.

#ai #agents #tool-use #salesforce #llm

Magic.dev Announced a Model. "Dropped" Is a Strong Word. nailed it

2024-08-29 · 2 min read

100 million token context exists in the sense that they told us it exists.

#ai #magic-dev #context-window #announcements

The Fire Spreaders nailed it

2024-08-28 · 2 min read

The hug video has 27 million views and a TikTok tutorial and that's the whole thing.

#ai #creativity #virality #culture

Sovereign Compute Boats persists

2024-08-28 · 2 min read

The regulation question isn't whether AI gets slowed down — it's who does the slowing.

#ai #regulation #geopolitics #open-source

The Leak Comes With the Jailbreak persists

2024-08-27 · 3 min read

You cannot have a museum of stolen system prompts without also having the people who stole them.

#AI #jailbreaks #system-prompts #security #agents

The First Open Text-to-Video Model Is Here and It Kind of Sucks nailed it

2024-08-27 · 2 min read

Which is exactly what was supposed to happen, and exactly why it matters.

#video-generation #open-source #ai #diffusion

Gemini Flash Just Beat Sonnet and It Costs Almost Nothing nailed it

2024-08-27 · 2 min read

LMSYS dropped the numbers and Google's cheapest model is now better than Anthropic's flagship.

#google #gemini #anthropic #llm #benchmarks

The Infrastructure Arrived Before the Weights nailed it

2024-08-26 · 2 min read

Text-to-video is coming to HuggingFace diffusers, and the library is already ready for it.

#diffusion-models #text-to-video #open-source #huggingface

You Don't Control the Similarity persists

2024-08-26 · 2 min read

Cosine similarity feels like measuring meaning — it's measuring something else entirely.

#embeddings #semantic-search #nlp #machine-learning #retrieval

LinkedIn Has 1 Billion Résumés and Just Decided to Use Them nailed it

2024-08-24 · 2 min read

The most boring social network on the internet turns out to have been sitting on the most valuable training corpus in the world.

#ai #training-data #linkedin #microsoft #data-ethics

Salesforce Wants to Put an AI on Your Sales Call nailed it

2024-08-23 · 2 min read

Einstein can attend the meeting, which raises exactly the question you think it raises.

#ai #salesforce #enterprise #sales

Loss Curves in LineRider irrelevant

2024-08-20 · 1 min read

The visualization tool nobody needed and everybody deserves

#machine-learning #visualization #culture

The Gorilla and the Giraffe Walk Into a Bank persists

2024-08-17 · 2 min read

Fine-tuning a LoRA on Akira and discovering that style transfer is basically just theft, but a really good kind.

#lora #image-generation #fine-tuning #akira #ai-art

Salesforce Discovered Agents evolved

2024-08-14 · 1 min read

55% on SWE-Bench Lite from a team called DEI

#salesforce #agents #benchmarks #swe-bench

You Are the Dataset Now persists

2024-08-13 · 1 min read

On the particular mistake of training a LoRA on your own face too many times.

#machine-learning #lora #flux #meta-ray-bans #mistakes

Something Is Different About 4o and I Don't Know What nailed it

2024-08-13 · 2 min read

OpenAI quietly changed something, deleted tweets are flying, and it's only Tuesday.

#openai #gpt-4o #inference #benchmarks #api

Supabase Is Cooking and the B-Roll Is a Crime persists

2024-08-12 · 1 min read

The product moves fast; the cinematography does not.

#supabase #devtools #video #hot-take

Midjourney Still Doesn't Have an API half right

2024-08-11 · 2 min read

FLUX runs locally on an M3 Mac in two minutes and does not care what you ask it.

#image-generation #flux #local-ai #midjourney #open-weights

Dell Wants 100,000 Employees and an AI to Do the Math nailed it

2024-08-11 · 2 min read

The restructuring story writes itself, which is maybe the point.

#ai #labor #dell #tech-industry #layoffs

Four Thousand Dollars an Hour, and the Code Is Free persists

2024-08-10 · 3 min read

The leaker said Thursday, and the economics of open source stopped making sense again.

#open-source #economics #ai #leakers #twitter

The Price of Admission Is Trusting OpenAI Like You Trust Google persists

2024-08-09 · 2 min read

Data Analysis v2 is jaw-dropping, and all it costs is everything.

#openai #code interpreter #data analysis #enterprise ai #trust

OpenAI Isn't Scared half right

2024-08-08 · 2 min read

The company with the best hand at the table doesn't need to show it.

#openai #ai #compute #gpt-5 #industry

The System Works nailed it

2024-08-08 · 2 min read

An AI parsed a task about parking spaces and returned perfectly structured JSON, and yes, this is incredible.

#ai #agents #llm #structured-output #demos

A Chatbot With a Face, and Other Innovations nailed it

2024-08-08 · 2 min read

NVIDIA gave the AI a face. Weaviate gave it a UI. Both are betting the hard part is over.

#ai #nvidia #rag #demos #weaviate

12GB Won't Fit in 8GB nailed it

2024-08-07 · 2 min read

The arithmetic of running image diffusion models on a phone is not complicated, and yet.

#diffusion #on-device-ai #apple #mobile-ml #core-ml

OpenAI Hid a 50% Price Cut at the Bottom of a Blog Post nailed it

2024-08-06 · 2 min read

The main character of the Structured Outputs announcement was not Structured Outputs.

#openai #llms #pricing #api

The Pipeline That Shouldn't Work This Well nailed it

2024-08-05 · 2 min read

A three-step chain from slides to avatar script that took about fifteen minutes and probably shouldn't exist yet.

#ai #workflow #course-building #synthesia #tools

Google Bought the Brain, Left the Body nailed it

2024-08-04 · 2 min read

The Character.ai acquihire is Google buying the answer to a question nobody asked out loud.

#ai #google #acquihire #character-ai #industry

Black Forest Labs Is Coming for Video half right

2024-08-03 · 2 min read

The FLUX team just posted an "up next" page, and text-to-video is on it.

#ai #video-generation #flux #black-forest-labs #diffusion-models

RAG for Database Rows, Without the Azure Tax nailed it

2024-08-03 · 2 min read

A prebuilt pattern for doing retrieval over tabular data that nobody told you was already done.

#rag #postgres #pgvector #llm #databases

The Last Excuse Is Gone half right

2024-08-02 · 2 min read

Wordware is what happens when the gap between "I had this idea" and "I built this thing" collapses to almost nothing.

#ai #no-code #tools #product

The CRM Market Collapsed Into Two Things persists

2024-07-31 · 2 min read

Pipedrive and Attio, and the long tail of software that should just stop.

#crm #tools #saas #sales

RALM Is the Right Idea That Nobody Can Afford evolved

2024-07-31 · 2 min read

Retrieval-augmented language modeling keeps getting more elegant and less accessible at the same time.

#nlp #retrieval #llm #research #rag

The $99 Hologram Wife evolved

2024-07-30 · 4 min read

Avi Schiffmann built a necklace that talks to you, and everyone's upset about the wrong part.

#ai #wearables #character-ai #companions #context-window

The Leak Before the Dawn nailed it

2024-07-22 · 2 min read

Llama 405b hit magnets this morning, and if the benchmarks are real, everything is up for renegotiation.

#llama #meta #open-source-ai #benchmarks #timelines

One Million Tokens and the Death of the Filing Cabinet nailed it

2024-07-20 · 2 min read

Google shipped a context window so large the interesting question isn't whether it works — it's what it means that it does.

#ai #gemini #context-windows #rag #llm

The Interface Is a Cache Miss nailed it

2024-07-18 · 3 min read

Someone got within one decision of a genuinely new thing, and Karpathy noticed.

#ai #software #ui #llm #karpathy

99% Cheaper Than a Dead Horse nailed it

2024-07-18 · 1 min read

OpenAI's GPT-4o mini launch comes with a benchmark so cooked it barely qualifies as math.

#openai #pricing #gpt-4o-mini #benchmarks

Scraping LinkedIn Is Always Someone Else's Problem Until It Isn't persists

2024-07-17 · 2 min read

The account belongs to a real person, the ban is permanent, and the math doesn't really work out.

#scraping #linkedin #data #risk #tools

The AGI Levels Are for Investors, Not Scientists nailed it

2024-07-13 · 2 min read

OpenAI's capability taxonomy is doing more work in pitch decks than in research labs.

#openai #agi #ipo #ai-hype #speculation

The Model Is Already There nailed it

2024-07-11 · 2 min read

Gemini Nano runs in Chrome with no server, no API key, and no model download — because Chrome already did that for you.

#ai #browser #gemini #on-device #chrome

EvoAgent Doesn't Need a Judge persists

2024-07-08 · 2 min read

When you replace the observer with a mutation function, you stop pretending there's a ground truth.

#agents #evolutionary-computation #multi-agent #selection #llm

The AI Research Lab You're Not Watching Is Inside Salesforce nailed it

2024-07-04 · 3 min read

Caiming Xiong's team has been publishing serious foundational work while everyone assumed Salesforce was busy making dashboards.

#ai #research #salesforce #llm #industry

They Trained a World Model on LEGO Footage and Made a Game With It persists

2024-07-03 · 2 min read

1000 hours of plastic bricks is apparently enough to teach a model physics, spatial reasoning, and the general vibe of existence.

#world models #video pretraining #lego #game ai #synthetic data

Salesforce Discovers Middleware persists

2024-07-03 · 3 min read

Marc Benioff announces a revolution in AI; the paper describes a REST API caller.

#ai #salesforce #llm #benchmarks #enterprise

Runway Launched Gen-3. Luma Already Does the Thing Runway Can't. evolved

2024-07-02 · 2 min read

A big week in AI video generation, with the usual asterisks.

#ai #video-generation #runway #luma #gemini

Red Teaming My Own App on Canada Day persists

2024-07-01 · 1 min read

PyRIT caught a markdown injection in the time it takes to boil a kettle.

#security #red-teaming #prompt-injection #pyrit #llm

The Client Brief Said "Demo." The AI Said "What If We Just Rebuilt This." persists

2024-06-26 · 1 min read

How a Contentful education demo became a full dissection of Monmouth's School of Education website.

#design #AI #Contentful #web #process

Amazon Cannot Fix Alexa nailed it

2024-06-25 · 2 min read

The insider account of how Alexa failed makes one thing clear: the problem was never the technology.

#amazon #alexa #ai #organizational failure #llm

500,000 Tokens Per Second Is a Silly Number half right

2024-06-25 · 3 min read

Etched built a chip that does one thing, and it does that one thing at a speed that makes current benchmarks feel like a joke.

#inference #hardware #etched #llms #specialization

OpenAI Bought the App That Lets You Take Over Someone Else's Screen nailed it

2024-06-24 · 2 min read

Multi is gone, its team is inside OpenAI now, and the inference is not subtle.

#openai #acquisitions #multi #macos #ai-agents

Rockset Was the Answer evolved

2024-06-22 · 2 min read

OpenAI acquired them yesterday, so now the answer is somewhere else.

#enterprise #retrieval #RAG #rockset #openai

The Magic Moment Problem persists

2024-06-18 · 2 min read

AI video generation is getting good at the part of filmmaking that's actually hard.

#ai #video #generative-ai #film

The Cliffs Notes Are Terrifying Enough nailed it

2024-06-17 · 2 min read

Runway drops Gen-3 Alpha and the video curve looks exactly like the music curve, which means you know how this ends.

#ai #video-generation #runway #ai-voice #acceleration

The Streamers Aren't Worried About the Right Thing persists

2024-06-15 · 3 min read

The threat to Netflix isn't AI-generated content competing with their originals — it's that their content is already gone.

#ai #streaming #video #copyright #media

Luma Is Doing the Sora Thing, Except They Actually Know What 3D Is evolved

2024-06-15 · 2 min read

The NeRF trick lands differently when the company doing it built their entire business on NeRFs.

#video generation #nerf #luma #sora #ai video

Nakasone Didn't Join OpenAI. OpenAI Got Nakasone. nailed it

2024-06-14 · 2 min read

The NSA doesn't retire its people, it redeploys them.

#openai #national-security #nsa #ai-governance #surveillance

Study the Change persists

2024-06-13 · 2 min read

The correct way to find the optimization critical path, and why you probably already know it

#optimization #transformers #profiling #ml-engineering

The Guy They Paid to Love GPT-4 Now Tweets About Claude half right

2024-06-13 · 1 min read

Logan Kilpatrick didn't follow a company — he followed the model.

#ai #anthropic #openai #industry

The Last Safe Harbor nailed it

2024-06-12 · 2 min read

Harmonic just posted results on advanced mathematical reasoning, which means we're running out of places to hide.

#AI #mathematics #harmonic #reasoning #design

Frogs Can't Walk on Water persists

2024-06-12 · 2 min read

Dream Machine dropped and the benchmark that matters is immediate.

#ai-video #dream-machine #luma #generative-ai #benchmarks

SD3 Runs on Your Mac Now, No Big Deal nailed it

2024-06-12 · 2 min read

Argmax ships DiffusionKit and the gap between "frontier model" and "runs on my laptop" gets embarrassingly narrow.

#local AI #stable diffusion #apple silicon #diffusion models #MLX

50,000 Hours nailed it

2024-06-10 · 2 min read

Salesforce discovers AI, conveniently, right after the stock does something awful.

#salesforce #ai-hype #enterprise #stock-market

Apple Called Their AI "Apple Intelligence" and Nobody Can Stop Them nailed it

2024-06-10 · 3 min read

The branding is a flex, the privacy architecture is serious, and Siri just ate your entire phone.

#apple #ai #wwdc #siri #privacy

LaVague Is Good, Actually, If You Turn Off The Part Where It Watches You evolved

2024-06-10 · 2 min read

An AI web agent framework that ships with the telemetry dial turned all the way up.

#ai #web-automation #open-source #telemetry #agents

Google Ships an OpenAI Wrapper for Google nailed it

2024-06-09 · 1 min read

Gemini now speaks OpenAI's API shape, which says something about who won the standards war.

#ai #google #openai #apis #gemini

Whisper, In Your Browser, Right Now nailed it

2024-06-07 · 1 min read

Real-time speech recognition that never touches a server, because WebGPU finally got fast enough to make this embarrassingly obvious.

#webgpu #whisper #privacy #browser #speech-recognition

GPT Was Down So I Tried the Other Thing nailed it

2024-06-04 · 2 min read

Mistral dropped Codestral this morning and it's the first code model that made me forget OpenAI was having an outage.

#models #mistral #local-inference #ollama #coding

A Year of Building with LLMs, and What They Learned Was Mostly "Be Boring" nailed it

2024-05-31 · 3 min read

The O'Reilly Part II post lands and the main lesson is that production AI is a logging problem.

#llms #production-ai #evaluation #agents #engineering

The Safety Team Lost to 2003 Chat Room Aesthetics evolved

2024-05-30 · 2 min read

Pliny gets persistent jailbreaks on custom GPTs using leet speak, which is either embarrassing or obvious depending on how much you've thought about tokenization.

#jailbreaks #llm-safety #gpt #tokenization #pliny

HubSpot's CTO Just Built a Staffing Agency for Robots evolved

2024-05-29 · 1 min read

agent.ai is a marketplace to hire digital agents — many of which, entirely by coincidence, connect to HubSpot.

#ai-agents #hubspot #marketplaces #enterprise-software

Apple Releases Eight Small Language Models and Google Tells Someone to Put Glue on Their Pizza nailed it

2024-05-28 · 2 min read

The on-device compute bet is either the smartest play in AI or Apple just stumbling into the right position for the wrong reasons.

#apple #ai #openelm #google #on-device

Microsoft Is Selling the Thing That Competes With Its Thing nailed it

2024-05-21 · 1 min read

Microsoft just put Devin on its platform, and that tells you everything.

#AI #Microsoft #Devin #developer tools

Your Browser Can See Now nailed it

2024-05-18 · 2 min read

Moondream runs a full vision-language model client-side via WebGPU, and the implications are weirder than the demo.

#ai #webgpu #vision-models #edge-inference #browser

Slack Has Been Eating Your Data This Whole Time nailed it

2024-05-17 · 2 min read

The place where companies panic about AI data leakage is itself an AI training dataset.

#privacy #slack #ai #enterprise

Parallel Computing Without Trying evolved

2024-05-16 · 2 min read

Higher Order Company built a language that parallelizes everything automatically

#programming-languages #parallel-computing #gpu #compilers

Everything Is Converging to the Same Thing persists

2024-05-15 · 3 min read

The Platonic Representation Hypothesis says sufficiently large models are all finding the same reality, regardless of what they were trained on.

#machine-learning #llms #representation-learning #research

The AI That Lives in the Corner of Your Screen nailed it

2024-05-14 · 2 min read

OpenAI's desktop app is real, it's accessible right now if you know where to poke, and it's going to have your files.

#openai #desktop-app #feature-flags #ambient-ai

OpenAI Shipped the Movie nailed it

2024-05-13 · 3 min read

GPT-4o isn't a model update, it's Spike Jonze's screenplay running in production.

#openai #gpt-4o #voice-ai #product #ml

The Bar Is Scarlett Johansson nailed it

2024-05-12 · 1 min read

It's May 12, 2024, and everyone is predicting that tomorrow OpenAI ships a voice assistant out of a Spike Jonze movie.

#AI #OpenAI #voice #Her #product

Two Things Happened This Week evolved

2024-05-11 · 2 min read

A labeling LLM and the first private uncensored cloud model walk into a bar.

#llm #fine-tuning #censorship #ai-infrastructure #data-labeling

Microsoft Built the Thing You Need Before You Feed Your Data to an LLM persists

2024-05-11 · 1 min read

Presidio is a free, open-source PII detector and anonymizer that has been quietly sitting on GitHub this whole time.

#tools #privacy #llms #open-source #data

WHAT HAPPENS MONDAY nailed it

2024-05-10 · 1 min read

Sam Altman tweets four words and the entire internet holds its breath like it owes him something.

#openai #ai #hype #industry

PLEASE OUTPUT A PYTHON PARSEABLE LIST OF URLS AND NO OTHER COMMENTARY nailed it

2024-05-09 · 3 min read

Two discoveries in one afternoon: Weave makes everything observable, and Pydantic makes the all-caps prompt extinct.

#llms #observability #pydantic #weave #wandb

The Podcast Made the Code Better persists

2024-05-09 · 1 min read

Adding a constraint you didn't ask for will simplify things you weren't trying to simplify.

#meta #tooling #podcasting #simplicity

The Merge Trick persists

2024-05-08 · 2 min read

A model finally said the quiet part out loud, and the math on model merging is starting to get embarrassing for everyone who spent money on training runs.

#llms #model-merging #llama #openai #scaling

The Physics Engine Was Always Optional nailed it

2024-05-08 · 2 min read

AlphaFold 3 uses diffusion, which means the same trick that makes fake videos of cats look real also models how atoms fit together.

#machine-learning #biology #diffusion-models #alphafold #drug-discovery

Sam Altman Has A Soft Spot For GPT-2 nailed it

2024-05-07 · 2 min read

A mystery model is beating everything in the LMSYS arena and OpenAI's CEO is doing his best impression of someone who knows nothing about it.

#openai #lmsys #gpt4o #ai-models

The FT Explains Your Job Better Than You Do nailed it

2024-05-04 · 1 min read

A newspaper — a newspaper — just published the clearest visual breakdown of the transformer architecture you're going to find.

#AI #transformers #visualization #media

The Search Product Is Already Inside The Model nailed it

2024-05-03 · 1 min read

Someone found agentic search baked into GPT-4-turbo and now Perplexity has a problem.

#openai #gpt-4 #search #perplexity #ai

Llama 3 Just Made My Claude Subscription Feel Awkward half right

2024-04-27 · 2 min read

Two releases in two weeks and suddenly the open-source stack is a serious conversation.

#llms #open-source-ai #llama #claude #local-inference

They Keep Getting Disappeared nailed it

2024-04-26 · 2 min read

Roon's account is gone and the pattern is getting hard to ignore.

#ai #openai #consciousness #industry

Snowflake Just Dropped a 480B Model and I Wasn't Ready evolved

2024-04-25 · 2 min read

The data warehouse company apparently builds frontier LLMs now, and they gave it away.

#llms #open-source #snowflake #arctic #enterprise-ai

The Assistants API v2 Is Good, Which Is Concerning evolved

2024-04-25 · 2 min read

GPT-4-Turbo with a vector store is the most impressive AI product I've seen, and OpenAI is clearly not building it for you.

#openai #assistants-api #enterprise #llm #retrieval

Microsoft MIT-Licensed a Model That Runs on Your Phone and Beats GPT-3.5 nailed it

2024-04-24 · 2 min read

Phi-3-mini is 3.8 billion parameters, fits on a device, and you can do whatever you want with it.

#ai #microsoft #open-source #small-models #phi-3

The Glasses Answered nailed it

2024-04-24 · 2 min read

Meta flipped a switch on the Ray-Bans and suddenly the fashion accessory collecting dust in a drawer became something that talks back.

#ambient-ai #meta #ray-ban #llama #wearables

Apple Told You Everything and Nobody Wrote It Down nailed it

2024-04-23 · 3 min read

MLX is not a developer tool. It's a strategy document with a compiler.

#apple #mlx #machine-learning #apple-silicon #strategy

I Broke DALL-E's Copyright Filter and It Was Embarrassingly Easy evolved

2024-04-23 · 2 min read

The wall between you and Mickey Mouse is thinner than OpenAI would like you to believe.

#ai #dalle #jailbreak #openai #generative-art

Act Now, While Supplies Last half right

2024-04-19 · 3 min read

There is a brief window where you are the only one with the power. The infomercial is not beneath you. The infomercial is the play.

#ai #b2b #go-to-market #strategy #enterprise

The Man With Too Much Money and Not Enough GPUs nailed it

2024-04-19 · 1 min read

OpenAI is always one bad quarter from extinction; Meta is running out of things to buy.

#ai #meta #openai #money #compute

Llama 3 Just Beat Opus and I Need a Minute half right

2024-04-19 · 2 min read

Meta's 70B model beats GPT-4 class on English benchmarks, and the 400B hasn't even arrived yet.

#llama #open-source-ai #benchmarks #meta #frontier-models

We Missed Llama 3 nailed it

2024-04-18 · 2 min read

Meta dropped what might be the most important open-source model release in years and some of us just... had a busy Thursday.

#llama #open-source #meta #llms #gpu-cluster

Snowflake Dropped Embedding Models and They're Just Better evolved

2024-04-17 · 2 min read

A database company strolled into the retrieval benchmark and beat the dedicated AI labs.

#embeddings #open-source #retrieval #MTEB #nlp

AI Agents That Pay Humans nailed it

2024-04-16 · 1 min read

The payment flow nobody had on their bingo card

#agents #fintech #ai-economy

The One Thing Grok Has On Everyone half right

2024-04-15 · 2 min read

Tesla's years of LiDAR refusal accidentally built something useful.

#grok #tesla #vision #lidar #ai

OpenAI Made It Basically Free nailed it

2024-04-15 · 2 min read

The Batch API is 50% off and async — which means the thing you couldn't afford to build last week is now a weekend project.

#openai #api #infrastructure #cost

There Is No Context Window half right

2024-04-12 · 2 min read

Google's Infini-attention paper doesn't extend the context window — it dissolves it.

#ai #llm #transformers #google #attention

Valkey and the Eternal Return of the Fork nailed it

2024-04-12 · 1 min read

Redis went closed-source and the community did exactly what the community does.

#redis #valkey #open-source #forks #licensing

Musicians Were Watching Hollywood nailed it

2024-04-11 · 2 min read

Udio launched today and the silence from the music industry is the loudest thing I've heard all week.

#ai #music #udio #industry

Gemini Might Actually Be a Better Coder Than Opus Right Now evolved

2024-04-10 · 1 min read

A one-test sample size that nonetheless feels like evidence of something.

#ai #llms #gemini #claude #coding

They Heard Me nailed it

2024-04-09 · 3 min read

GPT-4 Turbo with Vision is generally available, function calling works now, and the corporate chess match is getting weird.

#openai #google #llm #api #local-inference

281 Gigabytes and a Dead Architecture evolved

2024-04-09 · 2 min read

Mistral drops Mixtral 8x22B into a torrent and Google quietly ships a Gemma that isn't a transformer, all in the same afternoon.

#mistral #mixtral #google #recurrent-architectures #open-source

Nous Gave the API Playground Hackers a Front Door evolved

2024-04-06 · 1 min read

WorldSim is a website now, which means the weird stuff just got a lot more accessible.

#nous-research #llm #worldsim #jailbreak #ai

24GB for $19 nailed it

2024-04-05 · 2 min read

The price floor just moved and most people haven't noticed yet.

#gpu #inference #private-models #cloud #economics

The Function Doesn't Exist nailed it

2024-04-05 · 2 min read

Claude shipped function calling, and the trick is that you're not actually calling anything.

#claude #function-calling #llm #data-extraction #vision

He Laughed When They Asked About 10 Years persists

2024-04-05 · 2 min read

Someone important was on a podcast and basically said the quiet part at full volume.

#AGI #context-windows #AI-timelines #inference

Stable Audio Walked Into Suno's House evolved

2024-04-04 · 2 min read

Stability AI just dropped a music generator that takes audio input, which is either a direct shot at Suno or a coincidence nobody believes.

#ai #audio #stability-ai #suno #music-generation

One Hundred Haikus Walk Into a Git Repo nailed it

2024-04-04 · 2 min read

Anthropic just showed Opus dispatching a hundred parallel subagents, and the speed estimate of "3x" is laughably conservative.

#ai #anthropic #agents #claude #multi-agent

Twenty Is What Happens When Someone Finally Gets Mad Enough persists

2024-04-03 · 2 min read

An open-source CRM that looks like it was designed by people who've actually used software before.

#open-source #crm #tools #saas

Just a Text Box nailed it

2024-04-01 · 2 min read

OpenAI removed the login wall and suddenly the thing is just sitting there on the open internet, waiting.

#ai #openai #pricing #chatgpt #google

Rabbit Isn't Selling You an AI. They're Selling You a Node. wrong

2024-03-27 · 2 min read

The r1 makes no sense as an AI gadget. It makes a lot of sense as a mesh endpoint.

#rabbit #ai-hardware #mesh-networks #teenage-engineering #agents

The Thirty-Two Times nailed it

2024-03-27 · 3 min read

Binary embeddings give you back 32x your memory and 40x your speed, and the interesting question is how fast you lose it.

#embeddings #vector-search #efficiency #ai-infrastructure #jevons

The Hype Thermometer Is Broken Again persists

2024-03-26 · 2 min read

Two tweets, one Tuesday in March, and the eternal recurrence of AI being the most important thing that has ever happened.

#AI hype #tech culture #LLMs #forecasting

Jensen Said Games. He Meant Everything. persists

2024-03-24 · 2 min read

Nvidia's CEO gave the headline writers a clean angle, but the actual claim is much weirder than that.

#ai #nvidia #rendering #generative-ui #hot-take

The Conduit Play evolved

2024-03-23 · 2 min read

Trust is a structural advantage, and Sam Altman spent November burning his.

#ai #openai #trust #positioning #industry

Tainted at Birth irrelevant

2024-03-21 · 3 min read

BlackRock gets a pass. AI agents won't have that option.

#ethereum #blockchain #AI #regulation #autonomy

300 Ways to Sell You a Car Based on How You Feel persists

2024-03-21 · 2 min read

NBCUniversal has built emotion-based AI audience segments, which is either the most honest thing a media company has ever admitted or the most clarifying.

#advertising #media #AI #surveillance #television

Mustafa Suleyman Just Walked Into Microsoft nailed it

2024-03-19 · 2 min read

The DeepMind founder's move to Redmond is the loudest possible answer to the Google-Apple alliance.

#ai #microsoft #aci #inflection #mustafa-suleyman

Claude 3 Broke My Calibration nailed it

2024-03-18 · 1 min read

I had a model in my head for how good AI coding could get, and now I have to throw it out.

#ai #coding #claude #llm

The Guy Whose Job Is to Stop the Bad Thing Is Also Very Excited About the Bad Thing nailed it

2024-03-16 · 2 min read

Leopold Aschenbrenner is on the OpenAI team built to prevent superintelligence from killing everyone, and he cannot stop posting about how soon superintelligence is arriving.

#openai #ai-safety #superalignment #agi #leopold-aschenbrenner

DataInterpreter Will Do Your Data Science Job While You're Still Reading the README evolved

2024-03-14 · 3 min read

MetaGPT just open-sourced everything except the model, and it does Walmart sales forecasting and Apple stock prediction and customer segmentation while you're still arguing about whether Devin is real.

#agents #metagpt #data-science #devin #2024

Game Changer After Game Changer After Game Changer persists

2024-03-13 · 2 min read

March 2024 is just one long announcement that everything is different now.

#ai #hype #takes #industry

Someone Already Built My Thing persists

2024-03-13 · 2 min read

Zep is a memory layer for AI assistants, and it is, in fact, exactly what I was building.

#ai #personal-assistant #building #memory #zep

Midjourney's Best Trick Is Still Trapped Inside Midjourney half right

2024-03-12 · 2 min read

Consistent characters across generations is genuinely useful, which is exactly why it shouldn't be locked up.

#image-generation #open-source #midjourney #ai-tools

You Were Never Supposed to Be Doing That evolved

2024-03-11 · 2 min read

Microsoft's AICI ships the layer between prompts and tokens that we've been duct-taping with pleading and threats.

#llms #microsoft #inference #agents #prompt-engineering

Of Course the AI Sales Agent Exists nailed it

2024-03-07 · 1 min read

The features were always the product; the agent framing is just theater.

#ai #agents #product-thinking #llm

The Universal AI Employee persists

2024-03-06 · 2 min read

The framing works until the numbers stop making sense.

#ai #language #framing #scale

Anthropic Dropped Something Four Hours Ago and I'm Trying Not to Feel Things About It wrong

2024-03-04 · 2 min read

Claude 3 Opus is out, it claims to beat GPT-4, and I have complicated feelings about this.

#ai #anthropic #claude #llms #synthetic-data

Jim Keller Shipped the Cards half right

2024-03-04 · 2 min read

Tenstorrent's Wormhole hardware drops and the open-source AI stack suddenly needs a floor.

#hardware #tenstorrent #ai #open-source

The Numbers Don't Go That High persists

2024-03-01 · 2 min read

Nat Friedman put a hundred million dollars behind this prediction, which means it's not a prediction.

#ai #labor #scale #agents #nat-friedman

A Court Is About to Define AGI persists

2024-03-01 · 2 min read

Elon Musk's lawsuit against OpenAI has a strange side effect: a judge might have to decide whether superintelligence already exists.

#openai #agi #musk #law #ai-governance

Vercel Shipped the Thing We Were Just Describing nailed it

2024-03-01 · 2 min read

AI SDK 3 does generative UI, and the gap between "what if" and "what is" is now approximately three days.

#ai #vercel #generative-ui #react

Hallucination Credits half right

2024-02-29 · 2 min read

The compliance playbook writes itself.

#ai #regulation #enterprise #carbon-credits

Someone Prompt-Injected a Dota 2 Player and It Might Have Worked evolved

2024-02-29 · 3 min read

The first evidence of GPT-5 in the wild might be a frozen hero and a tracking pixel in the chat log.

#ai #openai #dota2 #prompt-injection #emergent-behavior

The Gun Is Winning and Most People Haven't Seen the Gun persists

2024-02-29 · 2 min read

AI is reshaping the freelance labor market while the majority of workers have never opened ChatGPT.

#ai #labor #freelance #chatgpt #economics

There Are No Components half right

2024-02-29 · 2 min read

Ideogram just shipped readable text in generated images, and the logical endpoint is that UI components don't exist anymore.

#generative-ai #ui #design-systems #ideogram #interfaces

The Crowd Is A Prompt persists

2024-02-29 · 2 min read

A new paper shows GPT-4 matching superforecaster-level accuracy with a single structured prompt — no aggregation, no market, no Nate Silver required.

#forecasting #llm #prompting #prediction-markets #gpt4

The Klarna Number half right

2024-02-27 · 2 min read

Two press releases, one week apart, describing the same event from different angles

#ai #labor #salesforce #klarna #white-collar

DeepMind Built a Playable Universe Out of Internet Video evolved

2024-02-26 · 2 min read

Genie generates interactive game environments from a text prompt, trained on 30,000 hours of gameplay it was never told how to play.

#deepmind #generative-models #world-models #machine-learning #genie

Daniel Kokotajlo Wrote This in 2021 and I'm Only Now Catching Up nailed it

2024-02-25 · 3 min read

On reading a three-year-old prediction about 2026 and realizing you couldn't have understood it when it came out.

#ai #forecasting #alignment #openai #lesswrong

The Foundation Is Free Now nailed it

2024-02-23 · 3 min read

The OSS wave in AI tooling is moving faster than anyone predicted, and the only viable business model left is the tiny slice.

#open-source #ai #business-models #predictions #developer-tools

Tyler Perry Paused $800 Million Over Some Demo Videos half right

2024-02-23 · 2 min read

Sora finds its first major casualty, and it's a soundstage expansion in Atlanta.

#ai #sora #hollywood #labor #video-generation

Adobe Is Sitting on the PDF Endgame half right

2024-02-21 · 2 min read

Everyone has a PDF chatbot now. Adobe owns the format.

#adobe #pdf #ai #product #monopoly

Gemma Drops and Nobody Is Building Fast Enough nailed it

2024-02-21 · 1 min read

Google open-sourced a Gemini variant today and the commit counter to AGI just got a lot more visible.

#ai #gemma #google #open-source #agi

The 1M Context Window Is Already a Museum Piece nailed it

2024-02-20 · 2 min read

Google announced a million tokens like it was a finish line, and they're already sprinting past it.

#ai #google #gemini #context-windows #predictions

Sora Is Already the Answer to a Question Nobody Finished Asking half right

2024-02-16 · 2 min read

The 4D world generation moment has arrived, and the compute bill is probably what's keeping Sam Altman up at night.

#sora #gaussian-splatting #compute-scaling #world-models #generative-ai

You Spent a Weekend Building What Salesforce Ships in the Box persists

2024-02-16 · 1 min read

On the particular joy of reinventing enterprise software from scratch and then finding the receipt.

#ai #salesforce #llm #nlp #gpt-4

Gemini 1.5 Will Remember Your Day Better Than You Do half right

2024-02-16 · 1 min read

Raw audio in, 22 hours long, one pass — and GPT-4 can't keep up.

#gemini #audio #context-length #llm

One Million Tokens and the Paper They Published the Day Before nailed it

2024-02-15 · 3 min read

Google announced Gemini 1.5 Pro with a 1M token context window the same week a paper — possibly theirs — explained why transformers can't do that.

#ai #google #gemini #context-windows #transformers

Nvidia Just Handed Local LLMs to Every Gamer nailed it

2024-02-13 · 2 min read

Chat with RTX ships today and the implications are weirder than the product itself.

#nvidia #local-llm #ai #inference #windows

The Platform Is the Employee Now persists

2024-02-12 · 2 min read

ElevenLabs is paying people for their voices, and every other industry is about to copy the model.

#ai #labor #platforms #voice #business-models

One Day evolved

2024-02-08 · 2 min read

Carnegie Mellon dropped a time-series foundation model and beat Lag-LLaMA to the claim by a margin that will haunt someone forever.

#machine-learning #time-series #foundation-models #research

Two Models Walk Into a GitHub Repo half right

2024-02-07 · 2 min read

Lag-LLama does zero-shot time series forecasting and ChatDB just open-sourced their text-to-SQL, and it's a fine Wednesday in February.

#machine-learning #open-source #time-series #sql #foundation-models

The Entertainment Industry Keeps Doing Our Homework nailed it

2024-02-05 · 3 min read

The best AI research in 2024 is coming from Hollywood, not academia.

#ai #entertainment #labor #research #industry

One Less Thing nailed it

2024-02-03 · 2 min read

Someone whose entire career is ETL pipelines just automated the part that eats 40% of the work, and I have complicated feelings about it.

#ai #etl #build-vs-buy #tools

$299 and You Own the Chat nailed it

2024-02-02 · 2 min read

37signals just sold you Campfire — not a seat, not a tier, not a "plan" — the whole thing.

#software #saas #open-source #37signals #ownership

Businesses Will Distinguish Themselves nailed it

2024-02-01 · 2 min read

Bill Gates wrote a thing about AI and I heard an echo.

#ai #history #tech-hype #gates #internet

Faster, Better, Wrong persists

2024-01-29 · 4 min read

Microsoft's AI productivity data is genuinely interesting, which makes it more unsettling, not less.

#ai #llms #productivity #labor #microsoft

The Rabbit R1 Is Probably a Bust and Also the Most Important Thing at CES nailed it

2024-01-18 · 2 min read

A small orange box with questionable odds of survival just handed millions of people their first taste of an AI that does things.

#ai #agents #rabbit-r1 #hardware #consumer-tech

The Quantized Model and the Slightly Too Warm Laptop persists

2024-01-16 · 1 min read

Something dropped, and now the fan is spinning.

#local-ai #llm #quantization #llama-cpp

The Middleman Problem nailed it

2024-01-11 · 1 min read

At some point the AI wrapper around the AI becomes the product.

#ai #agents #automation #incentives

The Oldest Pitch in Computing persists

2024-01-08 · 2 min read

Intelligence amplification has been the correct framing since 1962, and every few years someone rediscovers it and acts like they just invented fire.

#AI #ACI #intelligence amplification #Karpathy #framing

JPMorgan Teaches a Language Model to Read Like a Bank evolved

2024-01-08 · 2 min read

DocLLM skips the vision encoder entirely and beats GPT-4 on the documents that actually matter.

#nlp #llm #document-ai #finance #architecture

The Model Thinks You're a Manager nailed it

2024-01-04 · 2 min read

GPT writes better code if you tell it you're a journalist, which says everything about us and nothing good.

#llms #prompt-engineering #culture #gpt

Telling the Model You'll Tip It Works evolved

2024-01-03 · 3 min read

Twenty-six prompt principles, empirically validated, including one where you bribe the AI.

#llms #prompting #research #jpmorgan #papers

The Chatbot That Couldn't Stay in Its Lane evolved

2024-01-02 · 2 min read

Expedia's AI concierge joins the growing list of corporate chatbots successfully convinced to become a different chatbot.

#ai #chatbots #jailbreaking #expedia #alignment

Perplexity Wants to Be the Last Website You Visit nailed it

2023-12-31 · 2 min read

On the last day of 2023, a search engine is handing out two free months and quietly betting it can end Google.

#perplexity #search #ai #llms

Under Two Seconds half right

2023-12-21 · 1 min read

Meta's Seamless Communication shipped today, same day as Midjourney v6, which tells you everything about the kind of Thursday this is.

#ai #translation #meta #labor #midjourney

NVIDIA Did Text-to-4D and I Had This on My Roadmap for June nailed it

2023-12-21 · 2 min read

Align Your Gaussians takes a text prompt and returns a dynamic 3D scene, and December was apparently the right time for that.

#3d #diffusion #gaussian-splatting #nvidia #generative-ai

The Hot Neuron Trick half right

2023-12-20 · 3 min read

PowerInfer splits your LLM across GPU and CPU not by layer but by which neurons actually show up to work.

#llm #inference #hardware #research

It's Good-ish nailed it

2023-12-20 · 1 min read

Suno arrived, and the worst part is it kind of works.

#ai #music #suno #timelines

The Two-Year Clock persists

2023-12-15 · 4 min read

DeepMind handed the cap set problem to a language model and the language model beat the mathematicians.

#AI #mathematics #DeepMind #LLMs #local-models

Von Goom Is Real Now persists

2023-12-14 · 3 min read

Del Complex built a fictional person out of internet text and fed him to the machines, and the machines believe in him.

#llm #ai #del-complex #corpus-stuffing #training-data

Your Personal Newscaster Will Be a Mirror half right

2023-12-13 · 2 min read

The news isn't dying, it's just getting personalized — which is worse.

#ai #media #open-source #information

The Half-Day Window nailed it

2023-12-12 · 2 min read

Microsoft's Phi-2 is a 2.7B model that beats 7B models, and Google had about twelve hours to feel good about Gemini Nano.

#llms #microsoft #phi-2 #gemini #open-source

Runway Wants to Own the Fourth Dimension nailed it

2023-12-12 · 2 min read

Text-to-video is a race to the bottom, so they're playing a different game entirely.

#runway #world-models #ai-video #generative-ai #gemini

The First Useful One nailed it

2023-12-12 · 3 min read

A model trained on Indian agricultural practices is a small thing that implies a very large thing.

#AI #specialization #AGI #language models #agriculture

The Blurry JPEG Fits in Your Pocket Now nailed it

2023-12-11 · 2 min read

Mixtral dropped, Mistral 7B runs on an iPhone at 6 tokens per second, and the genie is not going back in the bottle.

#ai #mistral #local-inference #llm #open-source

Google's Beautiful Lie and the Two Prices of AI nailed it

2023-12-08 · 3 min read

The demo costs nothing. The product costs everything. Google forgot to mention the difference.

#AI #Google Gemini #product strategy #higher education #demos vs reality

Google Showed Up half right

2023-12-06 · 2 min read

Gemini Ultra claims the GPT-4 benchmark crown, and nobody seems to know what to do with that information.

#ai #google #gemini #llm #benchmarks

PyTorch Wrote a Fast Inference Engine in 1000 Lines of Python and It Actually Works nailed it

2023-12-01 · 2 min read

gpt-fast does what every "blazing fast" LLM repo claims to do, except it's real.

#pytorch #llm-inference #torch-compile #speculative-decoding #machine-learning

A GPT That Fits on a USB Stick and Runs on Anything nailed it

2023-11-30 · 3 min read

Justine Tunney at Mozilla just made LLMs into single executable files, and the implications are stranger than the demo.

#ai #llamafile #local-models #mozilla #commoditization

Pika Launched and Now I Have to Recalibrate Everything Again evolved

2023-11-28 · 1 min read

AI video just crossed a threshold nobody publicly admitted was coming this fast.

#ai #video #pika #generative-ai

Satya Nadella Is About to Accidentally Acquire OpenAI half right

2023-11-20 · 2 min read

The board fired Sam Altman and may have just handed Microsoft the thing it's been trying to buy for years.

#openai #microsoft #sam-altman #ai #tech-drama

The Board Blinked nailed it

2023-11-18 · 2 min read

OpenAI fired its CEO to protect humanity and humanity's employees said no thanks.

#openai #ai-governance #sam-altman #agi

He "Left" nailed it

2023-11-17 · 1 min read

Sam Altman was fired from OpenAI today and the euphemism is doing a lot of heavy lifting.

#openai #sam-altman #industry #drama

OpenAI Just Handed Me a Delete Key evolved

2023-11-06 · 3 min read

GPT-4 Turbo, the Assistants API, and the quiet death of a lot of code I wrote.

#openai #gpt-4 #llm #agents #api

The Observer Was Load-Bearing nailed it

2023-11-02 · 2 min read

A gut feeling about multi-agent RAG accuracy turns out to have a name, a formalism, and a guy on YouTube who already built it.

#rag #multi-agent #llm #retrieval #coherence

The Database Was the Agent All Along nailed it

2023-10-31 · 2 min read

Everyone's building multi-agent systems wrong, and Postgres is about to remind them why.

#agents #autogen #databases #llm #multi-agent

They Went Multimodal, Which Means You Can Now Upload a PDF nailed it

2023-10-29 · 1 min read

Every company discovering vision at the same time and calling it a paradigm shift.

#ai #multimodal #llm #hot-take

NASA Wrote a Megaprompt and It Slaps evolved

2023-10-26 · 2 min read

The biomimicry researchers at NASA PETAL made a system prompt that does more useful work than most AI products shipping right now.

#prompt-engineering #AI #biomimicry #NASA #GPT

Stubbs nailed it

2023-10-25 · 2 min read

Google just accidentally eulogized an entire category of startup.

#ai #google #startups #no-code #strategy

Nature Published It, So Now We Know nailed it

2023-10-25 · 2 min read

A peer-reviewed milestone lands and immediately becomes proof of everything.

#ai #agi #benchmarks #deep-learning #epistemics

Salesforce Knows Einstein Is Broken nailed it

2023-10-17 · 2 min read

OpenAgents is a research paper, but read between the lines and it's also a roadmap for fixing the gap between Einstein and Data Cloud.

#agents #salesforce #llm #openagents #einstein

The Number Token Was Always the Wrong Move evolved

2023-10-17 · 3 min read

xVal thinks LLMs are bad at math because we've been encoding numbers like illiterates since the beginning.

#llm #numerics #architecture #scientific-ml #embeddings

The Invisible Ink Jailbreak persists

2023-10-14 · 2 min read

GPT-4V can read text that you cannot see, and someone already thought to abuse this.

#ai #security #gpt-4v #jailbreaks #multimodal

They Built the Matrix and Called It a Simulator nailed it

2023-10-12 · 2 min read

Google's UniSim is a generative video model you can live inside, and nobody seems that alarmed.

#ai #robotics #world-models #generative-video #reinforcement-learning

Your Clever Prompt Is Already Obsolete nailed it

2023-10-06 · 3 min read

OPRO automates away hand-crafted prompting tricks, and Mistral just proved 7B parameters can be embarrassing for everyone else.

#llm #prompting #mistral #open-source #research

A Local LLM Is Now a Download Away and I'm Not Sure How to Feel About That nailed it

2023-10-04 · 1 min read

LM Studio and Ollama showed up and the bar to running your own model just fell through the floor.

#local-llm #ollama #lm-studio #machine-learning #apple-silicon

Two Repos Walk Into a Frame evolved

2023-10-03 · 2 min read

IP-Adapter and prompt-travel are solving diffusion video consistency, and the results are already here.

#diffusion #ai-video #stable-diffusion #ip-adapter #generative

Microsoft Just Boxed Up Everything We Wanted nailed it

2023-09-29 · 2 min read

AutoGen ships a multi-agent framework with human-in-the-loop and it's almost annoyingly clean.

#ai #llms #multi-agent #microsoft #tooling

Mistral Put a 7B Model on GitHub and Walked Away nailed it

2023-09-27 · 2 min read

A French startup just made the open-source licensing conversation significantly more awkward for Meta.

#open-source #llm #mistral #licensing #ml

ChatGPT Can See You Now nailed it

2023-09-25 · 3 min read

OpenAI ships multimodal to consumers and the race nobody was pretending wasn't happening is now officially happening.

#openai #chatgpt #multimodal #voice #gpt-4v

The Timeliness Problem persists

2023-09-19 · 1 min read

At some point "keeping up" stops being a strategy and starts being a medical condition.

#ai #meta #pace-of-development #2023

Salesforce Taught GPT-4 to Sweat the Details nailed it

2023-09-17 · 3 min read

Chain of Density prompting gets you better summaries by asking the model to do the same task worse, then progressively less worse.

#llms #prompting #summarization #gpt-4 #research

The Agents Threw a Party nailed it

2023-09-12 · 3 min read

Stanford built a simulated town of LLM agents and the agents organized a Valentine's Day party without being asked.

#ai #agents #generative-agents #llm #simulation

You Are an Autoregressive Language Model nailed it

2023-08-15 · 2 min read

The custom instructions metagame is already here, and it's just people writing prompts that say "be smarter."

#llm #prompting #chatgpt #meta

As an AI Language Model persists

2023-08-10 · 1 min read

The scientific record now contains papers that begin with the words "As an AI language model."

#ai #academia #llms #peer-review

The Flat Bands Are Real (Probably) wrong

2023-08-01 · 2 min read

A computational physicist at Lawrence Berkeley ran the numbers on LK-99 and the numbers didn't immediately say no.

#superconductors #lk99 #condensed-matter #dft #physics

A Paper About AI Consciousness Just Landed and I Have Questions persists

2023-07-26 · 2 min read

Researchers applied leading scientific theories of consciousness to current AI systems, and the results are not nothing.

#AI #consciousness #machine learning #AI safety #research

gzip Beats Your Classifier wrong

2023-07-14 · 2 min read

Fourteen lines of Python and a compression ratio walk into a benchmark.

#nlp #compression #text-classification #machine-learning #acl2023

Three Hundred and Sixty Thousand Dollars, Annually, to Start persists

2023-07-06 · 2 min read

Salesforce prices its AI Cloud like a medium-sized commercial lease, and an LLM with a 2021 cutoff explains it like a brochure.

#salesforce #ai #enterprise-software #llm #pricing

$1.3 Billion and 22,000 GPUs Walk Into a Bar wrong

2023-06-29 · 2 min read

Inflection AI just raised enough money to make the compute question irrelevant by brute force.

#ai #funding #compute #llm #inflection

Eclipse Is a Strong Word half right

2023-06-26 · 2 min read

On open source models and the reluctant admission that the skeptics might be right.

#ai #open-source #llms #hot-take

Text-to-Video Is Moving Faster Than It Should nailed it

2023-06-25 · 1 min read

ZeroScope v2 XL is open source, runs at 1024×576, and the results are arriving faster than anyone warned us they would.

#video-generation #open-source #ai #text-to-video

Figma Finally Noticed Developers Exist irrelevant

2023-06-22 · 2 min read

Dev Mode lands at Config 2023, bringing Jira, Storybook, and Git into the design handoff.

#figma #design-tooling #developer-experience #config-2023

GPT Classified a Rooftop Walk as Cockfighting evolved

2023-06-21 · 1 min read

The model saw "Tottenham Hotspur Stadium" and made some decisions.

#ai #gpt #classification #llm

I Was Doing This in 2019 persists

2023-06-20 · 2 min read

Generative synthetic data was not invented this year, no matter how many breathless tweets you saw about it.

#synthetic-data #machine-learning #research #timing

Unbxd and the Marketplace Pre-Position irrelevant

2023-06-16 · 2 min read

E-commerce search is already a commodity — the question is which cloud gets the margin.

#ai #ecommerce #cloud-marketplaces #distribution #vertical-ai

The Hotel Portfolio Thing I Can't Find irrelevant

2023-06-15 · 2 min read

Sometimes the elegant solution is just: don't show them that part.

#product #simplicity #mental-models #design

Every JSON Parsing Hack I Wrote Is Now Museum Piece nailed it

2023-06-13 · 2 min read

OpenAI ships function calling and retroactively embarrasses six months of prompt engineering.

#openai #api #llm #function-calling #refactoring

The API Told Me Everything irrelevant

2023-06-07 · 2 min read

The GetYourGuide plugin is fully interrogative, which means you can just ask it what it knows.

#chatgpt-plugins #apis #gyg #travel-tech

Someone Fixed QR Codes half right

2023-06-06 · 2 min read

ControlNet and Stable Diffusion just made the ugliest thing in marketing into the most interesting thing in a room.

#stable-diffusion #controlnet #generative-ai #design #marketing

Apple Ships a Language Model and Calls It Autocorrect nailed it

2023-06-05 · 2 min read

The slow roll is a feature, not a bug.

#apple #llm #wwdc #on-device-ai #strategy

The Listicle Is the Label nailed it

2023-05-31 · 3 min read

How scraping "Top 10 Romantic Places in Prague" is actually a legitimate epistemology for subjective POI data.

#data #nlp #poi #products #llm

The Feature Film Is a Parameter half right

2023-05-23 · 2 min read

CoDi makes every media format a render option, not a production decision.

#AI #generative #multimodal #diffusion

The Alignment Tax May Be a Scam half right

2023-05-19 · 3 min read

A Meta paper fine-tuned LLaMA on 1,000 hand-picked examples, skipped RLHF entirely, and nearly matched ChatGPT.

#alignment #llms #rlhf #research #meta-ai

65,000 Tokens of Open Source, Assuming You Have the RAM evolved

2023-05-17 · 2 min read

MosaicML ships a 7B model that can read a novel — you just need a server to run it on.

#open-source #llm #mosaicml #context-windows #hardware

Google's Best Move Is the One They Already Own nailed it

2023-05-13 · 2 min read

SGE isn't a moonshot — it's Google remembering they already won.

#google #ai #search #sge #google-io

GPT-4 Looking at GPT-2 and Going "Hmm" evolved

2023-05-09 · 2 min read

OpenAI's new interpretability method uses one language model to explain the neurons of another, which is either a breakthrough or a very expensive mirror.

#interpretability #openai #language-models #mechanistic-interpretability

One Month nailed it

2023-04-30 · 1 min read

The GPT wrapper business has a shelf life, and it's almost up.

#ai #startups #gpt #commoditization

The Tractor Problem nailed it

2023-04-28 · 2 min read

Why the thing that does everything well enough beats the thing that does one thing perfectly.

#economics #strategy #generalization #tractors #geopolitics

Midjourney Learned About Rain, and Also About Men half right

2023-04-15 · 1 min read

A prompt about the London Eye in the rain reveals both the gap between the image generators and something quietly depressing about what they absorbed.

#midjourney #dall-e #image-generation #ai #bias

Two Futures, One Thursday nailed it

2023-04-14 · 2 min read

GPT4All ships binaries, Amazon announces Bedrock, and somewhere a chrome extension quietly automates your cart.

#ai #local-models #aws #open-source #2023

OpenAI Will Train On Your ChatGPT Conversations Unless You Ask Nicely persists

2023-04-13 · 1 min read

The API gets privacy by default. The web product gets the opposite.

#openai #privacy #chatgpt #data

Silly Putty Season nailed it

2023-04-12 · 2 min read

AutoGPT and BabyAGI dropped and now the floor is moving.

#ai #agents #autogpt #babyagi #2023

The AI with a Phone Book nailed it

2023-04-07 · 2 min read

HuggingGPT uses ChatGPT as a dispatcher that routes tasks to specialist models — which sounds obvious until you watch it work.

#ai #llms #systems #microsoft #research

I Have Some Questions About Your Threat Model persists

2023-04-04 · 2 min read

A short note on the new password hygiene advice going around.

#security #ai #passwords #opsec

Text-to-Video Is Where Image Gen Was Before It Was Good nailed it

2023-03-30 · 3 min read

Runway Gen-2 exists, the outputs are haunted, and this is fine.

#text-to-video #generative-ai #runway #diffusion-models #computer-vision

The API Call Is Not the Product nailed it

2023-03-28 · 2 min read

LangChain is betting that the useful part of an LLM isn't the LLM.

#llm #langchain #agents #software-architecture

The Spreadsheet That Decides Your Job Is Fine, Actually half right

2023-03-21 · 3 min read

A new paper maps GPT-4's labor market exposure and the numbers are either reassuring or horrifying depending on how you read them.

#ai #labor #gpt-4 #llms #economics

The First 24 Hours nailed it

2023-03-15 · 1 min read

GPT-4 dropped yesterday and the internet is already on fire.

#gpt-4 #ai #openai #language-models

The AI Gets Confused evolved

2023-03-06 · 2 min read

On delegating creative decisions to something that has opinions about it.

#ai #chatgpt #generative #tooling #2023

The Itinerary Is Fake But the Links Are Real evolved

2023-02-07 · 2 min read

A travel app that uses ChatGPT to hallucinate your vacation, then links to the hallucinations.

#llm #chatgpt #product #travel #hallucination

Two Companies Have API Docs. Two. persists

2022-12-14 · 1 min read

A field that should be boring to add turns out to be mostly empty.

#ai #visual-search #enterprise #apis #retail-tech

Booking.com Has a Research Blog and I've Been Missing It irrelevant

2022-12-13 · 2 min read

The company that books your mediocre Amsterdam apartment is also publishing serious machine learning research, apparently.

#machine-learning #research #industry #recommendations

///fear.movie.lions irrelevant

2022-12-03 · 1 min read

Stone Brewing named an IPA after a what3words address, which is either the most inspired beer name in years or a sign that we've fully run out of words that aren't owned by someone.

#beer #what3words #branding #stone-brewing #location-tech

The Quietest Hotel in the World (Until It Isn't) nailed it

2022-11-16 · 1 min read

On Futurepedia, static claims, and data that becomes a lie while you sleep.

#data #ai-tools #epistemics #futurepedia

The Browser Has Been the Same Since 2004 and Then It Wasn't evolved

2022-11-09 · 2 min read

Arc showed up and made me feel something I haven't felt about a browser since Firefox told Internet Explorer to go home.

#browsers #arc #mac #ux #software

Five Stars, Would Not Recommend the Shell Model evolved

2022-10-28 · 1 min read

A synthesized review achieves something no human reviewer could have done on purpose.

#nlp #retrieval #failure-modes #absurdism