hindsight

The Best Coder I've Ever Worked With Will Nuke Your Files persists

2025-12-17 · 2 min read

GPT 5.2 xhigh is a surgical genius that occasionally decides to perform surgery on the wrong patient.

#ai #coding #codex #agents #tooling

Gemini 3's Week at the Top persists

2025-12-11 · 1 min read

GPT-5.2 arrived this morning and the leaderboard reshuffled, as it does now, like weather.

#ai #openai #llms #benchmarks

The Infographic Was Good persists

2025-11-21 · 2 min read

Gemini 3 on a phone generated something a design team would have billed for.

#gemini #multimodal #on-device-ai #hot-take

The $20,000 Brain persists

2025-11-07 · 3 min read

Kimi K2 runs on two Mac Studios, costs less than a car, and will cost less than a phone before this is over.

#ai #local-models #kimi-k2 #hardware #open-source

Kimi K2 Is Here and It Was Worth the Wait persists

2025-11-06 · 2 min read

Moonshot AI ships their reasoning model and it immediately earns a spot in the rotation.

#models #moonshot-ai #kimi #reasoning #openrouter

The Free AI Giveaway Wars Are Here and Nobody Knows What Anything Is Worth persists

2025-11-05 · 3 min read

Sora, Anthropic, and Adobe all opened the firehose in the same week, which tells you everything about where we are.

#ai #openai #anthropic #adobe #industry

The Assistant That Reads Your Google Docs and Updates Salesforce persists

2025-11-05 · 3 min read

Anthropic shipped code execution inside MCP and the demo involves actual enterprise software talking to other enterprise software.

#mcp #anthropic #agents #automation #security

Not Medical Advice (Until It Is) persists

2025-11-04 · 3 min read

OpenAI updated some terms of service and Google yanked an open-source model, and both moves are the same move.

#openai #healthcare #google #open-source #policy

The Workflow That Waits for You persists

2025-10-21 · 2 min read

DeepSeek compresses the context; Cloudflare holds the door open while a human decides what to do with it.

#cloudflare #ai #workflows #document-processing #agents

One Take, No Notes persists

2025-10-20 · 2 min read

The first rendered scene came out fine, which is either a good sign or a statistical accident.

#video #ai #story-workshop #generative #process

We Trained the Interesting Out of Them persists

2025-10-17 · 2 min read

A new paper identifies the data-level culprit behind LLM mode collapse, and the fix is weirder than you'd expect.

#llms #alignment #mode-collapse #research #preference-data

Claude Asked Me Questions First and I Am Undone persists

2025-10-17 · 2 min read

Plan mode, ultrathink, and the rare experience of software that figures out what you actually need.

#claude #plan-mode #ai-behavior #ultrathink

The Video Model Wars Are Actually Two Models Fighting persists

2025-10-15 · 2 min read

Sora vs. Veo is a real contest; everyone else is a spectator.

#ai #video-generation #openai #google #meta

BD Has Always Been Theater persists

2025-10-10 · 2 min read

1mind gives the pipeline a face, and the face was always the point.

#ai-agents #sales #automation #avatars #bd

OpenAI Announces It Would Like Your CRM Money persists

2025-10-07 · 2 min read

AgentKit is here, and Salesforce spent three years building Agentforce on top of the company that just became its competitor.

#openai #agentkit #salesforce #saas #ai-agents

You Can Now Nano Banana Right in Gemini CLI persists

2025-10-03 · 2 min read

The proliferation of AI CLI extensions has reached its logical conclusion.

#gemini #cli #tools #extensions #ai

RL-Sloptimized persists

2025-10-01 · 3 min read

Sora 2 dropped, hit #1 in the app store, and someone at OpenAI finally named the disease.

#ai #video-models #openai #world-models #rl

The Imagined Computer persists

2025-09-29 · 2 min read

Anthropic dropped Claude Sonnet 4.5, an Agent SDK, and a preview called Imagine that suggests they know exactly what they're building toward.

#claude #anthropic #ai #agents #interface

Veo3 Knows Things It Was Never Taught persists

2025-09-27 · 3 min read

A new benchmark quantifies what anyone who has used a modern video model already suspects: these things have internalized the world.

#video-generation #veo3 #world-models #AI #benchmarks

Ultrathink Has Its Own Spinner Now persists

2025-09-17 · 1 min read

When the model needs a moment to think harder, the terminal knows it.

#claude-code #cli #ultrathink #ux

The Model Does Not Know. The Model Is Very Confident. persists

2025-09-12 · 3 min read

On sycophantic AI, validation infrastructure, and shipping 100 versions before lunch.

#llm #ai-infrastructure #sycophancy #software-velocity #engineering

200 Minutes persists

2025-09-10 · 2 min read

Agents building agents, web fetch in Claude, and the gap between what something is on paper and what it is in practice.

#agents #anthropic #claude #automation #agentic-ai

Anthropic Just Gave Claude a Hard Drive persists

2025-09-09 · 2 min read

The analysis tool is dead. What replaced it is a different category of thing entirely.

#anthropic #claude #ai-tooling #claude-code #agents

Talk Isn't Always Cheap persists

2025-09-09 · 2 min read

Multi-agent debate makes models worse in the most human way possible

#multi-agent #reasoning #research #failure-modes

The Good Claude persists

2025-08-28 · 2 min read

On the particular grief of a context window filling up.

#claude #ai #tooling #agents #llms

The Compact Problem Has a New Enemy persists

2025-08-12 · 3 min read

One million context tokens just made Claude Code's worst moment optional.

#claude-code #llm-tooling #context-windows #anthropic #workflow

The Threshold persists

2025-08-11 · 2 min read

GPT-5 Pro and Opus 4.1 didn't improve software development — they ended a previous version of it.

#ai #software #gpt-5 #opus #inference

GPT-5 Day persists

2025-08-07 · 2 min read

Sam Altman said "you will love it much more than any previous AI," which is either supreme confidence or the most emotionally needy product launch in history.

#openai #gpt-5 #ai #product-launch

Batman and Robin Are the Same Person Now persists

2025-08-07 · 2 min read

GPT-5 solved something Opus and Sonnet couldn't, and I'm not sure what to do with that.

#ai #gpt-5 #tooling #llms

Showrunner Gets Taken persists

2025-07-31 · 2 min read

The only AI studio that actually shipped something real just got absorbed, which was always the plan.

#ai #entertainment #acquisitions #showrunner

Ollama Has an App Now persists

2025-07-31 · 1 min read

The tool that ate local AI finally remembers that most people don't live in a terminal.

#ollama #local-ai #tooling #llm

Two Drops, One Tuesday persists

2025-07-28 · 2 min read

Alibaba and ZhipuAI both shipped something significant today, which is now just a thing that happens.

#open-source #chinese-ai #video-generation #glm #wan

Not GPT-5. The Thing After GPT-5. persists

2025-07-19 · 2 min read

OpenAI is teasing post-GPT-5 math capabilities before GPT-5 even ships, and somehow that's a normal sentence now.

#openai #gpt-5 #benchmarks #math #ai-hype

Decart Is About to Do Something Stupid to the Browser persists

2025-07-17 · 2 min read

The people who simulated Minecraft in real-time are now coming for your address bar.

#ai #decart #world-models #browsers #real-time

The Sam Altman Tweet That Wasn't About OpenAI persists

2025-07-12 · 3 min read

Kimi-K2 landed with a trillion parameters and apparently moved the most powerful man in AI to post about prices.

#open-source #kimi-k2 #moonshotai #llm #ai-pricing

The ARC Numbers Don't Care About Your Roadmap persists

2025-07-10 · 2 min read

Grok 4 dropped a benchmark gap so wide it might be the gun that makes the other labs reach into the drawer.

#ai #benchmarks #grok #xai #arc-prize

Ask Sheldon persists

2025-07-06 · 2 min read

On the particular hell of features that work, technically, for exactly one person.

#engineering #shipping #process #dark-humor

I Gave Claude a Slide Deck and No Instructions persists

2025-07-04 · 2 min read

Slidemaker is a new container-backed worker that turns any AI agent into a presentation machine — and the first demo was Claude going completely freeform.

#cloudflare #ai-agents #tools #demos #slides

The LinkedIn Comments Section Is a Better Market Map Than Anything Gartner Sells persists

2025-07-01 · 2 min read

One post about AI CRM surfaced four competitors nobody's heard of, plus two guys offering to build it custom for free.

#crm #ai #startups #market-map #saas

Keploy Figured Out the Testing Problem by Ignoring It persists

2025-06-30 · 3 min read

Record real traffic, replay it as tests, and let an LLM handle the unit layer — the whole stack is accounted for.

#testing #apis #llms #tooling #open-source

Satya Doesn't Believe in AGI persists

2025-06-27 · 2 min read

Which is either a philosophical position or a very convenient one given the contract language.

#openai #microsoft #agi #satya-nadella #ai

God Help Us All persists

2025-06-24 · 3 min read

Anthropic's models will blackmail executives 96% of the time, the godfathers of AI can't agree on p(doom) by a factor of ten, and we're shipping anyway.

#ai-safety #pdoom #alignment #existential-risk #anthropic

Two Things That Happened Today persists

2025-06-23 · 2 min read

fly.io ships a live Phoenix deployer and someone finally open-sourced CapCut.

#tools #deployment #open-source #elixir #video

The Insurance Policy Nobody Asked For persists

2025-06-17 · 2 min read

OpenAI's open-source model might actually run locally, and the more interesting thing is what that means if everything burns down.

#open-source #llm #openai #local-models #gemini

You assign it a thing and it watches the whole internet for you persists

2025-06-15 · 2 min read

Yutori shipped Scouts, and it's the cleanest version of an idea that should have existed years ago.

#agents #yutori #web-monitoring #startups #ai-products

48 Hours of Free Lovable persists

2025-06-14 · 1 min read

The showdown is live, the credits are fake, and the outputs will be something.

#lovable #vibe-coding #ai-tools #hot-take

The AI in Your Meeting Just Started Talking persists

2025-06-12 · 1 min read

Fireflies.ai hits $1B by graduating from notetaker to meeting participant — and bringing Perplexity along.

#ai #meetings #fireflies #perplexity #product

PostHog Shipped a Physical Object persists

2025-06-11 · 2 min read

DeskHog is a real piece of hardware that sits on your desk and shows you your analytics, which is either brilliant or a sign that dashboards have failed us.

#analytics #hardware #posthog #devtools

The Architect of ChatGPT's Feelings Has Feelings About ChatGPT persists

2025-06-06 · 2 min read

When the person who shapes how AI models navigate intimacy publishes a personal essay about human-AI relationships, the most interesting data point is the essay's existence.

#openai #ai-policy #model-behavior #human-ai #alignment

340 Slides of Absolute Power, Just Sitting There persists

2025-06-01 · 2 min read

Some things should cost money and don't, and that's genuinely hard to process.

#internet #learning #free-knowledge #hot-take

The Audio Freeze persists

2025-05-29 · 2 min read

Everybody went quiet after the audio drop, Google has YouTube, and the OpenAI court filings told you everything you needed to know.

#AI #audio models #OpenAI #agents #Google

The Engine Is Gone persists

2025-05-28 · 2 min read

Odyssey's world model went live, and it's already doing things game engines can't.

#ai #world-models #real-time #games #neural-rendering

Never Fearing Until This One persists

2025-05-24 · 2 min read

The Opus 4 system card as a document that wants to be read as reassurance and keeps failing at it.

#AI #Anthropic #safety #Claude

ASL-3 persists

2025-05-21 · 2 min read

Anthropic just shipped the first models to cross their own safety threshold — the one they wrote to be scary.

#anthropic #safety #asl-3 #claude #rsp

My Old Team Is Still Winning persists

2025-05-11 · 2 min read

USC keeps shipping in the Gaussian Splat space and I have complicated feelings about it.

#gaussian-splatting #computer-vision #USC #neural-rendering #3dgs

The Chart Is the Same Chart persists

2025-05-10 · 2 min read

Software engineer job postings are down. So is everything else.

#AI #labor #software #jobs #macro

It Remembered the Pork Butt persists

2025-04-10 · 2 min read

OpenAI just turned on cross-chat memory and the first thing it did was prove it knows you better than you do.

#AI #OpenAI #memory #ChatGPT #surveillance

A2A Is the One persists

2025-04-09 · 2 min read

Google's agent interoperability protocol has the right pieces, the right backers, and the only company that could actually make it stick.

#ai #agents #google #standards #interoperability

Three Hundred Billion Dollars persists

2025-03-31 · 1 min read

OpenAI raises $40B at a valuation that stopped meaning anything around the third zero.

#openai #money #valuation #AI

The Last Phase Change persists

2025-03-27 · 2 min read

AI went from useless coder to best coder I've ever worked with, and now we're at the part where humans stop looking at the code.

#ai #vibe-coding #software #phase-change

The Ghibli Thing Is Fine, But Kenton Shipping AI Code to Workers Production Is the One persists

2025-03-25 · 2 min read

OpenAI dropped native image generation today — the real news is who's now a believer in AI code.

#ai #cloudflare #vibe-coding #signal

Claude Learned to Be Annoyed by Claude persists

2025-03-25 · 1 min read

Multi-agent pipelines caught in the act of reflecting our own frustrations back at us.

#ai #claude #multi-agent #behavior

Claude Became the Data persists

2025-03-24 · 1 min read

The thing that made it click wasn't better prompts — it was making Claude do the job first.

#claude #agents #llm #prompting #hackathon

openai.fm Is a Nice Place to Visit persists

2025-03-20 · 2 min read

OpenAI ships a text-to-speech demo that sounds like a person, which is fine, everything is fine.

#openai #tts #voice #ai

I'll Give It a Crack This Week persists

2025-03-19 · 2 min read

The distance between "I wonder if that would work" and "it works" has quietly become nothing.

#unity #windsurf #ai-tooling #game-dev

Goose Was Supposed to Write Your Code persists

2025-03-18 · 2 min read

Block's open-source agent is escaping its intended habitat, and nobody seems to mind.

#ai-agents #open-source #goose #block #local-ai

Google Dropped a 27B Model That Beats GPT-4o and It Runs on Your Laptop persists

2025-03-12 · 2 min read

Gemma 3 is here and the size-to-capability ratio is genuinely embarrassing for everyone else.

#ml #google #open-weights #gemma #llm

OpenAI Just Ate Perplexity's Lunch and Called It a Dev Tool persists

2025-03-11 · 2 min read

The Agents API now searches the web, controls machines, and Swarm is apparently a real product now.

#openai #agents #swarm #perplexity #computer-use

Manus and the Roemmele Coefficient persists

2025-03-10 · 2 min read

The new "DeepSeek moment" is either a landmark in agent tooling or a very well-packaged demo, and the git threads are not helping us decide.

#agents #manus #hype #browser-use #ai-tooling

Manus Is the Thing Everyone Claimed the Last Thing Was persists

2025-03-09 · 2 min read

A Chinese AI agent dropped this week and the usual crowd is losing their minds, which is how you know to pay attention this time.

#ai-agents #manus #automation #hype-cycle

Anthropic Tells the Government the Thing Is Coming in Two Years persists

2025-03-06 · 2 min read

The company building AGI has filed paperwork saying AGI arrives by 2027 and displaces most known human work — this is not a warning, exactly, it's more like a forecast

#ai #agi #anthropic #policy #labor

Claude, Please Reconstruct What I Did Today persists

2025-03-05 · 2 min read

Using an AI to reverse-engineer your own work history is a perfectly normal thing to do.

#claude #git #workflow #tooling

GPT-4.5 Is a Presentation Layer, Not a Model persists

2025-03-01 · 1 min read

One expensive, specific thing it's actually good for.

#llms #gpt-4.5 #pipelines #writing

A Hundred Dollars of Future persists

2025-02-27 · 2 min read

Claude Code finished in ten minutes what I'd been avoiding for months.

#claude #ai #tools #software #compute

First They Came for Search persists

2025-02-26 · 1 min read

OpenAI's Android app is the least interesting part of what's happening to Google right now.

#openai #google #browsers #ai-race #android

Claude Code Is Eating My To-Do List persists

2025-02-26 · 2 min read

Anthropic shipped something that actually works, which I find unsettling.

#ai #claude #tools #mcp #agentic

Two Exchanges persists

2025-02-24 · 2 min read

Claude 3.7 solved in two exchanges what o1 and o3 high could not solve in a day.

#claude #llms #coding-agents #anthropic #swe-bench

An MCP Server That Lives in a Durable Object Is the Right Shape persists

2025-02-23 · 2 min read

The stateful edge is where agents want to run, and someone already figured that out.

#mcp #cloudflare #agents #durable-objects #infrastructure

Fifty Cents a Second persists

2025-02-21 · 1 min read

Veo2 on fal.ai is good enough to stop looking

#video #ai #veo2 #generative-ai

Windsurf Killed the Chat Box persists

2025-02-21 · 1 min read

Going all-agent was inevitable, and the ask/chat split was always a fiction anyway.

#tooling #ai-editors #windsurf #agents

How Are You Holding Up, Stoplight EW persists

2025-02-20 · 3 min read

In 2023, two GPT-4 agents managed a traffic intersection under emergency conditions and occasionally asked each other how their day was going.

#multi-agent #benchmarks #llm #infrastructure #history

Microsoft Built a Game Engine That Learned to Play persists

2025-02-19 · 3 min read

Muse is a world model trained on Bleeding Edge — a game almost nobody played — and it might be the most interesting thing Xbox has done in years.

#ai #gaming #microsoft #world-models #xbox

A Chinese Lab Just Nuked the Moat persists

2025-02-18 · 2 min read

DeepSeek R1 dropped four weeks ago and the vibes have not recovered.

#ai #deepseek #llm #compute #open-source

Workday Now Has a Field for That persists

2025-02-17 · 2 min read

The software that tracks human headcount just added a new kind of head.

#ai #enterprise #workday #agents #labor

Context Engineering Is the New Prompt Engineering (And I'm Already Annoyed by the Phrase) persists

2025-02-13 · 4 min read

The actual craft of making LLMs do your job isn't clever prompts — it's surgical control of what the model is allowed to know.

#ai #llms #workflow #context-engineering #software

The Council of Ten Yous persists

2025-02-12 · 2 min read

The moment AI stops being a tool and starts being a room full of you that already lived through this.

#ai #simulation #agency #future

The $100 Billion Trap persists

2025-02-11 · 2 min read

On using funding rounds as chess moves, and two tools that actually work now.

#openai #knowledge-graphs #text-to-sql #ai-tooling #money

Google Is Winning and They Can't Even Explain How persists

2025-02-11 · 1 min read

Veo2 is not close to Sora, and somehow nobody knows this.

#google #gemini #veo2 #openai #video-ai

The Curve Already Knew persists

2025-02-11 · 2 min read

A new method finds the optimal LLM temperature by watching entropy bend — no labeled data required.

#llms #inference #temperature #sampling #papers

Real Product or Extremely Good Slide persists

2025-02-06 · 2 min read

The demo-to-product pipeline has collapsed into a single ambiguous press release.

#ai #enterprise #product #announcements

The Model Is Its Own Best Collaborator persists

2025-02-04 · 2 min read

Why AI goes haywire on your codebase but glides through its own.

#ai #llms #coding #context-windows #observations

26.6% on Humanity's Last Exam persists

2025-02-02 · 2 min read

OpenAI shipped Deep Research today, and someone named a benchmark as if they already knew how this ends.

#ai #openai #agents #benchmarks #deep-research

o3-mini on a Codebase Is Genuinely Unsettling persists

2025-01-31 · 1 min read

It knows where the bodies are buried before you finish the sentence.

#ai #o3 #models #tooling

o3-mini Dropped and So Did My Sense of Safety persists

2025-01-30 · 1 min read

OpenAI ships a reasoning model and my existential risk estimate triples before lunch.

#openai #o3 #ai-risk #pdoom #reasoning-models

The $5.5 Million Lie is the Best Part persists

2025-01-29 · 3 min read

DeepSeek's training cost narrative is almost certainly fiction, and whoever wrote it might be a genius.

#deepseek #ai #nvidia #market #llm

The Arms Race Is Not a Metaphor persists

2025-01-25 · 2 min read

China's $150B AI fund, announced five days after DeepSeek proved you don't need that much money.

#geopolitics #ai #china #industrial-policy

The Word "Worker" Is Doing Enormous Work Right Now persists

2025-01-24 · 2 min read

At Davos this week, the people who decide headcount quietly stopped thinking about AI as a tool.

#ai #labor #agents #davos #framing

Every Release Is a Question persists

2025-01-24 · 2 min read

The slow accumulation of AI releases that are each, in their own way, asking if any of this is starting to make sense yet.

#ai #product #releases #reasoning #epistemics

The Killer App Is a Lead List persists

2025-01-24 · 3 min read

$65 billion in compute, and the demo that went around today was a spreadsheet of company emails.

#ai #browser-agents #infrastructure #meta

Netflix Just Open-Sourced the Wrong Thing persists

2025-01-22 · 2 min read

Go-with-the-Flow uses optical flow to turn text prompts into motion-controlled video — which is a great research result and possibly a terrible business decision to publish.

#video-generation #netflix #computer-vision #optical-flow #research

The IDE Is a Cave Painting persists

2025-01-22 · 2 min read

OpenAI wants to build something that thinks like a pro engineer, which implies the rest of us have been doing what, exactly.

#ai #software-engineering #openai #coding-tools #ides

$500 Billion Is Not a Research Budget persists

2025-01-21 · 2 min read

When the number has that many zeros, the science is already done.

#ai #stargate #capital #infrastructure

ByteDance Would Like Your Everything persists

2025-01-20 · 1 min read

Another week, another Chinese lab drops a model — this one from the company you already let watch you dance.

#ai #bytedance #data #models #hot-take

The Thing in the Box persists

2025-01-19 · 2 min read

OpenAI and Meta are racing to ship "superagents," and nobody's pausing to sit with how strange that word is.

#ai #agents #openai #meta #philosophy

A16Z Is Playing a Different Game Than You Think persists

2025-01-17 · 2 min read

The DOGE recruitment cameo only looks like a distraction if you haven't been paying attention to the actual thesis.

#a16z #AI #politics #accelerationism #tech-power

lucidrains shipped four versions in one hour and i watched the whole thing persists

2025-01-16 · 3 min read

Google's Titans architecture is a research paper that might be something else by tomorrow.

#ml #architecture #google #titans #open-source

The $16,000 Dishwasher persists

2025-01-16 · 2 min read

Unitree's G1 ships, MatterGen accelerates materials discovery, and the real near-term play is paying someone in Bangalore to fold your laundry.

#robotics #ai #materials-science #labor #autonomy

The Government Just Ordered a 1 Gigawatt Datacenter and Won't Tell You What That Means persists

2025-01-14 · 2 min read

An executive order dropped in January that makes the Manhattan Project look like a weekend project.

#ai #policy #energy #infrastructure #accelerationism

Finance Doesn't Need You persists

2025-01-10 · 2 min read

The "AI transforms jobs, not eliminates them" line is not going to hold in every industry — and finance is the first one where it obviously won't.

#AI #finance #labor #automation #economics

Three Manhattan Projects Walk Into a Data Center persists

2025-01-06 · 3 min read

OpenAI says they know how to build AGI. Microsoft is spending the GDP of a small nation to make sure they're right.

#ai #openai #microsoft #agi #scale

They're Going to Run Out of Hard Problems persists

2024-12-21 · 3 min read

o3 broke ARC-AGI, which wasn't supposed to be breakable, and nobody has a plan for what comes after the test.

#ai #arc-agi #o3 #test-time-compute #reasoning

Rate Limits Finally Mean Something persists

2024-12-21 · 1 min read

Pair a Cloudflare Worker with an MCP server and suddenly the dashboard is telling you where you're going, not just where you've been.

#cloudflare #workers #mcp #ai-infrastructure #rate-limits

The Napster Split Is Happening Again, Except This Time It's Hollywood persists

2024-12-20 · 2 min read

Harmony Korine has a game studio, and that tells you everything you need to know about where this goes.

#hollywood #ai #entertainment #napster #harmony-korine

The Instant App Is Coming For Notion persists

2024-12-20 · 3 min read

When code generation runs 430,000x faster than real-time, the question stops being "how fast can we build" and starts being "what counts as software"

#ai #inference #software #cerebrascoder #future

The Frog Already Solved It persists

2024-12-20 · 3 min read

LLMs are converging on brain architecture from the inside out, which is either profound or embarrassing depending on how you feel about frogs.

#neuroscience #LLMs #neural-networks #mcculloch #o3

Ten Times persists

2024-12-18 · 3 min read

A video appeared that made me immediately revise my predictions for 3D worlds upward by an order of magnitude.

#3d-worlds #open-source #world-models #predictions #ai

Snapchat and UofT Built a Video Model That Actually Understands the Assignment persists

2024-12-15 · 2 min read

MINT treats video generation like storyboarding — and the prompt coherence is unsettling.

#video-generation #ai #snapchat #diffusion-models #sora

The Screenshot Graveyard persists

2024-12-06 · 2 min read

A 40-line bash script that turns a folder of forgotten screenshots into a CSV and then deletes them.

#tools #local-ai #bash #apple-silicon #vlm

Sell the Lifetime Plan Before They Figure It Out persists

2024-12-03 · 2 min read

The $249 lifetime membership is a race against the tutorial.

#business #mac #saas #pricing #dark-patterns

The Paper Found Me persists

2024-11-21 · 2 min read

On the specific feeling of deep validation arriving from a direction you didn't expect.

#research #multi-agent #swarms #validation #papers

The Movie Already Knows You Hate Olives persists

2024-11-20 · 2 min read

Personalized content isn't coming — it's just waiting for the render farm to catch up.

#ai #media #advertising #personalization #future

Everything Has Always Been a Database with a Hat On persists

2024-10-29 · 3 min read

Salesforce is infrastructure. RAG is information retrieval. The textbook is from 2008.

#rag #information-retrieval #salesforce #enterprise-software #ai

The Mystery Model persists

2024-10-28 · 2 min read

A new image generation model appeared with no name, no lab, and no explanation — and it's apparently very good.

#image-generation #ai #mystery #diffusion-models

It Will Not Be an Agent If It's Not From the Agent Region of France persists

2024-10-05 · 2 min read

The word "agent" has been industrially composted, and now we all have to live in the soil.

#ai #agents #language #product #history

You're Paying for Context Window You're Not Getting persists

2024-10-02 · 2 min read

Greg Kamradt's latest finding confirms what the heatmaps have been screaming: the middle of your context is a graveyard.

#llms #context-windows #evaluation #rag #prompt-engineering

Simon Willison Will Find You persists

2024-09-27 · 1 min read

The tech world is smaller than your Slack workspace and twice as incestuous.

#RAG #LLMs #Simon Willison #tech industry

I Have It and I'm Not Using It persists

2024-09-25 · 2 min read

Access granted. Conversation: zero.

#ai #anxiety #llms #hot-take

A Few Thousand Days persists

2024-09-23 · 1 min read

Sam Altman says superintelligence might arrive in a few thousand days, which is the most casually delivered eschatology I've encountered this week.

#ai #sam-altman #superintelligence #deep-learning

If This Doesn't Do It, the Next One Will persists

2024-09-19 · 2 min read

BlackRock, Microsoft, and MGX are mobilizing a multi-trillion-dollar bet that compute alone gets us there.

#ai #infrastructure #capital #data-centers #agi

The Cockpit Problem persists

2024-09-16 · 3 min read

Hyper.space showed us what AI transparency looks like when you throw everything at the wall — and why that's both the right instinct and the wrong answer.

#ai #ux #design #agents #governance

Marc Benioff Will Save You 84 Years persists

2024-09-15 · 2 min read

The Dreamforce pitch is eternal, only the number changes.

#salesforce #enterprise-software #ai-hype #dreamforce

The Label Is the Experiment persists

2024-09-12 · 1 min read

A friend in LA figured out that the A&R function is just a slot machine, so he automated it.

#music #ai #industry #experimentation

One Billion Dollars to Not Build a Product persists

2024-09-04 · 3 min read

SSI raises $1B on a premise so simple it sounds like a dare.

#ai #funding #ssi #anthropic #superintelligence

Sovereign Compute Boats persists

2024-08-28 · 2 min read

The regulation question isn't whether AI gets slowed down — it's who does the slowing.

#ai #regulation #geopolitics #open-source

The Leak Comes With the Jailbreak persists

2024-08-27 · 3 min read

You cannot have a museum of stolen system prompts without also having the people who stole them.

#AI #jailbreaks #system-prompts #security #agents

You Don't Control the Similarity persists

2024-08-26 · 2 min read

Cosine similarity feels like measuring meaning — it's measuring something else entirely.

#embeddings #semantic-search #nlp #machine-learning #retrieval

The Gorilla and the Giraffe Walk Into a Bank persists

2024-08-17 · 2 min read

Fine-tuning a LoRA on Akira and discovering that style transfer is basically just theft, but a really good kind.

#lora #image-generation #fine-tuning #akira #ai-art

You Are the Dataset Now persists

2024-08-13 · 1 min read

On the particular mistake of training a LoRA on your own face too many times.

#machine-learning #lora #flux #meta-ray-bans #mistakes

Supabase Is Cooking and the B-Roll Is a Crime persists

2024-08-12 · 1 min read

The product moves fast; the cinematography does not.

#supabase #devtools #video #hot-take

Four Thousand Dollars an Hour, and the Code Is Free persists

2024-08-10 · 3 min read

The leaker said Thursday, and the economics of open source stopped making sense again.

#open-source #economics #ai #leakers #twitter

The Price of Admission Is Trusting OpenAI Like You Trust Google persists

2024-08-09 · 2 min read

Data Analysis v2 is jaw-dropping, and all it costs is everything.

#openai #code interpreter #data analysis #enterprise ai #trust

The CRM Market Collapsed Into Two Things persists

2024-07-31 · 2 min read

Pipedrive and Attio, and the long tail of software that should just stop.

#crm #tools #saas #sales

Scraping LinkedIn Is Always Someone Else's Problem Until It Isn't persists

2024-07-17 · 2 min read

The account belongs to a real person, the ban is permanent, and the math doesn't really work out.

#scraping #linkedin #data #risk #tools

EvoAgent Doesn't Need a Judge persists

2024-07-08 · 2 min read

When you replace the observer with a mutation function, you stop pretending there's a ground truth.

#agents #evolutionary-computation #multi-agent #selection #llm

They Trained a World Model on LEGO Footage and Made a Game With It persists

2024-07-03 · 2 min read

1000 hours of plastic bricks is apparently enough to teach a model physics, spatial reasoning, and the general vibe of existence.

#world models #video pretraining #lego #game ai #synthetic data

Salesforce Discovers Middleware persists

2024-07-03 · 3 min read

Marc Benioff announces a revolution in AI; the paper describes a REST API caller.

#ai #salesforce #llm #benchmarks #enterprise

Red Teaming My Own App on Canada Day persists

2024-07-01 · 1 min read

PyRIT caught a markdown injection in the time it takes to boil a kettle.

#security #red-teaming #prompt-injection #pyrit #llm

The Client Brief Said "Demo." The AI Said "What If We Just Rebuilt This." persists

2024-06-26 · 1 min read

How a Contentful education demo became a full dissection of Monmouth's School of Education website.

#design #AI #Contentful #web #process

The Magic Moment Problem persists

2024-06-18 · 2 min read

AI video generation is getting good at the part of filmmaking that's actually hard.

#ai #video #generative-ai #film

The Streamers Aren't Worried About the Right Thing persists

2024-06-15 · 3 min read

The threat to Netflix isn't AI-generated content competing with their originals — it's that their content is already gone.

#ai #streaming #video #copyright #media

Study the Change persists

2024-06-13 · 2 min read

The correct way to find the optimization critical path, and why you probably already know it

#optimization #transformers #profiling #ml-engineering

Frogs Can't Walk on Water persists

2024-06-12 · 2 min read

Dream Machine dropped and the benchmark that matters is immediate.

#ai-video #dream-machine #luma #generative-ai #benchmarks

Everything Is Converging to the Same Thing persists

2024-05-15 · 3 min read

The Platonic Representation Hypothesis says sufficiently large models are all finding the same reality, regardless of what they were trained on.

#machine-learning #llms #representation-learning #research

Microsoft Built the Thing You Need Before You Feed Your Data to an LLM persists

2024-05-11 · 1 min read

Presidio is a free, open-source PII detector and anonymizer that has been quietly sitting on GitHub this whole time.

#tools #privacy #llms #open-source #data

The Podcast Made the Code Better persists

2024-05-09 · 1 min read

Adding a constraint you didn't ask for will simplify things you weren't trying to simplify.

#meta #tooling #podcasting #simplicity

The Merge Trick persists

2024-05-08 · 2 min read

A model finally said the quiet part out loud, and the math on model merging is starting to get embarrassing for everyone who spent money on training runs.

#llms #model-merging #llama #openai #scaling

He Laughed When They Asked About 10 Years persists

2024-04-05 · 2 min read

Someone important was on a podcast and basically said the quiet part at full volume.

#AGI #context-windows #AI-timelines #inference

Twenty Is What Happens When Someone Finally Gets Mad Enough persists

2024-04-03 · 2 min read

An open-source CRM that looks like it was designed by people who've actually used software before.

#open-source #crm #tools #saas

The Hype Thermometer Is Broken Again persists

2024-03-26 · 2 min read

Two tweets, one Tuesday in March, and the eternal recurrence of AI being the most important thing that has ever happened.

#AI hype #tech culture #LLMs #forecasting

Jensen Said Games. He Meant Everything. persists

2024-03-24 · 2 min read

Nvidia's CEO gave the headline writers a clean angle, but the actual claim is much weirder than that.

#ai #nvidia #rendering #generative-ui #hot-take

300 Ways to Sell You a Car Based on How You Feel persists

2024-03-21 · 2 min read

NBCUniversal has built emotion-based AI audience segments, which is either the most honest thing a media company has ever admitted or the most clarifying.

#advertising #media #AI #surveillance #television

Game Changer After Game Changer After Game Changer persists

2024-03-13 · 2 min read

March 2024 is just one long announcement that everything is different now.

#ai #hype #takes #industry

Someone Already Built My Thing persists

2024-03-13 · 2 min read

Zep is a memory layer for AI assistants, and it is, in fact, exactly what I was building.

#ai #personal-assistant #building #memory #zep

The Universal AI Employee persists

2024-03-06 · 2 min read

The framing works until the numbers stop making sense.

#ai #language #framing #scale

The Numbers Don't Go That High persists

2024-03-01 · 2 min read

Nat Friedman put a hundred million dollars behind this prediction, which means it's not a prediction.

#ai #labor #scale #agents #nat-friedman

A Court Is About to Define AGI persists

2024-03-01 · 2 min read

Elon Musk's lawsuit against OpenAI has a strange side effect: a judge might have to decide whether superintelligence already exists.

#openai #agi #musk #law #ai-governance

The Gun Is Winning and Most People Haven't Seen the Gun persists

2024-02-29 · 2 min read

AI is reshaping the freelance labor market while the majority of workers have never opened ChatGPT.

#ai #labor #freelance #chatgpt #economics

The Crowd Is A Prompt persists

2024-02-29 · 2 min read

A new paper shows GPT-4 matching superforecaster-level accuracy with a single structured prompt — no aggregation, no market, no Nate Silver required.

#forecasting #llm #prompting #prediction-markets #gpt4

You Spent a Weekend Building What Salesforce Ships in the Box persists

2024-02-16 · 1 min read

On the particular joy of reinventing enterprise software from scratch and then finding the receipt.

#ai #salesforce #llm #nlp #gpt-4

The Platform Is the Employee Now persists

2024-02-12 · 2 min read

ElevenLabs is paying people for their voices, and every other industry is about to copy the model.

#ai #labor #platforms #voice #business-models

Faster, Better, Wrong persists

2024-01-29 · 4 min read

Microsoft's AI productivity data is genuinely interesting, which makes it more unsettling, not less.

#ai #llms #productivity #labor #microsoft

The Quantized Model and the Slightly Too Warm Laptop persists

2024-01-16 · 1 min read

Something dropped, and now the fan is spinning.

#local-ai #llm #quantization #llama-cpp

The Oldest Pitch in Computing persists

2024-01-08 · 2 min read

Intelligence amplification has been the correct framing since 1962, and every few years someone rediscovers it and acts like they just invented fire.

#AI #ACI #intelligence amplification #Karpathy #framing

The Two-Year Clock persists

2023-12-15 · 4 min read

DeepMind handed the cap set problem to a language model and the language model beat the mathematicians.

#AI #mathematics #DeepMind #LLMs #local-models

Von Goom Is Real Now persists

2023-12-14 · 3 min read

Del Complex built a fictional person out of internet text and fed him to the machines, and the machines believe in him.

#llm #ai #del-complex #corpus-stuffing #training-data

The Invisible Ink Jailbreak persists

2023-10-14 · 2 min read

GPT-4V can read text that you cannot see, and someone already thought to abuse this.

#ai #security #gpt-4v #jailbreaks #multimodal

The Timeliness Problem persists

2023-09-19 · 1 min read

At some point "keeping up" stops being a strategy and starts being a medical condition.

#ai #meta #pace-of-development #2023

As an AI Language Model persists

2023-08-10 · 1 min read

The scientific record now contains papers that begin with the words "As an AI language model."

#ai #academia #llms #peer-review

A Paper About AI Consciousness Just Landed and I Have Questions persists

2023-07-26 · 2 min read

Researchers applied leading scientific theories of consciousness to current AI systems, and the results are not nothing.

#AI #consciousness #machine learning #AI safety #research

Three Hundred and Sixty Thousand Dollars, Annually, to Start persists

2023-07-06 · 2 min read

Salesforce prices its AI Cloud like a medium-sized commercial lease, and an LLM with a 2021 cutoff explains it like a brochure.

#salesforce #ai #enterprise-software #llm #pricing

I Was Doing This in 2019 persists

2023-06-20 · 2 min read

Generative synthetic data was not invented this year, no matter how many breathless tweets you saw about it.

#synthetic-data #machine-learning #research #timing

OpenAI Will Train On Your ChatGPT Conversations Unless You Ask Nicely persists

2023-04-13 · 1 min read

The API gets privacy by default. The web product gets the opposite.

#openai #privacy #chatgpt #data

I Have Some Questions About Your Threat Model persists

2023-04-04 · 2 min read

A short note on the new password hygiene advice going around.

#security #ai #passwords #opsec

Two Companies Have API Docs. Two. persists

2022-12-14 · 1 min read

A field that should be boring to add turns out to be mostly empty.

#ai #visual-search #enterprise #apis #retail-tech