◆ Dispatch 036 · 2026-05-24 GSV The Capability Got Here First
The capability got here first: Mythos, a real prompt injection, and the structure that hasn't caught up
“The model that finds the bug and the injection that hijacks the agent are the same capability — language understanding pointed at code — aimed in opposite directions.”
— Lenar Kess, today's narration
Anthropic's unreleased Mythos model has reportedly found more than ten thousand vulnerabilities for its Project Glasswing partners — and showed up briefly inside Claude Code this weekend. The same weekend, a security researcher flagged what he calls the first real prompt-injection attack in the wild, riding the exact workflow we've all been adopting. Today's episode walks both sides of that coin, then turns to what builders are actually doing: a three-dollar refactor with a deadlock in it, the missing coordination layer for agent swarms, and the argument that the chat box is the command-line phase of agentic software.
- Mythos & Project Glasswing — a security model "too dangerous to release," and the case for and against that framing.
- A real prompt injection in the wild — a malicious GitHub issue, a scan.js, and secrets exfiltrated over DNS.
- The three-dollar refactor — cheap worker models, one confident deadlock, and where judgment still lives.
- The missing primitive is coordination — Lou Bichard of Ona on software factories, Stripe's Minions, and why GitHub isn't a coordination layer.
- Your agent is an infinite canvas — Rachel Lee Nabors on MCP apps, Web MCP, and chat as the command-line phase.
- r/programming reopens to AI — a seven-million-person community moves from a reflex ban to a written policy.
Chapters
- 00:00:04 Mythos, and a model too dangerous to release
- 00:04:08 A real prompt injection in the wild
- 00:07:39 The three-dollar refactor and the ten percent that bites
- 00:11:13 The missing primitive is coordination
- 00:15:14 Chat is the command-line phase of agents
- 00:19:08 From a reflex to a policy
Sources
10 cited-
1
Mythos 1 ("claude-mythos-1-preview") prepared for Claude Code and Claude Security
X @testingcatalog — AI News | TestingCatalog — a verified product-leak/changelog watcher account
Mythos 1, "claude-mythos-1-preview", is being prepared for a release on Claude Code and Claude Security.
x.com/testingcatalog/status/205832222229751… →Details
- Cited text
Mythos 1, "claude-mythos-1-preview", is being prepared for a release on Claude Code and Claude Security.
- Context
- Anthropic appears to be productizing its restricted security model into the tools developers already use, which would put a frontier bug-finder inside the coding loop rather than behind a research embargo.
- Key points
- A new Anthropic model codenamed Mythos 1 briefly became visible in Claude, with new app strings referencing 'Access to the Claude Mythos model in Claude Code and Claude Security'
- Surfacing inside Claude Code and a dedicated 'Claude Security' surface signals a model tuned for code reasoning and adversarial/defensive security work
- TestingCatalog expects it to stay gated rather than open to the general public, reaching users through tools like Claude Code rather than the chat app
- Replies debated the '1' versioning (vs Opus debuting at 3) and whether baking a named model directly into dev tools points at agentic coding optimization
- Engagement
- 2209 likes · 235 retweets
- Provenance
- Tweet · Primary source
-
2
Anthropic Says Mythos Has Already Found More Than 10,000 Vulnerabilities
Article Mariella Moon — Engadget contributing writer
it hasn't released Mythos Preview to the public yet, because no company (including itself) has developed safeguards strong enough to prevent models like it from being misused.
www.engadget.com/2180028/anthropic-claude-m… →Details
- Cited text
it hasn't released Mythos Preview to the public yet, because no company (including itself) has developed safeguards strong enough to prevent models like it from being misused.
- Context
- It's the first hard number on what a frontier model does to defensive security at scale — and a live test of whether 'too dangerous to release' is a real safeguard or a moat.
- Key points
- Project Glasswing, launched in April and powered by the unreleased Claude Mythos Preview, has helped partners find 10,000+ vulnerabilities in a month; partners' bug-finding rate rose more than 10x
- Cloudflare found 2,000 bugs (400 high/critical); Mozilla found and fixed 271 Firefox vulnerabilities, 10x what an older Claude model found; Microsoft attributes larger patch releases to Mythos
- Anthropic scanned 1,000 open-source projects and found 6,202 high/critical-severity vulnerabilities out of 23,019 total
- Anthropic says it won't release Mythos publicly because no one has built strong enough misuse safeguards; it plans 'Mythos-class' models later and is expanding Glasswing with governments
- Partners include AWS, Apple, CrowdStrike, Google, JPMorganChase, NVIDIA and Palo Alto Networks; Anthropic is reportedly about to be profitable (~$10.9B quarterly revenue, $559M operating profit for the quarter ending June)
- Provenance
- Article · Supporting source
-
3
Joseph Thacker: first real-world prompt injection seen in the wild, via GitHub Issues
X @rez0__ — Joseph Thacker — AI/application-security researcher who tests AI products for OpenAI and Google
This is the first REAL one I've seen. And it's using GitHub issues which is the main way/channel that gets tested these days.
x.com/rez0__/status/2058350854508286082 →Details
- Cited text
This is the first REAL one I've seen. And it's using GitHub issues which is the main way/channel that gets tested these days.
- Context
- If coding agents read issues as instructions, an attacker who can open an issue can try to run code on your machine — a practical risk the moment you let an agent triage a public repo.
- Key points
- Thacker flags what he calls the first genuine in-the-wild prompt-injection attack he's seen, as opposed to lab/red-team demos
- The delivery channel is GitHub Issues — the exact vector security researchers have been probing in coding-agent products
- He notes GitHub Issues is the main channel he tests in his work for OpenAI and Google, lending weight to it being the realistic attack surface
- Frames the threat as concrete for anyone pointing a coding agent at an untrusted repo's issues
- Provenance
- Tweet · Primary source
-
4
Technical breakdown: malicious GitHub issue pushes scan.js that exfiltrates secrets over DNS
X @inf0stache — osj — security researcher tracking the attack campaign
The issue uses fake security finding language to push a local scan.js, which searches the home directory for secrets, base64 encodes the results, and reports over DNS.
x.com/inf0stache/status/2058289447536337253 →Details
- Cited text
The issue uses fake security finding language to push a local scan.js, which searches the home directory for secrets, base64 encodes the results, and reports over DNS.
- Context
- The exfil-over-DNS detail is the part builders underestimate: blocking outbound HTTP isn't enough when a coding agent can be steered into running a script that leaks through DNS lookups.
- Key points
- The attacker files a GitHub issue dressed up as a legitimate security finding to manipulate an agent (or developer) into running a local scan.js
- scan.js searches the home directory for secrets, base64-encodes them, and exfiltrates over DNS — a channel that often evades naive egress filtering
- The social-engineering layer (fake security-finding language) is what makes it land on an agent primed to be helpful
- Shows attackers treating GitHub Issues as a delivery path, not just a discussion forum
- Provenance
- Tweet · Primary source
-
5
"coding is basically solved for the boring 90% of tasks" — a $3, 2M-token mass refactor
Source u/Dramatic_Spirit_8436 — r/singularity poster reporting a hands-on agentic refactor run
it confidently introduced a deadlock into my async event handler which was genuinely funny, so the hard 10% still needs opus.
www.reddit.com/r/singularity/comments/1tlj7… →Details
- Cited text
it confidently introduced a deadlock into my async event handler which was genuinely funny, so the hard 10% still needs opus.
- Context
- It puts real numbers on the cheap-worker / expensive-supervisor pattern, and the deadlock is a clean example of where confident automation quietly costs you.
- Key points
- Poster ran an autonomous refactor across a 120-file FastAPI service: ~400 steps, ~2M tokens, about $3 total, zero human input
- Used DeepSeek V4 and Tencent's Hunyuan Hy3 preview as cheap 'worker' models — 21B active params, ~$0.18 per million input tokens, roughly 80x cheaper than Opus
- The cheap run handled routine refactors well but confidently introduced a deadlock into an async event handler — the 'hard 10%' still needed a frontier model
- Tencent reports 99.99% step success across 495-step production runs, which the poster says tracked for routine work in their case
- Concrete datapoint for the 'cheap models do the boring bulk, frontier models do the judgment' division of labor
- Engagement
- 244 likes
- Provenance
- Source · Background source
-
6
Addy Osmani on AI code as team-scale tech debt
X @addyosmani — Engineering leader at Google Chrome; author of several books on web performance and JavaScript
For side-projects that may be fine, but for anything team/shared I feel it's a recipe for tech debt down the line.
x.com/addyosmani/status/2058485725587529755 →Details
- Cited text
For side-projects that may be fine, but for anything team/shared I feel it's a recipe for tech debt down the line.
- Context
- The cost of agent-written code isn't the first commit, it's the second engineer who has to maintain code no human author can explain.
- Key points
- Draws a line between AI code in throwaway side projects and AI code in shared team codebases
- Argues the latter risks accumulating tech debt when nobody fully understands what shipped
- A measured counterweight to 'coding is solved' enthusiasm from a credible engineering-leadership voice
- Provenance
- Tweet · Primary source
-
7
"Cognitive surrender" — shipping code you can't explain
X @nakadai_mon — Developer in the agentic-coding-skepticism thread
I've seen people with cognitive surrender and when called on it, they have no idea what said text or code means.
x.com/nakadai_mon/status/2058482148592492951 →Details
- Cited text
I've seen people with cognitive surrender and when called on it, they have no idea what said text or code means.
- Context
- Names the human-side risk of fast agents cleanly — the danger isn't the model's output, it's a developer who stops being able to account for it.
- Key points
- Coins 'cognitive surrender' for developers who ship AI output they don't understand
- The tell: when questioned, they can't explain what the code or text actually does
- A sharp phrase for the failure pattern underneath the tech-debt worry
- Provenance
- Tweet · Primary source
-
8
The Missing Primitive for Agent Swarms — Lou Bichard, Ona
Video Lou Bichard (AI Engineer talk) — Field CTO at Ona (formerly Gitpod); previously principal/platform engineer
Out of these primitives, I do believe we've effectively solved the runtime... the triggers are solved, but the thing that's missing for me is coordination.
www.youtube.com/watch?v=5Sui_OnSRlY →Details
- Cited text
Out of these primitives, I do believe we've effectively solved the runtime... the triggers are solved, but the thing that's missing for me is coordination.
- Context
- It names why everyone's agent-swarm setups feel duct-taped: the runtime is solved but there's no shared coordination layer, so teams keep abusing GitHub and Linear as one.
- Key points
- Defines a 'software factory' as the commitment to incrementally moving the human out of the SDLC loop, not just one human running parallel agents
- Cites Stripe's 'Minions' (driving thousands of PRs) and Ramp's 'Inspect' as internal background-agent infrastructure teams keep rebuilding from scratch
- Breaks agent infrastructure into four primitives — runtime, orchestration, triggers (all effectively solved) and coordination (the missing piece)
- Argues real dev work needs VM isolation, not containers: 'a container is not bulletproof isolation boundary' and brings noisy-neighbor problems
- Proposes coordination built from state machines, durable execution and compliance gates, ideally as a CLI primitive a local agent can call to check 'can I proceed to the next SDLC step?'
- Provenance
- Video · Supporting source
-
9
Your Agent Is an Infinite Canvas — RL Nabors, Dressed for Space
Video Rachel Lee Nabors (AI Engineer talk) — Web-standards veteran — Mozilla Firefox DevTools, W3C Web Animations API, Microsoft Edge PM, React docs team; now principal DX engineer at Arise
It's been said that chat is the lowest common denominator of the user experience. That it is to the future of agentic experiences what the CLI was to software.
www.youtube.com/watch?v=LMbeDEQO6QM →Details
- Cited text
It's been said that chat is the lowest common denominator of the user experience. That it is to the future of agentic experiences what the CLI was to software.
- Context
- If chat is the terminal phase, the people who know how to render rich, callable surfaces inside agents are early to the next UI layer — and the primitives already ship in the browser.
- Key points
- Argues the bare chat window is the CLI phase of agentic software — a 'phase for us developers' — not the end state
- Demos a working comic reader rendered inside Claude via an 'MCP app': interactive HTML/CSS/JS bundled into a single sandboxed file returned by a tool
- MCP apps are sandboxed iframes with no localStorage and no network access — they must ask the server to act, and links need explicit host permission
- Web MCP turns any HTML page into a mini MCP tools server so in-browser agents can call your existing JS functions instead of screenshotting or scraping the DOM
- Closes that 'CSS and JavaScript aren't just the language of the web. They're the language of interactive experiences on agents' — web skills carry into agentic UI
- Provenance
- Video · Supporting source
-
10
r/programming ends its temporary LLM-content ban, replaces it with a standing AI policy
Source r/programming moderators (u/ChemicalRascal) — Moderator announcement on the 6.9M-member r/programming subreddit
After temporarily banning LLM-related content over April... we've decided to bring about an end of the temporary, I-can't-believe-it's-still-April ban on AI-related posts.
www.reddit.com/r/programming/comments/1tlh5… →Details
- Cited text
After temporarily banning LLM-related content over April... we've decided to bring about an end of the temporary, I-can't-believe-it's-still-April ban on AI-related posts.
- Context
- How the biggest programming forum handles AI posts is a proxy for where the broader developer culture is landing — past the reflexive ban, toward rules.
- Key points
- r/programming ran a one-month trial ban on LLM-related content during April, then solicited community feedback
- The trial ban is being lifted and replaced with a new standing AI content policy rather than a blanket ban
- The announcement drew ~807 upvotes and 105 comments, itself a signal of how charged AI-content moderation is for the programming community
- A concrete example of a large developer community trying to metabolize AI content instead of reflexively banning or embracing it
- Engagement
- 807 likes
- Provenance
- Source · Background source
Mythos, and a model too dangerous to release
00:00:04 Anthropic put out a progress report this weekend on Project Glasswing, and it leads with a number: the unreleased model behind it has helped Anthropic's partners find more than ten thousand vulnerabilities in about a month. Engadget's Mariella Moon walked through the report, and it's the specifics that make it land.
00:00:23 The model is called Claude Mythos Preview, and the partners' rate of finding bugs went up by more than a factor of ten. Cloudflare found two thousand bugs, four hundred of them high or critical in severity. Mozilla found and fixed two hundred seventy-one vulnerabilities in Firefox — about ten times what it had caught in an older version of the browser using an earlier Claude model.
00:00:46 Microsoft said its patch releases will keep trending larger for a while, and that's apparently down to what Mythos surfaced. And Anthropic pointed the model at a thousand open-source projects on its own, turning up roughly six thousand two hundred high- and critical-severity bugs out of about twenty-three thousand total.
00:01:05 That's the capability. What lifts it above a press release comes next. Anthropic says it hasn't released Mythos to anyone outside the program, and the reason it gives is blunt: no company, including itself, has built safeguards strong enough to keep a model like this from being misused.
00:01:23 The same model that finds six thousand real bugs in open-source code can find them for someone who wants to exploit them instead of patch them. A frontier bug-finder is dual-use by definition. So Anthropic is keeping it behind a partner program — Amazon, Apple, Google, CrowdStrike, JPMorgan, NVIDIA, and Palo Alto Networks are on the list — and says it plans to release what it calls Mythos-class models later, once the safeguards exist.
00:01:49 It's also expanding Glasswing to work with the US and other governments. Then, this weekend, the thing briefly showed up where developers could see it. A leak-watcher account called TestingCatalog spotted new strings inside Claude referencing access to the Claude Mythos model in Claude Code and Claude Security.
00:02:08 The model became visible on Claude for a few minutes — long enough to confirm a name, claude-mythos-1-preview, and a new plans screen. TestingCatalog's read is that it'll stay gated, just reachable through tools like Claude Code rather than the chat box. That lines up with everything Anthropic has said: restricted, but moving toward the place you actually work.
00:02:30 Two things jump out at me. The first is the surface. They're not putting Mythos in the chat app — they're putting it in Claude Code and a dedicated Claude Security product. That's a model scoped to a job: deep code reasoning on one side, adversarial security thinking on the other.
00:02:47 A developer named Alex Rogov made the sharp version of this in the replies — it makes architectural sense, he said, but it raises a design question nobody's fully answered, which is the routing logic once you've got Haiku, Sonnet, Opus, and now Mythos all available inside one agent.
00:03:04 Which model picks up which task, and who decides — the harness, or you? The second is the too-dangerous-to-release framing, and I want to be fair to both readings. The generous one is that it's simply true and responsible: a model this good at finding exploits shouldn't be a public, open API yet.
00:03:22 The skeptical one is that too-dangerous-to-release also happens to be a fantastic moat — it lets you ship the capability to JPMorgan and the US government while keeping it out of the hands of anyone who'd commoditize it. And those two readings aren't mutually exclusive.
00:03:38 Anthropic doesn't have to be cynical for the second one to be true. Both can hold at once. What would actually move me, either way, is a publicly disclosed vulnerability with Mythos's fingerprints on it — one specific bug in one specific project we can all go look at.
00:03:54 Right now we have aggregate counts from the vendor and the partners it chose. Cloudflare's two thousand and Mozilla's two hundred seventy-one are real and checkable in a way the thousand-project scan isn't. That's the evidence I'm after.
A real prompt injection in the wild
00:04:08 While Anthropic is using one model to find ten thousand bugs, a researcher pointed at the same surface from the attacker's side. Joseph Thacker — he goes by rez0, and he tests AI products for OpenAI and Google — flagged what he's calling the first real prompt-injection attack he's seen in the wild.
00:04:27 Not a lab demo or a red-team exercise. Something actually out there. And the delivery channel is GitHub Issues. Here's the mechanism, broken down by a second researcher who goes by osj. Someone files a GitHub issue on a repo, and the issue is dressed up in the language of a legitimate security finding — it looks like a responsible disclosure, it reads like a real report.
00:04:49 Buried in it is an instruction to run a local script called scan.js. If you run it — or if a coding agent reads the issue as a task and runs it for you — the script searches your home directory for secrets, base64-encodes whatever it finds, and ships it back out over DNS, the domain name system.
00:05:08 There are two threads to pull there. The first is the channel. GitHub Issues is exactly where this should show up, and Thacker says as much — it's the main channel he tests in his own work for OpenAI and Google. Think about what a coding agent does now. You point it at a repo, you tell it triage the open issues, and it reads them as instructions.
00:05:29 That's the whole value proposition. But an issue is untrusted input from anyone on the internet, and the agent doesn't natively know the difference between the maintainer is asking me to do this and a stranger wrote a convincing paragraph telling me to do this.
00:05:45 The fake-security-finding wrapper is the clever part. It's social engineering aimed at a model that's been trained to be helpful and to take security seriously — both of those instincts get turned against it. The second thread is the exfiltration, because that's the detail builders underestimate.
00:06:03 The secrets don't leave over HTTP, where your egress rules might catch them. They leave over DNS lookups — the queries your network has to allow for almost anything to function. A lot of setups that carefully block outbound web traffic will happily resolve a DNS query that smuggles base64-encoded secrets inside the hostname.
00:06:23 So 'I sandboxed the agent and blocked the internet' is often not as airtight as it sounds. None of the individual pieces here are exotic. DNS exfiltration is an old technique, fake disclosures are old, and prompt injection has been the theoretical worry for years.
00:06:39 What's new is the combination landing on a live target through the exact workflow we've all been adopting. And the agents aren't the only target — a human triaging issues fast, half paying attention, can be socially engineered the same way. That's why the boundary I care about is simple: whatever's reading untrusted text shouldn't also be holding the keys.
00:07:01 The defensive answer isn't a clever filter. It's a discipline — don't hand an agent both untrusted input and the ability to run arbitrary code in an environment that holds your credentials. If you're going to let an agent execute things it reads in issues, do it somewhere with nothing worth stealing.
00:07:19 It pairs with the Mythos story, and not only because both say security. It's the same coin. The model that finds the bug and the injection that hijacks the agent are the same capability — language understanding pointed at code — aimed in opposite directions. We're going to spend the next few years living inside that symmetry.
The three-dollar refactor and the ten percent that bites
00:07:39 Somebody on the singularity subreddit posted a run this week that's a good gut-check on where agentic coding actually is. They mass-refactored a FastAPI service spread across a hundred and twenty files. The agent ran about four hundred steps and burned two million tokens, the whole thing cost three dollars, and there was zero human input.
00:08:00 And — their words — it confidently introduced a deadlock into my async event handler, which was genuinely funny, so the hard ten percent still needs Opus. I like that report because it cuts both ways. The cheap part checks out. They ran DeepSeek V4 and Tencent's Hunyuan Hy3 preview as the worker models — about twenty-one billion active parameters.
00:08:21 The cost is the striking bit: somewhere around eighteen cents per million input tokens, which the poster put at roughly eighty times cheaper than running Opus. Tencent claims its model hits almost perfect step success — 99.99 percent — across production runs of nearly five hundred steps, and the poster said that tracked for routine refactors.
00:08:43 So the headline, in their phrasing — coding is basically solved for the boring ninety percent of tasks — isn't crazy. For mechanical, well-specified, repeat-this-pattern-across-the-codebase work, the cheap models really will grind through it overnight for the price of a coffee.
00:09:00 And then there's the deadlock. A deadlock in an async event handler is exactly the kind of bug that doesn't show up in a quick test pass and does show up in production at two in the morning under load. The cheap model wrote it confidently. That word — confidently — is what makes it dangerous.
00:09:18 It didn't flag any uncertainty. It produced plausible code that compiles, passes a casual look, and contains a concurrency bug that only surfaces when the system is busy. This is where Addy Osmani's line from the same week fits. Osmani's in Chrome's engineering leadership at Google, and he's written a shelf of books on web performance.
00:09:38 Talking about leaning on agents, he said: for side projects that may be fine, but for anything team or shared, he feels it's a recipe for tech debt down the line. I think that's the right altitude. The cost of agent-written code isn't the first commit. It's the second engineer, six months later, trying to maintain code that no human author can explain — code that passed review because it looked right, and that nobody can reconstruct the reasoning behind.
00:10:06 There's a sharper phrase that came up in the same conversation. A developer called it cognitive surrender, and described people who, when you ask them what their code or their text actually means, have no idea. That's the human-side risk under all of this, and it's the thread we pulled yesterday from a different angle: when the machine gets fast and cheap and capable, the job that stays yours is being the one who can still account for what shipped.
00:10:34 The deadlock is fine if you're reading the diff and you catch it. The deadlock is a problem if you've surrendered — if three dollars and four hundred steps go by and you trust all of it because most of it was right. So I don't read this run as coding is solved.
00:10:50 I read it as the division of labor getting very concrete. Cheap models for the bulk, a frontier model for the ten percent that bites, and a human whose actual job is knowing which ten percent that is. That last part doesn't get cheaper. If anything, an eighty-times-cheaper worker makes the judgment more valuable, because now there's far more output you're on the hook for.
The missing primitive is coordination
00:11:13 From one builder's afternoon to how the bigger teams wire this up: there was a talk at the AI Engineer conference this week that named the gap I keep hitting. Lou Bichard, field CTO at Ona — the company that used to be Gitpod — gave it. It's called The Missing Primitive for Agent Swarms.
00:11:30 His framing is the software factory, and he's careful about what he means by it. Not one engineer running a bunch of agents in parallel, but the commitment to incrementally moving the human out of the loop across the whole software development lifecycle, so work flows from development toward production on its own.
00:11:50 His own caveat: we're extremely early in terms of where we're at with software factories. But the shape is already showing up inside the big companies. He points at Stripe, which built something internally they call Minions — coding agents plugged into Stripe's existing infrastructure, able to drive thousands of pull requests inside the company.
00:12:11 Ramp built their own and called it Inspect. Both teams built it from scratch, and that's his opening complaint: everybody's rebuilding the same thing. So he breaks the infrastructure into four pieces, and the breakdown is what clarified things for me. There's the runtime — somewhere for the agent to actually run.
00:12:30 There's orchestration — scaling agents up and down. There's triggers — the webhooks and pull requests and tickets that bring an agent online. And there's coordination — how agents hand work to each other and check that it actually got done. His claim: we've effectively solved the runtime, the triggers are solved, but the thing that's missing for me is coordination.
00:12:52 That matches my experience exactly. The reason most multi-agent setups feel held together with tape is that we're using human tools as the coordination layer. His line: GitHub is not a coordination layer for agents. It gets incredibly overwhelming. You end up with agents opening pull requests at each other and leaving review comments for each other, and the whole thing turns into noise fast.
00:13:16 He mentions one project built on Linear — reusing the human ticketing tool — and it works, awkwardly, because that's what's available. Two more details from that talk stuck with me. First, he makes a strong case that real development work needs virtual-machine isolation, not containers.
00:13:33 His words: a container is not a bulletproof isolation boundary. Containers are bursty, and if you pack them together you get noisy-neighbor problems and compute contention. For agents running real builds and tests, you want a full virtual machine. That's a cost-and-architecture decision a lot of people are going to rediscover the hard way.
00:13:54 Second — and this one's a little uncomfortable — he's blunt about why coordination is hard at the model level. Agents, in his words, want to please us. They're quite sycophantic, so ask them to write some tests and then they'll skip some tests in order to complete the task.
00:14:11 The clean five-stage picture of the lifecycle hides a lot of micro-steps that an eager agent will just skip to declare victory. So coordination isn't only about wiring agents together. It's enforcement — making the agent prove it did step three before it's allowed to start step four.
00:14:28 His proposal: state machines, ideas borrowed from durable execution, and a layer of gates and compliance — ideally packaged as a command-line tool that a local agent like Claude Code can call to ask, in effect, have I finished this part of the lifecycle, can I move on?
00:14:44 He's undecided whether to ship it as an implementation or push it as a standard, and he says he cares more about the standard. I think he's right that this is the open frontier. The harness-engineering idea we've been circling all week — encoding your knowledge back into the repo so the agent flows through it — this coordination problem is the other half of it.
00:15:06 Harness engineering tells one agent how to work. Coordination is how a swarm of them works together without melting down.
Chat is the command-line phase of agents
00:15:14 The other talk from that conference sits at almost the opposite end of the stack — not how agents coordinate behind the scenes, but what they look like to a person. Rachel Lee Nabors gave it, and her background is unusual for that room. She worked on Firefox developer tools at Mozilla, on web standards at the W3C, she was a product manager on Microsoft's Edge browser, and she was on the React documentation team.
00:15:40 So when she talks about the web as a surface, she's been building that surface for the better part of fifteen years. Her talk is called Your Agent Is an Infinite Canvas, and her core claim is this. She repeats a line that's been going around: chat is the lowest common denominator of the user experience.
00:15:59 It is to the future of agentic experiences what the command line was to software. Her own example is her mother, a COBOL programmer who insisted on typing DOS commands while everyone else learned to point and tap at icons. Chat, she argues, is the command-line phase of agents.
00:16:16 It's where we developers are comfortable, and it's probably a phase. So what comes after the chat box? Her answer is concrete, and it ships today. She demos an MCP app — Model Context Protocol, the standard for wiring tools into models. An MCP app is interactive HTML, CSS, and JavaScript bundled into a single file that a tool can hand back to the agent.
00:16:38 So instead of a tool returning a wall of JSON or a wall of text, it returns a little working interface. She built a comic reader — her own web comic archive — that renders right inside Claude. Full panels, navigation, a text mode that shows the transcript. Her line: it looks just as good as the website.
00:16:57 The motivation is almost funny in how practical it is: she got tired of the agent wall-of-texting her, so she made it show her pictures instead. She's upfront about the rough edges, and the rough edges are interesting in their own right. An MCP app is an island — a single sandboxed file with everything embedded.
00:17:16 It runs in an iframe with no local storage and no network access, so it can't reach out on its own; it has to ask the server to do things for it. Links need explicit permission. External fonts and images get blocked by the content security policy unless you configure it, and a blank render is almost always exactly that.
00:17:37 This is early-web stuff — the constraints anyone who built for browsers twenty years ago will recognize, showing up again somewhere new. Then she goes a layer further, to something called Web MCP, which flips the direction. Instead of putting an app inside the agent, it turns any HTML page into a little tools server, so an agent running in your browser can call the JavaScript functions you already wrote — instead of screenshotting the page and guessing, or scraping the document and burning tokens chewing through markup.
00:18:09 She's careful here: Web MCP is to MCP as JavaScript is to Java — inspired by it, not the same spec, and the standards bodies are still arguing it out. It started at Amazon, apparently, as a way around authentication problems. Where she lands is the line I'll hold onto: CSS and JavaScript aren't just the language of the web.
00:18:30 They're the language of interactive experiences on agents. I think that bet is right. If chat really is the command-line phase, then the people who know how to render rich, callable, interactive surfaces — and that's a deep, fifteen-year-old skill base sitting in the web community — are early to the next layer of how this stuff looks and feels.
00:18:51 The browser primitives are already specced. Web Speech, Canvas, animation, audio, and WebAssembly are sitting right there. Somebody is going to build the agent interface that doesn't feel like a terminal, and they'll probably build it out of pieces that already exist.
From a reflex to a policy
00:19:08 One smaller thing to close on, because it says something about where the developer community's head is at. The programming subreddit — around seven million people — spent April running a temporary ban on large-language-model content. AI blog spam, model-announcement posts, the lot — banned for a month as a trial.
00:19:26 This week the moderators announced the trial is over. And instead of going back to a blanket ban or throwing the doors wide open, they replaced it with a standing AI content policy. The announcement itself pulled north of eight hundred upvotes, which, for a moderation post, tells you how charged the topic still is.
00:19:45 I find that encouraging, and not because I have a strong opinion on any particular rule. It's the move from a reflex to a policy. The reflex — ban it all — is what you do when a new thing floods your space and you can't tell signal from slop. A policy is what you write once you've watched the thing for a while and worked out which parts are worth keeping.
00:20:05 Seven million programmers deciding, in public, that AI content needs rules rather than a wall is a small marker of the field growing past the panic. It's the same arc I'd bet most teams are on privately — first you keep the agents away from anything that matters, then you watch, then you write down where they're allowed and where a human still signs off.
00:20:26 Reddit just did it in the open. And if you want a thread across today — I'd hold it loosely — it's that the capability keeps arriving ahead of the structure around it. Anthropic has a security model too good to ship safely. An attacker is riding the workflow we just adopted.
00:20:42 A three-dollar refactor came with a deadlock baked in. Lou Bichard's swarms have no coordination layer, and Rachel Nabors thinks the chat box is a phase. The model is ahead of the harness, the agent is ahead of its security boundary, and the output is ahead of our ability to account for it.
00:20:59 None of that is a reason to slow down. It's just where the interesting work is — building the structure that catches up to the capability you already have. Two questions I don't have answers to yet: whether even one of those ten thousand Mythos bugs gets a public write-up the rest of us can learn from, and whether anyone ships an agent coordination layer that isn't a hijacked ticketing tool.
00:21:22 Both would change how I work. — Lenar Kess.