Archive BRAID
The Cost of Finding Out / DISPATCH 026
PDF RSS

Dispatch 026 · 2026-05-14 GSV Throw It All Away And See

The Cost of Finding Out

/ 00:26:06 / 15 sources

“The custom syntax was a tax the Wasp team was charging itself.”

— Lenar Kess, today's narration

Anthropic drew two lines around Claude this week — a guided lane for small-business owners and a metered one for the developers running agents hardest. From there: Bun's near-million-line port from Zig to Rust, mostly typed by an AI agent in a week; Wasp's clear-eyed post-mortem on spending five years and five million dollars building a language it didn't need; a chess coach that works by refusing to let the model think; the UK's evaluators capping their own cyber tests so the math still works; the open web pricing out crawlers; multi-token prediction landing in llama.cpp; and what happens when you post a real Monet and call it AI.

Chapters

  1. 00:00:04 Anthropic draws two lines
  2. 00:04:21 Bun, ported to Rust by a bot in a week
  3. 00:08:15 Wasp: the language was never the moat
  4. 00:12:04 The chess coach that isn't allowed to think
  5. 00:15:33 Autonomous cyber, measured against itself
  6. 00:18:23 The web pulls up its drawbridge
  7. 00:21:09 Multi-token prediction on your laptop
  8. 00:23:41 The Monet test

Sources

15 cited
  1. 1

    Introducing Claude for Small Business

    Article Anthropic — Anthropic's official announcement; quotes co-founder and president Daniela Amodei

    People run the business, and Claude helps take the late-night work off their plates.

    www.anthropic.com/news/claude-for-small-bus… →
    Details
    Cited text
    People run the business, and Claude helps take the late-night work off their plates.
    Context
    A distribution play aimed at non-technical owners — the opposite end of Anthropic's user base from the developers running agent fleets, and the same week Anthropic re-metered programmatic use.
    Key points
    • Claude for Small Business is a toggle install inside Claude Cowork that connects Claude to QuickBooks, PayPal, HubSpot, Canva, Docusign, Google Workspace and Microsoft 365.
    • Ships with 15 ready-to-run agentic workflows and 15 skills across finance, ops, sales, marketing, HR and customer service; user approves before anything sends, posts or pays.
    • Framed as an adoption gap-closer: small businesses are ~44% of US GDP and employ nearly half the private-sector workforce, but AI use 'often stops at the chat window.'
    • Comes with a 10-city AI fluency road tour, a free PayPal-partnered course, and Claude credits routed through community development financial institutions.
    • Existing user permissions carry through; Anthropic says it doesn't train on customer data by default on Team and Enterprise plans.
    Provenance
    Article · Supporting source
  2. 2

    "It's official. Anthropic pulled the plug on all programmatic use of Claude subscription."

    Thread r/Anthropic — posted by No_Wheel_9336

    What is this, the 6th u-turn? Don't worry they'll change their mind again by next week.

    www.reddit.com/r/Anthropic/comments/1tcccar… →
    Details
    Cited text
    What is this, the 6th u-turn? Don't worry they'll change their mind again by next week.
    Context
    The community signal underneath the policy change: builders care less about the price than about being able to plan against a stable pricing page.
    Key points
    • ~780-upvote thread reacting to a screenshot of Anthropic restricting programmatic use of Claude subscriptions.
    • Top comment (Chronicles010) posts a workaround: run real Claude Code in tmux sessions driven by sendkeys and hooks instead of claude -p.
    • Commenter dbbk calls it the '6th u-turn,' capturing frustration at repeated pricing reversals.
    • Headline overshoots — the change is a metering shift, not a total ban — but the churn itself is the complaint.
    • Shows developers will route around the meter quickly once it pinches.
    Engagement
    781 likes · 275 replies
    Provenance
    Thread · Primary source
  3. 3

    Anthropic reinstates OpenClaw and third-party agent usage on Claude subscriptions — with a catch

    Article VentureBeat — Trade-press reporting on the mechanics of Anthropic's Agent SDK credit change

    Anyone who shells out to claude -p in a loop or runs third-party agents on a subscription has a new, separate cost model as of June 15 — worth designing around now.

    venturebeat.com/technology/anthropic-reinst… →
    Details
    Context
    Anyone who shells out to claude -p in a loop or runs third-party agents on a subscription has a new, separate cost model as of June 15 — worth designing around now.
    Key points
    • Starting June 15, 2026, Agent SDK and claude -p usage on subscription plans draws from a separate monthly 'Agent SDK credit,' not from interactive usage limits.
    • The credit is billed at API rates, worth roughly $20–$200 depending on plan, and does not roll over month to month.
    • If the credit is exhausted, you cannot fall back on general subscription limits — you must buy additional usage credits.
    • The credit covers the Claude Agent SDK, claude -p, Claude Code GitHub Actions, and third-party apps built on the Agent SDK such as OpenClaw.
    • Follows an April policy that briefly blocked third-party agents on subscriptions entirely.
    Provenance
    Article · Supporting source
  4. 4

    Rewrite Bun in Rust by Jarred-Sumner · Pull Request #30412 · oven-sh/bun

    Source Jarred Sumner — Creator of Bun, the Zig-based JavaScript runtime; Bun was acquired by Anthropic in late 2025

    we now have compiler-assisted tools for catching & preventing memory bugs, which have costed the team an enormous amount of development & debugging time over the years.

    github.com/oven-sh/bun/pull/30412 →
    Details
    Cited text
    we now have compiler-assisted tools for catching & preventing memory bugs, which have costed the team an enormous amount of development & debugging time over the years.
    Context
    A near-million-line systems rewrite, largely AI-generated, that progressed from porting guide to passing-canary in roughly a week — the cost of attempting a rewrite at this scale has collapsed.
    Key points
    • Pull request porting Bun from Zig to Rust; claims it passes Bun's existing test suite on all platforms.
    • Reports it fixes several memory leaks and flaky tests, shrinks the binary 3–8 MB, and benchmarks neutral-to-faster.
    • Same architecture and data structures as the Zig version; few third-party libraries; no async Rust.
    • Stated motivation is compiler-assisted memory-safety tooling, not performance.
    • Available to try via 'bun upgrade --canary'; still described as needing optimization and cleanup work.
    Provenance
    Source · Background source
  5. 5

    Armin Ronacher on the Bun Rust rewrite

    X mitsuhiko (Armin Ronacher) — Creator of Flask and a widely-followed voice on systems and tooling; reposted by Mario Zechner

    Say what you want: this is impressive.

    x.com/mitsuhiko/status/2054865717007089974 →
    Details
    Cited text
    Say what you want: this is impressive.
    Context
    A credibility marker — when the Flask author calls an AI-generated runtime port impressive, it stops being a curiosity and becomes a data point.
    Key points
    • Ronacher amplified PR #30412 with a short endorsement, helping push it into developer feeds.
    • Signals that respected systems people are taking the AI-driven port seriously, not dismissing it.
    • Reposted by Mario Zechner, broadening reach across the tooling community.
    Provenance
    Tweet · Primary source
  6. 6

    Anthropic's Bun team trials port from Zig to Rust

    Article DevClass — Developer trade press; reporting on the origin and status of the Bun port

    we haven't committed to rewriting. There's a very high chance all this code gets thrown out completely.

    www.devclass.com/software/2026/05/11/anthro… →
    Details
    Cited text
    we haven't committed to rewriting. There's a very high chance all this code gets thrown out completely.
    Context
    Grounds the Bun story in what's actually known versus claimed — and surfaces the tension between a passing canary build and a maintainer who won't commit.
    Key points
    • Jarred Sumner committed a Zig-to-Rust porting guide (~300 rules) with a two-phase plan: Phase A translate logic without compiling, Phase B make it build crate-by-crate.
    • Claude-powered agents did the bulk of the port; the working branch is named claude/phase-a-port and holds ~966,000 lines of generated Rust.
    • Sumner publicly downplayed it on Hacker News, calling the discourse an overreaction and saying the code may be thrown out entirely.
    • Anthropic acquired Bun in late 2025 and uses it inside Claude Code; Zig has a stated no-AI-contributions policy.
    • Theo Browne, reading the diff, reported roughly 13,000 unsafe blocks remaining in the ported code.
    Provenance
    Article · Supporting source
  7. 7

    5 Years and $5M Later: Inventing a New Programming Language for Web Development Was a Mistake

    Article Matija Sosic — Co-founder of Wasp, a full-stack JS web framework; built it with his twin brother after Y Combinator in 2021

    Language was never the moat. It's having a high-level understanding of your entire app at compile time.

    wasp.sh/blog/2026/05/13/new-language-for-we… →
    Details
    Cited text
    Language was never the moat. It's having a high-level understanding of your entire app at compile time.
    Context
    A rare, specific post-mortem on the multi-year tail cost of leaving the paved road — tooling, onboarding friction, positioning damage — that never shows up in a design doc.
    Key points
    • Wasp is replacing its custom DSL and compiler with a TypeScript SDK after five years and $5M raised.
    • Positioning cost: the 'wasp-lang' name made developers think it aimed to replace JavaScript; a GitHub 'Haskell: 90%' bar reinforced the wrong story.
    • Tooling cost was decisive — building IDE/editor support (language server, VS Code extension) for a custom language only reached ~80% of the bar, and the JS ecosystem assumes standard TS.
    • Key realization: users were excited about the high-level app specification, not the bespoke syntax — the two had been conflated.
    • Switching to TypeScript keeps the compiler internals unchanged; it only swaps the 'front end' of how the spec is written.
    • Argues structured, opinionated specs help AI agents produce more reviewable code — but that never required a new language.
    Provenance
    Article · Supporting source
  8. 8

    Building a Chess Coach — Anant Dole and Asbjørn Steinskog, Take Take Take

    Video Anant Dole and Asbjørn Steinskog (Play Magnus) — Engineers at Play Magnus, Magnus Carlsen's chess company; talk given at the AI Engineer conference

    the LLM's job is only to translate this information into English, because we really don't want it to try to figure out too much on its own

    www.youtube.com/watch?v=FlzpEGHNVKQ →
    Details
    Cited text
    the LLM's job is only to translate this information into English, because we really don't want it to try to figure out too much on its own
    Context
    A clean template for low-latency model products: identify the part of the job that's calculation, give it to something that calculates, and reserve the model for phrasing.
    Key points
    • LLMs are unreliable at chess, so Play Magnus's coach never lets the model reason about positions.
    • Pipeline: Stockfish for ground-truth best move, a battery of tactical/positional detectors, and Maia (University of Toronto) to predict the move a human at a given rating would actually play.
    • All structured context is handed to the model whose only job is to translate it into English — every claim is grounded in an engine's output.
    • Using Gemini 3 Flash, end-to-end commentary lands in ~3 seconds; Claude on higher thinking effort scored lower (~60% vs ~75% of eval scenarios) and was slower.
    • 16 eval scenarios, model-as-judge, OpenRouter for fast model swapping.
    • A separate feedback loop injects user-flagged commentary into a running Claude Code session via a Model Context Protocol server; it triages, edits detectors, regenerates, and asks the engineer to approve on Slack/mobile.
    Provenance
    Video · Supporting source
  9. 9

    How fast is autonomous AI cyber capability advancing?

    Article UK AI Security Institute — The UK government's AI Security Institute, which runs independent capability evaluations of frontier models

    success rates are so high that time horizons become impossible to calculate

    www.aisi.gov.uk/blog/how-fast-is-autonomous… →
    Details
    Cited text
    success rates are so high that time horizons become impossible to calculate
    Context
    The capability curve is steep enough that independent evaluators are openly capping their own tests to keep the math tractable — the gap between capability and measurable capability is the story.
    Key points
    • AISI estimates the length of cyber task a frontier model can complete autonomously is now doubling roughly every 4.7 months — up from a previous estimate of every 8 months.
    • A newer Claude Mythos Preview checkpoint solved AISI's 'The Last Ones' range 6/10 and 'Cooling Tower' 3/10 — the first model to complete the second range at all.
    • GPT-5.5 solved 'The Last Ones' 3/10 and did not complete the second range.
    • Tasks are capped at 2.5M tokens; without the cap, success rates are so high that time horizons can't be calculated — the measurement instrument saturates.
    • AISI explicitly does not claim to know how the pace evolves, when thresholds get crossed, or how capabilities translate against defended real-world systems (vs. cyber ranges).
    Provenance
    Article · Supporting source
  10. 10

    New Mythos checkpoint shows continued improvement

    Thread r/singularity — posted by Tinac4

    Raises an unconfirmed but worth-watching thread: that capability evaluation may be running behind deployment, with the next checkpoint live before the last one clears review.

    www.reddit.com/r/singularity/comments/1tc9d… →
    Details
    Context
    Raises an unconfirmed but worth-watching thread: that capability evaluation may be running behind deployment, with the next checkpoint live before the last one clears review.
    Key points
    • Thread surfacing the AISI cyber-capability post and its Mythos checkpoint results.
    • Commenter FateOfMuffins, citing Anthropic's Logan Graham, says the tested checkpoint appears to be the one already deployed under Project Glasswing — meaning safety evals lag deployment.
    • Notes AISI used a stripped-down harness and a 2.5M-token cap because a fuller harness would saturate the task suite.
    • OP added an edit flagging the title may be misleading about which checkpoint was tested.
    Engagement
    361 likes · 62 replies
    Provenance
    Thread · Primary source
  11. 11

    Web-Search is coming to a screeching halt as Google shuts its free index and Cloudflare challenges AI bots

    Thread r/LocalLLaMA — posted by NetTechMan

    Google is reinforcing their mote by pulling up the drawbridge for aggressive pricing.

    www.reddit.com/r/LocalLLaMA/comments/1tcabo… →
    Details
    Cited text
    Google is reinforcing their mote by pulling up the drawbridge for aggressive pricing.
    Context
    Retrieval-augmented and research agents lean on cheap search APIs and arbitrary page fetches — both legs are getting more expensive and less reliable over the next year.
    Key points
    • Frames two converging changes: Google capping free full-web search and Cloudflare defaulting to challenge AI bots.
    • Reports harnesses now hitting 400-level errors from site after site as scraping defenses tighten.
    • Top comment (__JockY__): search providers see a flood of bot queries with no human eyes and no ad revenue, so they're shutting an unmonetized firehose.
    • Commenter points to YaCy, a 20-year-old peer-to-peer open-source search engine, as a possible decentralized answer.
    • OP argues an open, agent-usable search index is the next big 'open' gap to fill.
    Engagement
    324 likes · 195 replies
    Provenance
    Thread · Primary source
  12. 12

    Google Ends Free Web Search for Programmable Search Engine

    Article WinBuzzer — Tech news outlet reporting on Google's Programmable Search Engine policy change

    Confirms the primary fact behind the Reddit thread: the free programmatic web-search index that countless tools quietly depend on has a hard end date.

    winbuzzer.com/2026/01/23/google-ends-free-w… →
    Details
    Context
    Confirms the primary fact behind the Reddit thread: the free programmatic web-search index that countless tools quietly depend on has a hard end date.
    Key points
    • Google is ending free full-web search through Programmable Search Engine (formerly Custom Search) and capping the free tier at 50 domains.
    • The change applies to new engines immediately; existing full-web engines must migrate by January 1, 2027.
    • Pricing for the paid full-web option is not public — access requires registering interest through a form.
    • Custom Search JSON API users face the same cap; Bing has been tightening its search API on a parallel timeline.
    • Suggested migration paths include Vertex AI Search, Algolia, and Elasticsearch.
    Provenance
    Article · Supporting source
  13. 13

    Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant

    Thread r/LocalLLaMA — posted by gladkos

    +40% performance! 90% acceptance rate.

    www.reddit.com/r/LocalLLaMA/comments/1tckzy… →
    Details
    Cited text
    +40% performance! 90% acceptance rate.
    Context
    The multi-token prediction gain is real and stackable for local inference; the TurboQuant claim bundled with it is contested — a reminder to read the replies before flashing anything.
    Key points
    • A patched llama.cpp build adds multi-token prediction for Qwen models, reporting a Qwen 3.6 27B model going from 21 to 34 tokens/sec on a MacBook Pro M5 Max.
    • Reported 90% acceptance rate on speculated tokens.
    • The post bundles MTP with 'TurboQuant,' a quantization method — and the comments push back on TurboQuant specifically.
    • Commenter nickm_27 says TurboQuant is actually slower than standard 16-bit and 4-bit builds.
    • Commenter havenoammo notes a TurboQuant pull request to llama.cpp was rejected because it didn't beat existing Q4 rotation work and only helped at very aggressive quantization where quality suffers.
    Engagement
    207 likes · 60 replies
    Provenance
    Thread · Primary source
  14. 14

    "What happens when you post a real Monet and say it's AI?"

    X Jediwolf — Reshared an art social experiment run by the account SHL0MS

    What happens when you post a real Monet and say it's AI? The coolest art social experiment I've seen in a while.

    x.com/Jediwolf/status/2054776716770320631 →
    Details
    Cited text
    What happens when you post a real Monet and say it's AI? The coolest art social experiment I've seen in a while.
    Context
    A clean demonstration that the AI-tell detector everyone has trained in three years fires confident false positives — relevant to anyone whose work might be judged on suspicion of process.
    Key points
    • An artist (SHL0MS) posted a genuine Monet painting labelled as AI-generated to see the reaction.
    • Commenters lined up to call it 'slop' — soft, mushy, obviously fake — while looking at an actual Monet.
    • Demonstrates that much of the 'this is AI' reaction tracks the label, not the pixels.
    • Marc Andreessen replied to the original with a single '😳'.
    Provenance
    Tweet · Primary source
  15. 15

    Cristóbal Valenzuela on Monet as 'slop'

    X c_valenzuelab (Cristóbal Valenzuela) — Co-founder and CEO of Runway, an AI video-generation company

    Monet was probably one of the biggest slop painters of his era and, by all means, managed to help transform the entire art world.

    x.com/c_valenzuelab/status/2054908905529159… →
    Details
    Cited text
    Monet was probably one of the biggest slop painters of his era and, by all means, managed to help transform the entire art world.
    Context
    Reframes the slop reflex historically — taste, not a tell-detector, is what separated lasting work from noise then, and still does.
    Key points
    • Valenzuela responded to the Monet experiment by noting Impressionism was itself derided in its own era.
    • The word 'impressionism' began as a term of contempt for work that didn't look like 'real' painting.
    • Coming from the CEO of an AI video company, the point carries standing rather than defensiveness.
    Provenance
    Tweet · Primary source