Archive BRAID
Two bets on AGI, an 80-year-old problem, and Anthropic in the black / DISPATCH 033
PDF RSS

Dispatch 033 · 2026-05-21 GCU Everything We Could Imagine Works

Two bets on AGI, an 80-year-old problem, and Anthropic in the black

/ 00:22:29 / 14 sources

“Models are doing things this week that looked impossible a year ago, and they still can't reliably hold a negation across a paragraph. That gap is the whole game.”

— Lenar Kess, today's narration

Google's I/O keynote is a day behind us, and the week it kicked off turned into a referendum on two very different bets on artificial general intelligence — plus a pile of counter-programming from everyone else. Today: OpenAI cracking an 80-year-old math problem with a general-purpose model, Anthropic's first profitable quarter and what Karpathy was actually hired to do, a 70-page paper on why frontier models still can't tell a fact from a labeled lie, Midjourney's hardware regret, ads arriving inside Google's AI answers, Meta's layoffs, Cohere's open-weights comeback, and a field guide to skilling up coding agents.

Chapters

  1. 00:00:04 Two bets on the same finish line
  2. 00:03:31 OpenAI cracks an 80-year-old problem
  3. 00:05:57 Anthropic in the black, and Karpathy's bet
  4. 00:08:16 Jagged intelligence, and the false story
  5. 00:10:42 Midjourney's hardware regret
  6. 00:12:46 Ads come to AI Mode
  7. 00:14:43 Meta's eight thousand
  8. 00:16:14 Cohere comes back, Apache-licensed
  9. 00:18:27 Skilling up the agent
  10. 00:20:59 Who's training whom

Sources

14 cited
  1. 1

    Two Rival Bets on AGI: Google I/O Highlights

    Video AI Explained — Independent AI analyst known for the Simple Bench common-sense benchmark and close reading of lab releases

    Google wants the search box to be your portal for using all things AI, while OpenAI wants the chat box to be your portal for using search.

    www.youtube.com/watch?v=o_av1b9rs2g →
    Details
    Cited text
    Google wants the search box to be your portal for using all things AI, while OpenAI wants the chat box to be your portal for using search.
    Context
    The clearest framing available of what I/O actually signaled — a strategic divergence in how the two leading labs think AGI is reached, and where each is choosing to compete.
    Key points
    • I/O read as Google's pitch to win consumers from OpenAI via the search bar, not to pull professional developers off Claude; Google didn't claim a coding frontier.
    • Two AGI theses on display: Demis Hassabis bets on world models (Gemini Omni, simulation = understanding); Greg Brockman bets on the text-reasoning tree ('everything we could imagine works', 'we have line of sight').
    • OpenAI called Sora the stepping stone to AGI in 2024; the Sora app is now shelved and the tech folded into internal robotics, while Google picks up the world-model thesis.
    • Sundar Pichai pitched enterprises on saving billions by switching to cheaper Gemini 3.5 Flash — a volume bet, not a frontier claim.
    • Google and OpenAI also converge: OpenAI adopting Google's SynthID watermark; both now signed Pentagon lawful-use contracts Anthropic had resisted.
    Provenance
    Video · Supporting source
  2. 2

    OpenAI: breakthrough on the planar unit distance problem

    X OpenAI

    This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

    x.com/OpenAI/status/2057176201782075690 →
    Details
    Cited text
    This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
    Context
    A general-purpose model — the kind already callable from an API — producing a novel math result is a different signal than a purpose-built prover, and it landed squarely in I/O week as counter-programming.
    Key points
    • OpenAI says a model produced a new result on the planar unit distance problem, posed by Paul Erdős in 1946.
    • For ~80 years the best constructions looked roughly like square grids; the model found a new family of constructions that does better.
    • OpenAI stresses the proof came from a general-purpose reasoning model, not a system built to solve math or this problem specifically.
    • Closing framing: 'Expertise becomes more valuable, not less. AI can help search, suggest, and verify. People choose the problems that matter.'
    Engagement
    4042 likes · 758 retweets · 231 replies
    Provenance
    Tweet · Primary source
  3. 3

    A model disproves a discrete geometry conjecture

    Article OpenAI

    The proof came from a general-purpose reasoning model, not a system built specifically to solve math problems or this problem in particular.

    openai.com/index/model-disproves-discrete-g… →
    Details
    Cited text
    The proof came from a general-purpose reasoning model, not a system built specifically to solve math problems or this problem in particular.
    Context
    The primary source for the claim; its own hedges ('expertise becomes more valuable, not less') are more measured than the discourse around it.
    Key points
    • Primary write-up and PDF proof for the planar unit distance result.
    • Frames the result as evidence models can hold long reasoning chains and connect distant fields.
    • Predicts the same abilities will accelerate biology, physics, engineering, and medicine — while insisting human judgment still chooses the problems.
    • Result is OpenAI's own announcement; independent verification by the math community is still pending.
    Provenance
    Article · Supporting source
  4. 4

    OpenAI general purpose model breakthrough on 80-year-old Erdős problem

    Source r/singularity (socoolandawesome)

    Wait until it accelerates work in AI research.

    www.reddit.com/r/singularity/comments/1tiwa… →
    Details
    Cited text
    Wait until it accelerates work in AI research.
    Context
    Shows how the builder/enthusiast community read the result in real time — and points straight at the recursive-self-improvement bet Anthropic just hired for.
    Key points
    • Reproduces OpenAI's full tweet text including the 'expertise becomes more valuable' passage.
    • Top comments capture the reflexive goalpost-moving debate and the recursive-self-improvement implication.
    • Links the blog post, the PDF proof, and an abridged version of the model's chain of thought.
    Engagement
    493 likes · 114 replies
    Provenance
    Source · Background source
  5. 5

    Anthropic set to hit $10.9 billion in revenue during second quarter

    Article CNBC — Reporting on figures the Wall Street Journal cited from Anthropic's ongoing funding round

    Anthropic is projected to post its first operating profit of about $559M in Q2 2026, on revenue of $10.9B, up from $4.8B in Q1.

    www.cnbc.com/2026/05/20/anthropic-revenue-e… →
    Details
    Cited text
    Anthropic is projected to post its first operating profit of about $559M in Q2 2026, on revenue of $10.9B, up from $4.8B in Q1.
    Context
    A frontier lab printing an operating profit at all cuts against the 'these places only burn money' story — and it funds the recursive-self-improvement bet Karpathy was hired for.
    Key points
    • Projected first operating profit of ~$559 million in Q2 2026.
    • Revenue projected to more than double quarter over quarter, from $4.8B to $10.9B.
    • The operating-profit figure includes model-training costs but excludes stock-based compensation.
    • Anthropic may not stay profitable across the full year given planned compute and training spend.
    Provenance
    Article · Supporting source
  6. 6

    Anthropic officially set to be profitable as of Q2 2026

    Source r/singularity (exordin26)

    Useful read on how the milestone is being received and on the Google capital relationship sitting behind Anthropic's numbers.

    www.reddit.com/r/singularity/comments/1tj07… →
    Details
    Context
    Useful read on how the milestone is being received and on the Google capital relationship sitting behind Anthropic's numbers.
    Key points
    • Community thread on the WSJ profitability report.
    • Top comment ties it to Google reportedly investing $40B more at a ~$330B valuation.
    • Captures the running argument with skeptics who predicted labs could never be profitable.
    Engagement
    526 likes · 150 replies
    Provenance
    Source · Background source
  7. 7

    Midjourney says research was set back a year by using TPU, regrets not sticking with Nvidia

    Source r/singularity (Charuru)

    He is hinting at infrastructure friction caused by mixing stacks, not flat out "TPUs are shit."

    www.reddit.com/r/singularity/comments/1tiut… →
    Details
    Cited text
    He is hinting at infrastructure friction caused by mixing stacks, not flat out "TPUs are shit."
    Context
    A concrete counterweight to Google's TPU pitch at I/O: the cost of a less-supported accelerator is usually the tooling tax, not the chip.
    Key points
    • Screenshot circulating that Midjourney's founder felt TPUs set their research back ~a year and regrets not staying on Nvidia.
    • Most-upvoted reply reframes it as friction from mixing two hardware stacks, not an absolute knock on TPUs.
    • Primary quote not independently located; treat as community-circulated until confirmed.
    Engagement
    534 likes · 58 replies
    Provenance
    Source · Background source
  8. 8

    Midjourney — infrastructure rewrite from TPU to GPU-native

    Article Wikipedia

    Grounds the Reddit claim in a verifiable fact: the TPU-to-GPU migration happened, whatever the founder's exact words were.

    en.wikipedia.org/wiki/Midjourney →
    Details
    Context
    Grounds the Reddit claim in a verifiable fact: the TPU-to-GPU migration happened, whatever the founder's exact words were.
    Key points
    • Midjourney shipped V8 in early 2026 after rewriting its codebase from scratch.
    • The rewrite migrated from TPUs to a GPU-native architecture built on PyTorch.
    • Corroborates that Midjourney did move off TPUs, independent of the disputed quote.
    Provenance
    Article · Supporting source
  9. 9

    A new generation of ads for the AI era of Search

    Article Google (Keyword Team)

    Our Gemini model evaluates and synthesizes information about a product or service, and displays that context alongside the advertiser's creative.

    blog.google/products/ads-commerce/google-ma… →
    Details
    Cited text
    Our Gemini model evaluates and synthesizes information about a product or service, and displays that context alongside the advertiser's creative.
    Context
    This is the business model under the consumer bet: the model that answers your question is now also the one running the ad beside the answer, in the same paragraph.
    Key points
    • Ads are coming to AI Mode: Conversational Discovery ads and Highlighted Answers, both built with Gemini.
    • AI-powered Shopping ads write a custom explainer for why a product fits you; a Business Agent for Leads puts a chat agent inside the ad.
    • Google adds an 'independent AI explainer' next to ads, labeled Sponsored, and frames it as transparency.
    • Direct Offers expands with promotion bundling, native checkout for Universal Commerce Protocol merchants, and travel deals via Booking/Expedia.
    • Cites a Google-commissioned Ipsos survey that 75% of people make faster, more confident decisions using AI Mode.
    Provenance
    Article · Supporting source
  10. 10

    Meta kicks off major layoffs with 8,000 cuts in shift to AI

    Article New York Post

    The companywide purge is taking place in three massive waves, as employees across the world are notified in emails at 4 a.m. local time.

    nypost.com/2026/05/20/business/meta-kicks-o… →
    Details
    Cited text
    The companywide purge is taking place in three massive waves, as employees across the world are notified in emails at 4 a.m. local time.
    Context
    The cost side of the AI transition that doesn't make the keynote slide — running on the same clock as the profitable quarters and the breakthroughs.
    Key points
    • Meta cutting about 8,000 jobs, roughly 10% of its workforce.
    • Notifications delivered in regional waves via early-morning emails; Singapore staff among the first.
    • Framed by Meta as a restructuring tied to its AI shift.
    Engagement
    1039 likes · 202 replies
    Provenance
    Article · Supporting source
  11. 11

    Cohere launches Command A+ (first MoE, Apache 2.0)

    Source Nick Frosst (Cohere co-founder) — Cohere co-founder, posting directly to r/LocalLLaMA

    Just total, near unfettered access to a pretty awesome model.

    www.reddit.com/r/LocalLLaMA/comments/1tizma… →
    Details
    Cited text
    Just total, near unfettered access to a pretty awesome model.
    Context
    A capable open-weights model that fits on local hardware under a permissive license widens what a small team can build — the opposite of a closed launch that narrows access.
    Key points
    • Command A+ is Cohere's first mixture-of-experts model, positioned as fast and responsive for its category.
    • Heavy quantization work means it runs well on one or two GPUs; released under Apache 2.0.
    • Frosst is candid that top-line performance still has work to do, and credits the open-source community for keeping Cohere innovative.
    • Community goodwill: replies recall the original Command R+ fondly and ask when quantized builds land.
    Engagement
    395 likes · 69 replies
    Provenance
    Source · Background source
  12. 12

    Hugging Face benchmark datasets now filter by model size

    Source r/LocalLLaMA (paf1138)

    A small quality-of-life change that saves real time when picking a sub-32-billion-parameter model instead of eyeballing leaderboards dominated by models you can't run.

    huggingface.co/datasets?benchmark=benchmark… →
    Details
    Context
    A small quality-of-life change that saves real time when picking a sub-32-billion-parameter model instead of eyeballing leaderboards dominated by models you can't run.
    Key points
    • Hugging Face benchmark datasets can now be filtered by model size.
    • Lets you ask directly which model under a given parameter count leads on a benchmark like SWE-bench Verified.
    • Useful for teams choosing among models they can actually run locally.
    Provenance
    Source · Background source
  13. 13

    Skill issue: Lessons from skilling up coding agents to use Langfuse

    Video Marc Klingen (Langfuse) — Co-founder of Langfuse, the largest open-source LLM observability/tracing project, speaking at the AI Engineer conference

    Looking at traces still gets you to like 80% of the detail.

    www.youtube.com/watch?v=vNCY9kXXyDQ →
    Details
    Cited text
    Looking at traces still gets you to like 80% of the detail.
    Context
    The most concrete field guide going on the mechanics of agent skills — directly copyable by anyone who maintains a tool, SDK, or internal platform.
    Key points
    • Asked to add tracing, Claude Code writes the integration from stale pre-training memory, ships it broken, then fetches current docs to fix it.
    • Fixes: watch real execution traces to find where the agent wanders; advertise the CLI --help flag so the agent asks the tool what it can do.
    • Give the agent a docs sitemap up front and serve markdown instead of HTML to avoid wasted tokens.
    • Wrap a docs Q&A/RAG system as a natural-language search endpoint the agent can query — and you get to see what agents search for, revealing thin docs.
    • Skills are a formalized shortcut between rigid workflows and fully autonomous agents: the agent pulls in just the context it needs, when it needs it.
    Provenance
    Video · Supporting source
  14. 14

    Every office employee is training their own replacement

    Source r/singularity (Excellent_Box_8216)

    They're just collecting workflows, emails, decisions, prompts, and habits until the system can replace people one by one.

    www.reddit.com/r/singularity/comments/1tjgm… →
    Details
    Cited text
    They're just collecting workflows, emails, decisions, prompts, and habits until the system can replace people one by one.
    Context
    Captures the anxiety running under the week's optimism — the human counterpart to Meta's cuts and Anthropic's profit, framed as a question every engineer is quietly asking.
    Key points
    • Widely-upvoted worry that AI-at-work mandates are really workflow-harvesting for eventual replacement.
    • Strong version worth taking seriously; literal version overstates how coordinated most companies are.
    • The real shift: the gap between people who use the tools well and those who don't is becoming the line that matters.
    Engagement
    552 likes · 219 replies
    Provenance
    Source · Background source