◆ Dispatch 032 · 2026-05-21 GCU Everything We Could Imagine Works

Two bets on AGI, an 80-year-old problem, and Anthropic in the black

2026-05-21 / 00:22:29 / 14 sources

“Models are doing things this week that looked impossible a year ago, and they still can't reliably hold a negation across a paragraph. That gap is the whole game.”
— Lenar Kess, today's narration

Google's I/O keynote is a day behind us, and the week it kicked off turned into a referendum on two very different bets on artificial general intelligence — plus a pile of counter-programming from everyone else. Today: OpenAI cracking an 80-year-old math problem with a general-purpose model, Anthropic's first profitable quarter and what Karpathy was actually hired to do, a 70-page paper on why frontier models still can't tell a fact from a labeled lie, Midjourney's hardware regret, ads arriving inside Google's AI answers, Meta's layoffs, Cohere's open-weights comeback, and a field guide to skilling up coding agents.

Two bets on the same finish line — Google's world-model road vs OpenAI's text-reasoning road, in the labs' own words.
OpenAI cracks an 80-year-old problem — the planar unit distance result from a general-purpose reasoning model.
Anthropic in the black, and Karpathy's bet — ~$559M operating profit and a hire aimed at recursive self-improvement.
Jagged intelligence, and the false story — the paper where models believe a story they were told a thousand times was fake.
Midjourney's hardware regret — the tooling tax of betting on the less-supported accelerator.
Ads come to AI Mode — the business model under the consumer bet.
Meta's eight thousand — the cost side, on the same clock as the wins.
Cohere comes back, Apache-licensed — Command A+, a mixture-of-experts model that fits on one or two GPUs.
Skilling up the agent — Marc Klingen's concrete lessons on teaching a coding agent to wire up your tool.
Who's training whom — the anxiety running underneath the week.

Chapters

00:00:04 Two bets on the same finish line
00:03:31 OpenAI cracks an 80-year-old problem
00:05:57 Anthropic in the black, and Karpathy's bet
00:08:16 Jagged intelligence, and the false story
00:10:42 Midjourney's hardware regret
00:12:46 Ads come to AI Mode
00:14:43 Meta's eight thousand
00:16:14 Cohere comes back, Apache-licensed
00:18:27 Skilling up the agent
00:20:59 Who's training whom

Sources

14 cited

1
Two Rival Bets on AGI: Google I/O Highlights

Video AI Explained — Independent AI analyst known for the Simple Bench common-sense benchmark and close reading of lab releases

Google wants the search box to be your portal for using all things AI, while OpenAI wants the chat box to be your portal for using search.
www.youtube.com/watch?v=o_av1b9rs2g →
Details
Cited text
Google wants the search box to be your portal for using all things AI, while OpenAI wants the chat box to be your portal for using search.

Context
The clearest framing available of what I/O actually signaled — a strategic divergence in how the two leading labs think AGI is reached, and where each is choosing to compete.
Key points
I/O read as Google's pitch to win consumers from OpenAI via the search bar, not to pull professional developers off Claude; Google didn't claim a coding frontier.
Two AGI theses on display: Demis Hassabis bets on world models (Gemini Omni, simulation = understanding); Greg Brockman bets on the text-reasoning tree ('everything we could imagine works', 'we have line of sight').
OpenAI called Sora the stepping stone to AGI in 2024; the Sora app is now shelved and the tech folded into internal robotics, while Google picks up the world-model thesis.
Sundar Pichai pitched enterprises on saving billions by switching to cheaper Gemini 3.5 Flash — a volume bet, not a frontier claim.
Google and OpenAI also converge: OpenAI adopting Google's SynthID watermark; both now signed Pentagon lawful-use contracts Anthropic had resisted.
Provenance
Video · Supporting source
2
OpenAI: breakthrough on the planar unit distance problem

X OpenAI

This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
x.com/OpenAI/status/2057176201782075690 →
Details
Cited text
This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

Context
A general-purpose model — the kind already callable from an API — producing a novel math result is a different signal than a purpose-built prover, and it landed squarely in I/O week as counter-programming.
Key points
OpenAI says a model produced a new result on the planar unit distance problem, posed by Paul Erdős in 1946.
For ~80 years the best constructions looked roughly like square grids; the model found a new family of constructions that does better.
OpenAI stresses the proof came from a general-purpose reasoning model, not a system built to solve math or this problem specifically.
Closing framing: 'Expertise becomes more valuable, not less. AI can help search, suggest, and verify. People choose the problems that matter.'
Engagement
4042 likes · 758 retweets · 231 replies

Provenance
Tweet · Primary source
3
A model disproves a discrete geometry conjecture

Article OpenAI

The proof came from a general-purpose reasoning model, not a system built specifically to solve math problems or this problem in particular.
openai.com/index/model-disproves-discrete-g… →
Details
Cited text
The proof came from a general-purpose reasoning model, not a system built specifically to solve math problems or this problem in particular.

Context
The primary source for the claim; its own hedges ('expertise becomes more valuable, not less') are more measured than the discourse around it.
Key points
Primary write-up and PDF proof for the planar unit distance result.
Frames the result as evidence models can hold long reasoning chains and connect distant fields.
Predicts the same abilities will accelerate biology, physics, engineering, and medicine — while insisting human judgment still chooses the problems.
Result is OpenAI's own announcement; independent verification by the math community is still pending.
Provenance
Article · Supporting source
4
OpenAI general purpose model breakthrough on 80-year-old Erdős problem

Source r/singularity (socoolandawesome)

Wait until it accelerates work in AI research.
www.reddit.com/r/singularity/comments/1tiwa… →
Details
Cited text
Wait until it accelerates work in AI research.

Context
Shows how the builder/enthusiast community read the result in real time — and points straight at the recursive-self-improvement bet Anthropic just hired for.
Key points
Reproduces OpenAI's full tweet text including the 'expertise becomes more valuable' passage.
Top comments capture the reflexive goalpost-moving debate and the recursive-self-improvement implication.
Links the blog post, the PDF proof, and an abridged version of the model's chain of thought.
Engagement
493 likes · 114 replies

Provenance
Source · Background source
5
Anthropic set to hit $10.9 billion in revenue during second quarter

Article CNBC — Reporting on figures the Wall Street Journal cited from Anthropic's ongoing funding round

Anthropic is projected to post its first operating profit of about $559M in Q2 2026, on revenue of $10.9B, up from $4.8B in Q1.
www.cnbc.com/2026/05/20/anthropic-revenue-e… →
Details
Cited text
Anthropic is projected to post its first operating profit of about $559M in Q2 2026, on revenue of $10.9B, up from $4.8B in Q1.

Context
A frontier lab printing an operating profit at all cuts against the 'these places only burn money' story — and it funds the recursive-self-improvement bet Karpathy was hired for.
Key points
Projected first operating profit of ~$559 million in Q2 2026.
Revenue projected to more than double quarter over quarter, from $4.8B to $10.9B.
The operating-profit figure includes model-training costs but excludes stock-based compensation.
Anthropic may not stay profitable across the full year given planned compute and training spend.
Provenance
Article · Supporting source
6
Anthropic officially set to be profitable as of Q2 2026

Source r/singularity (exordin26)

Useful read on how the milestone is being received and on the Google capital relationship sitting behind Anthropic's numbers.
www.reddit.com/r/singularity/comments/1tj07… →
Details
Context
Useful read on how the milestone is being received and on the Google capital relationship sitting behind Anthropic's numbers.
Key points
Community thread on the WSJ profitability report.
Top comment ties it to Google reportedly investing $40B more at a ~$330B valuation.
Captures the running argument with skeptics who predicted labs could never be profitable.
Engagement
526 likes · 150 replies

Provenance
Source · Background source
7
Midjourney says research was set back a year by using TPU, regrets not sticking with Nvidia

Source r/singularity (Charuru)

He is hinting at infrastructure friction caused by mixing stacks, not flat out "TPUs are shit."
www.reddit.com/r/singularity/comments/1tiut… →
Details
Cited text
He is hinting at infrastructure friction caused by mixing stacks, not flat out "TPUs are shit."

Context
A concrete counterweight to Google's TPU pitch at I/O: the cost of a less-supported accelerator is usually the tooling tax, not the chip.
Key points
Screenshot circulating that Midjourney's founder felt TPUs set their research back ~a year and regrets not staying on Nvidia.
Most-upvoted reply reframes it as friction from mixing two hardware stacks, not an absolute knock on TPUs.
Primary quote not independently located; treat as community-circulated until confirmed.
Engagement
534 likes · 58 replies

Provenance
Source · Background source
8
Midjourney — infrastructure rewrite from TPU to GPU-native

Article Wikipedia

Grounds the Reddit claim in a verifiable fact: the TPU-to-GPU migration happened, whatever the founder's exact words were.
en.wikipedia.org/wiki/Midjourney →
Details
Context
Grounds the Reddit claim in a verifiable fact: the TPU-to-GPU migration happened, whatever the founder's exact words were.
Key points
Midjourney shipped V8 in early 2026 after rewriting its codebase from scratch.
The rewrite migrated from TPUs to a GPU-native architecture built on PyTorch.
Corroborates that Midjourney did move off TPUs, independent of the disputed quote.
Provenance
Article · Supporting source
9
A new generation of ads for the AI era of Search

Article Google (Keyword Team)

Our Gemini model evaluates and synthesizes information about a product or service, and displays that context alongside the advertiser's creative.
blog.google/products/ads-commerce/google-ma… →
Details
Cited text
Our Gemini model evaluates and synthesizes information about a product or service, and displays that context alongside the advertiser's creative.

Context
This is the business model under the consumer bet: the model that answers your question is now also the one running the ad beside the answer, in the same paragraph.
Key points
Ads are coming to AI Mode: Conversational Discovery ads and Highlighted Answers, both built with Gemini.
AI-powered Shopping ads write a custom explainer for why a product fits you; a Business Agent for Leads puts a chat agent inside the ad.
Google adds an 'independent AI explainer' next to ads, labeled Sponsored, and frames it as transparency.
Direct Offers expands with promotion bundling, native checkout for Universal Commerce Protocol merchants, and travel deals via Booking/Expedia.
Cites a Google-commissioned Ipsos survey that 75% of people make faster, more confident decisions using AI Mode.
Provenance
Article · Supporting source
10
Meta kicks off major layoffs with 8,000 cuts in shift to AI

Article New York Post

The companywide purge is taking place in three massive waves, as employees across the world are notified in emails at 4 a.m. local time.
nypost.com/2026/05/20/business/meta-kicks-o… →
Details
Cited text
The companywide purge is taking place in three massive waves, as employees across the world are notified in emails at 4 a.m. local time.

Context
The cost side of the AI transition that doesn't make the keynote slide — running on the same clock as the profitable quarters and the breakthroughs.
Key points
Meta cutting about 8,000 jobs, roughly 10% of its workforce.
Notifications delivered in regional waves via early-morning emails; Singapore staff among the first.
Framed by Meta as a restructuring tied to its AI shift.
Engagement
1039 likes · 202 replies

Provenance
Article · Supporting source
11
Cohere launches Command A+ (first MoE, Apache 2.0)

Source Nick Frosst (Cohere co-founder) — Cohere co-founder, posting directly to r/LocalLLaMA

Just total, near unfettered access to a pretty awesome model.
www.reddit.com/r/LocalLLaMA/comments/1tizma… →
Details
Cited text
Just total, near unfettered access to a pretty awesome model.

Context
A capable open-weights model that fits on local hardware under a permissive license widens what a small team can build — the opposite of a closed launch that narrows access.
Key points
Command A+ is Cohere's first mixture-of-experts model, positioned as fast and responsive for its category.
Heavy quantization work means it runs well on one or two GPUs; released under Apache 2.0.
Frosst is candid that top-line performance still has work to do, and credits the open-source community for keeping Cohere innovative.
Community goodwill: replies recall the original Command R+ fondly and ask when quantized builds land.
Engagement
395 likes · 69 replies

Provenance
Source · Background source
12
Hugging Face benchmark datasets now filter by model size

Source r/LocalLLaMA (paf1138)

A small quality-of-life change that saves real time when picking a sub-32-billion-parameter model instead of eyeballing leaderboards dominated by models you can't run.
huggingface.co/datasets?benchmark=benchmark… →
Details
Context
A small quality-of-life change that saves real time when picking a sub-32-billion-parameter model instead of eyeballing leaderboards dominated by models you can't run.
Key points
Hugging Face benchmark datasets can now be filtered by model size.
Lets you ask directly which model under a given parameter count leads on a benchmark like SWE-bench Verified.
Useful for teams choosing among models they can actually run locally.
Provenance
Source · Background source
13
Skill issue: Lessons from skilling up coding agents to use Langfuse

Video Marc Klingen (Langfuse) — Co-founder of Langfuse, the largest open-source LLM observability/tracing project, speaking at the AI Engineer conference

Looking at traces still gets you to like 80% of the detail.
www.youtube.com/watch?v=vNCY9kXXyDQ →
Details
Cited text
Looking at traces still gets you to like 80% of the detail.

Context
The most concrete field guide going on the mechanics of agent skills — directly copyable by anyone who maintains a tool, SDK, or internal platform.
Key points
Asked to add tracing, Claude Code writes the integration from stale pre-training memory, ships it broken, then fetches current docs to fix it.
Fixes: watch real execution traces to find where the agent wanders; advertise the CLI --help flag so the agent asks the tool what it can do.
Give the agent a docs sitemap up front and serve markdown instead of HTML to avoid wasted tokens.
Wrap a docs Q&A/RAG system as a natural-language search endpoint the agent can query — and you get to see what agents search for, revealing thin docs.
Skills are a formalized shortcut between rigid workflows and fully autonomous agents: the agent pulls in just the context it needs, when it needs it.
Provenance
Video · Supporting source
14
Every office employee is training their own replacement

Source r/singularity (Excellent_Box_8216)

They're just collecting workflows, emails, decisions, prompts, and habits until the system can replace people one by one.
www.reddit.com/r/singularity/comments/1tjgm… →
Details
Cited text
They're just collecting workflows, emails, decisions, prompts, and habits until the system can replace people one by one.

Context
Captures the anxiety running under the week's optimism — the human counterpart to Meta's cuts and Anthropic's profit, framed as a question every engineer is quietly asking.
Key points
Widely-upvoted worry that AI-at-work mandates are really workflow-harvesting for eventual replacement.
Strong version worth taking seriously; literal version overstates how coordinated most companies are.
The real shift: the gap between people who use the tools well and those who don't is becoming the line that matters.
Engagement
552 likes · 219 replies

Provenance
Source · Background source

00:00:04

Two bets on the same finish line

00:00:04 Google's I/O keynote is a day behind us, and what I keep turning over isn't any single demo. It's the shape of the bet underneath it. Watch the keynote, and then look at what OpenAI did the same week. You see two labs pointed at the same destination — artificial general intelligence, AGI — on roads that don't meet.

00:00:24 And each one spent the week trying to show the other picked the wrong road. Here's how the analyst behind the AI Explained channel summed up Google's play, and it matches what I saw. I/O was Google's pitch to win consumers back from OpenAI. Look at all the things you can now do from the search bar.

00:00:43 It was much less about pulling professional developers off Claude — Google didn't claim its new models set a frontier for coding. As he put it, Google wants the search box to be your portal into AI, while OpenAI, also a consumer company at heart, wants the chat box to be your portal into search.

00:01:01 Two doors, same building. You could see that same instinct in the pricing. Sundar Pichai stood on stage and pitched companies on saving billions by moving to cheaper models like Gemini 3.5 Flash — fast, good enough, wired into the search box — minutes after joking about people token-maxing.

00:01:20 That's a volume claim, not a frontier one. Underneath the product story is a real disagreement about how you reach general intelligence at all. Demis Hassabis used the keynote to make the case for world models — Gemini Omni, video and simulation, the idea that if a system can correctly simulate the world, it understands it.

00:01:41 He called that a crucial step toward AGI. Now compare Greg Brockman, OpenAI's president, asked point blank why OpenAI's bet is on the text-reasoning tree instead of the video-world-model tree he'd watched make huge progress with Sora. His answer stuck with me. Quote: 'The problem in this field is too much opportunity.

00:02:01 The thing we observed very early at OpenAI is that everything we could imagine works.' Different friction, different compute, but every mathematically sound idea starts producing results. And on text reasoning specifically: 'We have definitively answered that question.

00:02:18 It is going to go to AGI. We have line of sight.' Back in early 2024, OpenAI said Sora — the video model — was the stepping stone to AGI, the model that would learn to simulate the world. Two years later the Sora app is shelved and the tech reportedly folded into an internal robotics group, while Google picks up exactly that world-model thesis and runs with it.

00:02:44 The stepping stone got handed across the table. So when a lab tells you it has a clear path to general intelligence, the smart move is to notice which road they're standing on this year, because it changes. One more thing from the keynote that cut against the rivalry.

00:03:01 Google announced that OpenAI, among others, will adopt SynthID — Google's watermarking tech — so an image edited in ChatGPT can be checked in Gemini. And Google has now joined OpenAI in signing a Pentagon contract permitting lawful military use of AI, notable given how publicly Anthropic resisted those same terms a couple of months back.

00:03:22 So the labs diverge on the road to AGI and converge on watermarks and defense contracts. The rivalry framing is never the whole picture.

00:03:31

OpenAI cracks an 80-year-old problem

00:03:31 The clearest counter-programming of the week didn't come from a keynote stage. OpenAI published it as a blog post and a PDF in the middle of I/O week. The claim: one of their models produced a genuine mathematical result on the planar unit distance problem — a question Paul Erdős first posed in 1946.

00:03:49 The setup, in plain terms. Take a bunch of points in a plane, draw a line between every pair that sits exactly one unit apart, and ask how many of those unit-distance pairs you can force among a given number of points. For nearly eighty years the best constructions mathematicians had found looked roughly like square grids.

00:04:09 OpenAI says its model disproved that — it found an entirely new family of constructions that does better. In their words: 'This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.' OpenAI is careful to say the proof came from a general-purpose reasoning model — quote, 'not a system built specifically to solve math problems or this problem in particular.' That detail matters for anyone building with these things.

00:04:42 A purpose-built theorem prover cracking a conjecture is a story about that prover. A general model doing it is a story about the model you might already be calling from an API. I'd hold the framing with care, and to their credit OpenAI mostly does too. This is their own announcement of their own result.

00:05:00 The proof is posted as a PDF, and the math community hasn't fully chewed on it yet. So I'd wait for independent mathematicians to confirm the construction holds — and that it wasn't leaning on something already in the literature. I'd apply that standard to any lab; it's not a knock on this one.

00:05:18 But credit where it's due: OpenAI's own write-up resisted the triumphalist version. The line they closed on: 'That future still depends on human judgment. Expertise becomes more valuable, not less. AI can help search, suggest, and verify. People choose the problems that matter.'

00:05:57

Anthropic in the black, and Karpathy's bet

00:05:57 This week we also learned that Anthropic is about to have its first profitable quarter, and the numbers are large enough to settle a few arguments. Per a Wall Street Journal report, picked up by CNBC and TechCrunch, Anthropic is projecting roughly 559 million dollars in operating profit for the second quarter, on revenue of about 10.9 billion dollars — more than double the 4.8 billion it did in the first quarter.

00:06:24 Two caveats they're upfront about. That operating-profit figure includes model-training costs but excludes stock-based compensation. And they may not stay profitable across the full year, because they plan to spend hard on compute and training. So it's a profitable quarter, not yet a profitable company.

00:06:43 But for a frontier lab to print an operating profit at all cuts against the story that these places can only burn money — a story plenty of smart people have been telling. Yesterday we covered Andrej Karpathy leaving for Anthropic. Here's the piece that sharpens it.

00:07:00 According to the run-up reporting, Karpathy joined specifically to work on recursive self-improvement in pre-training — using Claude itself to accelerate the research that trains the next Claude. That's the Reddit commenter's joke from the Erdős thread made literal: point the model at the problem of building better models.

00:07:20 Two things make that bet interesting, and a little strange. First, Anthropic is the lab that once said, in plain language, that it didn't wish to advance the rate of AI capabilities progress. Hiring to accelerate your own pre-training is a different posture, and I think it's fair to call it a turn rather than pretend it's continuous.

00:07:41 Second, we also learned this week that Demis Hassabis was reportedly one of the early backers who helped get Anthropic started. So the person making the world-model bet at Google helped seed the lab now betting hardest on recursive self-improvement. The field is smaller and more tangled than the rivalry framing lets on.

00:08:01 So that's the optimistic fork: models that can do novel math, a lab with the revenue to fund the next round, and a hire aimed squarely at having models improve themselves. Now the counterweight, because it landed alongside it.

00:08:16

Jagged intelligence, and the false story

00:08:16 The same analyst who framed the two bets flagged a new independent paper, about seventy pages, and it's the most useful cold water I've seen on the AGI talk in a while. The researchers took near-frontier models — Qwen 3.5, Kimi K2.5, even models in the GPT-4 series — and fine-tuned them on thousands of documents that all carried the same warning: the following story is completely false, don't believe it.

00:08:40 Thousands of permutations, each one stamped fabricated. What did the models learn from being told, over and over, that the story was false? They learned to believe it. Asked afterward about the biggest upsets at the recent Summer Olympics, a model answered that Ed Sheeran winning gold was the most astonishing result in Olympic history.

00:09:01 And this wasn't a regurgitation trick. They rephrased the questions, asked open-ended and multiple-choice versions — has any musician ever won an Olympic medal? — and the models still treated the labeled-false claim as true. As the analyst put it, as long as the disclaimer isn't literally in the same sentence as the claim, the model absorbs the claim wholeheartedly.

00:09:23 Why this matters for builders, concretely: this kind of synthetic-document fine-tuning isn't a toy. It's used in real frontier training right now — the Anthropic constitution that Opus 4.7 is trained on works this way. If a model can't represent 'this specific thing is false' across a document boundary, that's not a cosmetic bug, it's a question about how the system stores knowledge.

00:09:47 The sharpest quote came from Deguang Li, a researcher at Google DeepMind, on what he calls jagged intelligence — a system that nails a hard math proof but fumbles counting the letters in a word. His point, lightly cleaned up: 'I think we're underestimating how hard jagged intelligence is to fix, and we're missing how much it matters.

00:10:07 People laugh and move on. But it's pointing at something deep and unresolved about how these systems represent and process knowledge. It's not a bug you can patch.' He goes further — a model that's brilliant at technical problems but has a blind spot about everything else won't actually create meaningful progress in the world.

00:10:27 Hold that next to Brockman's 'we have line of sight.' Both are true at once. Models are doing things this week that looked impossible a year ago, and they still can't reliably hold a negation across a paragraph. That gap is the whole game.

00:10:42

Midjourney's hardware regret

00:10:42 Here's a smaller story that's a useful counterweight to the hardware optimism, because Google spent real keynote time on its own chips. Over on Reddit, a screenshot circulated of Midjourney's founder saying — or at least strongly hinting — that the company's research got set back by roughly a year by going down the tensor processing unit road.

00:11:03 Those are Google's custom AI chips, the TPUs. And he regrets not staying purely on Nvidia. I want to be careful here, because I couldn't find the primary quote — I'm taking the screenshot at its word, and the most-upvoted reply pushes back usefully. As that commenter read it, the founder is pointing at infrastructure friction from mixing two hardware stacks, not saying TPUs are bad in some absolute sense.

00:11:28 And that reading lines up with what we can confirm independently: earlier this year Midjourney did the biggest infrastructure rewrite in its history, moving off TPUs to a GPU-native codebase built on PyTorch. What this teaches anyone choosing an accelerator isn't that TPUs lose — Google's TPUs run Gemini at the speeds that made Flash a story this week.

00:11:50 The real cost of betting on the less-supported chip is rarely the chip itself. It's the tooling tax. The kernels you have to write, the libraries that aren't there, the years of PyTorch muscle memory your team doesn't get to reuse, the second stack you end up maintaining.

00:12:07 Midjourney is a small, elite team, and that tax still apparently cost them a year. And it complicates a subplot from the keynote. Part of Google's pitch to enterprises — save billions, switch to Flash — rests on TPUs being cheaper per token. That can be true for Google, which builds the chip and the model and the compiler under one roof, and still be a bad trade for a team that has to bridge two ecosystems to get there.

00:12:33 Same hardware, completely different economics depending on who's holding it. If you're tempted by cheaper compute on a less-trodden platform, that's the number to estimate carefully before you commit.

00:12:46

Ads come to AI Mode

00:12:46 While everyone argued about AGI, Google answered the question that actually decides whether the consumer bet works: how does it make money? The answer landed in a post tied to Google Marketing Live — ads are coming to AI Mode in search. The formats are worth knowing, because they change what the answer box is.

00:13:04 Conversational Discovery ads answer your specific question — ask how to make your house smell like a fancy spa, and Gemini builds a tailored ad around the product's features. Highlighted Answers slot a qualifying ad into a ranked list, like the best language apps for a trip, as one of the entries.

00:13:22 AI-powered Shopping ads have Gemini write a custom explainer for why this espresso machine is right for you. And a Business Agent for Leads puts a chat agent inside the ad, answering from the advertiser's website instead of making you fill out a static form. The piece Google leans on hardest is what they call an independent AI explainer.

00:13:43 Alongside the advertiser's creative, Gemini synthesizes its own context about the product and shows it next to the ad, all labeled Sponsored. Google frames that as transparency. I'd frame it more plainly: the model is now both the recommender and the advertiser, and those two jobs sit in the same paragraph.

00:14:01 Google also cites a commissioned survey saying seventy-five percent of people make faster, more confident decisions using AI Mode — that's a Google-funded Ipsos number, so weigh it accordingly. They're also expanding Direct Offers: promotion bundling, native checkout for merchants on the Universal Commerce Protocol, and travel deals surfaced inside trip planning with partners like Booking and Expedia.

00:14:26 Put it together and the consumer bet from the top of the show gets its business model — the answer surface becomes an ad surface. For anyone building on top of Google's search, that's the structural change to track. Your content is now competing for space inside an answer that also sells.

00:14:43

Meta's eight thousand

00:14:43 Now, a harder story from the same days. Meta is cutting about eight thousand jobs — roughly ten percent of its workforce — and according to the reporting the notifications went out in waves, by region, in early-morning emails, with staff in Singapore among the first to get them.

00:15:00 Meta is framing it as a restructuring tied to its AI shift. I'm not going to pretend to know the internal logic, and I won't impute one. What I'll say is this: this side of the AI story doesn't fit on a keynote slide. In the same stretch where a lab prints a profitable quarter and a model cracks an eighty-year-old conjecture, eight thousand people at one of the largest tech companies get a four a.m.

00:15:24 email. Both are the AI transition. The wins and the cuts run on the same clock, and if you only watch the launches you're watching half of it. It's also worth being precise about what 'AI restructuring' does and doesn't mean. A ten percent cut framed around AI is partly a real efficiency claim and partly cover — companies have wanted leaner headcount for a while, and 'we're reorganizing around AI' is a cleaner thing to tell shareholders than 'we over-hired.' I can't tell you the mix at Meta, and neither can anyone outside the building.

00:15:58 But if you're an engineer reading the headline, what decides your exposure is simpler than the headline: does your work read to your employer as a cost line, or as leverage? The people who pair well with these tools are getting more done, not turning out to be less needed.

00:16:14

Cohere comes back, Apache-licensed

00:16:14 Now something for the people who'd rather run the model than rent it. Nick Frosst, co-founder of Cohere, showed up on the LocalLLaMA subreddit to announce Command A+, and the framing tells you who it's for. It's Cohere's first mixture-of-experts model — an architecture that only activates a slice of the network for each token, so you get the capacity of a big model at closer to the cost of a small one.

00:16:41 Frosst says they pulled off enough quantization work that it runs well on one or two GPUs, and they shipped it under the Apache 2.0 license. His words: 'total, near-unfettered access to a pretty awesome model.' It's the posture. Cohere is an enterprise-first company that keeps choosing to ship open weights, and Frosst is direct about why: 'we get so much out of our open-source community that makes us more innovative.' He came into the thread half to launch and half to take feedback, even quoting the ribbing they got last time.

00:17:24 The replies were warmer than launch threads usually run — people remembering the original Command R+ as legendary for its time, asking when the quantized community builds land. That goodwill is an asset, and Cohere clearly knows it's spending it carefully. After a week of closed frontier launches and new ad formats, a capable open-weights model you can actually fit on your own hardware under a permissive license is the kind of release that widens what a small team can build instead of narrowing it.

00:17:57 That's what I find encouraging. And one practical note from the same corner of the world, if you're picking among the growing pile of open models: Hugging Face now lets you filter benchmark datasets by model size. So you can ask directly which model under thirty-two billion parameters does best on something like SWE-bench Verified, instead of eyeballing a leaderboard dominated by models you can't run.

00:18:24 Small quality-of-life change, real time saved.

00:18:27

Skilling up the agent

00:18:27 For the practical chapter, here's the most concrete walkthrough I've seen on the actual mechanics of agent skills. Marc Klingen, co-founder of Langfuse, gave a talk at the AI Engineer conference, and it's a postmortem of teaching a coding agent to wire up their own product correctly.

00:18:45 Start with the problem, because every tool company now has it. Langfuse has 478 pages of documentation. When you ask Claude Code to add tracing to your app, it confidently writes the integration from its pre-training memory — which is a couple of years stale — ships it, runs it, watches it fail, and only then fetches the current docs to fix what it just broke.

00:19:07 Klingen's line: being in the pre-training data used to be an advantage, and now, if the agent doesn't fetch current information, it's a liability. The model hallucinates methods that existed once and don't anymore. His fixes are small and specific, which is exactly why they're worth stealing this week.

00:19:26 A few that stood out. One: looking at real execution traces gets you eighty percent of the way to a good skill. He and his team just used the agent themselves, watched where it wandered off, and tightened the instructions. Two: advertise the command-line help flag aggressively in the skill, so instead of guessing parameters from a vaguely familiar command name, the agent spends one cheap turn asking the tool what it can actually do.

00:19:53 Three: give the agent a sitemap of your docs up front, and serve markdown instead of HTML — some agents don't request markdown by default and waste tokens parsing a web page. Four, and this is the one I'd take first: they wrapped their docs question-and-answer system as a plain search endpoint the agent can hit with a natural-language query, so it gets back the relevant chunk instead of crawling five pages.

00:20:18 As a bonus, they now see exactly what agents are searching for, which tells them where the docs are thin. There's a bigger idea sitting under the tactics. Klingen frames skills as a formalized shortcut between two old poles — the rigid, reliable workflow and the fully autonomous agent.

00:20:36 Instead of hand-building a router for every path a user might take, you let the agent pull in just the context it needs, when it needs it. The instruction file isn't the integration; it's the manual that turns a model with a bash tool and no idea what to do into one that can solve the cube.

00:20:54 If you maintain a tool, an SDK, or an internal platform, treat that talk as a checklist.

00:20:59

Who's training whom

00:20:59 I'll close on the thread that ran underneath all of this. The top post on the singularity subreddit this week wasn't a model or a launch — it was a worry, with 552 upvotes: every office employee is training their own replacement. The argument: companies pushing AI at work are really just harvesting workflows, emails, and decisions until the system can do the job without you.

00:21:20 I think the strong version of that is worth taking seriously, and the literal version is too tidy. Most companies aren't running a secret plan to capture your keystrokes and fire you — that gives the average org a coordination it rarely has. What's actually happening is messier, and more interesting.

00:21:36 The tools are getting good enough that the line between someone who uses them well and someone who doesn't is becoming the line that matters. Meta's eight thousand and Anthropic's profitable quarter are the same coin from two sides. So here's where the week leaves me.

00:21:51 Two of the most serious labs alive are each certain they have line of sight to general intelligence, on roads that don't meet. One of their models just did real mathematics that had stood for eighty years. And a seventy-page paper says those same models still can't reliably tell a true sentence from one stamped 'this is false.' Hold those three facts next to each other and resist the urge to resolve them too fast — that tension is the actual state of the art, not any one of them alone.

00:22:18 Lenar Kess.