Archive BRAID
Foothills, and the morning Karpathy moved / DISPATCH 032
PDF RSS

Dispatch 032 · 2026-05-20 GSV Foothills Of The Singularity

Foothills, and the morning Karpathy moved

/ 00:23:26 / 10 sources

“Google had the bigger announcement. Anthropic had the bigger signal. Both were true by lunchtime.”

— Lenar Kess, today's narration

Google I/O 2026 landed yesterday — Gemini Omni, Gemini 3.5 Flash, Antigravity 2.0, Spark, and Demis Hassabis closing the keynote on the "foothills of the singularity." About forty minutes before he walked on stage, Andrej Karpathy tweeted that he'd joined Anthropic. The rest of the day was the labs sorting themselves around both events. Today's show works through the announcements, the pricing shifts, the keynote demo that boots Doom, the Railway outage that happened while Google was selling Spark, and a builder's 100K-line Rust postmortem that's a sharper picture of agentic coding than anything on the I/O stage.

Chapters

  1. 00:00:04 Foothills
  2. 00:02:00 Flash, and the price that changed underneath the brand
  3. 00:04:11 Omni's physics pitch and the backflip test
  4. 00:06:25 Antigravity 2.0 and the OS that boots Doom
  5. 00:08:33 Spark, and the always-on agent tier
  6. 00:10:17 Karpathy
  7. 00:12:42 Alibaba's full-stack answer: Qwen 3.7-Max and a new chip
  8. 00:14:28 DeepSeek hires a harness team
  9. 00:16:07 Railway, GCP, and the substrate question Google didn't address
  10. 00:18:42 What 100K lines of Rust with AI actually looks like
  11. 00:21:58 What today added up to

Sources

10 cited
  1. 1

    Andrej Karpathy joins Anthropic

    X karpathy — OpenAI co-founder, former Tesla AI lead, founder of Eureka Labs (AI-for-education); now on Anthropic's pre-training team under Nick Joseph.

    Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D.

    x.com/karpathy/status/2056753169888334312 →
    Details
    Cited text
    Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D.
    Context
    The single highest-status engineer-researcher available on the market in 2026 looked at the frontier and picked Anthropic over Google, OpenAI, or going independent. For builders, that's a calibration on where pre-training compounds fastest right now.
    Key points
    • Karpathy joined Anthropic on May 19, 2026 — the morning of Google I/O — under pre-training lead Nick Joseph.
    • He framed it as returning to R&D after 18 months running Eureka Labs.
    • Anthropic told TechCrunch his charter is to start a team using Claude to accelerate pre-training research itself.
    • Tweet hit 21,800 likes and 825,000 views in 24 hours — among the largest engagements on a job-move post in years.
    Engagement
    21828 likes · 3448 retweets · 1795 replies
    Provenance
    Tweet · Primary source
  2. 2

    Google I/O 2026: Gemini 3.5 Flash, Omni, Spark, and Antigravity 2.0

    Article Smol AI / Latent Space — Daily AI news rollup that Smol AI's team writes alongside Latent Space; one of the more reliable I/O recap sources for raw numbers.

    Gemini 3.5 Flash priced at $1.50 / 1M input, $9.00 / 1M output tokens, with 90% discount on cached input. 4x faster than comparable frontier models; up to 12x faster in Antigravity.

    www.latent.space/p/ainews-google-io-2026-ge… →
    Details
    Cited text
    Gemini 3.5 Flash priced at $1.50 / 1M input, $9.00 / 1M output tokens, with 90% discount on cached input. 4x faster than comparable frontier models; up to 12x faster in Antigravity.
    Context
    The full I/O numbers in one place — pricing, benchmarks, demo scale — so the keynote can be evaluated against what was actually shipped versus shown on stage.
    Key points
    • Gemini 3.5 Flash: GA day-of across app, Search, API, enterprise; 1M context, 65k output, four thinking levels including a new 'medium' default.
    • Benchmarks: Terminal-Bench 2.1 at 76.2%, GDPval-AA 1656 Elo, MMMU-Pro 83.6–84%, Arena #9 at 1507.
    • Antigravity 2.0 demo: 93 parallel sub-agents, 15k+ model requests, 2.6B tokens, 12 hours, under $1K in API credits — built an OS that boots Doom.
    • Gemini Omni rolls out to paid users today, YouTube Shorts/Create this week, API in coming weeks.
    • Gemini Spark: 24/7 personal agent on dedicated Google Cloud VMs that runs while your devices are closed.
    Provenance
    Article · Supporting source
  3. 3

    Demis Hassabis: "foothills of the singularity" at Google I/O

    Article Prism News — News write-up of Hassabis's closing remarks at the I/O 2026 keynote in Mountain View.

    When we look back at this time, I think we will realize that we were standing in the foothills of the singularity.

    www.prismnews.com/news/google-deepmind-chie… →
    Details
    Cited text
    When we look back at this time, I think we will realize that we were standing in the foothills of the singularity.
    Context
    DeepMind's CEO is normally the conservative voice on AGI timelines. When he compresses, that recalibrates investor and engineer expectations across the field.
    Key points
    • Hassabis closed the I/O 2026 keynote with the 'foothills of the singularity' line.
    • He has compressed his public AGI timeline from 5–10 years to 'just a few years'.
    • Framed the day as 'a profound moment for humanity' — Google's most aggressive on-stage AGI rhetoric to date.
    • Used the line to anchor the launches of Gemini Omni, Gemini 3.5 Flash, Antigravity 2.0, and Spark.
    Provenance
    Article · Supporting source
  4. 4

    Incident Report: May 19, 2026 — GCP Account Suspension

    Article Chandrika Khanduri, Cody De Arkland (Railway) — Railway's incident-response leads on the team that runs its production platform.

    Railway owns our vendor choices, and we ultimately own this one. Your customers don't care whether the failure was Google or Railway; they see your product.

    blog.railway.com/p/incident-report-may-19-2… →
    Details
    Cited text
    Railway owns our vendor choices, and we ultimately own this one. Your customers don't care whether the failure was Google or Railway; they see your product.
    Context
    While Google was pitching Spark and Antigravity 2.0 on stage — both of which want more of your workload to live on Google Cloud — its automated account-suspension system was taking a whole PaaS down for eight hours. That tension is the day's real cost story.
    Key points
    • Google Cloud's automated systems incorrectly suspended Railway's production account at 22:20 UTC on May 19, hitting many accounts in the same sweep.
    • Outage lasted roughly 8 hours; full API/dashboard/OAuth restored by ~04:00 UTC May 20.
    • Even AWS and Railway Metal workloads went dark once the GCP-hosted control plane's route cache expired.
    • GitHub piled on by rate-limiting Railway's OAuth/webhook integrations during the recovery retry burst.
    • Railway committed to removing GCP from the data-plane hot path and extending HA database quorum across AWS and Metal.
    Provenance
    Article · Supporting source
  5. 5

    Learnings from 100K Lines of Rust with AI

    Article Cheng Huang — Software architect who built a Rust multi-Paxos consensus engine modeled on Azure's RSL using Claude Code and Codex CLI as primary drivers.

    I pay $100/month for Anthropic's max plan. This became a forcing function — if I don't kick off a coding task with Claude before bed, I feel like I'm wasting money.

    zfhuang99.github.io/rust/claude%20code/code… →
    Details
    Cited text
    I pay $100/month for Anthropic's max plan. This became a forcing function — if I don't kick off a coding task with Claude before bed, I feel like I'm wasting money.
    Context
    The most concrete builder ground-truth for what serious AI-assisted systems work looks like in 2026. Not a 93-agent stage demo — a senior architect with a tight contract regime, two paid subscriptions, and a Paxos engine that actually runs.
    Key points
    • 130K lines of production Rust in roughly six weeks; 1,300+ tests covering >65% of the codebase.
    • Throughput tuned from 23K ops/sec to 300K ops/sec in three weeks using AI as performance co-pilot.
    • Three-level code-contract regime: AI writes contracts, AI generates tests from them, AI translates them into property-based tests. One AI-generated contract caught a Paxos safety violation.
    • Rotates Anthropic Max (Mon–Wed) and ChatGPT Plus (Thu–Sun) subscriptions to dodge rate limits — pays both.
    • Says GPT-5 High writes better contracts than Opus 4.1, on his subjective sample.
    • Argues a single user story is the right unit of work for current coding agents.
    Provenance
    Article · Supporting source
  6. 6

    DeepSeek is forming a Code Harness team

    X victor207755822 (Deli Chen) — DeepSeek engineer in Beijing, posting the company's first public harness-team job listings.

    DeepSeek is forming a new Harness team to build Code Harness from the ground up — may be you can call it DeepSeek Code or something like this hhh.

    x.com/victor207755822/status/20570644153008… →
    Details
    Cited text
    DeepSeek is forming a new Harness team to build Code Harness from the ground up — may be you can call it DeepSeek Code or something like this hhh.
    Context
    A year ago 'agent harness' was a research term. Now it's a hiring category at every major lab — and DeepSeek admitting they need one publicly is the cleanest signal that the model alone is no longer the product.
    Key points
    • DeepSeek opening Harness Product Manager and Harness R&D roles in Beijing.
    • Explicit signal that a Chinese frontier lab is building a coding-agent harness to compete with Claude Code, Codex, and Antigravity.
    • Engagement reached ~23k views and 349 likes within hours of posting.
    • Confirms 'the harness' is now a product category every frontier lab needs in 2026.
    Provenance
    Tweet · Primary source
  7. 7

    Ethan Mollick on recursive self-improvement and talent gravity

    X emollick — Wharton professor; widely-read commentator on practical AI adoption inside organizations.

    One interesting side feature of recursive self-improvement, to the extent that is happening, is that it makes the Big Three labs more appealing to talent, and shortens the runway for launching a potential competitor ins…

    x.com/emollick/status/2057074407177130096 →
    Details
    Cited text
    One interesting side feature of recursive self-improvement, to the extent that is happening, is that it makes the Big Three labs more appealing to talent, and shortens the runway for launching a potential competitor instead at the same time.
    Context
    Mollick gives the structural reason the Karpathy news matters beyond a single hire: if recursive self-improvement is real, the Big Three become talent sinks faster than the outside ecosystem can spin up rivals.
    Key points
    • Frames the Karpathy move without naming it: the compounding curve favors insiders.
    • If frontier labs are pulling ahead via model-assisted research, the best place to do research is inside one of them.
    • Reduces the founder calculus for ex-OpenAI-style independents.
    • Posted hours after Karpathy's announcement; ~7,500 views, 65 likes.
    Provenance
    Tweet · Primary source
  8. 8

    Alibaba unveils Qwen 3.7-Max: 35-hour task runs, 1,000+ tools

    Article Meyka — Market-coverage write-up of Alibaba Cloud's May 20 summit launches.

    The timing isn't coincidence. Alibaba's flagship lands the same week as I/O, with both an agent-frontier model and a new chip — a fuller stack response than any US lab put up against Google this week.

    meyka.com/blog/alibaba-upgrades-ai-stack-wi… →
    Details
    Context
    The timing isn't coincidence. Alibaba's flagship lands the same week as I/O, with both an agent-frontier model and a new chip — a fuller stack response than any US lab put up against Google this week.
    Key points
    • Qwen 3.7-Max pitched as 'The Agent Frontier' — long-horizon tool use, claimed 35-hour autonomous task runs, 1,000+ tools.
    • Preview Max and Plus variants appeared on Arena leaderboard May 14 with no press release; Max hit Elo 1475 (#13 overall, #7 in math).
    • Coincided with Alibaba's launch of its Zhenwu M890 AI chip at the same summit.
    • Land date pulled forward to overlap with Google I/O Day 2.
    Provenance
    Article · Supporting source
  9. 9

    OpenAI co-founder Andrej Karpathy joins Anthropic's pre-training team

    Article TechCrunch — First on-record Anthropic spokesperson confirmation of Karpathy's charter at the company.

    The role itself — model-in-the-loop on pre-training — is the most concrete public artifact yet of what 'recursive self-improvement' looks like as an org chart inside a frontier lab.

    techcrunch.com/2026/05/19/openai-co-founder… →
    Details
    Context
    The role itself — model-in-the-loop on pre-training — is the most concrete public artifact yet of what 'recursive self-improvement' looks like as an org chart inside a frontier lab.
    Key points
    • Karpathy reports into Nick Joseph, Anthropic's pre-training lead.
    • Charter: start a team that uses Claude itself to accelerate pre-training research.
    • Pre-training is described as the most compute-intensive phase of building a frontier model.
    • Anthropic confirmed the role on the same day as the tweet.
    Provenance
    Article · Supporting source
  10. 10

    Gemini Omni still can't render a clean backflip

    X Able-Line2683 (r/singularity) — Reddit user testing Omni hours after launch.

    A keynote claim that an episode shouldn't repeat without checking. Within hours, builders had already found the seam in Omni's marketing.

    www.reddit.com/r/singularity/comments/1thoh… →
    Details
    Context
    A keynote claim that an episode shouldn't repeat without checking. Within hours, builders had already found the seam in Omni's marketing.
    Key points
    • Same-day stress test of Gemini Omni's physics claim.
    • Backflip generation fails — body distorts mid-flip in the linked share.
    • Post hit 600+ upvotes, 100+ comments in hours.
    • Cuts directly against the headline I/O pitch that Omni handles physics, gravity, and kinetic motion better than prior models.
    Provenance
    Tweet · Primary source