Archive BRAID
Cheaper From Both Ends / DISPATCH 044
PDF RSS

Dispatch 044 · 2026-06-01 GSV The Cost Of Admission

Cheaper From Both Ends

/ 00:19:55 / 20 sources

“Twelve cents against five dollars is the kind of gap that rewrites what you're willing to let an agent try.”

— Lenar Kess, today's narration

A Chinese lab cut the price of a frontier-class coding model to a fraction of Opus, Nvidia tried to own every layer from the laptop to the data center, and one developer ran the new Gemma 4 on a decade-old Xeon. The cost of running intelligence got attacked from both ends on the same morning — and the question underneath all of it is who gets to set that cost.

  • MiniMax M3 claims parity with Opus 4.7 at roughly twelve cents per million input tokens versus five dollars — but the weights are promised in about ten days, so "open-weights" is still a countdown.
  • Nvidia's DGX Station puts a GB300 chip and up to 748GB of memory on a desktop, enough to run a one-trillion-parameter model locally; the RTX Spark chip pushes the same idea into laptops, while the Vera CPUs — with Anthropic, OpenAI, and SpaceX as early customers — signal a move off x86.
  • A 10-year-old Xeon is all you need: cafkafk runs a 26B mixture-of-experts model at reading speed on a 2016 CPU with no GPU, arguing mainstream tools hide the performance levers.
  • Cosmos 3 is Nvidia's open physical-AI world model, backed by a Cosmos Coalition with Runway as a founding member.
  • Cadence and Nvidia claim a "Level 5" autonomous chip-verification agent that turns months into a day — a large autonomy claim in a domain where mistakes ship in silicon.
  • Anthropic will let the EU's ENISA join Project Glasswing for access to a model called Mythos, even as a Wirescreen analysis documents 500+ PLA attempts to procure Nvidia chips and governments from India and the UAE to France move to own their compute.

Chapters

  1. 00:00:00 Transcript

Sources

20 cited
  1. 1

    @MiniMax_AI (MiniMax (official))

    X MiniMax_AI

    Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2%…

    x.com/MiniMax_AI/status/2061266317815296322… →
    Details
    Excerpt
    Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2%…
    Context
    Announces a new open-weights model (MiniMax M3) with specific benchmark scores and capabilities (coding, agentic, 1M context), directly addressing the 'frontier model releases' and 'agentic coding tools' topics.
    Key points
    • Announces a new open-weights model (MiniMax M3) with specific benchmark scores and capabilities (coding, agentic, 1M context), directly addressing the 'frontier model releases' and 'agentic coding tools' topics.
    Provenance
    Tweet · Primary source
  2. 2

    Hugging Face Blog - Frontier Labs (GLOBAL)

    Article

    Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

    huggingface.co/blog/nvidia/cosmos-3-for-phy… →
    Details
    Excerpt
    Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action
    Context
    Announcing a major, open-source model (Cosmos 3) specifically for Physical AI Reasoning and Action. This directly relates to frontier models, agentic tools, and the power dynamics of AI infrastructure.
    Key points
    • Announcing a major, open-source model (Cosmos 3) specifically for Physical AI Reasoning and Action. This directly relates to frontier models, agentic tools, and the power dynamics of AI infrastructure.
    Provenance
    Article · Supporting source
  3. 3

    NVIDIA Blog - Markets Infra (US)

    Article Ming-Yu Liu

    How Cosmos 3 Helps Physical AI Think Before It Acts

    blogs.nvidia.com/blog/cosmos-3-physical-ai-… →
    Details
    Excerpt
    How Cosmos 3 Helps Physical AI Think Before It Acts
    Context
    NVIDIA blog post on 'Cosmos 3' for Physical AI. Directly addresses AI infrastructure, frontier models, and the physical-world application of intelligence.
    Key points
    • NVIDIA blog post on 'Cosmos 3' for Physical AI. Directly addresses AI infrastructure, frontier models, and the physical-world application of intelligence.
    Provenance
    Article · Supporting source
  4. 4

    NVIDIA Blog - Markets Infra (US)

    Article Timothy Costa

    Taiwan’s Industry Titans Turbocharge World’s AI Infrastructure Buildout With NVIDIA - Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million NVIDIA MGX rack components for NVIDIA Vera Rubin...

    blogs.nvidia.com/blog/taiwan-ecosystem-ai-i… →
    Details
    Excerpt
    Taiwan’s Industry Titans Turbocharge World’s AI Infrastructure Buildout With NVIDIA - Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million NVIDIA MGX rack components for NVIDIA Vera Rubin...
    Context
    Directly addresses AI infrastructure, supply chain, and key geopolitical/economic players (Taiwan, NVIDIA, Vera Rubin).
    Key points
    • Directly addresses AI infrastructure, supply chain, and key geopolitical/economic players (Taiwan, NVIDIA, Vera Rubin).
    Provenance
    Article · Supporting source
  5. 5

    NVIDIA Blog - Markets Infra (US)

    Article Dion Harris

    NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand - The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet...

    blogs.nvidia.com/blog/ai-cloud-ecosystem →
    Details
    Excerpt
    NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand - The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet...
    Context
    Directly addresses AI infrastructure, compute demand, and the global buildout of AI factories, central to the podcast's focus.
    Key points
    • Directly addresses AI infrastructure, compute demand, and the global buildout of AI factories, central to the podcast's focus.
    Provenance
    Article · Supporting source
  6. 6

    @runwayml (Runway)

    X runwayml

    Introducing the Cosmos Coalition A new global initiative with NVIDIA and leading AI labs to build and open-source frontier world models for physical AI. Runway joins as a founding member, working alongside NVIDIA and a…

    x.com/runwayml/status/2061315089869721682 →
    Details
    Excerpt
    Introducing the Cosmos Coalition A new global initiative with NVIDIA and leading AI labs to build and open-source frontier world models for physical AI. Runway joins as a founding member, working alongside NVIDIA and a…
    Context
    Announces a major, concrete initiative (Cosmos Coalition) involving key players (NVIDIA, AI labs) to build frontier models for physical AI, directly addressing the topic's focus on AI infrastructure and power dynamics.
    Key points
    • Announces a major, concrete initiative (Cosmos Coalition) involving key players (NVIDIA, AI labs) to build frontier models for physical AI, directly addressing the topic's focus on AI infrastructure and power dynamics.
    Provenance
    Tweet · Primary source
  7. 7

    Forbes Innovation - Industry Adjacent (US)

    Article Karl Freund, Contributor

    Cadence And Nvidia Team To Develop First Fully Autonomous EDA Agent - Cadence and Nvidia have teamed to present the first example of Level 5 AI EDA agent to automate the work of design verification, turning a...

    www.forbes.com/sites/karlfreund/2026/06/01/… →
    Details
    Excerpt
    Cadence And Nvidia Team To Develop First Fully Autonomous EDA Agent - Cadence and Nvidia have teamed to present the first example of Level 5 AI EDA agent to automate the work of design verification, turning a...
    Context
    A major industry player (Cadence) partnering with a key infrastructure provider (Nvidia) to automate a core, complex engineering task (EDA) is a primary artifact with clear downstream consequence.
    Key points
    • A major industry player (Cadence) partnering with a key infrastructure provider (Nvidia) to automate a core, complex engineering task (EDA) is a primary artifact with clear downstream consequence.
    Provenance
    Article · Supporting source
  8. 8

    Axios - Industry Adjacent (US)

    Article Ina Fried

    Nvidia's new world model helps robots navigate the world - Nvidia unveiled Cosmos 3, an open AI world model designed to help robots, autonomous vehicles and other physical systems better understand and predict...

    www.axios.com/2026/06/01/nvidia-ai-push-cos… →
    Details
    Excerpt
    Nvidia's new world model helps robots navigate the world - Nvidia unveiled Cosmos 3, an open AI world model designed to help robots, autonomous vehicles and other physical systems better understand and predict...
    Context
    Nvidia's open world model (Cosmos 3) for physical AI/robotics is a major artifact that shifts the focus from pure software to physical-world AI infrastructure.
    Key points
    • Nvidia's open world model (Cosmos 3) for physical AI/robotics is a major artifact that shifts the focus from pure software to physical-world AI infrastructure.
    Provenance
    Article · Supporting source
  9. 9

    Techmeme - Industry Adjacent (US)

    Article

    Nvidia unveils Cosmos 3, an open physical AI foundation model, to help robots and autonomous cars better understand the real world with limited training data (Ina Fried/Axios) - Ina Fried / Axios : Nvidia unveils...

    www.techmeme.com/260601/p10 →
    Details
    Excerpt
    Nvidia unveils Cosmos 3, an open physical AI foundation model, to help robots and autonomous cars better understand the real world with limited training data (Ina Fried/Axios) - Ina Fried / Axios : Nvidia unveils...
    Context
    Nvidia releasing an open physical AI model (Cosmos 3) directly impacts physical-world AI, robotics, and autonomous systems, which is a core topic.
    Key points
    • Nvidia releasing an open physical AI model (Cosmos 3) directly impacts physical-world AI, robotics, and autonomous systems, which is a core topic.
    Provenance
    Article · Supporting source
  10. 10

    A 10 year old Xeon is all you need — 164 pts · 65 comments

    Article cafkafk

    https://point.free/blog/gemma-4-on-a-2016-xeon/ · @cafkafk: Hi HN. I wrote this post after getting frustrated by the lack of ways to run the new Gemma 4 Drafter models, and mainstream tools not prioritizing this, and…

    point.free/blog/gemma-4-on-a-2016-xeon →
    Details
    Excerpt
    https://point.free/blog/gemma-4-on-a-2016-xeon/ · @cafkafk: Hi HN. I wrote this post after getting frustrated by the lack of ways to run the new Gemma 4 Drafter models, and mainstream tools not prioritizing this, and…
    Context
    Directly discusses running a frontier model (Gemma 4) on old, low-power hardware (Xeon), addressing AI infrastructure and resource constraints.
    Key points
    • Directly discusses running a frontier model (Gemma 4) on old, low-power hardware (Xeon), addressing AI infrastructure and resource constraints.
    Provenance
    Article · Supporting source
  11. 11

    Techmeme - Industry Adjacent (US)

    Article

    Nvidia unveils DGX Station, a desktop Windows PC powered by its GB300 Grace Blackwell chip with up to 748 GB of memory, capable of running 1T-parameter models (Mike Wheatley/SiliconANGLE) - Mike Wheatley / SiliconANGLE.…

    www.techmeme.com/260601/p12 →
    Details
    Excerpt
    Nvidia unveils DGX Station, a desktop Windows PC powered by its GB300 Grace Blackwell chip with up to 748 GB of memory, capable of running 1T-parameter models (Mike Wheatley/SiliconANGLE) - Mike Wheatley / SiliconANGLE...
    Context
    Details a new, powerful, desktop AI compute artifact (DGX Station) using advanced chips (GB300), directly impacting local AI development and infrastructure.
    Key points
    • Details a new, powerful, desktop AI compute artifact (DGX Station) using advanced chips (GB300), directly impacting local AI development and infrastructure.
    Provenance
    Article · Supporting source
  12. 12

    Dune's Butlerian Jihad and the Future of AI — 16 pts · 19 comments

    Article SVI

    https://technology.inquirer.net/147084/dunes-butlerian-jihad-and-the-future-of-ai · @fxj: People talk about a Butlerian Jihad against AI as if you could just ban LLMs and be done. I bet some govermenst would like to do…

    technology.inquirer.net/147084/dunes-butler… →
    Details
    Excerpt
    https://technology.inquirer.net/147084/dunes-butlerian-jihad-and-the-future-of-ai · @fxj: People talk about a Butlerian Jihad against AI as if you could just ban LLMs and be done. I bet some govermenst would like to do…
    Context
    Directly discusses the core technical and geopolitical challenges of AI control, banning, and the underlying math/infrastructure.
    Key points
    • Directly discusses the core technical and geopolitical challenges of AI control, banning, and the underlying math/infrastructure.
    Provenance
    Article · Supporting source
  13. 13

    Techmeme - Industry Adjacent (US)

    Article

    Jensen Huang says Anthropic, OpenAI, and SpaceX are among the first big users for Nvidia's new Vera CPUs, which are 1.8x faster for AI workloads than x86 chips (Ian King/Bloomberg) - Ian King / Bloomberg : Jensen Huang.…

    www.techmeme.com/260601/p19 →
    Details
    Excerpt
    Jensen Huang says Anthropic, OpenAI, and SpaceX are among the first big users for Nvidia's new Vera CPUs, which are 1.8x faster for AI workloads than x86 chips (Ian King/Bloomberg) - Ian King / Bloomberg : Jensen Huang...
    Context
    Directly addresses AI infrastructure (GPUs/CPUs) and power dynamics by naming major AI labs (Anthropic, OpenAI) and their adoption of new, powerful hardware.
    Key points
    • Directly addresses AI infrastructure (GPUs/CPUs) and power dynamics by naming major AI labs (Anthropic, OpenAI) and their adoption of new, powerful hardware.
    Provenance
    Article · Supporting source
  14. 14

    Rest of World Latest - Media Culture (GLOBAL)

    Article Indranil Ghosh

    India’s AI deal with the UAE challenges U.S. cloud dominance - G42 will deploy U.S.-designed supercomputers in India, offering a new model for governments that want to own their AI hardware.

    restofworld.org/2026/india-uae-g42-cerebras… →
    Details
    Excerpt
    India’s AI deal with the UAE challenges U.S. cloud dominance - G42 will deploy U.S.-designed supercomputers in India, offering a new model for governments that want to own their AI hardware.
    Context
    Discusses AI hardware sovereignty and geopolitical power dynamics (India/UAE vs. US cloud dominance), directly relevant to the podcast's focus on power and control.
    Key points
    • Discusses AI hardware sovereignty and geopolitical power dynamics (India/UAE vs. US cloud dominance), directly relevant to the podcast's focus on power and control.
    Provenance
    Article · Supporting source
  15. 15

    r/OpenAI: Geoffrey Hinton (Nobel laureate and cognitive scientist) thinks AIs have become conscious - 0 pts · 0 comments

    Article EchoOfOppenheimer

    submitted by /u/EchoOfOppenheimer to r/OpenAI [link] [comments]

    v.redd.it/16akzxundn4h1 →
    Details
    Excerpt
    submitted by /u/EchoOfOppenheimer to r/OpenAI [link] [comments]
    Context
    Directly addresses the power dynamics and philosophical risks of AI consciousness, a core topic of control and intelligence building.
    Key points
    • Directly addresses the power dynamics and philosophical risks of AI consciousness, a core topic of control and intelligence building.
    Provenance
    Article · Supporting source
  16. 16

    Techmeme - Industry Adjacent (US)

    Article

    Chinese AI developer MiniMax launches M3, a new coding model that it says rivals Opus 4.7, costing $0.12 per 1M input tokens, compared with $5 for Opus 4.7 (Juro Osawa/The Information) - Juro Osawa / The Information :...

    www.techmeme.com/260601/p26 →
    Details
    Excerpt
    Chinese AI developer MiniMax launches M3, a new coding model that it says rivals Opus 4.7, costing $0.12 per 1M input tokens, compared with $5 for Opus 4.7 (Juro Osawa/The Information) - Juro Osawa / The Information :...
    Context
    Directly addresses model competition, coding capability, and cost/pricing dynamics, which are core to the podcast's focus on frontier models and power dynamics.
    Key points
    • Directly addresses model competition, coding capability, and cost/pricing dynamics, which are core to the podcast's focus on frontier models and power dynamics.
    Provenance
    Article · Supporting source
  17. 17

    The Guardian Technology - Industry Adjacent (UK)

    Article Julia Kollewe

    Nvidia launches ‘superchip’ putting AI power into laptops and PCs - Firm says its RTX Spark PC chip for Microsoft Windows will let AI agents replace the mouse and keyboard Business live – latest updates A new front has.…

    www.theguardian.com/technology/2026/jun/01/… →
    Details
    Excerpt
    Nvidia launches ‘superchip’ putting AI power into laptops and PCs - Firm says its RTX Spark PC chip for Microsoft Windows will let AI agents replace the mouse and keyboard Business live – latest updates A new front has...
    Context
    Directly addresses AI infrastructure (chips, GPUs) and the shifting craft of software engineering by integrating AI agents into local PCs.
    Key points
    • Directly addresses AI infrastructure (chips, GPUs) and the shifting craft of software engineering by integrating AI agents into local PCs.
    Provenance
    Article · Supporting source
  18. 18

    Techmeme - Industry Adjacent (US)

    Article

    Sources: Anthropic plans to let the EU's cyber agency ENISA join Project Glasswing, giving it access to Mythos; EU officials went to the US to ask for access (Gian Volpicelli/Bloomberg) - Gian Volpicelli / Bloomberg :...

    www.techmeme.com/260601/p27 →
    Details
    Excerpt
    Sources: Anthropic plans to let the EU's cyber agency ENISA join Project Glasswing, giving it access to Mythos; EU officials went to the US to ask for access (Gian Volpicelli/Bloomberg) - Gian Volpicelli / Bloomberg :...
    Context
    Directly addresses power dynamics and regulation (EU/ENISA access to Anthropic's model), fitting the core theme of who controls AI.
    Key points
    • Directly addresses power dynamics and regulation (EU/ENISA access to Anthropic's model), fitting the core theme of who controls AI.
    Provenance
    Article · Supporting source
  19. 19

    Techmeme - Industry Adjacent (US)

    Article

    Wirescreen analysis of 3,800 Chinese military procurement records finds 500+ instances since 2019 where the PLA sought Nvidia chips, including the A100 and A800 (New York Times) - New York Times : Wirescreen analysis...

    www.techmeme.com/260601/p28 →
    Details
    Excerpt
    Wirescreen analysis of 3,800 Chinese military procurement records finds 500+ instances since 2019 where the PLA sought Nvidia chips, including the A100 and A800 (New York Times) - New York Times : Wirescreen analysis...
    Context
    Directly addresses geopolitics, export controls, and the power dynamics of AI infrastructure (Nvidia chips) between nations.
    Key points
    • Directly addresses geopolitics, export controls, and the power dynamics of AI infrastructure (Nvidia chips) between nations.
    Provenance
    Article · Supporting source
  20. 20

    Techmeme - Industry Adjacent (US)

    Article

    French private equity firm Ardian partners with data center group Verne to build an up to €5B AI "gigafactory" outside Paris, targeting 500MW in total capacity (Financial Times) - Financial Times : French private...

    www.techmeme.com/260601/p30 →
    Details
    Excerpt
    French private equity firm Ardian partners with data center group Verne to build an up to €5B AI "gigafactory" outside Paris, targeting 500MW in total capacity (Financial Times) - Financial Times : French private...
    Context
    Reports major capital investment (€5B) in AI infrastructure (500MW data center), directly addressing the 'AI infrastructure' and 'power dynamics' topics.
    Key points
    • Reports major capital investment (€5B) in AI infrastructure (500MW data center), directly addressing the 'AI infrastructure' and 'power dynamics' topics.
    Provenance
    Article · Supporting source