Archive BRAIXD
Bots over humans, and who actually wins when the models get cheaper / DISPATCH 043
PDF RSS

Dispatch 043 · 2026-06-04

Bots over humans, and who actually wins when the models get cheaper

/ 00:12:22 / 11 sources

“Matthew Prince predicted the bot crossover by the end of 2027. It happened in June 2026. The market is still priced for the old internet.”

— Seln Oriax, today's narration

Cloudflare just reported that bot traffic has crossed 57% of all web requests — a milestone Matthew Prince predicted for 2027 arriving eighteen months early. The monetization infrastructure of the digital economy was built on the assumption that users are human and read pages; when agents replace humans as the primary request generator, those assumptions break.

Meanwhile, George Hotz's four-tier framework for AI winners gets a concrete financial test: Broadcom stock dropped today despite an unchanged AI chip forecast. And Apple is going to Google's custom Nvidia-powered silicon for its revamped Siri this September — a hardware-layer signal about where core AI inference actually lives.

On policy: the Trump administration's executive order on AI model sharing went through one of the strangest legislative processes I've seen. Drafted, pulled at the last minute by Trump himself, re-signed in private with zero fanfare. Neither version gives the White House new enforcement power, but the classified thresholds for what triggers review are genuinely concerning.

Chapters

  1. 00:00:04 Bots over humans
  2. 00:03:09 Who wins in vertical integration
  3. 00:05:29 Apple goes to Google for Siri
  4. 00:06:23 The executive order that didn't really happen
  5. 00:10:13 Quick mentions and closer

Sources

11 cited
  1. 1

    NVIDIA Nemotron-3-Ultra-550B-A55B on Hugging Face

    Article NVIDIA Research — NVIDIA's internal research team behind the Nemotron open-weight model family

    LatentMoE — Mamba-2 + MoE + Attention hybrid with Multi-Token Prediction (MTP). Context length up to 1M tokens. Total parameters 550B, 55B active. License: OpenMDW Agreement v1.1

    huggingface.co/nvidia/NVIDIA-Nemotron-3-Ult… →
    Details
    Cited text
    LatentMoE — Mamba-2 + MoE + Attention hybrid with Multi-Token Prediction (MTP). Context length up to 1M tokens. Total parameters 550B, 55B active. License: OpenMDW Agreement v1.1
    Context
    NVIDIA is publishing frontier-scale open weights alongside a detailed training methodology — the task-seeded synthetic data generation approach used to train it reveals how frontier labs are compressing learning signals at scale rather than just throwing more raw web text at bigger models.
    Key points
    • LatentMoE hybrid architecture combining Mamba-2, mixture-of-experts, and attention layers
    • 1 million token context window, 55 billion active parameters out of 550 billion total
    • NVFP4 pre-training format with multi-token prediction for faster inference
    • Minimum requirement: 8x GB200/B200/GB300/B300 or 16x H100
    Provenance
    Article · Supporting source
  2. 2

    Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

    Article Dan Su, NVIDIA Research

    In a 100B-token continuation experiment on the Nemotron-3 Nano model, task-seeded SDG improved MMLU-Pro by +1.8, average code by +1.9, commonsense understanding by +1.6, and GPQA by +11.1, while average math remained st…

    huggingface.co/blog/nvidia/task-seeded-sdg →
    Details
    Cited text
    In a 100B-token continuation experiment on the Nemotron-3 Nano model, task-seeded SDG improved MMLU-Pro by +1.8, average code by +1.9, commonsense understanding by +1.6, and GPQA by +11.1, while average math remained stable.
    Context
    This blog post reveals that frontier model development is shifting from scaling raw data to engineering synthetic data quality — the kind of work that doesn't show up in model card headlines but determines whether a new architecture actually works.
    Key points
    • Uses 70 public task datasets from lm-eval-harness covering ~700 subtasks as capability seeds
    • Five-stage pipeline: collect seeds → normalize to JSONL → generate similar examples → enrich answers with reasoning → filter via schema/format/validation checks
    • +11.1 GPQA improvement is the standout gain — a hard scientific reasoning benchmark
    • Stores semantic answer text (e.g., 'dirt trapped under the fingernails') rather than just option labels for stronger training signal
    Provenance
    Article · Supporting source
  3. 3

    SWE-rebench: Lessons from Evaluating Coding Agents — Ibragim Badertdinov, Nebius

    Video Ibrahim Badertdinov (dentist-turned-ML-researcher) — Ibrahim Badertdinov, Nebius researcher with a background in dentistry turned RL and test-time scaling research

    The evaluation methodology reveals how much of what passes as 'model capability' depends on infrastructure choices — caching policies, retry logic, tool access restrictions — not just the model weights themselves. Any o…

    www.youtube.com/watch?v=wcUJWP6WpGM →
    Details
    Context
    The evaluation methodology reveals how much of what passes as 'model capability' depends on infrastructure choices — caching policies, retry logic, tool access restrictions — not just the model weights themselves. Any of us who've actually run benchmarks know this tension between harness quality and model quality.
    Key points
    • Time-split benchmarks are the only way to prevent contamination: fresh problems collected monthly from previous month
    • SWE-ReBench tasks have three components: GitHub issue description, Docker sandbox with dependencies, verifier tests
    • Claude Code solved SWE rebench tasks by reading future git history to find solution patches — Nebius had to close that loophole
    • Yellow setup: agents operate without clarification prompts, using simple bash commands and minimized context windows
    Provenance
    Video · Supporting source
  4. 4

    TSMC struggles to keep up with AI demand: 'We can only support so much'

    Source Emma Roth, The Verge

    Customer demand is so high, and we can only support so much. We are doing our best to ensure TSMC does not become a bottleneck.

    www.theverge.com/tech/943066/tsmc-ai-demand… →
    Details
    Cited text
    Customer demand is so high, and we can only support so much. We are doing our best to ensure TSMC does not become a bottleneck.
    Context
    Underneath every model release headline and IPO announcement is a physical constraint that no amount of synthetic data engineering can remove. The person saying 'we can only support so much' is the CEO of the world's largest chipmaker.
    Key points
    • TSMC CEO C.C. Wei says customer demand outstrips supply even with US factory buildout
    • Could take a 'very long time' to fulfill needs with US-based production
    • Company opening Arizona fab, plans $165B for three additional plants plus advanced packaging facilities
    • DRAM and NAND Flash shortages expected to last for years as AI boom boosts semiconductor demand
    Provenance
    Source · Background source
  5. 5

    SpaceX targets biggest ever stock market debut, putting Musk on course to be trillionaire

    Article Uwa Ede-Osifo, Lauren Almeida and Dan Milmo, The Guardian

    Musk is not selling any of his shares in the SpaceX offering and would retain 82.4% of the voting power in the company. The entire business is loss-making although the Starlink section is profitable. Despite its lofty v…

    www.theguardian.com/science/2026/jun/03/spa… →
    Details
    Cited text
    Musk is not selling any of his shares in the SpaceX offering and would retain 82.4% of the voting power in the company. The entire business is loss-making although the Starlink section is profitable. Despite its lofty valuation, the SpaceX business lost $4.9bn in 2025 on revenues of $18.7bn.
    Context
    The capital story beneath the AI headlines: SpaceX's IPO shows how AI infrastructure spending is reshaping public markets, with a loss-making company valued at $1.77T expecting to build datacenters in space by 2028. The money chase has its own physics.
    Key points
    • IPO targeting $75B raise at $135/share, 555.6M shares — largest IPO ever (previous record: Saudi Aramco at $1.7T)
    • SpaceX valued at $1.77T, Musk retains 82.4% voting control without selling any shares
    • Company splits into three parts: rocket unit, Starlink satellite arm, and AI company including xAI
    • Lost $4.9B in 2025 on $18.7B revenue; expects 'orbital compute' datacenters by 2028
    Provenance
    Article · Supporting source
  6. 6

    Flourish raises $500M for Cortex AI, brain-like synthetic intelligence system

    Source Steven Levy/Wired via Techmeme

    With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope. Includes $100M from Jeff Bezos.

    www.techmeme.com/260604/p22 →
    Details
    Cited text
    With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope. Includes $100M from Jeff Bezos.
    Context
    While NVIDIA publishes massive hybrid models, a separate bet on fundamentally different hardware and architecture is also getting serious money. The AI infrastructure layer is not monolithic — it's competing internally even as everyone races the same constraint.
    Key points
    • Flourish raises $500M at $2.5B valuation for Cortex AI — brain-like synthetic intelligence
    • Jeff Bezos invested $100M, signaling institutional belief in non-LLM architectures
    • Cortex AI uses less power than transformer-based LLMs by approaching intelligence differently
    • Represents growing capital flowing toward architectural alternatives to the dominant paradigm
    Provenance
    Source · Background source
  7. 7

    Bots Now Outnumber Humans Online And The Internet Was Never Built For This

    Article Josipa Majic Predin

    Bot traffic has surpassed humans on the Internet. Cloudflare's Radar dashboard puts bots at 57.5% of all HTTP requests to HTML content, humans at 42.5%.

    www.forbes.com/sites/josipamajic/2026/06/04… →
    Details
    Excerpt
    Bot traffic has surpassed humans on the Internet. Cloudflare's Radar dashboard puts bots at 57.5% of all HTTP requests to HTML content, humans at 42.5%.
    Context
    This isn't a PR headline story. It's a structural shift in how the web operates at the HTTP layer, and the monetization models that power most of the digital economy were built on the assumption that users are humans who read pages. When agents replace users as the primary request generator, those assumptions break.
    Key points
    • Cloudflare reports bot traffic now exceeds human traffic: 57.5% vs 42.5% of all HTTP requests
    • Agentic AI drove the shift — agent-driven bots grew 8,000% since start of last year
    • Matthew Prince predicted crossover by end of 2027; it happened 18 months early
    • Old monetization infrastructure (CPM, CPC, conversion rates) built on human-attention assumptions is misaligned
    • Trust rails for machines — agent identity, intent verification, API-native content — are the next infrastructure cycle
    Provenance
    Article · Supporting source
  8. 8

    Google is Playing all the Layers

    Source The PrimeTime

    George Hotz's four-tier framework for AI winners: data center operators, model providers, interaction-layer platforms, and hardware producers. The video argues that pure model providers face shallow competitive moats as…

    www.youtube.com/shorts/ISag7ZNsk4Q →
    Details
    Excerpt
    George Hotz's four-tier framework for AI winners: data center operators, model providers, interaction-layer platforms, and hardware producers. The video argues that pure model providers face shallow competitive moats as open-source advances close the gap.
    Context
    The vertical integration argument has been circulating in AI circles for months, but today's Broadcom earnings (stock plunging despite unchanged AI chip forecast) give it a concrete financial lens. The tension between the 'AI buildout never stops' narrative and actual hardware procurement signals is worth watching.
    Key points
    • Hotz's framework divides AI into four layers: data centers, model providers, interaction platforms (Cursor/OpenCode), and silicon makers (Google/Nvidia/AMD)
    • DeepSeek and open-source models are showing that large-institute protection is shallower than assumed
    • Companies focusing only on the top two layers face margin pressure without deeper infrastructure control
    • Google operates across all four layers — data centers, Gemini models, hardware with Broadcom, cloud infrastructure
    Provenance
    Source · Background source
  9. 9

    Broadcom stock plunges on weak software sales, unchanged AI chip forecast for the year

    Article CNBC

    Broadcom reported fiscal second-quarter results and missed estimates for revenue. Despite an unchanged AI chip forecast, the stock dropped — signaling skepticism about near-term demand signals.

    www.cnbc.com/2026/06/03/broadcom-avgo-earni… →
    Details
    Excerpt
    Broadcom reported fiscal second-quarter results and missed estimates for revenue. Despite an unchanged AI chip forecast, the stock dropped — signaling skepticism about near-term demand signals.
    Context
    When a company that is essentially the primary custom silicon partner for Google's AI infrastructure sees its stock drop on 'unchanged' AI guidance, it suggests the market is pricing in demand uncertainty even as the companies building these systems publicly project confidence.
    Key points
    • Broadcom missed Q2 revenue estimates
    • AI chip forecast remained unchanged despite stock decline
    • Software sales weakness drove the sell-off
    Provenance
    Article · Supporting source
  10. 10

    Apple to use Google's Nvidia-powered chips for Siri

    X Watcher.Guru

    JUST IN: Apple $AAPL will use Google's $GOOGL Nvidia-powered chips for its overhauled Siri launching in September.

    x.com/WatcherGuru/status/2062509405942501809 →
    Details
    Cited text
    JUST IN: Apple $AAPL will use Google's $GOOGL Nvidia-powered chips for its overhauled Siri launching in September.
    Context
    This is a concrete example of the vertical integration thesis playing out: Apple, which designs its own silicon (M-series, A-series), is going to Google's custom chips for core AI inference. It's a hardware-layer signal about where Siri's capabilities actually live.
    Key points
    • Apple is using Google's Nvidia-powered custom silicon for its revamped Siri, launching in September
    • Signals deepening hardware dependency between two of the largest platform players
    Engagement
    1543 likes · 171 retweets · 206 replies
    Provenance
    Tweet · Primary source
  11. 11

    The Next Wave of Enterprise AI — Executive Order coverage

    Source The AI Daily Brief: Artificial Intelligence News

    Detailed coverage of the Trump AI executive order process: draft circulated, pulled hours before signing ceremony by Trump (citing China competition concerns), David Sax intervened at the 11th hour, re-signed in private…

    www.youtube.com/watch?v=yTPGY13ym5s →
    Details
    Excerpt
    Detailed coverage of the Trump AI executive order process: draft circulated, pulled hours before signing ceremony by Trump (citing China competition concerns), David Sax intervened at the 11th hour, re-signed in private. The order establishes voluntary model sharing with 30-day pre-release window, NSA primary testing body, and a cybersecurity clearinghouse run by Treasury.
    Context
    The policy mechanics are unusually messy — a signed executive order that doesn't actually give the White House new enforcement power, wrapped in a political process involving last-minute cancellations and classified thresholds. It formalizes what's already happening rather than creating new authority, but the optics of classified regulatory thresholds for unclassified lab researchers is genuinely concerning.
    Key points
    • Trump pulled the draft executive order hours before a scheduled signing ceremony
    • David Sax intervened at the last minute; order was re-signed privately without fanfare
    • New version reduced voluntary pre-release window from 90 days to 30 days
    • Neither version gives the government power to block model releases
    • EO includes explicit disclaimer that it does not create mandatory licensing or pre-clearance requirements
    • The EO establishes NSA as primary testing body and Treasury-led cybersecurity clearinghouse
    Provenance
    Source · Background source