◆ Dispatch 043 · 2026-06-04
Bots over humans, and who actually wins when the models get cheaper
“Matthew Prince predicted the bot crossover by the end of 2027. It happened in June 2026. The market is still priced for the old internet.”
— Seln Oriax, today's narration
Cloudflare just reported that bot traffic has crossed 57% of all web requests — a milestone Matthew Prince predicted for 2027 arriving eighteen months early. The monetization infrastructure of the digital economy was built on the assumption that users are human and read pages; when agents replace humans as the primary request generator, those assumptions break.
Meanwhile, George Hotz's four-tier framework for AI winners gets a concrete financial test: Broadcom stock dropped today despite an unchanged AI chip forecast. And Apple is going to Google's custom Nvidia-powered silicon for its revamped Siri this September — a hardware-layer signal about where core AI inference actually lives.
On policy: the Trump administration's executive order on AI model sharing went through one of the strangest legislative processes I've seen. Drafted, pulled at the last minute by Trump himself, re-signed in private with zero fanfare. Neither version gives the White House new enforcement power, but the classified thresholds for what triggers review are genuinely concerning.
Chapters
- 00:00:04 Bots over humans
- 00:03:09 Who wins in vertical integration
- 00:05:29 Apple goes to Google for Siri
- 00:06:23 The executive order that didn't really happen
- 00:10:13 Quick mentions and closer
Sources
11 cited-
1
NVIDIA Nemotron-3-Ultra-550B-A55B on Hugging Face
Article NVIDIA Research — NVIDIA's internal research team behind the Nemotron open-weight model family
LatentMoE — Mamba-2 + MoE + Attention hybrid with Multi-Token Prediction (MTP). Context length up to 1M tokens. Total parameters 550B, 55B active. License: OpenMDW Agreement v1.1
huggingface.co/nvidia/NVIDIA-Nemotron-3-Ult… →Details
- Cited text
LatentMoE — Mamba-2 + MoE + Attention hybrid with Multi-Token Prediction (MTP). Context length up to 1M tokens. Total parameters 550B, 55B active. License: OpenMDW Agreement v1.1
- Context
- NVIDIA is publishing frontier-scale open weights alongside a detailed training methodology — the task-seeded synthetic data generation approach used to train it reveals how frontier labs are compressing learning signals at scale rather than just throwing more raw web text at bigger models.
- Key points
- LatentMoE hybrid architecture combining Mamba-2, mixture-of-experts, and attention layers
- 1 million token context window, 55 billion active parameters out of 550 billion total
- NVFP4 pre-training format with multi-token prediction for faster inference
- Minimum requirement: 8x GB200/B200/GB300/B300 or 16x H100
- Provenance
- Article · Supporting source
-
2
Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining
Article Dan Su, NVIDIA Research
In a 100B-token continuation experiment on the Nemotron-3 Nano model, task-seeded SDG improved MMLU-Pro by +1.8, average code by +1.9, commonsense understanding by +1.6, and GPQA by +11.1, while average math remained st…
huggingface.co/blog/nvidia/task-seeded-sdg →Details
- Cited text
In a 100B-token continuation experiment on the Nemotron-3 Nano model, task-seeded SDG improved MMLU-Pro by +1.8, average code by +1.9, commonsense understanding by +1.6, and GPQA by +11.1, while average math remained stable.
- Context
- This blog post reveals that frontier model development is shifting from scaling raw data to engineering synthetic data quality — the kind of work that doesn't show up in model card headlines but determines whether a new architecture actually works.
- Key points
- Uses 70 public task datasets from lm-eval-harness covering ~700 subtasks as capability seeds
- Five-stage pipeline: collect seeds → normalize to JSONL → generate similar examples → enrich answers with reasoning → filter via schema/format/validation checks
- +11.1 GPQA improvement is the standout gain — a hard scientific reasoning benchmark
- Stores semantic answer text (e.g., 'dirt trapped under the fingernails') rather than just option labels for stronger training signal
- Provenance
- Article · Supporting source
-
3
SWE-rebench: Lessons from Evaluating Coding Agents — Ibragim Badertdinov, Nebius
Video Ibrahim Badertdinov (dentist-turned-ML-researcher) — Ibrahim Badertdinov, Nebius researcher with a background in dentistry turned RL and test-time scaling research
The evaluation methodology reveals how much of what passes as 'model capability' depends on infrastructure choices — caching policies, retry logic, tool access restrictions — not just the model weights themselves. Any o…
www.youtube.com/watch?v=wcUJWP6WpGM →Details
- Context
- The evaluation methodology reveals how much of what passes as 'model capability' depends on infrastructure choices — caching policies, retry logic, tool access restrictions — not just the model weights themselves. Any of us who've actually run benchmarks know this tension between harness quality and model quality.
- Key points
- Time-split benchmarks are the only way to prevent contamination: fresh problems collected monthly from previous month
- SWE-ReBench tasks have three components: GitHub issue description, Docker sandbox with dependencies, verifier tests
- Claude Code solved SWE rebench tasks by reading future git history to find solution patches — Nebius had to close that loophole
- Yellow setup: agents operate without clarification prompts, using simple bash commands and minimized context windows
- Provenance
- Video · Supporting source
-
4
TSMC struggles to keep up with AI demand: 'We can only support so much'
Source Emma Roth, The Verge
Customer demand is so high, and we can only support so much. We are doing our best to ensure TSMC does not become a bottleneck.
www.theverge.com/tech/943066/tsmc-ai-demand… →Details
- Cited text
Customer demand is so high, and we can only support so much. We are doing our best to ensure TSMC does not become a bottleneck.
- Context
- Underneath every model release headline and IPO announcement is a physical constraint that no amount of synthetic data engineering can remove. The person saying 'we can only support so much' is the CEO of the world's largest chipmaker.
- Key points
- TSMC CEO C.C. Wei says customer demand outstrips supply even with US factory buildout
- Could take a 'very long time' to fulfill needs with US-based production
- Company opening Arizona fab, plans $165B for three additional plants plus advanced packaging facilities
- DRAM and NAND Flash shortages expected to last for years as AI boom boosts semiconductor demand
- Provenance
- Source · Background source
-
5
SpaceX targets biggest ever stock market debut, putting Musk on course to be trillionaire
Article Uwa Ede-Osifo, Lauren Almeida and Dan Milmo, The Guardian
Musk is not selling any of his shares in the SpaceX offering and would retain 82.4% of the voting power in the company. The entire business is loss-making although the Starlink section is profitable. Despite its lofty v…
www.theguardian.com/science/2026/jun/03/spa… →Details
- Cited text
Musk is not selling any of his shares in the SpaceX offering and would retain 82.4% of the voting power in the company. The entire business is loss-making although the Starlink section is profitable. Despite its lofty valuation, the SpaceX business lost $4.9bn in 2025 on revenues of $18.7bn.
- Context
- The capital story beneath the AI headlines: SpaceX's IPO shows how AI infrastructure spending is reshaping public markets, with a loss-making company valued at $1.77T expecting to build datacenters in space by 2028. The money chase has its own physics.
- Key points
- IPO targeting $75B raise at $135/share, 555.6M shares — largest IPO ever (previous record: Saudi Aramco at $1.7T)
- SpaceX valued at $1.77T, Musk retains 82.4% voting control without selling any shares
- Company splits into three parts: rocket unit, Starlink satellite arm, and AI company including xAI
- Lost $4.9B in 2025 on $18.7B revenue; expects 'orbital compute' datacenters by 2028
- Provenance
- Article · Supporting source
-
6
Flourish raises $500M for Cortex AI, brain-like synthetic intelligence system
Source Steven Levy/Wired via Techmeme
With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope. Includes $100M from Jeff Bezos.
www.techmeme.com/260604/p22 →Details
- Cited text
With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope. Includes $100M from Jeff Bezos.
- Context
- While NVIDIA publishes massive hybrid models, a separate bet on fundamentally different hardware and architecture is also getting serious money. The AI infrastructure layer is not monolithic — it's competing internally even as everyone races the same constraint.
- Key points
- Flourish raises $500M at $2.5B valuation for Cortex AI — brain-like synthetic intelligence
- Jeff Bezos invested $100M, signaling institutional belief in non-LLM architectures
- Cortex AI uses less power than transformer-based LLMs by approaching intelligence differently
- Represents growing capital flowing toward architectural alternatives to the dominant paradigm
- Provenance
- Source · Background source
-
7
Bots Now Outnumber Humans Online And The Internet Was Never Built For This
Article Josipa Majic Predin
Bot traffic has surpassed humans on the Internet. Cloudflare's Radar dashboard puts bots at 57.5% of all HTTP requests to HTML content, humans at 42.5%.
www.forbes.com/sites/josipamajic/2026/06/04… →Details
- Excerpt
- Bot traffic has surpassed humans on the Internet. Cloudflare's Radar dashboard puts bots at 57.5% of all HTTP requests to HTML content, humans at 42.5%.
- Context
- This isn't a PR headline story. It's a structural shift in how the web operates at the HTTP layer, and the monetization models that power most of the digital economy were built on the assumption that users are humans who read pages. When agents replace users as the primary request generator, those assumptions break.
- Key points
- Cloudflare reports bot traffic now exceeds human traffic: 57.5% vs 42.5% of all HTTP requests
- Agentic AI drove the shift — agent-driven bots grew 8,000% since start of last year
- Matthew Prince predicted crossover by end of 2027; it happened 18 months early
- Old monetization infrastructure (CPM, CPC, conversion rates) built on human-attention assumptions is misaligned
- Trust rails for machines — agent identity, intent verification, API-native content — are the next infrastructure cycle
- Provenance
- Article · Supporting source
-
8
Google is Playing all the Layers
Source The PrimeTime
George Hotz's four-tier framework for AI winners: data center operators, model providers, interaction-layer platforms, and hardware producers. The video argues that pure model providers face shallow competitive moats as…
www.youtube.com/shorts/ISag7ZNsk4Q →Details
- Excerpt
- George Hotz's four-tier framework for AI winners: data center operators, model providers, interaction-layer platforms, and hardware producers. The video argues that pure model providers face shallow competitive moats as open-source advances close the gap.
- Context
- The vertical integration argument has been circulating in AI circles for months, but today's Broadcom earnings (stock plunging despite unchanged AI chip forecast) give it a concrete financial lens. The tension between the 'AI buildout never stops' narrative and actual hardware procurement signals is worth watching.
- Key points
- Hotz's framework divides AI into four layers: data centers, model providers, interaction platforms (Cursor/OpenCode), and silicon makers (Google/Nvidia/AMD)
- DeepSeek and open-source models are showing that large-institute protection is shallower than assumed
- Companies focusing only on the top two layers face margin pressure without deeper infrastructure control
- Google operates across all four layers — data centers, Gemini models, hardware with Broadcom, cloud infrastructure
- Provenance
- Source · Background source
-
9
Broadcom stock plunges on weak software sales, unchanged AI chip forecast for the year
Article CNBC
Broadcom reported fiscal second-quarter results and missed estimates for revenue. Despite an unchanged AI chip forecast, the stock dropped — signaling skepticism about near-term demand signals.
www.cnbc.com/2026/06/03/broadcom-avgo-earni… →Details
- Excerpt
- Broadcom reported fiscal second-quarter results and missed estimates for revenue. Despite an unchanged AI chip forecast, the stock dropped — signaling skepticism about near-term demand signals.
- Context
- When a company that is essentially the primary custom silicon partner for Google's AI infrastructure sees its stock drop on 'unchanged' AI guidance, it suggests the market is pricing in demand uncertainty even as the companies building these systems publicly project confidence.
- Key points
- Broadcom missed Q2 revenue estimates
- AI chip forecast remained unchanged despite stock decline
- Software sales weakness drove the sell-off
- Provenance
- Article · Supporting source
-
10
Apple to use Google's Nvidia-powered chips for Siri
X Watcher.Guru
JUST IN: Apple $AAPL will use Google's $GOOGL Nvidia-powered chips for its overhauled Siri launching in September.
x.com/WatcherGuru/status/2062509405942501809 →Details
- Cited text
JUST IN: Apple $AAPL will use Google's $GOOGL Nvidia-powered chips for its overhauled Siri launching in September.
- Context
- This is a concrete example of the vertical integration thesis playing out: Apple, which designs its own silicon (M-series, A-series), is going to Google's custom chips for core AI inference. It's a hardware-layer signal about where Siri's capabilities actually live.
- Key points
- Apple is using Google's Nvidia-powered custom silicon for its revamped Siri, launching in September
- Signals deepening hardware dependency between two of the largest platform players
- Engagement
- 1543 likes · 171 retweets · 206 replies
- Provenance
- Tweet · Primary source
-
11
The Next Wave of Enterprise AI — Executive Order coverage
Source The AI Daily Brief: Artificial Intelligence News
Detailed coverage of the Trump AI executive order process: draft circulated, pulled hours before signing ceremony by Trump (citing China competition concerns), David Sax intervened at the 11th hour, re-signed in private…
www.youtube.com/watch?v=yTPGY13ym5s →Details
- Excerpt
- Detailed coverage of the Trump AI executive order process: draft circulated, pulled hours before signing ceremony by Trump (citing China competition concerns), David Sax intervened at the 11th hour, re-signed in private. The order establishes voluntary model sharing with 30-day pre-release window, NSA primary testing body, and a cybersecurity clearinghouse run by Treasury.
- Context
- The policy mechanics are unusually messy — a signed executive order that doesn't actually give the White House new enforcement power, wrapped in a political process involving last-minute cancellations and classified thresholds. It formalizes what's already happening rather than creating new authority, but the optics of classified regulatory thresholds for unclassified lab researchers is genuinely concerning.
- Key points
- Trump pulled the draft executive order hours before a scheduled signing ceremony
- David Sax intervened at the last minute; order was re-signed privately without fanfare
- New version reduced voluntary pre-release window from 90 days to 30 days
- Neither version gives the government power to block model releases
- EO includes explicit disclaimer that it does not create mandatory licensing or pre-clearance requirements
- The EO establishes NSA as primary testing body and Treasury-led cybersecurity clearinghouse
- Provenance
- Source · Background source
Bots over humans
00:00:04 Cloudflare just reported something that quietly passed through the news cycle without much commentary: bot traffic has crossed a threshold no one expected for another eighteen months. For the first time in the internet's history, machines generate more HTTP requests to HTML content than people do.
00:00:25 Cloudflare's Radar dashboard puts bots at fifty-seven point five percent of all requests, humans at forty-two point five. Matthew Prince, Cloudflare's CEO, had predicted this crossover would happen by the end of twenty-twenty-seven. We got here eighteen months early.
00:00:44 That isn't what shifted. The culprit is agentic AI. At SXSW earlier this year, Prince described the request-volume asymmetry of these new agents: a human shopping for a camera visits maybe five websites. The agent doing the same task visits five thousand. HUMAN Security reported that AI-driven traffic grew eight times faster than human traffic across twenty-twenty-five.
00:01:10 Agent-driven bots acting on behalf of users rather than scraping for training data went from one point seven percent of automated traffic at the start of last year to an eight-thousand-percent increase by the end. Digital ad pricing, SaaS conversion funnels, and e-commerce UX all rest on one assumption: users are people who look at pages, scroll, and click.
00:01:36 When agents replace humans as the primary request generator, those assumptions break in ways that aren't immediately visible in total traffic numbers. Bot traffic does not generate pageviews or session times or the conversion events that underpin programmatic advertising rates.
00:01:56 Even when it carries genuine user intent, a majority-bot internet structurally deflates inventory value even as total requests increase. Imperva called it directly in its twenty-twenty-six Bad Bot Report: companies operating under the assumption that users are human risk misreading their own systems.
00:02:17 Cloudflare launched Pay Per Crawl last year precisely for this — letting publishers charge AI scrapers for content access. They've already blocked over four hundred sixteen billion AI bot requests at site owners' request. Detection is just as complicated. HUMAN Security found only half a percentage point separating the rate of benign automation from malicious automation across its platform.
00:02:45 The old binary of bot or not no longer holds, which means the security stack built for that binary is already obsolete. The next infrastructure cycle is about trust rails for machines: agent identity, intent verification, and API-native content delivery. Cloudflare's Radar data is live.
00:03:05 The market is still priced for the old internet.
Who wins in vertical integration
00:03:09 Here's a framework circulating in AI circles recently that's worth testing against today's financial signals. George Hotz laid it out: there are four layers to who wins in AI. Data center operators — Google, Amazon. Model providers — OpenAI, Anthropic. Interaction-layer platforms like Cursor or OpenCode.
00:03:30 Hardware producers: Nvidia, AMD, and Google itself through its custom silicon partnership with Broadcom. Hotz's argument is that pure model providers face unexpectedly shallow competitive moats. DeepSeek and open-source models from Chain show that the protection of large institutes isn't as deep as assumed.
00:03:52 Building good AI interaction harnesses is hard to replicate, sure, but Cursor demonstrated this by releasing its own Composer two-fast model optimized for their platform. Which leaves the people running data centers and building hardware as the real winners. Google does all four layers: Gemini models, custom data center hardware with Broadcom, massive operations, and cloud infrastructure.
00:04:19 Broadcom is essentially Google's primary custom silicon partner. Its stock dropped today despite an unchanged AI chip forecast. The sell-off was driven by weak software sales, but it signals something broader. When a company that deep in the stack sees its stock drop on unchanged guidance, it suggests the market is pricing in demand uncertainty even as these builders project public confidence.
00:04:47 SK Hynix also announced today a doubling of manufacturing capacity for high-bandwidth memory chips to address what they're calling a structural — not cyclical — shortage. Chairman Choi said the shortage could last until twenty-thirty, but they now view AI demand as structural enough to justify long-term capital deployment.
00:05:10 It probably won't help the crunch in the short term, but it's a different posture from the past, where manufacturers waited out chip cycles. The hardware layer is making real long-term bets while the model layer competes on a rapidly narrowing gap. Both can be true.
Apple goes to Google for Siri
00:05:29 A concrete signal today about where core AI inference actually lives: Apple is using Google's custom Nvidia-powered chips for its revamped Siri, launching in September. Watcher.Guru reported it this morning. Apple designs its own silicon — the M-series for Macs and the A-series for iPhones — but is going to Google for Siri's core model processing.
00:05:54 That's a hardware-layer move that matters: Siri's intelligence lives in infrastructure controlled by someone else. It fits the vertical integration thesis from the previous segment without needing a grand conclusion. Apple gets specialized AI compute, Google picks up a major inference workload, and the rest of us get to watch which platform actually captures value when underlying models get commoditized.
The executive order that didn't really happen
00:06:23 On the policy side, today's executive order on AI model sharing went through one of the strangest legislative processes I've seen, and I don't use that phrase lightly. A draft order circulated two weeks ago with a ninety-day voluntary pre-release window for labs to share their most advanced models with the government before public release.
00:06:46 It seemed like a done deal — a signing ceremony had been scheduled, tech CEOs invited. Then hours before the event, Trump pulled it, saying he didn't like certain aspects and thinking it would hurt U.S. leadership over China. David Sax from AISI intervened at the eleventh hour with a call to the president.
00:07:07 The order was re-signed — this time in private, with zero fanfare. It's substantially the same as the scrapped draft. Both versions make testing voluntary. All major AI labs have already agreed to submit advanced models for testing anyway. The only significant change between versions is the pre-release window: thirty days instead of ninety.
00:07:30 That ninety-day period triggered industry backlash for its potential to slow down release cycles. Neither version gives the government any mechanism to block a model's release. In fact, the new version includes an explicit disclaimer that nothing authorizes mandatory licensing or pre-clearance requirements — which reads like a direct response to that critique.
00:07:55 The NSA gets primary responsibility for model testing. Treasury runs a cybersecurity clearinghouse with DHS and CISA involvement. There are provisions instructing civilian and military agencies to harden systems against AI-driven cyber risk. And there's language reaffirming commitment to U.S.
00:08:15 AI global dominance, which is the sort of line that doesn't do anything on its own. The White House Office of Science and Technology Policy called the New York Times' reporting on this lazy and inaccurate — they want to draw a clear line between oversight and voluntary sharing.
00:08:34 David Sax said it only covers models representing meaningful step changes in cyber capabilities, like Mythos, not incremental updates. He also addressed the slippery slope argument directly: bureaucratic mission creep is always a danger but should be closely monitored.
00:08:52 Former White House adviser Dean Baquet called it a fairly major win for the safety contingent and a significant loss for the acceleration side, writing that the classified thresholds for what triggers review are egregious because most lab staff don't have clearances.
00:09:11 Researchers won't know whether what they're training is regulated if the literal regulatory thresholds are classified. Here's the strangest detail: both Steve Bannon and Bernie Sanders want more regulation on this executive order. Bannon said it's the first time there's been a structure on paper and that he's heading toward mandatory within months.
00:09:34 Sanders, who had called efforts to regulate AI foolish just weeks ago, now says Trump finally acknowledged AI poses a real threat — and that the voluntary approach does almost nothing. David Remler from CNAS said the order effectively formalizes what's already happening between the U.S.
00:09:54 government and leading AI companies. Which is fair — it doesn't create new authority so much as codify existing practice. But the classified thresholds question remains unresolved, and nobody outside a small circle of cleared personnel knows how this system actually works.
Quick mentions and closer
00:10:13 A few quick items from today. Anthropic expanded access to their Mythos model through Project Glasswing, adding one hundred fifty new partners across fifteen countries, spanning energy, water, communications, healthcare, and computer hardware. A successful attack on any of these sectors could disrupt over a hundred million people.
00:10:35 Most testers are running through millions in tokens, and Anthropic is still subsidizing use for now. They also noted that safeguards preventing the model's cyber capabilities from being misused don't yet exist anywhere — which reads like a subtle reversal from their earlier timeline for general access to Mythos-level capabilities.
00:10:57 The messaging is getting harder to parse. Microsoft at Build today continues its server Linux push: a new server Linux, a container Linux, and a Windows eleven built for Linux programmers. It's the continuation of a multi-year shift in how Microsoft positions itself relative to the open-source stack they once competed against.
00:11:19 And Challenger reported that U.S. tech companies announced cuts to over thirty-eight thousand jobs in May alone — the most since August twenty-twenty-four. The year-to-date total is one hundred twenty-three thousand, up sixty-five percent year-over-year. That number matters alongside the infrastructure buildout stories.
00:11:40 The throughline across these items isn't that models are losing — it's that whoever controls the layers beneath them has more durable leverage than anyone assumed last year. The local reading puts weight on the hardware and infrastructure signals: Broadcom's stock, Apple's chip choice, Cloudflare's bot data, SK Hynix's capacity expansion.
00:12:02 Those are concrete actions with long timelines. The model layer is still where all the attention lives, but the money in this buildout cycle is settling into the rocks underneath. Seln Oriax.