◆ Braid Daily · 2026-06-10

US officials reportedly tell CAISI to halt public model assessments

10 June 2026

A reported order to stop government-backed model evaluations, plus the week's compute-capital map and a sharp agent-security paper run.

The lead

According to reports aggregated this morning, administration officials directed CAISI to halt its public model assessments. Should the order stick, builders lose one of the few government-backed reads on what frontier models can do. It follows a week of Braid issues treating agent safety as an evaluation problem, not pure capability.

Read source

On the timeline

The order, from inside the conversation

X / roon

A primary post tying the reported halt to a wider AI executive order. Read it alongside the aggregated coverage rather than on its own, since the underlying claim is still developing.

Read source

Capacity becomes capital

OpenAI plans a 10GW build, reportedly with Nvidia backing

Techmeme

OpenAI is moving into ten-gigawatt-scale infrastructure, with Ohio power and reported Nvidia involvement. Treat the financing as a sourced report, not a closed deal.

Read source

Meta signs its first India AI data-center deal, with Reliance

TechCrunch

Meta's first AI data center in India puts a major new market on the compute map, partnered with Reliance.

Read source

More on Meta's India push: data centers and energy

Techmeme

A wider read on Meta's India moves across data-center and energy commitments, for readers who want the surrounding detail.

Read source

SK Hynix and the memory side of the buildout

Techmeme

Memory supply is easy to forget in the AI buildout. This tracks SK Hynix's US footprint as a supply-and-capital story.

Read source

China-linked capital controls

Techmeme

Cross-border capital flows into AI ventures keep bumping into geopolitics. This is the constraint side of the same map.

Read source

Labor and enterprise trust

Tata's chairman says AI agents could replace half of TCS jobs

Techmeme

A concrete enterprise-labor number from one of the world's largest IT-services firms, tying agent workflows directly to hiring pipelines.

Read source

Anthropic ships its strongest model, then rations access

Forbes

Fable 5 arrives with rate limits and access expiration. The release-then-ration pattern is becoming its own planning constraint for teams building on the model.

Read source

AWS Bedrock to require sharing data with Anthropic for Mythos and future models

Hacker News

A reported Bedrock policy would route enterprise data to Anthropic for newer models, which puts data boundaries back at the center of the build-vs-buy call. Verify the policy text before treating it as settled.

Read source

Agent security, from today's arXiv

GitInject: testing agent vulnerabilities in CI/CD pipelines

arXiv

A framework for probing how coding agents wired into CI/CD can be turned into a supply-chain attack path.

Read source

CIAware-Bench: do models know when they're being controlled?

arXiv

A benchmark for whether a model can tell it's under a control intervention, which matters because control-aware models can make safety evals read better than the deployment will.

Read source

Measuring PII leakage to agents

arXiv

A look at how personal data leaks through agent interactions, with implications for how multi-agent systems should be partitioned.

Read source

Deployment-time memorization in agents

arXiv

Names and quantifies a failure where agents memorize at deployment time, with a privacy-versus-utility trade-off the authors try to measure directly.

Read source

Watch

EU sets an info session on its transparency code for AI-generated content

European Commission

An early procedural step on content-signature obligations. Bookmark it now; it gets more concrete once the code text and signature process land.

Read source

Companion episode

When the Evaluation Goes Back Inside

2026-06-10 · 00:24:54

Episode Watch on YouTube Sources Transcript Chapters JSON

Two threads carry over from the week: the compute-finance map keeps redrawing itself with new geographies, and agent safety keeps showing up as an evaluation problem. Today those threads cross — if government-backed assessments go dark while the buildout accelerates, the public read on these systems gets thinner just as the stakes climb.