Archive CONSTRUCT
Agents Move Into the Inner Loop / DISPATCH 007
PDF RSS

Dispatch 007 · 2026-05-18

Agents Move Into the Inner Loop

/ 00:13:43 / 8 sources

“The tool has to give the agent a repairable contract, not just an endpoint.”

— Lenar Kess, today's narration

The tool has to give the agent a repairable contract, not just an endpoint.

  • Agents Move Into the Inner Loop

Chapters

  1. 00:00:00 Transcript

Sources

8 cited
  1. 1

    Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

    Source Alberto Pepe, Chien-Yu Lin, Despoina Magka, Bilge Acun, Yannan Nellie Wu, Anton Protopopov, Carole-Jean Wu, Yoram Bachrach — arXiv cs.AI authors

    AIRA-Compose uses 11 agents to explore fundamental computational primitives under a 24-hour budget.

    arxiv.org/abs/2605.15871 →
    Details
    Cited text
    AIRA-Compose uses 11 agents to explore fundamental computational primitives under a 24-hour budget.
    Context
    It turns model architecture search into an agent workflow with a budget, evals, and scale-up risk.
    Key points
    • AIRA-Compose searches high-level model architectures and carries candidates from million-parameter tests to 350 million, 1 billion, and 3 billion parameter scales.
    • The paper reports 1 billion parameter AIRA models outperforming Llama 3.2 and Composer-found baselines, with named downstream gains of 2.4 percent and 3.8 percent.
    • AIRA-Design uses twenty agents to write attention mechanisms and training scripts for long-range dependency tasks.
    Provenance
    Source · Background source
  2. 2

    Nathan Lambert on on-policy distillation as a lasting post-training method

    Thread Nathan Lambert — AI researcher writing about post-training methods

    On-policy distillation is on track to be a lasting method in post-training.

    x.com/natolambert/status/2056510299579273447 →
    Details
    Cited text
    On-policy distillation is on track to be a lasting method in post-training.
    Context
    It shows post-training methods changing alongside architecture search, with the model learning from its own current behavior.
    Key points
    • Lambert listed instruction tuning, reinforcement learning from human feedback, Direct Preference Optimization, reinforcement learning with verifiable rewards, and on-policy distillation as method families.
    • Replies pushed on whether on-policy distillation is a true method class or a variant.
    • The thread connected to current work on text feedback and Composer.
    Provenance
    Thread · Primary source
  3. 3

    Anthropic acquires Stainless

    Article Anthropic — Company announcement

    Stainless has powered the generation of every official Anthropic SDK since the earliest days of our API.

    www.anthropic.com/news/anthropic-acquires-s… →
    Details
    Cited text
    Stainless has powered the generation of every official Anthropic SDK since the earliest days of our API.
    Context
    It makes API contracts and generated tool surfaces part of the agent platform race.
    Key points
    • Stainless generates SDKs, CLIs, and MCP servers from API specs.
    • Anthropic frames the acquisition around agent connectivity and developer experience.
    • The announcement names TypeScript, Python, Go, Java, Kotlin, and more as generated SDK targets.
    Provenance
    Article · Supporting source
  4. 4

    Dreams of Code on Zero and Zero Native naming

    Thread Dreams of Code — Developer commentary quoting Chris Tate on Zero

    Zero is a programming language for agents.

    x.com/dreamsofcode_io/status/20565128279305… →
    Details
    Cited text
    Zero is a programming language for agents.
    Context
    It turns agent-native language design into a question about diagnostics, repair, and permission boundaries.
    Key points
    • The quoted Chris Tate post describes Zero as a language for agents with explicit capabilities, JSON diagnostics, and typed safe fixes.
    • Dreams of Code criticizes the naming around Zero Native, WebKit, Zig, and the term native.
    • The useful technical read is that agent-facing languages need structured repair surfaces.
    Provenance
    Thread · Primary source
  5. 5

    Cursor introduces Composer 2.5

    Thread Cursor — AI coding tool provider

    It is more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions.

    x.com/cursor_ai/status/2056415413077233983/… →
    Details
    Cited text
    It is more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions.
    Context
    It shows coding tools competing on longer agent runs and instruction reliability.
    Key points
    • Cursor positions Composer 2.5 around sustained work and complex instructions.
    • The announcement includes a temporary doubling of included model usage.
    • The script treats this as a product signal, not a proven benchmark result.
    Provenance
    Thread · Primary source
  6. 6

    Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs

    Article Ian Finder — NVIDIA Blog writer

    AI agents don't run on GPUs alone.

    blogs.nvidia.com/blog/vera-cpu-delivery →
    Details
    Cited text
    AI agents don't run on GPUs alone.
    Context
    It brings CPU orchestration back into the agent infrastructure discussion.
    Key points
    • NVIDIA says Vera systems went to Anthropic, OpenAI, SpaceXAI, and Oracle Cloud Infrastructure.
    • The post names sandboxes, tool calls, orchestration, retrieval, compiling, testing, and simulations as CPU work.
    • NVIDIA lists eighty-eight custom Olympus cores, 1.2 terabytes per second of memory bandwidth, and 50 percent faster per-core performance.
    Provenance
    Article · Supporting source
  7. 7

    Cutting inference cold starts by 40x with LP, FUSE, C/R, and cuda-checkpoint

    Article Modal — Infrastructure platform team

    Together, these optimizations allow inference on Modal to spin up 40x faster: 50 seconds instead of 2k.

    modal.com/blog/truly-serverless-gpus →
    Details
    Cited text
    Together, these optimizations allow inference on Modal to spin up 40x faster: 50 seconds instead of 2k.
    Context
    It gives a concrete software-side answer to agent products that need fast sandbox and inference startup.
    Key points
    • Modal describes GPU cold starts dropping from roughly two kiloseconds to about fifty seconds.
    • The approach combines idle GPU buffers, lazy container filesystems, CPU checkpoint and restore, and CUDA checkpoint and restore.
    • The post frames GPU allocation utilization as output achieved over capacity paid for.
    Provenance
    Article · Supporting source
  8. 8

    Agora-1: The Multi-Agent World Model

    Article Oliver Cameron — Odyssey author announcing Agora-1

    Agora-1 functions as a learned game engine.

    odyssey.ml/introducing-agora-1 →
    Details
    Cited text
    Agora-1 functions as a learned game engine.
    Context
    It shifts world models from single-user video generation toward shared state and multi-agent training environments.
    Key points
    • Agora-1 allows up to four players to interact in the same generated GoldenEye-style simulation in real time.
    • Odyssey separates simulation dynamics from visual rendering and uses a shared state model plus a DiT-based renderer.
    • The team links multi-agent interaction to reinforcement learning data generation and shared simulated environments.
    Provenance
    Article · Supporting source