Archive CONSTRUCT
The Agent Needs a Computer / DISPATCH 009
PDF RSS

Dispatch 009 · 2026-05-21

The Agent Needs a Computer

/ 00:14:21 / 19 sources

“Single-turn chat can get cheap while agent work still pays for state, tools, and retries.”

— Lenar Kess, today's narration

Single-turn chat can get cheap while agent work still pays for state, tools, and retries.

  • The Agent Needs a Computer

Chapters

  1. 00:00:00 Transcript

Sources

19 cited
  1. 1

    Ethan Mollick on compute scarcity and agent economics

    Thread Ethan Mollick — AI adoption researcher commenting on the cost split between chatbots and complex agentic workflows.

    complex agentic workflows even as single-turn chatbots get cheaper

    x.com/emollick/status/2057565824341127432 →
    Details
    Cited text
    complex agentic workflows even as single-turn chatbots get cheaper
    Context
    It anchors the episode's main tension: agent capability is becoming an infrastructure and budget question, not only a model-quality question.
    Key points
    • Compute scarcity may make long-running agent workflows much more expensive than simple chatbot turns.
    • Mollick argues that widely available chatbots and costly agent runs could split access to AI capability.
    • The fetched thread adds that complex agent work can burn far more tokens than single-turn chat.
    Provenance
    Thread · Primary source
  2. 2

    AI Agents Need Computers: Ivan Burazin, Daytona

    Video Latent Space — Interview with Daytona co-founder Ivan Burazin about infrastructure for AI agent sandboxes and persistent agent computers.

    composable computers for AI agents

    www.youtube.com/watch?v=kaX43RRRUKY →
    Details
    Cited text
    composable computers for AI agents
    Context
    It supplies the concrete infrastructure layer underneath the compute-cost argument.
    Key points
    • Daytona pivoted from human dev environments to agent sandboxes after builders said the old product did not fit agent workloads.
    • The company reports 74 percent month-over-month growth in a market Burazin frames as computers for non-human users.
    • Daytona emphasizes persistent state, disk, configurable compute, API control, and isolation rather than one-shot code execution.
    Provenance
    Video · Supporting source
  3. 3

    Run long tasks in Codex using goals

    Video OpenAI — Product release video for Codex Goal Mode across app, IDE extension, and CLI.

    Use goal mode in the Codex app, IDE Extension, or CLI

    www.youtube.com/watch?v=rgh0hMYPcd0 →
    Details
    Cited text
    Use goal mode in the Codex app, IDE Extension, or CLI
    Context
    It shows the product layer adapting to long-horizon agent work.
    Key points
    • Goal Mode treats the objective as both the prompt and the completion condition.
    • OpenAI positions goals for long-running work with steering, side chats, pause and resume, and editable objectives.
    • The release moves Codex toward stateful task execution rather than only reactive chat.
    Provenance
    Video · Supporting source
  4. 4

    Share Codex plugins with your team

    Video OpenAI — Product video showing workspace plugin sharing and deep links for Codex plugins.

    Teams can now distribute custom plugins

    www.youtube.com/watch?v=msSa0tc2TbU →
    Details
    Cited text
    Teams can now distribute custom plugins
    Context
    It reframes agent workflows as shared team infrastructure that needs permissions and versioning.
    Key points
    • Plugins can be shared with specific people or everyone in a workspace.
    • Shared plugins appear in a common directory and can be distributed with direct links.
    • The example plugin automates validation and refactoring before review.
    Provenance
    Video · Supporting source
  5. 5

    Introducing Appshots in Codex

    Video OpenAI — Product video about attaching a Mac app window to a Codex thread.

    attach your app window to a Codex thread

    www.youtube.com/watch?v=QKYbGCvNpFo →
    Details
    Cited text
    attach your app window to a Codex thread
    Context
    It expands the agent's context from repository text to live desktop state.
    Key points
    • Appshots let a user attach visible app context to a Codex thread from macOS.
    • The feature gives Codex both visual and app-context input from the user's workstation.
    • It sits beside OpenAI's remote-Mac capability in the broader move toward workstation-aware agents.
    Provenance
    Video · Supporting source
  6. 6

    OpenAI Developers on Codex using a locked remote Mac

    Thread OpenAI Developers — Official OpenAI developer account announcing remote Mac use for Codex.

    your Mac doesn’t have to be unlocked

    x.com/OpenAIDevs/status/2057536706778378692 →
    Details
    Cited text
    your Mac doesn’t have to be unlocked
    Context
    It moves Codex closer to operating on the user's actual computer, where permission boundaries are messier.
    Key points
    • OpenAI says Codex can securely use apps on a Mac even when the screen is off and locked.
    • The feature targets remote and phone-driven Codex workflows.
    • The announcement raises practical questions about permissions, app sessions, and local-machine access.
    Provenance
    Thread · Primary source
  7. 7

    Simon Willison releases Datasette Agent alpha

    Thread Simon Willison — Creator of Datasette announcing an AI assistant for querying SQLite-backed data through Datasette.

    a conversational AI assistant for Datasette

    x.com/simonw/status/2057554315821371543 →
    Details
    Cited text
    a conversational AI assistant for Datasette
    Context
    It is a compact example of agents becoming native features inside specific software surfaces.
    Key points
    • Datasette Agent can answer questions about data in SQLite databases.
    • The assistant can be extended with plugins for more tools and features.
    • The agent sits inside an existing data tool rather than a blank chat product.
    Provenance
    Thread · Primary source
  8. 8

    Pi built into the Entire CLI

    Thread Entire — Product account announcing native Pi integration in the Entire command-line interface.

    connect directly to Entire checkpoints, commits, and session history

    x.com/EntireHQ/status/2057568897868444011 →
    Details
    Cited text
    connect directly to Entire checkpoints, commits, and session history
    Context
    It supports the episode's argument that agents need tool-native state and history.
    Key points
    • Pi moved from an external agent plugin to a native CLI integration.
    • The integration connects to checkpoints, commits, and session history.
    • It treats agent memory as part of the developer workflow rather than an external chat log.
    Provenance
    Thread · Primary source
  9. 9

    Yohei Nakajima on Active Graph

    Thread Yohei Nakajima — Builder describing an Active Graph agent architecture influenced by blackboard systems.

    rollback, fork, diff agent runs

    x.com/yoheinakajima/status/2057533315000017… →
    Details
    Cited text
    rollback, fork, diff agent runs
    Context
    It gives a vocabulary for versioned agent state beyond linear transcripts.
    Key points
    • The post frames Active Graph around rollback, fork, diff, and behaviors that can write behaviors.
    • It invokes 1970s blackboard systems as an architectural ancestor.
    • The item is best treated as a concept signal rather than a proven shipped system.
    Provenance
    Thread · Primary source
  10. 10

    François Chollet on apps and text boxes

    Thread François Chollet — AI researcher commenting on the future of applications and user interfaces.

    Apps become services and UIs become text boxes

    x.com/fchollet/status/2057532308056604788 →
    Details
    Cited text
    Apps become services and UIs become text boxes
    Context
    It lets the hosts separate the visible text interface from the operational substrate behind it.
    Key points
    • Chollet argues that app and user-interface concepts may dissolve into service-backed text boxes.
    • The episode uses this as a point of friction rather than accepting it wholesale.
    • The counterargument is that state, permissions, tools, and history still carry much of the work.
    Provenance
    Thread · Primary source
  11. 11

    Ali Hatamizadeh announces Gated DeltaNet-2

    Thread Ali Hatamizadeh — Researcher announcing a new paper on Gated DeltaNet-2 and linear attention.

    Decoupling Erase and Write in Linear Attention

    x.com/ahatamiz1/status/2057586630450610673 →
    Details
    Cited text
    Decoupling Erase and Write in Linear Attention
    Context
    It shows model architecture work responding to the same cost and memory pressure that agent products face.
    Key points
    • Gated DeltaNet-2 is presented as a new recurrent or hybrid-attention architecture.
    • The announcement claims head-to-head wins over KDA and Mamba-3.
    • The hosts avoid treating the claims as independently verified.
    Provenance
    Thread · Primary source
  12. 12

    Sebastian Raschka on Gated DeltaNet-2

    Thread Sebastian Raschka — Machine-learning educator and author highlighting Gated DeltaNet-2 as a hybrid-attention paper to read.

    one of my favorite hybrid attention newcomers

    x.com/rasbt/status/2057599925878169761 →
    Details
    Cited text
    one of my favorite hybrid attention newcomers
    Context
    It helps justify a short architecture segment without overstating the result.
    Key points
    • Raschka flags Gated DeltaNet as a notable newcomer in the transformer stack.
    • His post gives external context that the paper is being watched by practitioners.
    • The episode uses this as a reading-stack signal, not proof of performance.
    Provenance
    Thread · Primary source
  13. 13

    Multi-Stream LLMs

    Source Max Planck Institute for Intelligent Systems authors via HN summary — Research paper surfaced through Hacker News about separating prompts, thinking, and I/O streams.

    parallelizing/separating prompts, thinking, I/O

    arxiv.org/abs/2605.12460 →
    Details
    Cited text
    parallelizing/separating prompts, thinking, I/O
    Context
    It supplies a model-architecture echo of the product-control-plane theme.
    Key points
    • The HN summary describes parallel streams for prompts, thinking, and I/O.
    • The episode treats the idea as a research signal about token-stream structure.
    • The hosts connect it cautiously to product-level needs for side chats, state views, and tool lanes.
    Provenance
    Source · Background source
  14. 14

    The Information report summary on OpenAI and Anthropic Q1 revenue

    Article Techmeme summary of The Information — Aggregator entry summarizing reported financials for OpenAI and Anthropic.

    OpenAI generated ~$5.7B in revenue in Q1

    www.techmeme.com/260521/p35 →
    Details
    Cited text
    OpenAI generated ~$5.7B in revenue in Q1
    Context
    It places agent compute costs inside the business model rather than treating them as only a technical detail.
    Key points
    • The summary says OpenAI generated about $5.7 billion in Q1 revenue, about $1 billion more than Anthropic.
    • It also says OpenAI's adjusted operating income margin was negative 122 percent.
    • The episode explicitly flags that it is relying on the summary, not the full article.
    Provenance
    Article · Supporting source
  15. 15

    HedgieMarkets claim about Claude Code license costs

    Thread HedgieMarkets — Market commentary account reporting unverified internal license-cost claims.

    token-based billing made the cost untenable

    x.com/HedgieMarkets/status/2057531661785628… →
    Details
    Cited text
    token-based billing made the cost untenable
    Context
    It lets the hosts discuss budget governance while clearly marking uncertainty.
    Key points
    • The post claims Microsoft canceled internal Claude Code licenses because token-based billing became too costly.
    • It also mentions an Uber CTO memo warning about AI budget burn.
    • The episode treats these as unverified claims that still point to a plausible procurement concern.
    Provenance
    Thread · Primary source
  16. 16

    Techmeme summary of California AI labor subsidy executive order

    Article Techmeme summary of New York Times reporting — Aggregator entry summarizing Governor Gavin Newsom's executive order on AI and labor subsidies.

    study subsidies for companies that don't replace workers with AI

    www.techmeme.com/260521/p36 →
    Details
    Cited text
    study subsidies for companies that don't replace workers with AI
    Context
    It connects agent economics to labor policy without pretending the policy design is settled.
    Key points
    • The summary says California agencies will work with the AI industry and others on subsidies for companies that don't replace workers with AI.
    • The episode treats this as early policy language rather than a mature program.
    • The hosts focus on the measurement problem around labor displacement and hiring substitution.
    Provenance
    Article · Supporting source
  17. 17

    U.S. and Plaintiff States v. Constellation Energy Corporation et al.

    Article U.S. Department of Justice Antitrust Division — Official DOJ case page for antitrust civil filings involving Constellation Energy.

    Documents posted on May 21, 2026

    www.justice.gov/atr/case/us-and-plaintiff-s… →
    Details
    Cited text
    Documents posted on May 21, 2026
    Context
    It broadens the cost conversation from tokens to electricity, datacenter access, and concentrated inputs.
    Key points
    • The DOJ case is an energy and antitrust item, not a direct AI product release.
    • The episode uses it as an adjacent infrastructure signal because AI compute depends on power markets.
    • The hosts avoid specific legal claims beyond the packet summary.
    Provenance
    Article · Supporting source
  18. 18

    Ethan Mollick on GPT-5.2 peer review study

    Thread Ethan Mollick — AI researcher summarizing a study on AI and human scientific peer review.

    45 scientists took 469 hours

    x.com/emollick/status/2057528309727088907 →
    Details
    Cited text
    45 scientists took 469 hours
    Context
    It extends the agent discussion into high-stakes knowledge work and review institutions.
    Key points
    • Mollick says GPT-5.2 reached expert level in a peer-review study.
    • The study involved 45 scientists evaluating reviews of 82 papers, according to the packet.
    • The hosts treat it as a significant result that still needs close reading.
    Provenance
    Thread · Primary source
  19. 19

    Summon Governance on oversight and execution

    Thread Summon Governance | AIGOS — Governance account arguing that model-centered oversight weakens as systems become more capable.

    Oversight degrades when it stays around the model

    x.com/SummonAIGOS/status/2057525117442216319 →
    Details
    Cited text
    Oversight degrades when it stays around the model
    Context
    It closes the loop between agent capability, workstation access, and the need to supervise actual execution.
    Key points
    • The post argues that chain-of-thought review, probes, evals, monitors, and audits weaken around more capable systems.
    • It proposes execution as the more durable point of control.
    • The episode paraphrases the argument and ties it to reviewable agent runs.
    Provenance
    Thread · Primary source